AceForge

AceForge is a local-first AI music workstation for macOS Silicon powered by ACE-Step

Status: ALPHA

Features

100% Local (only needs to download models once)
Music Generation with ACE-Step prompts
- Use Stem separation to rebalance vocals vs instrumentals
- Use existing Audio as reference (optional)
- Train ACE-Step LoRAs from your own datasets
  - Mass-create _prompt.txt / _lyrics.txt files
  - Auto-tag datasets using MuFun-ACEStep (experimental)
Stem Splitting using Demucs for high-quality audio separation
Voice Cloning TTS using XTTS v2
MIDI Generation using basic-pitch for audio-to-MIDI transcription
Embedded Music Player to explore generation catalog
Manage and reuse prompt presets

System requirements

Minimum

macOS 12.0 (Monterey) or later
Apple Silicon (M1/M2/M3) or Intel Mac with AMD GPU
16 GB unified memory (for Apple Silicon) or 16 GB RAM
~10–12 GB VRAM/unified memory (more = more headroom)
SSD with tens of GB free (models + audio + datasets)

Install and run

Option 1: Download Pre-built Release for OSX

Download the latest App from the Releases page.

Installation:

Download AceForge-macOS.dmg from the latest release
Open the DMG file
Drag AceForge.app to your Applications folder (or any location on your Mac)

To Launch:

Double-click AceForge.app

Note: On first launch, macOS may show a security warning because the app is not notarized by Apple. Go to System Settings > Privacy & Security and click Open Anyway. This is normal for apps downloaded from the internet that are not distributed through the Mac App Store.

Note: If macOS prevents the app from opening with a "damaged" error execute the following command:
sudo xattr -cr /Applications/AceForge.app

Note: The app bundle does NOT include the large model files. On first run, it will download the ACE-Step models (several GB) automatically. You can monitor the download progress in the Terminal window or in the Server Console panel in the web interface.

Using AceForge (high-level workflow)

Launch AceForge and wait for the UI
Go to Generate → create tracks from prompt (and lyrics if desired)
Browse/manage tracks in Music Player
(Optional) Use stem controls to adjust vocal/instrumental balance
(Optional) Stem Splitting: Separate any audio file into individual stems
(Optional) Voice Clone: TTS voice cloning using reference clips
(Optional) Build a dataset and train a LoRA in Training

Generation basics

Prompt: your main ACE-Step tags / description (genre, instruments, mood, context)
Instrumental mode:
- Lyrics are not used
- AceForge uses the [inst] token so ACE-Step focuses on backing tracks
Vocal mode:
- Provide lyrics using markers like [verse], [chorus], [solo], etc.
Presets let you save/load a whole “knob bundle” (text + sliders)

Stem separation (vocals vs instrumentals)

AceForge can run audio-separator as a post-process step so you can rebalance:

Vocals level (dB)
Instrumental level (dB)

For fast iteration: generate with both gains at 0 dB, then only use stems once you like a track.

First use requires downloading a large stem model and adds a heavy processing step

Stem Splitting (Demucs)

The Stem Splitting tab uses Demucs for high-quality audio separation of any audio file.

Features:

2-stem mode: Separate vocals and instrumentals
4-stem mode: Separate vocals, drums, bass, and other instruments
6-stem mode: Even finer separation including piano and guitar

Upload any audio file (MP3, WAV, etc.) and AceForge will split it into individual stem tracks that appear in the Music Player.

First use requires downloading the Demucs model (~80MB). Processing time varies based on file length and your device (Apple Silicon MPS is faster than CPU)

Voice cloning (XTTS v2)

The Voice Clone uses XTTS v2 to synthesize speech in a cloned voice

Upload a short reference (MP3, WAV), enter the text and generate.
Output is converted to MP3 and displayed in the Music Player

ffmpeg must be installed (e.g. brew install ffmpeg) for non-WAV references.

First use requires downloading a large XTTS model (~1.9 GB) please be patient

MIDI Generation (basic-pitch)

The MIDI Generation tab uses basic-pitch (by Spotify) to convert audio files to MIDI format through automatic music transcription.

Features:

Convert any audio file (MP3, WAV, etc.) to MIDI format
Adjustable transcription parameters:
- Onset threshold and frame threshold for note detection
- Minimum note length and minimum frequency
- Tempo estimation
- Pitch bend detection
- Melodia trick (improved pitch tracking)

The basic-pitch models are bundled with the app, so no download is required. Processing time varies based on file length and your device.

LoRA training

Switch to the Training tab to configure and start LoRA runs.

Dataset structure

Datasets must live under:

<AceForge root>\training_datasets

For each audio file (foo.mp3 or foo.wav), provide:

foo_prompt.txt — ACE-Step prompt/tags for that track
foo_lyrics.txt — lyrics, or [inst] for instrumentals

AceForge includes tools to bulk-create these files (and optionally auto-generate them with MuFun-ACEStep).

Training parameters (examples)

Adapter name (experiment name)
LoRA config preset (JSON from training_config)
Epochs / max steps
Learning rate (commonly 1e-4 to 1e-5)
Max clip seconds (lower can reduce VRAM and speed up training)
Optional SSL loss weighting (set to 0 for some instrumental datasets)
Checkpoint/save cadence

Experimental: MuFun-ACEStep dataset analyzer

MuFun-ACEStep can auto-generate _prompt.txt and _lyrics.txt files from audio. It’s powerful but:

The model is large (tens of GB)
Outputs aren’t perfect—skim and correct weird tags/lyrics before training

Troubleshooting

Common Issues

First launch takes forever: Check terminal for pip/model download errors; verify disk space and network
No tracks found: Generate a track or run Voice Clone; confirm Output Directory matches the Music Player folder
Memory issues:
- Reduce target length during generation
- Reduce max clip seconds during training
- Lower batch/grad accumulation if you changed them

Building Releases

Pre-built macOS application bundles are automatically created via GitHub Actions. To build locally use the provided scripts.

Code Signing: The build includes automated code signing to prevent macOS security warnings that would otherwise require running sudo xattr -cr /Applications/AceForge.app. By default, the script uses ad-hoc signing (no Apple Developer certificate required). For distribution, you can provide a Developer ID certificate. See build/macos/README.md for detailed documentation on code signing options.

The build process creates a self-contained macOS application that includes:

Python runtime and all dependencies
Static files (HTML, CSS, JS)
Configuration files
Documentation

Note: The app bundle does NOT include the large AI model files (~several GB). These are downloaded automatically on first run.

Contributing

Issues and PRs welcome. If you’re changing anything related to training, model setup, or packaging, please include:

what GPU/driver you tested on
exact steps to reproduce any bug you fixed

(Consider adding CONTRIBUTING.md once you have preferred norms.)

License

This project’s source code is licensed under the Apache License 2.0. See LICENSE.

Name		Name	Last commit message	Last commit date
Latest commit History 218 Commits
.github		.github
ace_models		ace_models
build/macos		build/macos
static		static
training_config		training_config
.gitignore		.gitignore
AceForge.command		AceForge.command
CDMF.bat		CDMF.bat
CDMF.sh		CDMF.sh
CDMF.spec		CDMF.spec
LICENSE.txt		LICENSE.txt
README.md		README.md
STEM_SPLITTING_BUILD_NOTES.md		STEM_SPLITTING_BUILD_NOTES.md
UIDEV.md		UIDEV.md
USAGE.md		USAGE.md
VERSION		VERSION
ace_model_setup.py		ace_model_setup.py
aceforge_app.py		aceforge_app.py
audiotest.mp3		audiotest.mp3
build_and_test_local.sh		build_and_test_local.sh
build_local.sh		build_local.sh
cdmf_ffmpeg.py		cdmf_ffmpeg.py
cdmf_generation.py		cdmf_generation.py
cdmf_lyrics.py		cdmf_lyrics.py
cdmf_midi_generation.py		cdmf_midi_generation.py
cdmf_midi_generation_bp.py		cdmf_midi_generation_bp.py
cdmf_models.py		cdmf_models.py
cdmf_mufun.py		cdmf_mufun.py
cdmf_paths.py		cdmf_paths.py
cdmf_pipeline_ace_step.py		cdmf_pipeline_ace_step.py
cdmf_state.py		cdmf_state.py
cdmf_stem_splitting.py		cdmf_stem_splitting.py
cdmf_stem_splitting_bp.py		cdmf_stem_splitting_bp.py
cdmf_template.py		cdmf_template.py
cdmf_text2music_dataset.py		cdmf_text2music_dataset.py
cdmf_tracks.py		cdmf_tracks.py
cdmf_trainer.py		cdmf_trainer.py
cdmf_training.py		cdmf_training.py
cdmf_voice_cloning.py		cdmf_voice_cloning.py
cdmf_voice_cloning_bp.py		cdmf_voice_cloning_bp.py
debug_window_creation.py		debug_window_creation.py
generate_ace.py		generate_ace.py
launch_in_terminal.sh		launch_in_terminal.sh
lyrics_model_setup.py		lyrics_model_setup.py
lyrics_prompt_model.py		lyrics_prompt_model.py
macos_terminal_launcher.sh		macos_terminal_launcher.sh
midi_model_setup.py		midi_model_setup.py
mufun_model_setup.py		mufun_model_setup.py
music_forge_ui.py		music_forge_ui.py
presets.json		presets.json
requirements_ace.txt		requirements_ace.txt
requirements_ace_macos.txt		requirements_ace_macos.txt
requirements_ace_windows_reference.txt		requirements_ace_windows_reference.txt
test_bundled_app.sh		test_bundled_app.sh
test_bundled_app_api.sh		test_bundled_app_api.sh
test_bundled_app_local.sh		test_bundled_app_local.sh
test_generation_simple.sh		test_generation_simple.sh
test_stem_splitting_simple.sh		test_stem_splitting_simple.sh
test_stem_splitting_standalone.py		test_stem_splitting_standalone.py
test_voice_cloning_crash.sh		test_voice_cloning_crash.sh
test_voice_cloning_local.py		test_voice_cloning_local.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AceForge

Features

System requirements

Minimum

Recommended

Install and run

Option 1: Download Pre-built Release for OSX

Using AceForge (high-level workflow)

Generation basics

Stem separation (vocals vs instrumentals)

Stem Splitting (Demucs)

Voice cloning (XTTS v2)

MIDI Generation (basic-pitch)

LoRA training

Dataset structure

Training parameters (examples)

Experimental: MuFun-ACEStep dataset analyzer

Troubleshooting

Common Issues

Building Releases

Contributing

License

About

Uh oh!

Releases 4

Contributors 3

Uh oh!

Languages

License

audiohacking/AceForge

Folders and files

Latest commit

History

Repository files navigation

AceForge

Features

System requirements

Minimum

Recommended

Install and run

Option 1: Download Pre-built Release for OSX

Using AceForge (high-level workflow)

Generation basics

Stem separation (vocals vs instrumentals)

Stem Splitting (Demucs)

Voice cloning (XTTS v2)

MIDI Generation (basic-pitch)

LoRA training

Dataset structure

Training parameters (examples)

Experimental: MuFun-ACEStep dataset analyzer

Troubleshooting

Common Issues

Building Releases

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Contributors 3

Uh oh!

Languages