ACE-Step: A Step Towards Music Generation Foundation Model

Project | Checkpoints | Discord

-1 240

Check to enable Audio-to-Audio generation using a reference audio.

Lora Name or Path
Support tags, descriptions, and scene. Use commas to separate different tags.
Tags and lyrics examples are from AI music generation community.
Preset
Support lyric structure tags like [verse], [chorus], and [bridge] to separate different parts of the lyrics.
Use [instrumental] or [inst] to generate instrumental music. Not support genre structure tag in lyrics
1 200
0 30
0 10
0 10
Scheduler Type

Scheduler type for the generation. euler is recommended. heun will take more time.

CFG Type

CFG type for the generation. apg is recommended. cfg and cfg_star are almost the same.

Use Entropy Rectifying Guidance for tag. It will multiple a temperature to the attention to make a weaker tag condition and make better diversity.

The same but apply to lyric encoder's attention.

The same but apply to diffusion model's attention.

-100 100
0 1
0 1
0 200
0 1