What are the considerations when using SongGeneration?

2025-08-14

384

Note the following when using SongGeneration:

input prompt: Avoid simultaneous provision of prompt_audio_path cap (a poem) descriptionsOtherwise, the quality of generation may be degraded due to conflicts.
lyrics format: Lyrics need to be structurally segmented (e.g. [verse],[chorus]), non-lyrics segments (such as [intro-short]) should not contain lyrics.
Reference Audio: It is recommended to use the chorus of the song (10 seconds or less) for optimal musicality.
hardware requirement: 10GB of GPU memory for the base model and 16GB with reference audio.