Operational processes for efficient generation of alternatives:
- Batch generation technology: Write more than one in a single JSONL file
idx
Entries, each fine-tuneddescriptions
Parameters (e.g. change of genre/instrument) - style matrix approach: Combine different BPMs (80/120/160), genders (male/female), and emotions (happy/sad) to generate a base version.
- automation script: modified in a Python loop
lyrics.jsonl
file, replacing one variable value at a time and calling thegenerate.sh
- Rapid screening mechanism: Add a
metadata
Tags (e.g.v1_jazz_110bpm
), feature clustering was automatically extracted with an audio analysis tool (librosa) - Reference Audio Multiplexing: Fixing the premium version of the
prompt_audio_path
The lyrics are only tweaked to generate variants
This answer comes from the articleSongGeneration: open-source AI model for generating high-quality music and lyricsThe