Efficient ways to use cues
To make FantasyTalking respond accurately to expression and motion cues:
- Sentence Structure:adoption
主体+状态+动作
The canonical syntax of"The woman is smiling gently while nodding occasionally"
- Strength Grading:Use adverbs of degree (gently/moderately/strongly) for precise control:
Minor movements →slight head movements
Medium →moderate hand gestures
Strong performance →dramatic facial expressions
- Multi-conditional combinations:Separate multiple action descriptions with commas, but the total number is recommended to be no more than 3
- A guide to avoiding the pit:Avoid contradictory instructions (e.g.
static
together withwaving
), cartoon style needs to be addedcartoon-style
prefix (linguistics)
Experiments have shown that matching--prompt_cfg_scale=4.5
Response to cues is most stable when
This answer comes from the articleFantasyTalking: an open-source tool for generating realistic speaking portraitsThe