ActAvatar
Temporally-Aware Precise Action Control
for Talking Avatars
We present ActAvatar, a novel framework that achieves precise temporal control over talking avatar actions at the phase level. While existing methods generate avatars from holistic text descriptions without temporal grounding, ActAvatar introduces structured prompts with explicit temporal boundaries, enabling users to specify exactly when and what actions should occur during speech.