Overseas access: www.kdjingpai.com
Bookmark Us
Current Position:fig. beginning " AI Answers

MOSS-TTSD supports up to 960 seconds of one-shot speech generation and zero-sample two-person speech cloning.

2025-08-19 475
Link directMobile View
qrcode

MOSS-TTSD offers significant technical advantages in voice generation. It supports single-shot speech generation up to 960 seconds, a feature that makes it particularly suitable for podcasts or long-form content production. On the other hand, its zero-sample two-person voice cloning feature can accurately clone the target speaker's tone and apply it to dialog scenarios without additional training. Users only need to provide a 10-second target audio clip, and the model can generate dialog voices that match the timbre, effectively distinguishing between different speakers.

Recommended

Can't find AI tools? Try here!

Just type in the keyword Accessibility Bing SearchYou can quickly find all the AI tools on this site.

Top