🎵

ZipVoice

Zero-shot text-to-speech with instant voice cloning

💻 CPU Mode
Step 1 / 步驟一 Drop your reference voice (1–3 s) / 拖放 1–3 秒的參考語音
Step 2 / 步驟二 Transcribe the prompt or let ZipVoice auto-transcribe / 手動或自動生成轉寫
Step 3 / 步驟三 Write the target text and generate / 輸入目標文本並開始合成
🎤 Voice Prompt / 參考語音

Tip: use a clear 1–3 second sample for best results. 提示:請使用 1–3 秒的清晰語音,以獲得最佳效果。

📝 Prompt transcription / 提示文本
✍️ Text to Synthesize / 合成文本
Model / 模型

zipvoice = highest fidelity · zipvoice_distill = faster generation / zipvoice = 最高音質 · zipvoice_distill = 更快生成

0.5 2
🔊 Result & Status / 輸出與狀態

Ready to synthesize. Please upload a prompt and click generate! / 準備就緒:請上傳參考語音並開始合成。

⚡ Quick Examples / 快速範例
Try a scenario in one click / 一鍵體驗範例
Drop or select an audio file / 拖放或選擇音頻文件 Model / 模型 Speaking speed / 語速