🎵
ZipVoice
Zero-shot text-to-speech with instant voice cloning
💻 CPU Mode
Step 1 / 步驟一
Drop your reference voice (1–3 s) / 拖放 1–3 秒的參考語音
Step 2 / 步驟二
Transcribe the prompt or let ZipVoice auto-transcribe / 手動或自動生成轉寫
Step 3 / 步驟三
Write the target text and generate / 輸入目標文本並開始合成
🎤 Voice Prompt / 參考語音
Tip: use a clear 1–3 second sample for best results. 提示:請使用 1–3 秒的清晰語音,以獲得最佳效果。
📝 Prompt transcription / 提示文本
✍️ Text to Synthesize / 合成文本
Model / 模型
zipvoice = highest fidelity · zipvoice_distill = faster generation / zipvoice = 最高音質 · zipvoice_distill = 更快生成
0.5 2
🔊 Result & Status / 輸出與狀態
Ready to synthesize. Please upload a prompt and click generate! / 準備就緒:請上傳參考語音並開始合成。
⚡ Quick Examples / 快速範例
Try a scenario in one click / 一鍵體驗範例
| Drop or select an audio file / 拖放或選擇音頻文件 | Model / 模型 | Speaking speed / 語速 |
|---|
Created with ❤️ by the ZipVoice team on Gradio / 由 ZipVoice 團隊基於 Gradio 構建