🎵

ZipVoice

Zero-shot text-to-speech with instant voice cloning

💻 CPU Mode

Step 1 / 步驟一 Drop your reference voice (1–3 s) / 拖放 1–3 秒的參考語音

Step 2 / 步驟二 Transcribe the prompt or let ZipVoice auto-transcribe / 手動或自動生成轉寫

Step 3 / 步驟三 Write the target text and generate / 輸入目標文本並開始合成

🎤 Voice Prompt / 參考語音

Drop or select an audio file / 拖放或選擇音頻文件

Tip: use a clear 1–3 second sample for best results. 提示：請使用 1–3 秒的清晰語音，以獲得最佳效果。

📝 Prompt transcription / 提示文本

Textbox

✍️ Text to Synthesize / 合成文本

Textbox

🔊 Result & Status / 輸出與狀態

Playback / 播放

Ready to synthesize. Please upload a prompt and click generate! / 準備就緒：請上傳參考語音並開始合成。

⚡ Quick Examples / 快速範例

Try a scenario in one click / 一鍵體驗範例

	Drop or select an audio file / 拖放或選擇音頻文件		Model / 模型	Speaking speed / 語速