F5 TTS (CPU Basic)
This is a CPU-optimized web UI for F5 TTS with advanced batch processing support. This app supports the following TTS models:
- F5-TTS (A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching)
- E2 TTS (Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS)
The checkpoint was trained with Polish British English American English German Russian Ukrainian other languages may not work correctly.
If you're having issues, try converting your reference audio to WAV or MP3, clipping it to 10s, and shortening your prompt.
NOTE: This runs on CPU Basic. Generation will be slower than GPU. Reference text will be automatically transcribed with Whisper if not provided. For best results, keep your reference clips short (<15s). Ensure the audio is fully uploaded before generating.
Batched TTS
#Select Reference Language
Choose Language
#Select Synthesized Language
Choose Language