tts
Neural text-to-speech synthesis
TLDR
Synthesize speech
SYNOPSIS
tts [--text text] [--outpath file] [--modelname model] [options]
DESCRIPTION
tts is the command-line interface for Coqui TTS, an open-source deep learning text-to-speech toolkit. It converts text input into natural-sounding speech audio using neural network models, supporting multiple languages and voice styles.
The tool ships with a variety of pre-trained models that can be listed and selected by name. Multi-speaker models allow choosing between different voice characteristics via speaker indices. Audio output can be saved to WAV files or played directly.
Models are downloaded automatically on first use and cached locally. While CPU inference is supported, a GPU is recommended for faster synthesis, especially with larger models.
PARAMETERS
--text TEXT
Input text.--out_path FILE
Output audio file.--model_name MODEL
TTS model.--list_models
Show models.--speaker_idx ID
Speaker selection.
CAVEATS
Python required. Large model downloads. GPU recommended.
HISTORY
TTS (Coqui TTS) is an open-source deep learning text-to-speech toolkit with multiple models.
