LinuxCommandLibrary

tts

Neural text-to-speech synthesis

TLDR

Synthesize speech

$ tts --text "[Hello world]"
copy
Save to file
$ tts --text "[Hello]" --out_path [output.wav]
copy
List models
$ tts --list_models
copy
Use specific model
$ tts --text "[text]" --model_name [tts_models/en/ljspeech/tacotron2-DDC]
copy
List speakers
$ tts --model_name [model] --list_speakers
copy

SYNOPSIS

tts [--text text] [--outpath file] [--modelname model] [options]

DESCRIPTION

tts is the command-line interface for Coqui TTS, an open-source deep learning text-to-speech toolkit. It converts text input into natural-sounding speech audio using neural network models, supporting multiple languages and voice styles.
The tool ships with a variety of pre-trained models that can be listed and selected by name. Multi-speaker models allow choosing between different voice characteristics via speaker indices. Audio output can be saved to WAV files or played directly.
Models are downloaded automatically on first use and cached locally. While CPU inference is supported, a GPU is recommended for faster synthesis, especially with larger models.

PARAMETERS

--text TEXT

Input text.
--out_path FILE
Output audio file.
--model_name MODEL
TTS model.
--list_models
Show models.
--speaker_idx ID
Speaker selection.

CAVEATS

Python required. Large model downloads. GPU recommended.

HISTORY

TTS (Coqui TTS) is an open-source deep learning text-to-speech toolkit with multiple models.

SEE ALSO

espeak(1), festival(1), piper(1)

> TERMINAL_GEAR

Curated for the Linux community

Copied to clipboard

> TERMINAL_GEAR

Curated for the Linux community