Piper Hub




Advanced Settings
More about auto play

Flag to have the audio play automatically after Synthesizing.

More about speaking rate

Controls how fast the voice speaks the text. A value of 1 is the speed of the training dataset. Less than 1 is faster, and more than 1 is slower.

More about audio volatility

The amount of noise added to the generated audio (0-1). Can help mask audio artifacts from the voice model. Multi-speaker models tend to sound better with a lower amount of noise than single speaker models.

More about phoneme volatility

The amount of noise used to generate phoneme durations (0-1). Allows for variable speaking cadance, with a value closer to 1 being more variable. Multi-speaker models tend to sound better with a lower amount of phoneme variability than single speaker models.

Upload a Voice
More about JSON File

The Piper configuration json file for the model with extension '.onnx.json'.

More about the ONNX File

The Open Neural Network Exchange (ONNX) File. This is the model file for the voice.

More about Model Card

The MODEL_CARD file for each voice contains important licensing information. Piper is intended for text to speech research, and does not impose any additional restrictions on voice models. Some voices may have restrictive licenses, however, so please review them carefully!

More about voice name

Using the voice name to override the default naming in piper's config which is the dataset. Some models don't comply with the proper piper schema with dataset meaning in the config.





Installed Voices

Name Lang Quality Speakers Uninstall
Test Voice Test Lang Test Quality 1


Available Voices for Download:

Name Lang Quality Speakers Install
Test Voice Test Lang Test Quality 5