Voice settings are configured under avatar_persona.voice_settings when creating a session token. LiveAvatar supports two TTS providers.
Shared Settings
These apply to all providers:
| Parameter | Type | Range | Default | Description |
|---|
speed | number | 0.8–1.2 | 1 | Speaking speed. Below 1 slows down, above 1 speeds up. |
ElevenLabs
| Parameter | Type | Range | Default | Description |
|---|
stability | number | 0–1 | 0.75 | Voice consistency. Higher = more predictable. |
similarity_boost | number | 0–1 | 0.75 | Likeness to original voice. |
style | number | 0–1 | 0 | Style intensity exaggeration. |
use_speaker_boost | boolean | — | true | Voice clarity enhancement. |
model | string | — | — | eleven_flash_v2_5 (low latency) or eleven_multilingual_v2. |
Fish Audio
Fish Audio is currently in beta. Supports English, Chinese, and Japanese only.
| Parameter | Options | Default | Description |
|---|
latency_mode | balanced, normal | balanced | Speed vs. quality tradeoff. |
model | s1, s2 | s2 | Speech generation model. |
Example
{
"avatar_persona": {
"voice_id": "<voice_id>",
"voice_settings": {
"speed": 1.0,
"stability": 0.8,
"similarity_boost": 0.75,
"model": "eleven_flash_v2_5"
}
}
}