Configuring Voice Settings - LiveAvatar Documentation

Voice settings are configured under avatar_persona.voice_settings when creating a session token. LiveAvatar supports two TTS providers.

Shared Settings

These apply to all providers:

Parameter	Type	Range	Default	Description
`speed`	number	0.8–1.2	1	Speaking speed. Below 1 slows down, above 1 speeds up.

ElevenLabs

Parameter	Type	Range	Default	Description
`stability`	number	0–1	0.75	Voice consistency. Higher = more predictable.
`similarity_boost`	number	0–1	0.75	Likeness to original voice.
`style`	number	0–1	0	Style intensity exaggeration.
`use_speaker_boost`	boolean	—	true	Voice clarity enhancement.
`model`	string	—	—	`eleven_flash_v2_5` (low latency) or `eleven_multilingual_v2`.

Fish Audio

Fish Audio is currently in beta. Supports English, Chinese, and Japanese only.

Parameter	Options	Default	Description
`latency_mode`	`balanced`, `normal`	`balanced`	Speed vs. quality tradeoff.
`model`	`s1`, `s2`	`s2`	Speech generation model.

Example

{
  "avatar_persona": {
    "voice_id": "<voice_id>",
    "voice_settings": {
      "speed": 1.0,
      "stability": 0.8,
      "similarity_boost": 0.75,
      "model": "eleven_flash_v2_5"
    }
  }
}

Configuration

Push-to-Talk

⌘I

​Shared Settings

​ElevenLabs

​Fish Audio

​Example

Shared Settings

ElevenLabs

Fish Audio

Example