Lifecycle

1. Starting a Session

Generate a session token configured for LITE Mode on your backend

Start the session using that token

The avatar streams into the specified WebRTC room after initialization

Your backend should manage both token generation and session start. Pass the WebRTC credentials to your frontend.

Ensure the session token is explicitly set to "LITE" mode. The token configuration determines which mode initializes.

2. Managing the Session

LITE Mode provides a WebSocket connection for controlling the avatar. The typical flow:

User speaks — audio is sent to the room

Your agent processes — your STT/LLM/TTS pipeline handles the input

Agent constructs response audio — your TTS generates the speech

Agent streams audio via WebSocket — send audio chunks to LiveAvatar

LiveAvatar renders video — avatar video frames are sent to the room

Your “agent” can be anything from a simple backend service to a complex multi-model pipeline.

Audio format

Audio sent to LiveAvatar must be PCM 16-bit 24KHz, Base64-encoded. Recommended chunk size is ~1 second, with a maximum of 1MB per WebSocket packet.

Latency

LiveAvatar generates avatar video in real time as audio arrives — there is no batch processing or queuing delay on the LiveAvatar side.

Plugin path — end-to-end latency depends on your pipeline (STT, LLM, TTS, and network). LiveAvatar adds minimal overhead on top of your existing stack.

Connector path — LiveAvatar manages the full connection to your voice agent and optimizes latency on your behalf.

WebSocket commands

Through the WebSocket, you can:

Command the avatar to speak (by sending audio)

Interrupt avatar responses

Modify avatar poses (listening, idle)

Keep sessions alive

Getting Started

Core Concepts

FULL Mode

LITE Mode

FAQ

1. Starting a Session

2. Managing the Session

Audio format

Latency

WebSocket commands

3. Ending the Session

Getting Started

Core Concepts

FULL Mode

LITE Mode

FAQ

Documentation Index

​1. Starting a Session

​2. Managing the Session

​Audio format

​Latency

​WebSocket commands

​3. Ending the Session

1. Starting a Session

2. Managing the Session

Audio format

Latency

WebSocket commands

3. Ending the Session