Gemini Live Connector - LiveAvatar Documentation

The Gemini Live Connector bridges the Gemini Live API with LiveAvatar. Gemini handles speech-to-speech orchestration while LiveAvatar renders the avatar video.

Requirements

Google AI Studio / Gemini API key with access to the Live API
Optional Context for system prompt / instructions

Setup

1. Register your Gemini API key as a secret:

curl -X POST https://api.liveavatar.com/v1/secrets \
  -H "X-API-KEY: <YOUR_API_KEY>" \
  -H "content-type: application/json" \
  -d '{
    "secret_type": "GEMINI_API_KEY",
    "secret_value": "<gemini_api_key>",
    "secret_name": "Gemini Live Key"
  }'

2. Start a session with the connector:

{
  "mode": "LITE",
  "avatar_id": "<avatar_id>",
  "gemini_realtime_config": {
    "secret_id": "<secret_id>",
    "context_id": "<context_id>",
    "voice": "Puck",
    "model": "gemini-3.1-flash-live-preview",
    "temperature": 0.8
  }
}

Configuration

Field	Type	Default	Description
`secret_id`	uuid	required	Secret ID referencing your `GEMINI_API_KEY` secret.
`context_id`	uuid	—	Context that supplies system prompt / instructions.
`voice`	enum	`Puck`	Gemini Live voice. See voices.
`model`	enum	`gemini-3.1-flash-live-preview`	Gemini Live model. See models.
`temperature`	number	`0.8`	Sampling temperature. Range: `0.0` – `2.0`.

Models

Currently supported models:

gemini-3.1-flash-live-preview (default)
gemini-2.5-flash-native-audio-preview-12-2025

See Google’s Gemini Live API docs for the authoritative list and model capabilities.

Voices

Currently supported voices: Achernar, Achird, Algenib, Algieba, Alnilam, Aoede, Autonoe, Callirrhoe, Charon, Despina, Enceladus, Erinome, Fenrir, Gacrux, Iapetus, Kore, Laomedeia, Leda, Orus, Pulcherrima, Puck, Rasalgethi, Sadachbia, Sadaltager, Schedar, Sulafat, Umbriel, Vindemiatrix, Zephyr, Zubenelgenubi. See Google’s Gemini Live API voice reference for the authoritative list and characteristics.

How it works

A WebRTC room is created by LiveAvatar on your behalf (no WebSocket endpoint returned)
LiveAvatar bridges your session to the Gemini Live API using your credentials
User audio is streamed to Gemini Live; the model’s audio drives avatar lip sync

For protocol-level details — turn detection, tool calls, input/output modalities — see the Gemini Live API docs.

Billing

LiveAvatar: 1 credit per session minute (avatar video only)
Gemini: charged separately against your Google account for Live API usage

​Requirements

​Setup

​Configuration

​Models

​Voices

​How it works

​Billing

Requirements

Setup

Configuration

Models

Voices

How it works

Billing