The Gemini Live Connector bridges the Gemini Live API with LiveAvatar. Gemini handles speech-to-speech orchestration while LiveAvatar renders the avatar video.Documentation Index
Fetch the complete documentation index at: https://docs.liveavatar.com/llms.txt
Use this file to discover all available pages before exploring further.
Requirements
- Google AI Studio / Gemini API key with access to the Live API
- Optional Context for system prompt / instructions
Setup
1. Register your Gemini API key as a secret:Configuration
| Field | Type | Default | Description |
|---|---|---|---|
secret_id | uuid | required | Secret ID referencing your GEMINI_API_KEY secret. |
context_id | uuid | — | Context that supplies system prompt / instructions. |
voice | enum | Puck | Gemini Live voice. See voices. |
model | enum | gemini-3.1-flash-live-preview | Gemini Live model. See models. |
temperature | number | 0.8 | Sampling temperature. Range: 0.0 – 2.0. |
Models
Currently supported models:gemini-3.1-flash-live-preview(default)gemini-2.5-flash-native-audio-preview-12-2025
Voices
Currently supported voices:Achernar, Achird, Algenib, Algieba, Alnilam, Aoede, Autonoe, Callirrhoe, Charon, Despina, Enceladus, Erinome, Fenrir, Gacrux, Iapetus, Kore, Laomedeia, Leda, Orus, Pulcherrima, Puck, Rasalgethi, Sadachbia, Sadaltager, Schedar, Sulafat, Umbriel, Vindemiatrix, Zephyr, Zubenelgenubi. See Google’s Gemini Live API voice reference for the authoritative list and characteristics.
How it works
- A WebRTC room is created by LiveAvatar on your behalf (no WebSocket endpoint returned)
- LiveAvatar bridges your session to the Gemini Live API using your credentials
- User audio is streamed to Gemini Live; the model’s audio drives avatar lip sync
For protocol-level details — turn detection, tool calls, input/output modalities — see the Gemini Live API docs.
Billing
- LiveAvatar: 1 credit per session minute (avatar video only)
- Gemini: charged separately against your Google account for Live API usage