Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.liveavatar.com/llms.txt

Use this file to discover all available pages before exploring further.

The Gemini Live Connector bridges the Gemini Live API with LiveAvatar. Gemini handles speech-to-speech orchestration while LiveAvatar renders the avatar video.

Requirements

  • Google AI Studio / Gemini API key with access to the Live API
  • Optional Context for system prompt / instructions

Setup

1. Register your Gemini API key as a secret:
curl -X POST https://api.liveavatar.com/v1/secrets \
  -H "X-API-KEY: <YOUR_API_KEY>" \
  -H "content-type: application/json" \
  -d '{
    "secret_type": "GEMINI_API_KEY",
    "secret_value": "<gemini_api_key>",
    "secret_name": "Gemini Live Key"
  }'
2. Start a session with the connector:
{
  "mode": "LITE",
  "avatar_id": "<avatar_id>",
  "gemini_realtime_config": {
    "secret_id": "<secret_id>",
    "context_id": "<context_id>",
    "voice": "Puck",
    "model": "gemini-3.1-flash-live-preview",
    "temperature": 0.8
  }
}

Configuration

FieldTypeDefaultDescription
secret_iduuidrequiredSecret ID referencing your GEMINI_API_KEY secret.
context_iduuidContext that supplies system prompt / instructions.
voiceenumPuckGemini Live voice. See voices.
modelenumgemini-3.1-flash-live-previewGemini Live model. See models.
temperaturenumber0.8Sampling temperature. Range: 0.02.0.

Models

Currently supported models:
  • gemini-3.1-flash-live-preview (default)
  • gemini-2.5-flash-native-audio-preview-12-2025
See Google’s Gemini Live API docs for the authoritative list and model capabilities.

Voices

Currently supported voices: Achernar, Achird, Algenib, Algieba, Alnilam, Aoede, Autonoe, Callirrhoe, Charon, Despina, Enceladus, Erinome, Fenrir, Gacrux, Iapetus, Kore, Laomedeia, Leda, Orus, Pulcherrima, Puck, Rasalgethi, Sadachbia, Sadaltager, Schedar, Sulafat, Umbriel, Vindemiatrix, Zephyr, Zubenelgenubi. See Google’s Gemini Live API voice reference for the authoritative list and characteristics.

How it works

  • A WebRTC room is created by LiveAvatar on your behalf (no WebSocket endpoint returned)
  • LiveAvatar bridges your session to the Gemini Live API using your credentials
  • User audio is streamed to Gemini Live; the model’s audio drives avatar lip sync
For protocol-level details — turn detection, tool calls, input/output modalities — see the Gemini Live API docs.

Billing

  • LiveAvatar: 1 credit per session minute (avatar video only)
  • Gemini: charged separately against your Google account for Live API usage