Skip to content

Overview

ArionTalk is a voice AI agent you can embed on any website with a single HTML tag. It reads your page content, sees your images, and responds with voice — scrolling to and highlighting the exact content being discussed. Powered by Gemini Live or a fully offline local engine.

Key Capabilities

  • Page Understanding — Automatically extracts text, images, and structure from any webpage. The AI knows what’s on the page before you even ask.
  • Interactive Highlights — As the AI discusses content, it scrolls to and highlights the exact section or image — powered by Gemini function calling.
  • Natural Voice with Barge-in — Talk naturally, interrupt anytime. The AI stops, listens, and adapts.
  • Offline Mode — The local engine runs entirely on-device via Gemini Nano. No server, no API keys, no internet required.

Packages

PackageDescription
@ariontalk/coreHeadless voice engine — services, types, and session logic with no UI dependency
@ariontalk/widgetDrop-in Web Component that wraps @ariontalk/core with a ready-made UI
@ariontalk/engine-geminiCloud engine add-on using Gemini Live API for real-time voice conversations
@ariontalk/plugin-silero-vadSilero VAD plugin for AI-powered barge-in detection
@ariontalk/token-serverLightweight Hono server that issues ephemeral Gemini API tokens (not on npm — self-hosted)

Engine Comparison

ArionTalk supports two engines. Choose the one that fits your use case:

FeatureLocal (On-Device)Gemini Live (Cloud)
AI modelGemini Nano (Chrome built-in)Gemini 2.5 Flash (cloud)
Interactive highlightsNoYes (via function calling)
Image understandingNoYes
Languages2 (en, es)12
VoicesSystem TTS8 Gemini voices
PrivacyFully on-deviceAudio sent to Google servers
Backend requiredNoYes (token server)
Offline supportYesNo
API key requiredNoYes (Gemini API key on server)

Browser Support

EngineBrowserNotes
Gemini LiveAny modern browserRequires WebSocket + microphone access
LocalChrome 139+Requires Prompt API origin trial

The widget automatically hides itself on unsupported browsers (unless force is set).