rayvoc.ai

Platform · Multilingual

One agent. Twenty languages. Zero language configuration.

The caller speaks; the agent answers in their language. Rayvoc agents detect language automatically — including mid-call switching — across 20 languages, powered by Grok’s native speech-to-speech voice models.

20
languages, automatic detection
0
per-language configuration
mid-call
language switching supported
~0.78s
time-to-first-audio (Grok S2S)

How language detection works

Traditional multilingual IVR means building the call flow N times: one prompt set, one speech model, and one test matrix per language, fronted by a menu asking callers to self-identify. Rayvoc agents skip all of it. The underlying Grok voice models are natively multilingual speech-to-speech — they hear the caller’s speech directly and respond in kind, so there is no per-language STT or TTS to provision and nothing to configure. Detection is just what the model does.

That also makes switching natural. If a caller opens in English and shifts to Hindi when the details get complicated, the agent follows in the same turn — no transfer, no restart. Because this runs as a single speech-to-speech model rather than a translated pipeline, responses stay fast: about 0.78 seconds time-to-first-audio in independent testing. More on why that matters in low-latency voice AI.

Supported languages

All 20 languages are available on every agent, with no setup per language.

Language Code
English en
Arabic (Egypt) ar-EG
Arabic (Saudi Arabia) ar-SA
Arabic (UAE) ar-AE
Bengali bn
Chinese (Simplified) zh
French fr
German de
Hindi hi
Indonesian id
Italian it
Japanese ja
Korean ko
Portuguese (Brazil) pt-BR
Portuguese (Portugal) pt-PT
Russian ru
Spanish (Mexico) es-MX
Spanish (Spain) es-ES
Turkish tr
Vietnamese vi

Where multilingual agents earn their keep

International support desks

Instead of staffing language queues or routing callers through translation vendors, one agent answers every line in the caller’s language. Pair it with local DID numbers in 100+ countries and a single agent configuration becomes a local-language support desk in every market — a +49 number answered in German, a +81 number answered in Japanese, the same agent behind both.

Hospitality and travel

Hotels, airlines, and booking lines take calls from everywhere by definition. An agent that greets in English and switches to Korean or Portuguese the moment the guest does removes the worst friction in the call.

Multicultural home markets

Even single-country businesses serve multilingual populations — Spanish and English in the US, Arabic variants across the Gulf, Hindi and English in India. Mid-call switching matters most here, because real callers mix languages within one conversation.

Everything else on the platform works identically across languages: tool calling, transcripts, recordings, barge-in, warm transfer to human staff, and outbound campaigns. And if you would rather assemble your own pipeline for a specific language stack, you can — bring your own models covers how.

Frequently asked questions

How do I configure which language my agent speaks?

You don’t. Language detection is automatic: the caller starts speaking, the agent recognizes the language and answers in it. There are no per-language prompts, no language menus, and no “press 2 for Spanish.” One agent configuration covers all 20 supported languages.

Can a caller switch languages mid-call?

Yes. Because detection runs on the live conversation rather than a setting fixed at call start, the agent follows when a caller changes language partway through — common in multilingual households and markets where callers mix languages naturally.

What powers Rayvoc’s multilingual support?

Grok voice models running in native speech-to-speech mode. The model listens and speaks directly — no separate STT and TTS per language — which is what makes zero-configuration detection and mid-call switching possible while keeping latency low (~0.78s time-to-first-audio in independent tests).

Which dialects are distinguished?

The 20 supported languages include regional variants where they matter for a phone call: Egyptian, Saudi, and Emirati Arabic; Brazilian and European Portuguese; and Mexican and European Spanish are each handled distinctly. The full list with language codes is in the table on this page.

Answer every caller in their language

Every account starts with a 14-day free trial — 1 concurrent channel, a real phone number, and full platform access.