KUDO AI Speech translation offers two distinct modalities, One-Way and Interactive Communication, each of them designed to cater to different use cases and user needs. You can select the desired translation modality while scheduling a session. You cannot change between modalities once a session has been scheduled.
One-Way Communication
Use Case: Suitable for presentations with one speaker at a time. Ideal for formal presentations, lectures, or scenarios where translation accuracy and clarity are critical.
Output: translated audio and/or captions.
Quality: Highest quality with very accurate translations.
Advanced Functionalities:
- Glossaries: Customize how specific words are translated.
- Do-Not-Translate Words: Specify words that should remain untranslated (e.g. product names).
- Sounds_Like Feature: Improve recognition of complex terms pronounced in unexpected ways (e.g., recognise “SSHRC” when the speaker pronounces it as “ʃɜːk”).
- AI Voice Pronunciation Control: Force the AI voice to pronounce complex terms in a specific way (e.g., pronounce “SSHRC” as “ʃɜːk”).
Interactive Communication
Use Case: back-and-forth conversations.
Output: Captions only; no AI voice.
Quality: Lower quality but faster translations.
Advanced Functionalities: Does not support customization tools like glossaries, do-not-translate words, or pronunciation adjustments.
Both modalities support the same number of source and target languages.