Agent Voice

Agent Voice to text enables users to communicate with agents using speech input, creating more natural and intuitive conversational experiences. This feature transcribes spoken input into text in real time, allowing agents to process and respond to voice messages just like typed input. Agent Voice represents a key step toward creating conversational AI experiences that feel as natural as talking to a human colleague, removing friction from real-world conversations.

Demo

Key Features

Voice Input Functionality

  • Users can click the microphone button to start speaking to their agents
  • Speech is transcribed in real time with visual feedback
  • Transcribed speech is converted to text messages that agents process like typed input

Real-time Transcription

  • Words appear instantly as users speak
  • Powered by Assembly AI for accurate speech recognition
  • Visual indicators show when voice recording is active

Conversation Control

  • Users maintain full control over when messages are sent
  • Transcribed speech can be reviewed and edited before sending
  • Users must press send to confirm their message after speaking
  • Users can interact with the agent continuously without needing to press the mic button repeatedly, leading to a more natural conversational experience
  • Seamless switching between voice input and typing within the same conversation

Visual Feedback System

  • Clear visual indicators show when voice recording is active
  • Animated microphone button provides intuitive user feedback
  • Real-time transcription display keeps users informed of what’s being captured

Use Cases for Your End Users

  • Let your users send rambling voice notes to your AI so there is no friction between an idea, a question, or a challenge, and a helpful answer
  • Your users can have their ideas appear in text format, in real time
  • Your users can ‘chat’ with AI constantly without needing to click buttons

How to Monetize This Feature

  • At the user level, you can control who has access based on certain packages they purchase, upsell, and more!

Current Limitations

Language Support

  • Agent Voice currently supports the English language only

Voice Response

  • This is a voice-to-text solution only

SDK Requirements

  • Exclusively available to customers using SDK 2
  • Requires migration to SDK 2.0 if not already completed

Getting Started

Prerequisites

  • Must be using SDK 2.0 or later

Enabling Agent Voice

  • Follow the setup guide to enable Agent Voice for your agents