Gladia

Every voice message and call recording becomes searchable text with Gladia AI

Customers send voice messages, leave voicemails, or reference recorded calls. Your AI agent sends the audio to Gladia for transcription, retrieves the text in 100+ languages, and responds based on what was said. Support interactions that once required manual listening now resolve automatically.

Chosen by 800+ global brands across industries

Audio intelligence for customer conversations

From uploading recordings to initiating live transcription sessions, your AI agent processes audio through Gladia's speech-to-text engine and delivers text results in under 300 milliseconds.

Transcribe Recorded Audio

A customer submits a voice message about their issue. Your AI agent sends the audio URL to Gladia's pre-recorded transcription API, retrieves the full text with timestamps and speaker labels, and responds to the customer's concern without a human ever pressing play.

Initiate Live Transcription

During a real-time customer call, the agent starts a Gladia live session and receives a WebSocket URL for audio streaming. Transcription happens with sub-300ms latency, enabling real-time conversation analysis and instant note generation.

Upload Audio Files

A support team member wants to transcribe a meeting recording. The agent uploads the audio or video file to Gladia's servers, receives a hosted file URL, and uses it to initiate transcription. No manual file management or cloud storage configuration required.

Track Transcription Jobs

Your operations team asks about the status of a batch transcription. The agent lists all pre-recorded jobs with pagination and status filters, showing which are queued, processing, or completed. Real-time progress visibility without opening the Gladia dashboard.

Monitor Live Sessions

Multiple live transcription sessions are running simultaneously. The agent retrieves the list of active and completed sessions filtered by date range, status, or custom metadata, giving your team a clear overview of ongoing audio processing.

Retrieve Transcription Results

After a transcription completes, the agent fetches the full result including timestamped text, speaker diarization labels, detected language, and optional summary. The transcript is ready for review, search, or automated downstream processing.

Gladia

Use Cases

Voice data, unlocked by AI

Discover how businesses transform voice messages, call recordings, and meeting audio into actionable text that their AI agents can understand and act on.

Voice Messages Resolved Without Human Listening

A customer sends a WhatsApp voice note describing a product defect. Your AI Agent uploads the audio to Gladia, initiates pre-recorded transcription with speaker diarization, retrieves the full text transcript, and identifies the issue described. The agent then responds with troubleshooting steps based on the transcribed complaint. No support agent ever listens to the recording.

Meeting Summaries Generated Automatically

After a client onboarding call, the account manager uploads the recording. Your AI Agent sends it to Gladia with summarization enabled, waits for the job to complete, and retrieves both the full transcript and an AI-generated summary with key action items. The team gets meeting notes delivered in chat within minutes of the call ending.

Multilingual Support from a Single Audio Stream

A global support center receives calls in Spanish, French, and English. Your AI Agent initiates Gladia live sessions with automatic language detection enabled. The transcription engine identifies the language on the fly, transcribes accurately across all three, and the agent responds in the customer's language. No manual language routing needed.

Try
Gladia

Gladia

FAQs

Frequently Asked Questions

How fast is Gladia's real-time transcription through the agent?

Gladia delivers live transcription at sub-300 millisecond latency. The agent initiates a WebSocket-based live session, and as audio streams in, text results arrive almost instantly. This is fast enough for real-time conversation analysis, live captioning, and immediate agent responses.

Which audio and video formats does Gladia support?

Gladia accepts most common audio and video formats including MP3, WAV, MP4, FLAC, OGG, and WebM. You can provide a public URL to the file or upload it directly through the API. The agent handles format detection automatically when submitting files for transcription.

Can the agent transcribe audio in languages other than English?

Yes. Gladia supports transcription in over 100 languages with automatic language detection. The agent can also enable code-switching, which handles conversations where speakers switch between languages mid-sentence. Language configuration is passed as a parameter when initiating transcription.

Does Tars store the audio files or transcription results?

No. Audio files are uploaded directly to Gladia's servers for processing. Transcription results are fetched via the API and used only to formulate the agent's response. Tars does not maintain copies of your audio content or transcripts on its own infrastructure.

Can Gladia identify different speakers in a recording?

Yes. Speaker diarization is available as a configuration option when initiating pre-recorded transcription. The agent passes the diarization_config parameter, and Gladia labels each segment of the transcript with the identified speaker, making it clear who said what during multi-person recordings.

How is this different from using a standalone transcription service?

A standalone service gives you a transcript. Tars gives you an AI agent that reads the transcript, understands the customer's intent, and takes action. If a voice message contains a complaint, the agent does not just transcribe it but also identifies the issue and starts resolving it automatically.

What transcription model does Gladia use?

Gladia uses its Solaria-1 model by default, built on an enhanced and optimized version of OpenAI Whisper called Whisper-Zero. This model eliminates up to 99% of hallucinations from transcripts while maintaining high accuracy across languages and accents. The model parameter can be configured per request.

Can the agent generate subtitles from audio content?

Yes. When initiating pre-recorded transcription, the agent can enable the subtitles_config parameter. Gladia generates properly timed subtitle output alongside the transcript, suitable for embedding in video content or displaying as captions. SRT and VTT formats are supported.

How to add Tools to your AI Agent

Supercharge your AI Agent with Tool Integrations

Don't limit your AI Agent to basic conversations. Watch how to configure and add powerful tools making your agent smarter and more functional.

Real results, real customers, real stories

“We're saving an average of 4,000+ calls a month.”
Implementing an Agent revolutionized our customer service channels and our service to Indiana business owners. We're saving an average of 4,000+ calls a month and can now provide 24x7x365 customer service️ along with our business services.
Lindsey Roark Mayes
Ex-Director of SOS IT (State of Indiana)
“Cutting down on staff needing to email back.”
Since our launch of Tars Agents, we've had more than 5k interactions with them from individuals on the website. We saw prospects interacting with the Agent regarding application timelines, tuition, curriculum, and other items that may come through an email. This provides another avenue of access to our team while cutting down on staff needing to email back.
Levi Eastwood
Marketing Director of UCI Merage School
“Powerful tool - and there's so much more still to explore!”
I love the flexibility of the TARS chatbot design tool. The ability to drop in a new gambit and easily de-link and re-link gambits before and after it saves HOURS of time lost if I had to regenerate from scratch. TARS stands out for its flexibility, consistent performance, and outstanding support.
Aaron Rittmaster
Founder
“I like the product.”
Takes those boring forms and allows you to make the collecting of customer information enjoyable for the user. It has a lot of value and I see the company constantly working on improving it.
Pierre Rattini
Director of Marketing
“Very responsive and supportive Team”
I love how supportive and responsive the team members are. The chatbots are not difficult to build once you have an idea of what you are doing, and this is so based on the backend work Tars has completed. Also, they are flexible to the clients' requests.
Keisha Cameron
Product Manager
“Easy for cooperation and open to agreement.”
One of the biggest qualities of TARS is their ability to truly understand their clients' needs. They took the time to thoroughly assess our requirements, offering valuable insights and recommendations that we hadn't even considered. This level of personalized attention made a significant difference in the success of our project.
Milica Petrovic
Customer Care Project Associate
“Great service”
The guys at Tars are so passionate about bots and maximum engagement. Tars is overall a great product with really nice people there to help me engage and automate conversations I would have otherwise missed.
Nigel Gosling
Health Coach
“Best in class for lead generation”
TARS is an essential tool within our lead gen strategy. I started to use it a couple years ago, and the platform evolved to focus on results. You can integrate your Google Analytics, Facebook Pixel, and AdWords conversion tag to track results and measure your campaign. You can also integrate with Zapier, which opens a lot of possibilities. Compared with static landing pages, we have seen an increase in conversions and more engagement.
Leonardo Wolff
Founder
“Very professional and cooperative”
Building an automated chat that our customers actually used and reduced calls amount.
Maryam Alhaddad
Customer Journey Designer
“Flexibility and good service”
Tars platform is very flexible, so you can do pretty much any flux you desire. Also integrates with any third party through APIs with a very simple and easy-to-use interface. Also tars team is great! Always at disposal and brings suggestions and solutions for any issue encountered during the process of building the chatbot.
Lucas Von Lachmann
Process Manager
“Deserved The title of the Best Chatbot of the Year”
If you are building chatbots, you cannot go wrong with TARS. I have been with this company since their beginning. Seen how they have grown. I have tested a lot of bots, but this one is still the best out there. We use WhatsApp a lot for our business. And what I like best is that I can have 1 chatbot for website and for my WhatsApp. I especially appreciate their support team as well. Whenever I am stuck, they quickly reply to my questions.
MJ Felix
Chairman
“Advanced integration, easy to use, and very customizable.”
This chatbot does everything from AI integration to complete customization and is excellent for lead conversion in most industries. I think the pre-formatted templates are an excellent boost to get started quickly. The team is also excellent.
Rachel Rowling
Chief Operating Officer
“The chatbot implementation has exceeded expectations!”
The implementation has delivered 24/7 customer support and is proving its value by reducing Contact center calls by around 5% in just four months of operation. Beyond enhancing the e-care experience, the chatbot is driving impressive business results, achieving a remarkable 20% month-on-month growth.
Victor Pereira
Customer Care and CX Manager
“We're saving an average of 4,000+ calls a month.”
Implementing an Agent revolutionized our customer service channels and our service to Indiana business owners. We're saving an average of 4,000+ calls a month and can now provide 24x7x365 customer service️ along with our business services.
Lindsey Roark Mayes
Ex-Director of SOS IT (State of Indiana)
“Cutting down on staff needing to email back.”
Since our launch of Tars Agents, we've had more than 5k interactions with them from individuals on the website. We saw prospects interacting with the Agent regarding application timelines, tuition, curriculum, and other items that may come through an email. This provides another avenue of access to our team while cutting down on staff needing to email back.
Levi Eastwood
Marketing Director of UCI Merage School
“Powerful tool - and there's so much more still to explore!”
I love the flexibility of the TARS chatbot design tool. The ability to drop in a new gambit and easily de-link and re-link gambits before and after it saves HOURS of time lost if I had to regenerate from scratch. TARS stands out for its flexibility, consistent performance, and outstanding support.
Aaron Rittmaster
Founder
“I like the product.”
Takes those boring forms and allows you to make the collecting of customer information enjoyable for the user. It has a lot of value and I see the company constantly working on improving it.
Pierre Rattini
Director of Marketing
“Very responsive and supportive Team”
I love how supportive and responsive the team members are. The chatbots are not difficult to build once you have an idea of what you are doing, and this is so based on the backend work Tars has completed. Also, they are flexible to the clients' requests.
Keisha Cameron
Product Manager
“Easy for cooperation and open to agreement.”
One of the biggest qualities of TARS is their ability to truly understand their clients' needs. They took the time to thoroughly assess our requirements, offering valuable insights and recommendations that we hadn't even considered. This level of personalized attention made a significant difference in the success of our project.
Milica Petrovic
Customer Care Project Associate
“Great service”
The guys at Tars are so passionate about bots and maximum engagement. Tars is overall a great product with really nice people there to help me engage and automate conversations I would have otherwise missed.
Nigel Gosling
Health Coach
“Best in class for lead generation”
TARS is an essential tool within our lead gen strategy. I started to use it a couple years ago, and the platform evolved to focus on results. You can integrate your Google Analytics, Facebook Pixel, and AdWords conversion tag to track results and measure your campaign. You can also integrate with Zapier, which opens a lot of possibilities. Compared with static landing pages, we have seen an increase in conversions and more engagement.
Leonardo Wolff
Founder
“Very professional and cooperative”
Building an automated chat that our customers actually used and reduced calls amount.
Maryam Alhaddad
Customer Journey Designer
“Flexibility and good service”
Tars platform is very flexible, so you can do pretty much any flux you desire. Also integrates with any third party through APIs with a very simple and easy-to-use interface. Also tars team is great! Always at disposal and brings suggestions and solutions for any issue encountered during the process of building the chatbot.
Lucas Von Lachmann
Process Manager
“Deserved The title of the Best Chatbot of the Year”
If you are building chatbots, you cannot go wrong with TARS. I have been with this company since their beginning. Seen how they have grown. I have tested a lot of bots, but this one is still the best out there. We use WhatsApp a lot for our business. And what I like best is that I can have 1 chatbot for website and for my WhatsApp. I especially appreciate their support team as well. Whenever I am stuck, they quickly reply to my questions.
MJ Felix
Chairman
“Advanced integration, easy to use, and very customizable.”
This chatbot does everything from AI integration to complete customization and is excellent for lead conversion in most industries. I think the pre-formatted templates are an excellent boost to get started quickly. The team is also excellent.
Rachel Rowling
Chief Operating Officer
“The chatbot implementation has exceeded expectations!”
The implementation has delivered 24/7 customer support and is proving its value by reducing Contact center calls by around 5% in just four months of operation. Beyond enhancing the e-care experience, the chatbot is driving impressive business results, achieving a remarkable 20% month-on-month growth.
Victor Pereira
Customer Care and CX Manager

Privacy & Security

We’ll never let you lose sleep over privacy and security concerns

At Tars, we take privacy and security very seriously. We are compliant with GDPR, ISO, SOC 2, and HIPAA.

GDPR
ISO
SOC 2
HIPAA

Still scrolling? We both know you're interested.

Let's chat about AI Agents the old-fashioned way. Get a demo tailored to your requirements.

Schedule a Demo