Rev ai

Add speech-to-text intelligence to your AI agent with Rev AI

Your AI agent submits audio for transcription, retrieves completed transcripts in multiple formats, generates captions, and manages custom vocabularies through Rev AI. Voice recordings become searchable text, meetings become documentation, and customer calls become actionable data.

Chosen by 800+ global brands across industries

Transcription power inside your conversations

Your AI agent leverages Rev AI's speech recognition engine to turn audio into accurate text, generate captions, and manage transcription workflows through natural conversation.

Rev ai

Use Cases

Audio becomes text, automatically

How teams use AI agents with Rev AI to transcribe meetings, generate captions, and extract insights from audio content through simple conversational requests.

Transcribing Customer Call Recordings on Demand

A support manager shares a call recording URL and asks for a transcript. The AI Agent submits the audio to Rev AI with speaker diarization enabled, monitors the job, and returns the completed transcript with labeled speakers and timestamps. The manager reviews the call without listening to the entire recording. Key moments are instantly searchable.

Auto-Captioning Webinar Recordings for Accessibility

After a product webinar, the marketing team needs captions. They send the video URL to the AI Agent, which submits it to Rev AI and retrieves WebVTT captions once processing completes. The captions are ready to upload to their video platform. Accessibility compliance is handled through a single chat interaction.

Building Custom Vocabulary for Medical Transcription

A healthcare company uses many specialized terms that standard speech recognition misses. Their admin asks the AI Agent to add medical terminology to Rev AI's custom vocabulary. The agent submits the vocabulary list and confirms it is ready for use. Future transcription jobs recognize drug names, procedures, and clinical terms accurately.

Try
Rev ai

Rev ai

FAQs

Frequently Asked Questions

How does the AI agent submit audio for transcription to Rev AI?

The agent calls Rev AI's Submit Transcription Job endpoint with either a media URL or uploaded audio file. You can configure language, speaker diarization, profanity filtering, custom vocabularies, and a callback URL for completion notifications. The job processes asynchronously and returns a job ID for tracking.

What transcript formats does the agent support?

The agent retrieves transcripts in four formats: JSON with word-level timestamps and speaker labels, plain text, WebVTT for web video captions, and SRT for standard subtitle files. You specify the desired format, and the agent returns the content ready for use.

What authentication does the Rev AI integration require?

Rev AI uses an access token (bearer token) generated from your Rev AI account settings. Enter this token in your Tars dashboard, and the agent authenticates all API requests automatically. No OAuth flow is needed.

Does Tars store transcripts or audio data from Rev AI?

No. Transcripts and audio data are fetched live from Rev AI when requested. Tars does not store, cache, or replicate your transcription content. Job data remains in your Rev AI account and follows Rev AI's data retention policies.

Can the agent handle multi-speaker audio with speaker labels?

Yes. When speaker diarization is enabled in the transcription job, Rev AI identifies and labels different speakers in the transcript. The agent returns the transcript with speaker tags so you can see who said what. You can also specify the number of audio channels for multi-channel recordings.

How does custom vocabulary improve transcription accuracy?

Custom vocabulary lets you submit domain-specific terms, brand names, or technical jargon to Rev AI. The speech recognition engine then prioritizes these terms during transcription, reducing misrecognition. The agent submits vocabulary lists and tracks their processing status before they are applied to jobs.

What happens if a transcription job fails?

The agent queries the job status and relays the failure reason from Rev AI. Common causes include unsupported audio formats or inaccessible media URLs. The agent suggests corrective actions like verifying the URL, converting the audio format, or resubmitting with different settings.

Can the agent transcribe audio in languages other than English?

Yes. Rev AI supports 36 languages. The agent specifies the language code (ISO 639-1 or BCP-47) when submitting a transcription job. If no language is specified, Rev AI defaults to English. Language identification is also available as a separate NLP feature.

How to add Tools to your AI Agent

Supercharge your AI Agent with Tool Integrations

Don't limit your AI Agent to basic conversations. Watch how to configure and add powerful tools making your agent smarter and more functional.

Privacy & Security

We’ll never let you lose sleep over privacy and security concerns

At Tars, we take privacy and security very seriously. We are compliant with GDPR, ISO, SOC 2, and HIPAA.

GDPR
ISO
SOC 2
HIPAA

Still scrolling? We both know you're interested.

Let's chat about AI Agents the old-fashioned way. Get a demo tailored to your requirements.

Schedule a Demo