voice-ai tools

Discover essential tools for building voice applications. Find the perfect tools for speech recognition, text-to-speech, and more.

Featured

All tools (57)

Rasa

agentfreemium
4.2

Open-source framework for building contextual AI assistants with sophisticated dialogue management.

1 Language
API

Play.ht

ttsfreemium
4.6

AI Voice Generator and voice cloning tool, focusing on high-quality synthetic media and audio articles.

1 Language
API

Speechmatics

asrpaid
4.4

Global ASR platform known for high accuracy across a vast array of accents and challenging audio.

1 Language
API

Murf AI

ttsfreemium
4.3

A popular text-to-speech studio used for professional voiceovers, e-learning, and commercial content.

5 Languages
API

Deepdub

dubbingpaid
4.7

AI-powered localization service for film, TV, and gaming, maintaining the original actor's voice tone and cadence.

5 Languages

Lyrebird (Descript)

cloningfreemium
4.5

Voice cloning technology integrated within the Descript editor, allowing text-to-speech editing of recorded audio.

1 Language

Amazon Polly

ttspaid
4.1

AWS's Text-to-Speech service with high-quality neural voices and SSML control.

1 Language
API

Vonage Voice API

agentpaid
4.3

A versatile communications API for making and receiving voice calls, focusing on call control and webhooks.

1 Language
API

LiveKit

agentfreemium
4.7

Open-source WebRTC platform for real-time video/audio, often used to build custom low-latency voice agents.

1 Language
API

Gladia

asrfreemium
4.2

Fast ASR service focused on real-time transcription and multilingual low-latency use cases.

1 Language
API

iSpeech

ttspaid
3.9

A simple, clean API for text-to-speech, popular for accessibility and content reading apps.

4 Languages
API

Voicify

agentpaid
4.0

Platform for creating and deploying voice experiences across platforms like Alexa, Google Assistant, and custom apps.

3 Languages

Meta Voicebox

cloningfree
4.9

State-of-the-art Generative AI model for speech synthesis and voice editing (research tool, not fully API'd).

1 Language

Lovo AI (Genny)

ttsfreemium
4.0

AI voice generator specializing in realistic voiceovers for marketing, video, and e-learning.

1 Language
API

Acapela Group

ttspaid
3.8

Long-standing TTS provider focusing on accessibility, branded voices, and embedded solutions.

1 Language
API

Twilio Voice API

agentpaid
4.8

Programmable telephony API that connects software to the PSTN (phone lines) and SIP endpoints, essential for voice agents.

1 Language
API

Coqui TTS

ttsfree
4.1

Open-source toolkit for Text-to-Speech (TTS) and voice cloning, focused on research and customization.

1 Language

Google Dialogflow CX

agentpaid
4.4

Google's advanced conversational AI platform for designing complex, multi-turn virtual agents.

1 Language
API

Dubbing AI

dubbingfreemium
4.0

Simple, fast AI dubbing tool for content creators and small businesses, focused on quick turnaround.

4 Languages

Amazon Transcribe

asrpaid
4.1

AWS's scalable, managed transcription service for both real-time and batch processing of audio/video.

1 Language
API

Voicemaker

ttsfreemium
3.7

Web-based TTS tool with a focus on speed and ease of use for quick voice generation.

4 Languages
API

IBM Watson Speech to Text

asrpaid
3.9

IBM's cognitive service for converting speech to text, with strong customization for domain-specific vocabulary.

1 Language
API

Pindrop

agentpaid
4.5

Voice biometrics and fraud detection platform, crucial for securing voice transactions in contact centers.

1 Language
API

Loqui.tech

dubbingfreemium
3.9

AI-powered dubbing service focused on e-learning and internal corporate communication video content.

3 Languages
API

Krisp

asrfreemium
4.6

AI-powered noise, voice, and echo cancellation technology, often integrated into real-time voice apps to improve ASR input.

1 Language
API

Synapse AI

agentpaid
3.8

Agent platform for building virtual assistants optimized for field service and technical support via voice.

1 Language
API

Voicera

asrfreemium
4.0

Meeting transcription and note-taking platform, focused on extracting insights and action items from conversations.

1 Language

Voice AI (Custom)

agentpaid
4.0

Represents the option of building your own full stack voice agent using foundational open-source models (e.g., Llama, Mistral, VAD).

1 Language

Synthesia

ttspaid
4.4

AI video generation platform where TTS voices are used with human avatars, often for corporate training and explainer videos.

1 Language
API

Vidnoz AI Voice Changer

cloningfreemium
3.5

Tool for voice cloning and celebrity/character voice conversion, often used for content creation and fun projects.

1 Language

OpenAI TTS

ttspaid
4.6

OpenAI's high-quality Text-to-Speech API, offering natural and expressive voices optimized for conversation.

4 Languages
API

AWS Lex

agentpaid
4.1

Service for building conversational interfaces for voice and text, using the same technology as Alexa.

4 Languages
API

Speechify

ttsfreemium
4.0

Leading text-to-speech platform focused on consumption, education, and accessibility.

1 Language
API

Respeecher

cloningpaid
4.8

Studio-grade voice cloning and voice-to-voice conversion, used in major film productions and for deepfake prevention.

2 Languages

CereVoice

ttspaid
3.6

Text-to-speech technology with a focus on custom voice creation and embedding TTS into various devices and apps.

1 Language
API

Google Translate API

dubbingpaid
4.3

Google's core translation engine, widely used as the translation step in any basic dubbing pipeline.

1 Language
API

Microsoft Translator

dubbingpaid
4.0

Microsoft's translation service, offering real-time translation capabilities for integration into communication apps.

1 Language
API

iCloner (Third Party)

cloningfreemium
3.8

A hypothetical third-party tool focusing purely on low-cost, quick voice cloning via a simple API.

2 Languages
API

Speechelo

ttspaid
3.5

Popular online TTS tool marketed to video creators for generating voiceovers easily without API integration.

1 Language

Amazon Connect

agentpaid
4.5

AWS's contact center service, which provides a framework for integrating AI services (Lex, Polly, Transcribe) into customer calls.

1 Language
API

Custom Fine-Tuned Whisper

asrpaid
4.6

Represents using open-source Whisper and fine-tuning it with domain-specific data for superior ASR accuracy in niche areas.

2 Languages

Microsoft Azure Speech to Text

asrpaid
4.3

Microsoft's cloud ASR service, offering strong real-time performance and deep integration with Azure enterprise tools.

1 Language
API

Voicely

ttspaid
3.7

TTS tool focused on creating audio for books, articles, and blog content with an emphasis on natural, reading-friendly tones.

3 Languages

Dubverse

dubbingfreemium
4.1

Collaborative video dubbing platform that makes the translation and voiceover process simple for teams.

4 Languages
API

AssemblyAI

asrfreemium
4.5

Transcription and Intelligence API, providing sentiment, summarization, and topic detection alongside ASR.

4 Languages
API

Coqui Studio

ttsfreemium
4.3

A web-based studio offering high-quality text-to-speech generation and voice conversion services.

7 Languages
API

Google Cloud Speech-to-Text

asrpaid
4.5

Google's powerful, scalable cloud ASR service, integrated with the wider GCP ecosystem.

4 Languages
API

Microsoft Azure Text-to-Speech

ttspaid
4.4

Enterprise-grade TTS with highly natural voices, supporting custom voice creation and wide language support.

4 Languages
API

Resemble AI

cloningpaid
4.8

Hyper-realistic voice cloning and synthesis, capable of 'Resemble Fill' for real-time error correction.

4 Languages
API

Whisper.cpp

asrfree
4.7

High-performance C++ port of OpenAI's Whisper model, optimized for fast, on-device transcription.

1 Language

Vogent

agentpaid
4.3

AI Voice Agents platform specializing in phone interactions, IVR navigation, and outbound calling campaigns.

3 Languages
API