Speech, audio, and AI music

Voice leader

ElevenLabs

Create lifelike speech with our AI voice generator and voice agents platform.

Closed Source / PlatformFree / SubscriptionAPI

Voice & AudioAPITTS

Visit official site View details

AI music

Suno

Create stunning original music for free in seconds using our AI generator.

Closed Source / PlatformFree / SubscriptionNo API

Voice & AudioMusic GenerationAI music

Visit official site View details

music composition

AIVA

AIVA is a Speech, audio, and AI music product from AIVA, focused on music composition with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

enterprise speech

Alibaba Cloud Speech

阿里云

Alibaba Cloud Speech is a Speech recognition product from 阿里云, focused on enterprise speech with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIChina ecosystem

Visit official site View details

enterprise narration

Amazon Polly

AWS

Amazon Polly is a Text to speech product from AWS, focused on enterprise narration with tags such as TTS, Voice API, API.

closed-source / platformfree trial / usage-basedAPI

TTSVoice APIAPI

Visit official site View details

cloud transcription

Amazon Transcribe

AWS

Amazon Transcribe is a Speech recognition product from AWS, focused on cloud transcription with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIAPI

Visit official site View details

transcription platform

AssemblyAI

AssemblyAI is a Speech recognition product from AssemblyAI, focused on transcription platform with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIAPI

Visit official site View details

AI music

AudioGen Medium

Azure AI Speech

Microsoft

Azure AI Speech is a Speech recognition product from Microsoft, focused on enterprise speech with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIAPI

Visit official site View details

enterprise voice

Azure Text to Speech

Microsoft

Azure Text to Speech is a Text to speech product from Microsoft, focused on enterprise voice with tags such as TTS, Voice API, API.

closed-source / platformfree trial / usage-basedAPI

TTSVoice APIAPI

Visit official site View details

cloud ASR

Baidu Speech

百度智能云

Baidu Speech is a Speech recognition product from 百度智能云, focused on cloud ASR with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIChina ecosystem

Visit official site View details

royalty-friendly music

Beatoven.ai

Beatoven

Beatoven.ai is a Speech, audio, and AI music product from Beatoven, focused on royalty-friendly music with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

one-click music generation

Boomy

Boomy is a Speech, audio, and AI music product from Boomy, focused on one-click music generation with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

speech recognition

Canary 1B

NVIDIA

Canary 1B is a Speech recognition product from NVIDIA, focused on speech recognition with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

realtime voice

Cartesia

Cartesia is a Text to speech product from Cartesia, focused on realtime voice with tags such as TTS, Voice API, API.

closed-source / platformfree trial / usage-basedAPI

TTSVoice APIAPI

Visit official site View details

open-source TTS

Chatterbox TTS

Resemble AI

Chatterbox TTS is a Text to speech product from Resemble AI, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

open-source TTS

Coqui TTS

Coqui

Coqui TTS is a Text to speech product from Coqui, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

open-source TTS

CosyVoice

阿里通义

CosyVoice is a Text to speech product from 阿里通义, focused on open-source TTS with tags such as TTS, Open source, China ecosystem.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen sourceChina ecosystem

Visit official site View details

open-source TTS

CosyVoice 2

阿里通义

CosyVoice 2 is a Text to speech product from 阿里通义, focused on open-source TTS with tags such as TTS, Open source, China ecosystem.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen sourceChina ecosystem

Visit official site View details

open-source TTS

CSM-1B

Sesame

CSM-1B is a Text to speech product from Sesame, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

speech API

Deepgram

Deepgram is a Speech recognition product from Deepgram, focused on speech API with tags such as ASR, Speech API, Realtime.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIRealtime

Visit official site View details

Edit suite

Descript

Descript makes editing video and audio as easy as editing text.

Closed Source / PlatformFree / SubscriptionNo API

Voice & AudioASRAI music

Visit official site View details

voice localization

Dubverse

Dubverse is a Text to speech product from Dubverse, focused on voice localization with tags such as TTS, Translation, Workflow.

closed-source / platformfree trial / usage-basedAPI

TTSTranslationWorkflow

Visit official site View details

open-source TTS

F5-TTS

SWivid

F5-TTS is a Text to speech product from SWivid, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

meeting transcription

Fireflies AI

Fireflies

Fireflies AI is a Speech recognition product from Fireflies, focused on meeting transcription with tags such as ASR, Workflow, Knowledge base.

closed-source / platformfree trial / usage-basedAPI

ASRWorkflowKnowledge base

Visit official site View details

voice clone

Fish Audio

Fish Audio is a Text to speech product from Fish Audio, focused on voice clone with tags such as TTS, Voice clone, API.

closed-source / platformfree trial / usage-basedAPI

TTSVoice cloneAPI

Visit official site View details

speech recognition framework

FunASR

阿里通义

FunASR is a Speech recognition product from 阿里通义, focused on speech recognition framework with tags such as ASR, Open source, China ecosystem.

open-source / self-hostedopen source / self-hostedAPI

ASROpen sourceChina ecosystem

Visit official site View details

multilingual transcription

Gladia

Gladia is a Speech recognition product from Gladia, focused on multilingual transcription with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIAPI

Visit official site View details

cloud ASR

Google Cloud Speech-to-Text

Google

Google Cloud Speech-to-Text is a Speech recognition product from Google, focused on cloud ASR with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIAPI

Visit official site View details

cloud TTS

Google Cloud Text-to-Speech

Google

Google Cloud Text-to-Speech is a Text to speech product from Google, focused on cloud TTS with tags such as TTS, Voice API, API.

closed-source / platformfree trial / usage-basedAPI

TTSVoice APIAPI

Visit official site View details

Chinese ASR

iFlytek Open Platform

科大讯飞

iFlytek Open Platform is a Speech recognition product from 科大讯飞, focused on Chinese ASR with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIChina ecosystem

Visit official site View details

singing and voice clone

Kits AI

Kits AI is a Text to speech product from Kits AI, focused on singing and voice clone with tags such as TTS, Voice clone, AI music.

closed-source / platformfree trial / usage-basedAPI

TTSVoice cloneAI music

Visit official site View details

open-source TTS

Kokoro 82M

Hexgrad

Kokoro 82M is a Text to speech product from Hexgrad, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

speech recognition

Kyutai STT 1B

Kyutai

Kyutai STT 1B is a Speech recognition product from Kyutai, focused on speech recognition with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

commercial music generation

Loudly

Loudly is a Speech, audio, and AI music product from Loudly, focused on commercial music generation with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

Brand voice

LOVO

Award-winning AI Voice Generator and text to speech software with 500+ voices in 100 languages.

Closed Source / PlatformFree / SubscriptionNo API

Voice & AudioTTSAI music

Visit official site View details

AI music

Lyria 2

Google

Lyria 2 is a Speech, audio, and AI music product from Google, focused on AI music with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

stem separation

Moises

Moises is a Speech, audio, and AI music product from Moises, focused on stem separation with tags such as Audio editing, Workflow.

closed-source / platformfree / paidNo API

Audio editingWorkflowAI music

Visit official site View details

open-source ASR

Moonshine ASR

Moonshine

Moonshine ASR is a Speech recognition product from Moonshine, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

streaming music generation

Mubert

Mubert is a Speech, audio, and AI music product from Mubert, focused on streaming music generation with tags such as AI music, API.

closed-source / platformfree / paidAPI

AI musicAPI

Visit official site View details

Commercial voice

Murf

Murf is a voice and audio product from Murf, focused on commercial voice workflows and official access.

Closed Source / PlatformFree / SubscriptionNo API

Voice & AudioTTSAI music

Visit official site View details

AI singing

Musicfy

Musicfy is a Speech, audio, and AI music product from Musicfy, focused on AI singing with tags such as AI music, Voice clone.

closed-source / platformfree / paidNo API

AI musicVoice clone

Visit official site View details

AI music

MusicGen Large

MusicLM

Google

MusicLM is a Speech, audio, and AI music product from Google, focused on AI music with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

scripted narration

Narakeet

Narakeet is a Text to speech product from Narakeet, focused on scripted narration with tags such as TTS, Workflow.

closed-source / platformfree trial / usage-basedAPI

TTSWorkflow

Visit official site View details

open-source voice clone

OpenVoice

MyShell

OpenVoice is a Text to speech product from MyShell, focused on open-source voice clone with tags such as TTS, Voice clone, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSVoice cloneOpen source

Visit official site View details

speech recognition

Parakeet RNNT 1.1B

NVIDIA

Parakeet RNNT 1.1B is a Speech recognition product from NVIDIA, focused on speech recognition with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

open-source TTS

Piper TTS

Piper

Piper TTS is a Text to speech product from Piper, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

dialogue voice

PlayDialog

PlayDialog is a Text to speech product from PlayDialog, focused on dialogue voice with tags such as TTS, Voice API, API.

closed-source / platformfree trial / usage-basedAPI

TTSVoice APIAPI

Visit official site View details

TTS API

PlayHT

PlayHT is a voice and audio product from PlayHT, focused on tts api workflows and official access.

Closed Source / PlatformFree / SubscriptionAPI

Voice & AudioAPITTS

Visit official site View details

podcast workflow

Podcastle

Podcastle is a Speech, audio, and AI music product from Podcastle, focused on podcast workflow with tags such as Audio editing, Workflow.

closed-source / platformfree / paidNo API

Audio editingWorkflowAI music

Visit official site View details

brand voiceover

ReadSpeaker

ReadSpeaker is a Text to speech product from ReadSpeaker, focused on brand voiceover with tags such as TTS, Voice clone.

closed-source / platformfree trial / usage-basedAPI

TTSVoice clone

Visit official site View details

Voice clone

Resemble AI

Resemble AI helps enterprises generate secure voice AI, verify proper usage, and detect deepfakes instantly.

Closed Source / PlatformFree / SubscriptionAPI

Voice & AudioAPITTS

Visit official site View details

speech recognition API

Rev AI

Rev

Rev AI is a Speech recognition product from Rev, focused on speech recognition API with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIAPI

Visit official site View details

experimental music generation

Riffusion

Riffusion is a Speech, audio, and AI music product from Riffusion, focused on experimental music generation with tags such as AI music, Open source.

open-source / self-hostedopen source / self-hostedNo API

AI musicOpen source

Visit official site View details

AI music

Riffusion Fuzz

Riffusion

Riffusion Fuzz is a Speech, audio, and AI music product from Riffusion, focused on AI music with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

multilingual speech recognition

SeamlessM4T

SenseVoice

阿里通义

SenseVoice is a Speech recognition product from 阿里通义, focused on open-source speech recognition with tags such as ASR, Open source, China ecosystem.

open-source / self-hostedopen source / self-hostedAPI

ASROpen sourceChina ecosystem

Visit official site View details

open-source ASR

Sherpa ONNX ASR

Sherpa

Sherpa ONNX ASR is a Speech recognition product from Sherpa, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

low-latency ASR

Soniox

Soniox is a Speech recognition product from Soniox, focused on low-latency ASR with tags such as ASR, Realtime, Speech API.

closed-source / platformfree trial / usage-basedAPI

ASRRealtimeSpeech API

Visit official site View details

background music generation

Soundraw

Soundraw is a Speech, audio, and AI music product from Soundraw, focused on background music generation with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

open-source TTS

Spark-TTS

SparkAudio

Spark-TTS is a Text to speech product from SparkAudio, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

open-source ASR

SpeechBrain ASR

SpeechBrain

SpeechBrain ASR is a Speech recognition product from SpeechBrain, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

Read aloud

Speechify

Speechify reads anything aloud to you.

Closed Source / PlatformFree / SubscriptionNo API

Voice & AudioTTSAI music

Visit official site View details

enterprise ASR

Speechmatics

Speechmatics is a Speech recognition product from Speechmatics, focused on enterprise ASR with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIAPI

Visit official site View details

AI music

Stable Audio

Stability AI

Stable Audio is a Speech, audio, and AI music product from Stability AI, focused on AI music with tags such as AI music, Audio editing.

closed-source / platformfree / paidNo API

AI musicAudio editing

Visit official site View details

AI audio

Stable Audio Open

Stability AI

Stable Audio Open is a Speech, audio, and AI music product from Stability AI, focused on AI audio with tags such as AI music, Audio editing.

closed-source / platformfree / paidNo API

AI musicAudio editing

Visit official site View details

open-source TTS

StyleTTS2

StyleTTS

StyleTTS2 is a Text to speech product from StyleTTS, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

AI music

Suno v4

Suno

Suno v4 is a Speech, audio, and AI music product from Suno, focused on AI music with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

AI music

Suno v4.5

Suno

Suno v4.5 is a Speech, audio, and AI music product from Suno, focused on AI music with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

speech API

Tencent Cloud ASR

腾讯云

Tencent Cloud ASR is a Speech recognition product from 腾讯云, focused on speech API with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI

ASRSpeech APIChina ecosystem

Visit official site View details

open-source TTS

Tortoise TTS

Tortoise

Tortoise TTS is a Text to speech product from Tortoise, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

online voiceover

TTSMaker

TTSMaker is a Text to speech product from TTSMaker, focused on online voiceover with tags such as TTS.

closed-source / platformfree trial / usage-basedAPI

TTS

Visit official site View details

character voiceover

Typecast

Typecast is a Text to speech product from Typecast, focused on character voiceover with tags such as TTS, Voice clone.

closed-source / platformfree trial / usage-basedAPI

TTSVoice clone

Visit official site View details

Music generation

Udio

Discover, create, and share music with the world.

Closed Source / PlatformFree / SubscriptionNo API

Voice & AudioMusic GenerationAI music

Visit official site View details

AI music

Udio 1.3

Udio

Udio 1.3 is a Speech, audio, and AI music product from Udio, focused on AI music with tags such as AI music.

closed-source / platformfree / paidNo API

AI music

Visit official site View details

Enterprise TTS

WellSaid Labs

WellSaid Labs is a voice and audio product from WellSaid Labs, focused on enterprise tts workflows and official access.

Closed Source / PlatformFree / SubscriptionNo API

Voice & AudioTTSAI music

Visit official site View details

open ASR

Whisper

OpenAI

Whisper is a Speech recognition product from OpenAI, focused on open ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

open-source ASR

Whisper Large V3

OpenAI

Whisper Large V3 is a Speech recognition product from OpenAI, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

open-source ASR

Whisper Large V3 Turbo

OpenAI

Whisper Large V3 Turbo is a Speech recognition product from OpenAI, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI

ASROpen source

Visit official site View details

open-source TTS

XTTS v2

Coqui

XTTS v2 is a Text to speech product from Coqui, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

open-source TTS

Zonos TTS

Zyphra

Zonos TTS is a Text to speech product from Zyphra, focused on open-source TTS with tags such as TTS, Open source.

open-source / self-hostedopen source / self-hostedAPI

TTSOpen source

Visit official site View details

Selection guide

How to choose Speech, audio, and AI music

Separate TTS, dubbing, music generation, and voice cloning before comparing tools.
For dubbing, listen for naturalness and phrasing before checking price; ads and narration suffer most from robotic delivery.
Multilingual projects should focus on language coverage, accent quality, and subtitle workflow support.
Before commercial use, verify voice licensing, consent for cloning, and content copyright.

What matters first on Speech, audio, and AI music category pages?

Start with official access, pricing model, API support, open/closed status, and common use cases.