Speech recognition

enterprise speech

Alibaba Cloud Speech

阿里云

Alibaba Cloud Speech is a Speech recognition product from 阿里云, focused on enterprise speech with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIChina ecosystem
cloud transcription

Amazon Transcribe

AWS

Amazon Transcribe is a Speech recognition product from AWS, focused on cloud transcription with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIAPI
transcription platform

AssemblyAI

AssemblyAI

AssemblyAI is a Speech recognition product from AssemblyAI, focused on transcription platform with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIAPI
enterprise speech

Azure AI Speech

Microsoft

Azure AI Speech is a Speech recognition product from Microsoft, focused on enterprise speech with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIAPI
cloud ASR

Baidu Speech

百度智能云

Baidu Speech is a Speech recognition product from 百度智能云, focused on cloud ASR with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIChina ecosystem
speech recognition

Canary 1B

NVIDIA

Canary 1B is a Speech recognition product from NVIDIA, focused on speech recognition with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
official model

Cohere Transcribe

Cohere

Cohere Transcribe is a Foundation models product from Cohere, focused on official model with tags such as Foundation model, API, ASR.

closed-source / platformmodel access / platform distributionAPI
Foundation modelAPIASR
speech API

Deepgram

Deepgram

Deepgram is a Speech recognition product from Deepgram, focused on speech API with tags such as ASR, Speech API, Realtime.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIRealtime
Edit suite

Descript

Descript

Descript makes editing video and audio as easy as editing text.

Closed Source / PlatformFree / SubscriptionNo API
Voice & AudioASRAI music
meeting transcription

Fireflies AI

Fireflies

Fireflies AI is a Speech recognition product from Fireflies, focused on meeting transcription with tags such as ASR, Workflow, Knowledge base.

closed-source / platformfree trial / usage-basedAPI
ASRWorkflowKnowledge base
speech recognition framework

FunASR

阿里通义

FunASR is a Speech recognition product from 阿里通义, focused on speech recognition framework with tags such as ASR, Open source, China ecosystem.

open-source / self-hostedopen source / self-hostedAPI
ASROpen sourceChina ecosystem
multilingual transcription

Gladia

Gladia

Gladia is a Speech recognition product from Gladia, focused on multilingual transcription with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIAPI
cloud ASR

Google Cloud Speech-to-Text

Google

Google Cloud Speech-to-Text is a Speech recognition product from Google, focused on cloud ASR with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIAPI
official model

GPT-4o mini Transcribe

OpenAI

GPT-4o mini Transcribe is a Foundation models product from OpenAI, focused on official model with tags such as Foundation model, API, ASR.

closed-source / platformmodel access / platform distributionAPI
Foundation modelAPIASR
official model

GPT-4o Transcribe

OpenAI

GPT-4o Transcribe is a Foundation models product from OpenAI, focused on official model with tags such as Foundation model, API, ASR.

closed-source / platformmodel access / platform distributionAPI
Foundation modelAPIASR
open model

Granite Speech 3.3

IBM

Granite Speech 3.3 is a Foundation models product from IBM, focused on open model with tags such as Foundation model, Open source, ASR.

open-source / self-hostedopen source / self-hostedNo API
Foundation modelOpen sourceASR
Chinese ASR

iFlytek Open Platform

科大讯飞

iFlytek Open Platform is a Speech recognition product from 科大讯飞, focused on Chinese ASR with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIChina ecosystem
speech recognition

Kyutai STT 1B

Kyutai

Kyutai STT 1B is a Speech recognition product from Kyutai, focused on speech recognition with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
open-source ASR

Moonshine ASR

Moonshine

Moonshine ASR is a Speech recognition product from Moonshine, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
meeting notes

Otter AI

Otter

Otter AI is a Office and workflows product from Otter, focused on meeting notes with tags such as Workflow, ASR.

closed-source / platformfree / paidNo API
WorkflowASR
official model

Parakeet CTC 1.1B

NVIDIA

Parakeet CTC 1.1B is a Foundation models product from NVIDIA, focused on official model with tags such as Foundation model, API, ASR.

closed-source / platformmodel access / platform distributionAPI
Foundation modelAPIASR
speech recognition

Parakeet RNNT 1.1B

NVIDIA

Parakeet RNNT 1.1B is a Speech recognition product from NVIDIA, focused on speech recognition with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
meeting productivity

Read AI

Read AI

Read AI is a Office and workflows product from Read AI, focused on meeting productivity with tags such as Workflow, ASR.

closed-source / platformfree / paidNo API
WorkflowASR
speech recognition API

Rev AI

Rev

Rev AI is a Speech recognition product from Rev, focused on speech recognition API with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIAPI
multilingual speech recognition

SeamlessM4T

Meta

SeamlessM4T is a Speech recognition product from Meta, focused on multilingual speech recognition with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
open-source speech recognition

SenseVoice

阿里通义

SenseVoice is a Speech recognition product from 阿里通义, focused on open-source speech recognition with tags such as ASR, Open source, China ecosystem.

open-source / self-hostedopen source / self-hostedAPI
ASROpen sourceChina ecosystem
official model

SenseVoice Large

商汤日日新

SenseVoice Large is a Foundation models product from 商汤日日新, focused on official model with tags such as Foundation model, API, China ecosystem.

closed-source / platformmodel access / platform distributionAPI
Foundation modelAPIChina ecosystem
open-source ASR

Sherpa ONNX ASR

Sherpa

Sherpa ONNX ASR is a Speech recognition product from Sherpa, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
low-latency ASR

Soniox

Soniox

Soniox is a Speech recognition product from Soniox, focused on low-latency ASR with tags such as ASR, Realtime, Speech API.

closed-source / platformfree trial / usage-basedAPI
ASRRealtimeSpeech API
open-source ASR

SpeechBrain ASR

SpeechBrain

SpeechBrain ASR is a Speech recognition product from SpeechBrain, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
enterprise ASR

Speechmatics

Speechmatics

Speechmatics is a Speech recognition product from Speechmatics, focused on enterprise ASR with tags such as ASR, Speech API, API.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIAPI
speech API

Tencent Cloud ASR

腾讯云

Tencent Cloud ASR is a Speech recognition product from 腾讯云, focused on speech API with tags such as ASR, Speech API, China ecosystem.

closed-source / platformfree trial / usage-basedAPI
ASRSpeech APIChina ecosystem
meeting productivity

tl;dv

tl;dv

tl;dv is a Office and workflows product from tl;dv, focused on meeting productivity with tags such as Workflow, ASR.

closed-source / platformfree / paidNo API
WorkflowASR
official model

Voxtral Mini Transcribe

Mistral AI

Voxtral Mini Transcribe is a Foundation models product from Mistral AI, focused on official model with tags such as Foundation model, API, ASR.

closed-source / platformmodel access / platform distributionAPI
Foundation modelAPIASR
open ASR

Whisper

OpenAI

Whisper is a Speech recognition product from OpenAI, focused on open ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
open-source ASR

Whisper Large V3

OpenAI

Whisper Large V3 is a Speech recognition product from OpenAI, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
open-source ASR

Whisper Large V3 Turbo

OpenAI

Whisper Large V3 Turbo is a Speech recognition product from OpenAI, focused on open-source ASR with tags such as ASR, Open source.

open-source / self-hostedopen source / self-hostedAPI
ASROpen source
Selection guide

How to choose Speech recognition

  • Filter by audio environment first: meetings, support calls, phone audio, interviews, and short videos need different robustness.
  • Start with word error rate, speaker diarization, and timestamp accuracy.
  • Production use should check streaming, batch transcription, and hotword support.
  • For sensitive audio, then check on-prem deployment, private hosting, and data-retention policy.

What matters first on Speech recognition category pages?

Start with official access, pricing model, API support, open/closed status, and common use cases.