Speech-to-Text
Generate
Speech-to-Text
Transcribe audio files using Whisper, GPT-4o Transcribe, or ElevenLabs Scribe
POST
Speech-to-Text
Multipart Request
Models
| Slug | Notes | Cost |
|---|---|---|
whisper | Fast, multilingual, 99 languages | 2 tokens |
gpt-4o-transcribe | Highest accuracy | 2 tokens |
elevenlabs-scribe | Best for meetings, supports diarization | 2 tokens |
Response
Diarization (who said what)
Available withelevenlabs-scribe:
[Speaker 1]: Hello... [Speaker 2]: Hi there...
