Getting Started
- Install Whisper locally via pip:
pip install openai-whisper. - Transcribe an audio file from the command line:
whisper audio.mp3 --model medium. - Choose from model sizes (tiny, base, small, medium, large) based on your accuracy and speed needs.
- Alternatively, use the OpenAI API for cloud-based transcription without local GPU requirements.
Key Features
- 99-language support provides accurate transcription across nearly all widely spoken languages worldwide.
- Multiple model sizes from tiny (39M params) to large (1.5B params) for flexible accuracy vs. speed tradeoffs.
- Automatic language detection identifies the spoken language and transcribes without manual language selection.
- Translation capability translates speech from any supported language directly into English text.
- Timestamp generation produces word-level and segment-level timestamps for subtitle and caption creation.
- Fully open-source with MIT license, enabling unrestricted commercial use, modification, and self-hosting.
// related tools
ElevenLabs
AI / Audio & Music
Industry-leading AI voice synthesis and cloning platform
freemium
web
Suno
AI / Audio & Music
AI music generation — create full songs from text prompts
freemium
web
Aider
AI / AI Coding Tools
Terminal-based AI pair programmer that edits code in your git repo
oss
web git