OpenAI API University: Whisper — Speech to Text API

#whisper #speech-to-text #transcription #audio #openai-api

Whisper is OpenAI's speech recognition model — highly accurate transcription in 50+ languages, available via API.

Transcription API

with open("audio.mp3", "rb") as audio_file:
    transcript = client.audio.transcriptions.create(
        model="whisper-1",
        file=audio_file,
        language="en"  # optional, auto-detects if omitted
    )
print(transcript.text)

Translation

# Translate any language to English
translation = client.audio.translations.create(
    model="whisper-1",
    file=audio_file
)

Use Cases

Meeting transcription, podcast transcripts, voice commands, multilingual customer support

▶

YouTube • Top 10

OpenAI API University: Whisper — Speech to Text API

Tap to Watch ›

📸

Google Images • Top 10

OpenAI API University: Whisper — Speech to Text API

Tap to View ›

Reference:

Whisper API documentation

https://en.wikipedia.org/wiki/Special:Search?search=Whisper

📚 OpenAI API University — Full Course Syllabus

📋 Study this course on TaskLoco

← Back to Syllabus 🎓 All Courses

Make Work Feel Like Play

TaskLoco™ takes the simple joy of a sticky note and transforms it into a powerful, intuitive system that helps you organize your entire world—without the stress.

Ideas, tasks, files, links, reminders—everything snaps together like LEGO blocks, instantly and effortlessly.

What used to drain you now feels natural, even fun.

After decades of overcomplicated “productivity” tools, this is the first one that finally works with your mind instead of against it.

Join the TaskLoco™ Community

Instagram TikTok Facebook YouTube Substack Reddit

TaskLoco App • About • Terms • Privacy

“Bring genius to the world free.”