
Whisper is OpenAI's speech recognition model — highly accurate transcription in 50+ languages, available via API.
with open("audio.mp3", "rb") as audio_file:
transcript = client.audio.transcriptions.create(
model="whisper-1",
file=audio_file,
language="en" # optional, auto-detects if omitted
)
print(transcript.text)# Translate any language to English
translation = client.audio.translations.create(
model="whisper-1",
file=audio_file
)Reference:
TaskLoco™ — The Sticky Note GOAT