OpenAI logo
OpenAI

Whisper

Powerful speech recognition model supporting multilingual transcription and translation

Category Audio Model ID whisper-1
Language Support99+ languages
Model TypeAutomatic Speech Recognition (ASR)
Input Formatmp3, mp4, wav, m4a, etc.

ๅฎšไปทไธŽ่ง„ๆ ผ

๐Ÿ’ฐ ๅฎšไปท

ไปทๆ ผ$0.006 / minute

โš™๏ธ ่ง„ๆ ผ

Language Support99+ languages
Model TypeAutomatic Speech Recognition (ASR)
Input Formatmp3, mp4, wav, m4a, etc.
Max File Size25MB

API ่ฐƒ็”จ็คบไพ‹

Python

from openai import OpenAI

client = OpenAI(
    base_url="https://api.xairouter.com/v1",
    api_key="your-api-key"
)

audio_file = open("audio.mp3", "rb")
transcript = client.audio.transcriptions.create(
    model="whisper-1",
    file=audio_file,
    language="zh"  # Optional: specify language
)

print(transcript.text)

cURL

curl https://api.xairouter.com/v1/audio/transcriptions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -F file="@audio.mp3" \
  -F model="whisper-1" \
  -F language="zh"