Skip to main content
Represents a streamed chunk of an audio transcriptions response returned by model, based on the provided input.
data: {
  "type": "transcript.text.delta",
  "delta": "The"
}

data: {
  "type": "transcript.text.delta",
  "delta": " quick"
}

...

data: {
  "type": "transcript.text.done",
  "text": "The quick brown fox jumps over the lazy dog.",
  "usage": {
    "type": "tokens",
    "input_tokens": 20,
    "output_tokens": 10,
    "total_tokens": 30,
    "input_audio_length_ms": 18000,
    "processed_audio_length_ms": 24000,
    "input_token_details": {
      "audio_tokens": 10,
      "text_tokens": 10
    }
  }
}

data: [DONE]
type
enum<string>
required
The event type.Available options: transcript.text.delta, transcript.text.done
delta
string | null
The incremental transcript text.
text
string | null
The transcribed text.
usage
object | null