Model detail
GPT-4o Transcribe Diarize
A transcription model that identifies who is speaking when.
speechtranscriptionspeech_to_textvoice_assistanttranscriptionenterprise_api
Model detail, capability tags, source, OpenRouter state, and public reputation.
A transcription model that identifies who is speaking when.
GPT-4o Transcribe Diarize Model | OpenAI API
P0 · official · high
| Input modalities | audio / text |
|---|---|
| Output modalities | text / structured_output |
| Tool use / Function calling | No reliable public source / No reliable public source |
| Structured output / Reasoning | Yes / No reliable public source |
| Vision / Audio / Video | No reliable public source / Yes / No reliable public source |
| Open weights / License | No / Not applicable |
| OpenRouter | No reliable public source |
| Community reputation | No reliable public source |