Import OpenAI Whisper Large V3 Turbo in OCI Generative AI

The OpenAI Whisper Large V3 Turbo model is now compatible for import in OCI Generative AI.

Whisper Large V3 Turbo is an optimized version of Whisper Large V3 for automatic speech recognition. This audio-to-text model is a fine-tuned version of a pruned Whisper Large V3 model, with the number of decoder layers reduced from 32 to 4 for faster transcription with a minor quality tradeoff. The model supports multilingual transcription and language identification, can translate speech from supported languages into English text, and is designed for latency-sensitive and high-throughput audio transcription workloads.

In OCI Generative AI, use the following details for this model:

Hugging Face model ID for import:

  • openai/whisper-large-v3-turbo

Recommended dedicated AI cluster unit shapes:

  • H100_X1
  • A100_80G_X1

For the complete list of models compatible for import, see Compatible Models for Import. For available hardware unit sizes and deployment steps, see Managing Imported Models. For information about the service, see the Generative AI documentation.