Table of Contents
Preface
Part 1: Introducing OpenAI’s Whisper
1
Unveiling Whisper – Introducing OpenAI’s Whisper
Technical requirements
Deconstructing OpenAI’s Whisper
The marvel of human vocalization – Understanding voice and speech
Understanding the intricacies of speech recognition
OpenAI’s Whisper – A technological parallel
The evolution of speech recognition and the emergence of OpenAI’s Whisper
Exploring key features and capabilities of Whisper
Speech-to-text conversion
Translation capabilities
Support for diverse file formats
Ease of use
Multilingual capabilities
Large input handling
Prompts for specialized vocabularies
Integration with GPT models
Fine-tunability
Voice synthesis
Speech diarization
Setting up Whisper
Using Whisper via ...
Get Learn OpenAI Whisper now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.