top of page
  • Writer's pictureChris Hack


Another use of AI is speech-to-text conversion. Recorded audio can be turned into text. It is possible to embed this functionality in your own apps using the (paid-for) OpenAI APIs?

This seems to have a lot of potential in educational apps.

Talking about the Whisper API, ChatGPT4 says:

"OpenAI's Whisper API is an advanced Automatic Speech Recognition (ASR) system designed to convert spoken language into written text. Built on a vast dataset of 680,000 hours of multilingual and multitask supervised data collected from the web, Whisper demonstrates exceptional performance in a wide range of applications. This powerful ASR system has been trained using the cutting-edge deep learning model, Whisper ASR, which is part of OpenAI's Whisper ASR Collection.

The Whisper API offers valuable features including high accuracy, low latency, and support for multiple languages, making it suitable for various use cases such as transcription services, voice assistants, real-time captioning, and more. To integrate the Whisper API into applications, developers can utilize the OpenAI API, which provides a simple and secure interface to access the ASR system.

While the API is designed for easy integration and scalability, it is essential for users to adhere to OpenAI's usage policies, ensuring ethical utilization of the technology. As the Whisper API is continuously improved and updated, developers can expect enhanced performance and new features in future releases."

1 view0 comments

Recent Posts

See All

ChatGPT 3.5 vs ChatGPT 4

Did you know that the free version of ChatGPT uses something called the ChatGPT-3.5 engine but that an updated, more powerful version is available on paid-for accounts? This is ChatGPT 4 - it shows be

Fine Tuning ChatGPT

Did you know it's possible to 'fine tune' ChatGPT by pre-supplying it with model prompts and ideal responses? You need to do this via the paid for API and the details are necessarily a little technica

Temperature in ChatGPT

"The temperature parameter in ChatGPT completions is a crucial aspect that influences the model's output diversity and creativity. Essentially, it determines how the model selects tokens (words or wor


bottom of page