ChatGPT To Come Up With New Voice And Image Features

September 27, 2023

OpenAI is starting to roll out new voice and image features in ChatGPT. They provide a new, more intuitive interface by allowing you to have a conversation with ChatGPT or show it what you’re talking about. Similarly, Twin Reality can also integrate AI like ChatGPT with VR simulations by providing Virtual Reality Industrial Training, which further enables businesses and organisations to train their staff in a remarkably lifelike, immersive environment.

Over the following two weeks, the company will begin rolling out voice and images in ChatGPT to Plus and Enterprise subscribers. Images will be available across all platforms, and voice will soon be available on iOS and Android (opt-in in your settings).

Speak with ChatGPT

To get started with voice, head to Settings → New Features on the mobile app and opt into voice conversations. Then, tap the headphone button located in the top-right corner of the home screen and choose your preferred voice out of five different voices.

The new voice functionality is supported by a new text-to-speech algorithm that can produce human-like sounds using only text and a short sample of speech. To develop each voice, the company worked with experienced voice actors. Whisper, their open-source speech recognition software, is also used to convert your spoken words into text.

Chat about Images with ChatGPT

To get started, tap the photo button to capture or choose an image. If you’re on iOS or Android, tap the plus button first. You can also discuss multiple images or use our drawing tool to guide your assistant.

Image understanding is powered by multimodal GPT-3.5 and GPT-4. These models use screenshots, pictures, and papers with both text and images to apply their language reasoning abilities to a variety of images.

ChatGPT can now see, hear, and speak. Rolling out over next two weeks, Plus users will be able to have voice conversations with ChatGPT (iOS & Android) and to include images in conversations (all platforms). https://t.co/uNZjgbR5Bm pic.twitter.com/paG0hMshXb
— OpenAI (@OpenAI) September 25, 2023

Deepshikha Mahapatra

Deepshikha Mahapatra here, blending engineering precision with a fervor for writing. Dive into my blogs and find the magic of an engineer's touch in every word. My mission? To inspire and equip our future world-changers. A huge shoutout to Twin Reality Technologies for empowering my journey and sharpening my pen.