OpenAI is starting to roll out new voice and image features in ChatGPT. They provide a new, more intuitive interface by allowing you to have a conversation with ChatGPT or show it what you’re talking about. Similarly, Twin Reality can also integrate AI like ChatGPT with VR simulations by providing Virtual Reality Industrial Training, which further enables businesses and organisations to train their staff in a remarkably lifelike, immersive environment.
Over the following two weeks, the company will begin rolling out voice and images in ChatGPT to Plus and Enterprise subscribers. Images will be available across all platforms, and voice will soon be available on iOS and Android (opt-in in your settings).
Speak with ChatGPT
To get started with voice, head to Settings → New Features on the mobile app and opt into voice conversations. Then, tap the headphone button located in the top-right corner of the home screen and choose your preferred voice out of five different voices.
The new voice functionality is supported by a new text-to-speech algorithm that can produce human-like sounds using only text and a short sample of speech. To develop each voice, the company worked with experienced voice actors. Whisper, their open-source speech recognition software, is also used to convert your spoken words into text.
Chat about Images with ChatGPT
To get started, tap the photo button to capture or choose an image. If you’re on iOS or Android, tap the plus button first. You can also discuss multiple images or use our drawing tool to guide your assistant.
Image understanding is powered by multimodal GPT-3.5 and GPT-4. These models use screenshots, pictures, and papers with both text and images to apply their language reasoning abilities to a variety of images.