SAN FRANCISCO — OpenAI has announced the introduction of new voice and image features in ChatGPT, aiming to provide a more intuitive user experience. These capabilities allow users to have voice conversations and share images with ChatGPT, making interactions more dynamic and useful.
The new features are being gradually rolled out to ChatGPT Plus and Enterprise users over the next two weeks. Voice capabilities will be available on iOS and Android, while image features will be accessible across all platforms. Users can now take photos of landmarks while traveling and have interactive discussions or snap pictures of their fridge to get recipe suggestions.
To use the voice feature, users can enable it in the mobile app settings, then choose from five different voices for a personalized conversation experience. This feature leverages a new text-to-speech model and OpenAI’s Whisper system for accurate speech recognition.
The image feature allows users to share photos with ChatGPT for troubleshooting, meal planning, or analyzing data. Users can capture images directly in the app and use a drawing tool to focus on specific parts of the image.
OpenAI emphasizes its commitment to safety and gradual deployment of these advanced features. The new voice and image capabilities have been tested for responsible usage, and measures are in place to mitigate risks such as impersonation or privacy violations.
Early users, including those from companies like Block, Canva, Carlyle, and PwC, have started utilizing these features to enhance their operations and customer interactions.
OpenAI plans to expand access to these capabilities to more users and developers soon, ensuring that everyone can benefit from the advancements in AI technology.
For further details and to get started, users can visit the OpenAI website or contact their sales team.