OpenAI Expands the Capabilities of ChatGPT with Voice Commands and Image Search

OpenAI is enhancing ChatGPT to accept voice commands and image uploads, alongside text input. 

Voice chat functions like a virtual assistant, with users’ speaking questions, while ChatGPT responds with spoken answers. 

OpenAI utilizes Whisper for speech-to-text and introduces text-to-speech for realistic audio responses

Users can select from five voice options. Concerns about misuse, including impersonation or fraud, are recognized, leading to controlled and limited usage. 

The image search feature allows users to submit photos for interpretation, similar to Google Lens. To maintain privacy, ChatGPT has limitations when analyzing people

As OpenAI expands ChatGPT’s capabilities, responsible usage management becomes increasingly complex.


Source: The verge

0 Replies
Inline Feedbacks
View all replies