Open AI, ChatGPT’s parent company, is rolling out a voice chat function. Open AI’s latest blog article says so. This development comes shortly after it was revealed that the AI platform had seen a third consecutive month of declining traffic.
The AI startup claimed ChatGPT is adding speech and visual capabilities. These features allow users to hold a vocal discussion or show ChatGPT what they’re talking about, creating a more intuitive interface.
“You can now talk to your assistant using voice. The blog post suggests talking to it on the way, asking for a bedtime story, or settling a dinner table disagreement.
Read also: OpenAI launches ChatGPT enterprise package
The new voice capabilities use a new text-to-speech engine that Open AI claims can generate human-like sounds from text and a few seconds of sample speech. Users can utilize pre-recorded voices or record their own, which the system will learn quickly.
“The new voice technology can create realistic synthetic voices from a few seconds of speech. We created each voice with expert performers. Open AI also transcribes your speech into text using Whisper, our open-source speech recognition system.
The company will launch the functionality in two weeks. Go to Settings, choose New Features on the mobile app, and enable voice conversations. Tap the headphone button in the top-right corner of the home screen and select one of five voices.
While the business expects the breakthrough to enable many creative and accessibility-focused apps, it warns that malevolent actors might mimic public personalities or conduct fraud. Thus, it powers a particular use case with the technology.
“For this reason, we are utilising this technology to power voice chat, a particular use case. We worked with voice actors to build voice chat. We collaborate similarly with others. Spotify is using this technology to pilot their Voice Translation tool, which enables podcasters to reach more listeners by translating podcasts into other languages in their voices.
Other Open AI features
The Open AI AI solution will include voice and image capability and conversation. Users can talk to ChatGPT about their plans or daily routines with the tool. You may upload a photo of your closet on ChatGPT and ask for ideas about dressing.
Voice and image expand ChatGPT’s use. Please take a picture of a landmark while travelling and discuss its appeal live. To decide what to make for dinner, take images of your fridge and pantry at home and ask questions for a recipe. After supper, help your youngster with a maths issue by snapping a photo, circling the problem set and having it share tips with you, says the blog post.
Open AI said vision-based models face significant issues, from hallucinations about humans to relying on the model’s picture interpretation in high-stakes settings. It stated it vetted the model with red teamers for radicalism and scientific competency and a varied range of alpha testers before broader deployment.
“Our research enabled us to align on a few key details for responsible usage,” it stated.
The business stated that ChatGPT is not always accurate and has taken technical precautions to limit its ability to analyze and make direct remarks about people.
Whether these additional steps will stop the bleeding and boost company visits is unknown. However, it introduces a thrilling new AI battle.