ChatGPT gains multimodal capabilities to better assist users

ChatGPT has gained multimodal capabilities, allowing it to receive and respond to image and voice inputs. This new feature will make ChatGPT even more helpful in a variety of tasks, such as solving math problems, identifying objects, and providing recipes.

To use the image input feature, users simply need to snap a picture of what they are looking at and add the question they’d like an answer to. ChatGPT will then analyze the image and provide a response. For example, users could use this feature to identify the name of a plant, look up the nutritional information of a food item, or get help solving a math problem.

The voice input and output feature gives ChatGPT the same functionality as a voice assistant. Users can now ask ChatGPT to perform tasks or answer questions simply by speaking. ChatGPT will then process the request and respond verbally.

The sources for this piece include an article in ZDNET.

ChatGPT gains multimodal capabilities to better assist users

Top Stories

Toyota confirms leak of 240GB of sensitive data in recent hack

Former Google CEO Eric Schmidt makes controversial comments

Study says 94% of spreadsheets contain critical errors

Google anti-trust ruling a financial disaster for Firefox

Related Articles

Target’s new AI is aimed at employees

The good and the bad of AI generated code

Microsoft’s AI success may spell defeat for it’s climate goals

OpenAI’s Chief Scientist Ilya Sutskever Departs Company

Jim Love

Follow Us

Popular categories

Tech News Delivered