The AI Chatbot Can Now “See, Hear and Speak”
OpenAI has introduced new voice and image capabilities in its ChatGPT chatbot, ChatGPT-Plus. These upgrades are part of GPT Vision (or GPT-V), a multimodal version of GPT-4. The ChatGPT-Plus features voice chat powered by a text-to-speech model that mimics human voices and image discussion through integration with image generation models.
This follows OpenAI's unveiling of DALL-E 3, its most advanced text-to-image generator yet. The integration of DALL-E 3 and conversational voice chat signifies OpenAI's push towards AI assistants that can perceive the world more like humans.
Microsoft Fuels the AI Race with OpenAI Integration
Microsoft is integrating OpenAI's advanced generative AI capabilities into its products, including Windows 11, Office, and Bing search.
The company is leveraging models like DALL-E 3 and Copilot, OpenAI's programming assistant, to make AI help available across Microsoft platforms and devices. Microsoft 365 Chat automates complex work tasks using OpenAI's natural language prowess.
Cautious Steps Towards Responsible AI
OpenAI is developing powerful systems involving vision and voice generation but is aware of potential risks such as impersonation and bias. The company aims to build AGI that is safe and beneficial, gradually making its tools available for improvement.
CEO Sam Altman is lobbying for favorable legislation to prevent harmful consequences due to improper use of AI products. Plus, and Enterprise users will access new functionalities over the next two weeks, with plans to expand availability to developers.