top of page

OpenAI will soon start talking and seeing

OpenAI recently introduced the research preview of DALL-E 3, their latest image generation AI. This exciting development will soon be accessible to ChatGPT Plus and Enterprise users. The integration of DALL-E 3 with ChatGPT simplifies prompt creation with the assistance of the chatbot.


DALL-E 3's capacity to comprehend image inputs relies on GPT-4 Vision (GPT-4V), a multimodal version of the underlying GPT model. Additionally, the voice feature leverages OpenAI's Whisper automatic speech recognition (ASR) model for processing user voice inputs. Furthermore, a new text-to-speech (TTS) model allows ChatGPT to convert its text responses into one of five user-selectable voices.


OpenAI is taking a gradual approach to deploy these features, prioritizing safety. They've conducted beta testing and 'red teaming' exercises to identify and mitigate potential risks. OpenAI's commitment to ensuring secure and efficient AI usage is at the core of these developments.

12 views0 comments

Recent Posts

Beyond ChatBots

An AI-Powered Website is much more than simply deploying a customer service chatbot. It represents a paradigm shift in the way websites interact with users and deliver content. At its core, an AI-Powe

Comments

Rated 0 out of 5 stars.
No ratings yet

Add a rating
bottom of page