Google has introduced its latest and most sophisticated artificial intelligence model, Gemini, a technology capable of processing various types of information, including video, audio, and text.
Reflecting the broad scope of Google’s aspirations, the company announced Gemini as its “largest and most capable AI model,” ushering in a “Gemini era” where the technology is envisioned to be employed across diverse settings, from major corporations to consumer devices like the Google Pixel 8 Pro.
Diverging from existing AI models that typically focus on a single type of user prompt, such as images or text exclusively, Gemini is designed to be “multimodal,” according to Google. The highly anticipated AI model boasts more advanced reasoning capabilities and a nuanced understanding of information compared to its predecessors.
“This new era of models represents one of the biggest science and engineering efforts we’ve undertaken as a company,” stated Alphabet CEO Sundar Pichai in a blog post.
In a race to rival the AI software introduced by Microsoft-backed OpenAI’s ChatGPT a year ago, Google has been striving to develop AI technology that matches the capabilities of its competitor.
Google has integrated a portion of the new Gemini model into its AI assistant Bard and plans to release the most advanced version of Gemini through Bard early next year.
Alphabet has created three versions of Gemini, each optimized for different levels of processing power. The most potent version is designed for data centers, while the smallest is tailored to run efficiently on mobile devices.
We believe in making AI helpful for everyone. That’s why we’re launching Gemini, our most capable model that’s inspired by the way people understand and interact with the world. #GeminiAI pic.twitter.com/gNG9ha9xMO
— Google (@Google) December 6, 2023