Bard Meets Gemini: Google’s AI Leap

Bard Meets Gemini: Google’s AI Leap

The Gist

  • AI revolution accelerates. Google releases OpenAI rival, Gemini, integrated into Bard AI today.
  • Multimodal mastery. Gemini excels in understanding and combining text, images, audio, video and code.
  • Advancing AI applications. Gemini’s varied models set for integration in Google Search, Ads and more.

Days after word got out that Google was postponing the release of its Gemini AI until January 2024, Google today, Dec. 6, announced that it is releasing its OpenAI rival today, initially as part of its Bard AI chat application. According to the announcement, Gemini has been “built to be multimodal, can generalize and seamlessly understand, operate across and combine different types of information, including text, images, audio, video and code.” 

Gemini: Google’s Multimodal AI

Full details about Gemini have been released by Google in a Deepmind document titled Gemini: A Family of Highly Capable Multimodal Models. Gemini was designed and optimized for three different models, Ultra, Pro and Nano, enabling it to operate across a broad range of platforms, from data centers to mobile devices. Google Bard is now using a version of Gemini Pro that has been specifically tuned for more advanced reasoning, planning, understanding and more. The announcement stated that Google will introduce Bard Advanced early next year, which will use the most advanced model, Gemini Ultra.

Related Article: In the Age of AI, Google Experiments With Bold Changes to Search

Significant Reasoning Advancements

The Gemini Ultra model has achieved impressive results in benchmarks, including being the first to reach human-expert performance on the MMLU exam benchmark, demonstrating significant advancements in multimodal reasoning tasks. 

The Gemini models exhibit impressive crossmodal reasoning abilities, allowing them to understand and reason across a sequence of audio, images and text. An example that was presented in the Google Deepmind paper features Gemini solving a physics problem depicted in a drawing and handwriting, showcasing potential applications in education and other fields. Not only was Gemini able to correct the student’s error, it was able to read their handwriting and interpret their drawing:

Source link