Google Bard’s New Image Uploads Feature Boosts AI Power

Google Bard’s New Image Uploads Feature Boosts AI Power

The Gist

  • Google Bard breakthrough. Images can now significantly enhance prompt creativity.
  • Unleashing innovation. The inclusion of image analysis revolutionizes AI data consumption.
  • Enhancing tasks. Google Bard will allow users to pioneer innovative methods for AI task enhancement.

Headlines about remarkable advancements in AI consistently dominate the daily business news, with significant developments predominantly unveiled at the start or close of the business week. Google Bard’s newest strategy to take the lead in the AI competition is allowing image uploads alongside your prompt text. This new feature introduces a valuable enhancement to the way you apply your imagination and creativity in constructing a prompt.

How Uploading Images in Google Bard Works

To craft a prompt using the image in Google Bard, create your prompt as you normally would, then click on the plus sign next to your query window. Image formats can be JPEG or PNG.

Bard will interpret image details based on its understanding of the prompt and its understanding of image shapes. Bard’s trained data incorporates various aspects of Google’s vast image sources such as Google Lens. Bard will describe what it interprets in the image, answering questions about them, and even recognizing specific people’s faces.

Caution must still be given to large language model (LLM) output, as it is still not deterministic. Despite the innovation and LLMs’ ability to create fluent answers to your questions, marketers must remember that models are still offering a statistical calculation of tokens — word stems from a prompt or prompt chain. So, some answers can still require verification through the user’s knowledge and experience with the prompt subject. 

For example, I shared an image of wine bottles and asked Bard to identify the brands. The bottles and their labels in the image were turned in various ways, but still visible. Bard got a few brands right, like Kendall-Jackson and Chateau Ste. Michelle but also suggested a few that were not in the image. For instance, it overlooked Coppola, which has a large font on its label.

Google Bard image query with image of wine bottles in store.
Google Bard Image Query

Despite glitches like this, I think uploading image files in Bard will serve users well. It is a major incremental step compared to a May update in which Bard added images from Google Search in its responses. In my wine image example, Bard did return images of each wine it identified. Bard also had a slightly differing variation of its prompt response.

Google Bard Image result of wine brand.
Google Bard Image Query Result

All of this shows the level of insightfulness an image analysis can be. Image uploads revolutionize the way a model consumes data because an image analysis is included in a model’s token considerations. 

Related Article: Insider’s Look at Google Bard and How Can It Help Marketers

The Other Latest Google Bard Updates

The image upload feature was introduced as part of a series of enhancements to Google Bard that were unveiled in July. The feature that most complements image uploads is the option to tailor Bard’s responses with a simple click. Users can tap to modify the length of the response or calibrate the tone, enabling a shift from a more professional to a more casual demeanor.

Another noteworthy addition is the text-to-speech functionality, which offers a spoken word rendition of the response. It reads out loud in tandem with the displayed text and can interpret more than 39 languages apart from US English, including languages like Hindi and Spanish.

Source link