The Gemini AI chatbot incapacitated the image generation of people six months ago after Google braved backlash from users who comprehended it could not accurately create images of white people. When requested to depict U.S. founding fathers and Catholic popes, it illustrated women and men of several races. To end the controversy, Google shut down Gemini’s potential to portray humans in February 2024.
But now Google has announced that it is relaunching its image generation tool to Gemini AI, which will resume creating images that include people.
What is Google Gemini AI?
Gemini AI, formerly known as Bard, is a accumulation of generative AI models that commands Google’s digital products and services. It can produce text based on user prompts and the text can also be navigated by a Q&A-type chatbot connection. The model of Gemini is multimodal, which means it can elucidate and answer to various types of content such as text, code, video and audio.
You May Also Like: Is Technology Going To Save the World or Kill It?
Gemini AI can assist in various activities like:
- It helps in detecting various trends
- It helps in arranging information and recognizing business opportunities
- It helps in designing projects and generating campaign briefs
- It helps in drafting, replying and outlining emails
- It assists in generating images and various designs
Hence, Gemini is Google’s most capable AI model. From natural image, audio and video understanding to mathematical reasoning, Gemini models can evaluate their performance on a wide variety of tasks.
Google Announced to Resume Generating AI Images of People With Gemini AI
Google discontinued the sketch of people earlier this year because it was generating diversified but historically erroneous images. Google says now it has fixed image generators and it resumes generating images of people with a better user experience.
The latest image generator of Gemini, Imagen 3, is said to be a more powerful image generator that will allow users to create pictures of people accessible to Gemini Advanced, Business, and Enterprise users. They have made improvements in the technical part as well as improvements in evaluation sets.
Imagen 3 does not support photorealistic images of individuals and it will aim to prevent depictions of excessively violent or sexual scenes. Hence, Google Gemini AI is bringing back the ability to generate images with people although with new safety measures.
You May Also Like: ChatGPT Advanced Voice Mode: OpenAI Recently Developed Advanced Voice Mode
Dave Citron, a senior director of product on Gemini, wrote in a blog post that generative AI tools may not create every image perfectly but they will continue to listen to the feedback of consumers and keep on improving the tool.
The users who pay to utilize the English language version of the chatbot, Gemini Advanced, have begun turning the feature back on for them. Image 3 provides a better user experience when creating images of individuals. Google has said that Imagen 3 will acknowledge the text prompts that it render into images more accurately than its forerunner, Imagen 2 and is more innovative in its generation.