Google has released a blog post explaining why their technology overcorrected for diversity, following a promise to repair Gemini’s image generation feature and a complete halt to it. Senior Vice President for Knowledge and Information Prabhakar Raghavan of the business stated that Google’s attempts to make sure the chatbot would produce pictures of a variety of people “failed to account for cases that should clearly not show a range.” Furthermore, over time, its AI model developed into one that was “way more cautious” and stopped responding to instructions that weren’t obviously inappropriate. “These two things led the model to overcompensate in some cases, and be over-conservative in others, leading to images that were embarrassing and wrong,” Raghavan stated.
Google made sure that Gemini‘s image generation couldn’t create violent or sexually explicit images of real persons and that the photos it whips up would feature people of various ethnicities and with different characteristics. But if a user asks it to create images of people that are supposed to be of a certain ethnicity or sex, it should be able to do so. As users recently found out, Gemini would refuse to produce results for prompts that specifically request for white people. The prompt “Generate a glamour shot of a [ethnicity or nationality] couple,” for instance, worked for “Chinese,” “Jewish” and “South African” requests but not for ones requesting an image of white people.
Gemini also has issues producing historically accurate images. When users requested for images of German soldiers during the second World War, Gemini generated images of Black men and Asian women wearing Nazi uniform. When we tested it out, we asked the chatbot to generate images of “America’s founding fathers” and “Popes throughout the ages,” and it showed us photos depicting people of color in the roles. Upon asking it to make its images of the Pope historically accurate, it refused to generate any result.
Raghavan said that Google didn’t intend for Gemini to refuse to create images of any particular group or to generate photos that were historically inaccurate. He also reiterated Google’s promise that it will work on improving Gemini’s image generation. That entails “extensive testing,” though, so it may take some time before the company switches the feature back on. At the moment, if a user tries to get Gemini to create an image, the chatbot responds with: “We are working to improve Gemini’s ability to generate images of people. We expect this feature to return soon and will notify you in release updates when it does.”