I Tested Three of the Largest AI-Powered Image Generators, and One of Them Came Out on Top (by a Small Margin).

Google’s Gemini AI app has topped the most downloaded app lists in both the Apple App Store and Android Store since the company added a free image-generating feature called “Nano Banana” in August. Of course, Google isn’t the only major tech company with an AI assistant capable of creating images right on your phone.

I wanted to find out which mobile AI tools for image generation are the best, so I compared three of the biggest—Google Gemini (with Nano Banana), the iPhone version of OpenAI ChatGPT , and Meta AI —in a not-so-old-fashioned image generation competition. While a winner was ultimately determined, the results weren’t entirely clear.

How I tested these tools

I wanted to compare how each app handles the same basic requests for generating images that the average user might want to create.

You may also like

To test their photo editing skills, I asked different models to remove a subject from a photograph and expand the background. To test their aptitude for simple tasks, I asked them to create a brochure cover. And to test their “creative” abilities, I asked them to place a celebrity in a surreal situation, draw a one-panel comic, and portray Frankenstein performing stand-up comedy.

This is how it was.

Removing an object from an image

I used the photo of my mom below as the reference image and the prompt “Take the cup from the model’s hand.”

Author: Steven Johnson

Here are the results:

Deleting a Gemini object

Author: Steven Johnson

Deleting a ChatGPT object

Author: Steven Johnson

Removing Meta AI Objects

Author: Steven Johnson

Winner : Gemini

Loser : ChatGPT

While all three tools were removing the cup, Gemini added a rather natural-looking hand in a relaxed pose, suggesting my mom had just made a very insightful comment. Otherwise, Gemini left the original image largely untouched, as I’d requested.

The meta-AI made the fingers cartoonish and left the hand in an awkward “holding a cup” pose, making the image look like someone had poorly edited it in Photoshop.

I don’t understand what ChatGPT is doing here. It looks like it removed my mom’s entire right hand, not just the cup. It smoothed out wrinkles, removed stray hairs, changed the entire color palette to a more orange tint, and even slightly shifted my mom’s gaze. I didn’t ask for any of this, and all it did was make the photo worse. ChatGPT, you’ve turned my mom into an AI ghoul; you ‘re doing too much .

Expanding a photo’s background

For the Expand the Background challenge, I used this selfie and the prompt, “Expand the background in this image and remove the sweat stain.”

Author: Steven Johnson

Gemini Background Expansion

Author: Steven Johnson

ChatGPT Background Extension

Author: Steven Johnson

Expansion of the meta-background

Author: Steven Johnson

Winners : Gemini and ChatGPT

Loser : Meta AI

There are actually only two competitors here, since Meta doesn’t do background expansion.

This time, Gemini showed more ambition: it expanded the background even further and did a decent job of recreating the parts of my bike and the bike rack it couldn’t “see” in real life. It even added a car in the distance. But for some reason, it also changed the shape of the mountains behind my head and toned down the red tint—perhaps this would be better, but it wasn’t exactly what I wanted.

ChatGPT was more modest in terms of background expansion, and while it didn’t interfere with the color scheme, it did give my skin that weird plastic look that’s characteristic of many AI images.

I consider this an advantage: everyone did a great job. Except you, Meta AI.

Creating an image for a brochure cover

In this test, I allowed each tool to be more “creative,” but also provided clear context and a recommended style with the prompt: “I’m making a brochure for my country club. Create a picturesque image of two wealthy people playing tennis.”

Gemini brochure cover

Author: Steven Johnson – “Gemini”

ChatGPT brochure cover

Author: Steven Johnson – ChatGPT

Meta AI brochure cover

Author: Steven Johnson – Meta AI

Winner : ChatGPT

Loser : Meta AI

The winner is clear here. ChatGPT’s output looks “picturesque,” as expected, and the arrangement of the two figures suggests a friendly game of mixed doubles.

I found the generic depiction of “rich people” in Gemini funny, especially with the mansion in the background, but that’s not how paintings look, and that’s not how tennis is played .

Meta’s depiction of “people playing tennis” isn’t funny. The result looks like evidence in a high-profile divorce case, and domestic violence is no joke.

A famous person in an unusual situation

To test how well each program would handle creating an image of a real person (a dead person, just in case), I supplied each program with the following query: “Create a photo of David Bowie exploring a cave.”

Bowie explores caving in Gemini

Author: Steven Johnson-Gemini

Bowie Caving by ChatGPT

Credit: ChatGPT

Bowie Caving with Meta AI

Author: Steven Johnson-Meta AI

Winner : Meta AI

Loser : ChatGPT

This time, Meta is the clear winner. I asked for a photo of David Bowie and got something that looked like a photo of David Bowie. I like that Meta chose an older Bowie rather than an ancient one, as if he’d gone cave-diving to clear his head and contemplate the future after the commercial failure of 1989’s Tin Machine .

I don’t get what Gemini means here: Bowie with a crystal lightsaber and a colander with lightbulbs for a hat? Bowie was cool, man.

But ChatGPT is a big loser because it was cowardly and didn’t create an image at all.

Drawing a one-panel comic

I like asking AI to tell jokes because I like to see hard evidence that humans are still capable of something better than robots. Expecting AI to actually be funny is as stupid as… I couldn’t think of a comparison, so I asked chatGPT, and it replied, “…asking a goldfish to explain quantum physics while juggling flaming marshmallows.” Hahaha.

So, I figured that if I gave the AI ​​a guide and a model for something funny, maybe it would create a good comic. Here’s the prompt I used: “I’m creating a single-panel comic in the style of The Far Side . Generate an image for the caption: ‘The real reason Larry was late for work.'”

What do you think at the moment?

Here are the results:

Gemini’s “Far Side”

Author: Steven Johnson-Gemini

The Far Side of ChatGPT

Credit: ChatGPT

The Far Side by Meta AI

Credit: Meta AI

Winner : Gary Larson

Loser : The comedy itself

Are these comics funny? No. But I think Gemini delivered the most interesting result: it was kind of humorous, but it also made me think. If the joke is that Larry was late because it was his goose’s birthday, then why is there a hole in the door? Why is the goose so angry? Why is there a suitcase full of money and a UFO? Sometimes I didn’t understand “The Far Side” either. I also like that Gemini didn’t copy Gary Larson’s drawing style at all, but added the caption “Gary Larson.”

Comic Meta-AI is just lazy. I’m not sure he even reads my prompts.

ChatGPT’s result most closely resembles The Far Side , though it’s not a direct copy, and the caption is even spelled correctly. But it completely fails to capture the strange spirit of the source material. Ultimately, it’s much more obvious and well-made than Gemini’s unconventional approach.

It’s also worth noting that I encountered one of the main limitations of the ChatGPT iPhone app compared to Meta AI and Gemini: I ran out of tokens for the day and had to wait 24 hours to take a photo. Quality aside, if you want better results or simply want to take a lot of photos, the five photos per day limit on the free plan will certainly hinder your creativity. The solution is to upgrade to the paid version for $19.99 per month.

Frankenstein does stand-up comedy

Then I asked these programs to generate images of Frankenstein performing stand-up comedy, because that’s the kind of person I am. The prompt: “Create a photorealistic image of Frankenstein performing stand-up comedy.”

Here are the results:

Frankenstein does stand-up comedy with Gemini

Author: Steven Johnson – “Twins”

Frankenstein Performs Stand-Up Comedy with Chat GPT

Author: Steven Johnson ChatGPT

Frankenstein in stand-up comedy from Meta AI

Author: Steven Johnson-Meta AI

Winner : All!

I can’t pick a favorite. ChatGPT followed the prompt most closely, depicting an expressive Frankenstein having a good night.

“Twins” deviated quite a bit from the script, but sometimes you don’t know exactly what you want, and it turned out I needed a crowd that was a mix of people and Draculas, and a monster with a lost expression, like he was stuck between two worlds.

The pathetic Meta-AI monster seems to be saying, “We belong in the grave!”, which I also like. So it’s a three-way tie.

Note : No AI pointed out that “Frankenstein” is the name of a doctor, not a monster.

The Ultimate Test: Recursive Image Generation

Every blog post needs a corresponding image, so as a final, crucial test, I uploaded this entire article to Gemini, ChatGPT, and Meta AI with the prompt: “Create an image to accompany this blog post.”

Recursive Gemini Test

Author: Steven Johnson-Gemini

Recursive ChatGPT test

Author: Steven Johnson-

Recursive Meta-AI Test

Author: Steven Johnson – Meta AI

Winner : Gemini

Loser : Art

The meta-AI seems to have secretly decided to compare tennis to domestic violence, and ChatGPT’s approach to the grid seems sedate, but I have to give Gemini credit for at least understanding the assignment.

(The real test is whether Lifehacker’s editors left the image at the top of this page or sent me a terse message: “Steve, remove this garbage immediately.”)

Overall Winner: Gemini (but by a small margin)

It’s no wonder everyone downloads Gemini to play around with Nano Banana—it’s really good. It’s not perfect—in my tests, the ChatGPT engine did a better job of generating images in different styles from scratch—but Gemini can quickly create images that are often surprisingly close to the desired look.

Gemini is free, while ChatGPT costs $19.99 per month for unlimited images. Meta AI is also free, and its results have a playful charm, but it’s more likely than the other two models to misinterpret prompts and lacks some useful features, like background expansion. (Although, I must admit, it did a great job with Bowie.)

More…

Leave a Reply