--- title: Image generation gets better at comics date: "2024-08-26T12:24:05Z" lastmod: "2024-12-04T05:19:34Z" categories: - llms wp_id: 3588 description: "New image models are getting much better at comic-style generation, especially with text rendering and prompt expansion, though model quality still matters more than prompt detail alone." keywords: ["image generation", "comics", "Flux", "DALL-E", "ImageGen 3", "prompting"] --- ![Image generation gets better at comics](/blog/assets/flux.dev_.webp) I heard a lot about the new image generation models last week. So, I tested to see what's improved. I gave the prompt below to various image generation models -- old and new. > A Calvin and Hobbes strip. Calvin is boxing Hobbes, with a dialog bubble from Calvin, saying "Bring it on!" ### [Stable Diffusion XL Lightning](https://developers.cloudflare.com/workers-ai/models/stable-diffusion-xl-lightning/) ![](/blog/assets/stable-diffusion-xl-lightning.webp) ### [Stable Diffusion XL Base](https://developers.cloudflare.com/workers-ai/models/stable-diffusion-xl-base-1.0/) ![](/blog/assets/stable-diffusion-xl-base.webp) ### [Dall-E API](https://platform.openai.com/docs/guides/images/usage) ![](/blog/assets/dall-e-3.webp) ### [Runway ML](https://app.runwayml.com/video-tools/teams/rootnode/ai-tools/text-to-image) ![](/blog/assets/runway-1.webp) ### [ImageGen 3](https://aitestkitchen.withgoogle.com/tools/image-fx) ![](/blog/assets/imagegen3.webp) ### [Dall-E 3 API](https://platform.openai.com/docs/guides/images/usage) ![](/blog/assets/dall-e-3-1.webp) ### [Ideogram 2.0](https://ideogram.ai/) ![](/blog/assets/ideogram.webp) ### [Flux.dev via Fal.ai](https://fal.ai/models/fal-ai/flux/dev) ![](/blog/assets/flux.dev_.webp) ### [ChatGPT Plus](https://chatgpt.com/) ![](/blog/assets/chatgpt-dall-e.webp) A few observations: 1. Text generation has come a long way. The newer models have little problem generating clear text. 2. Flux.1 seems to be the better of the newly released models 3. But OpenAI's ChatGPT seems to create as good an output as Flux.1 On the last point, it's noteworthy that Dall-E-3 (the engine behind ChatGPT) gives a poor result. Clearly, prompting makes a difference. Here's how ChatGPT modified my prompt to Dall-E-3. > A comic strip style image featuring Calvin, a young boy with spiky hair, standing in a playful boxing stance with oversized boxing gloves. He looks determined as he says 'Bring it on!' in a speech bubble. Facing him is Hobbes, a tall and slightly bemused tiger, also in a mock boxing pose with a gentle smile, as if humoring Calvin. The scene is set in Calvin's backyard, typical of a Calvin and Hobbes comic, with a simple and uncluttered backdrop. But just as clearly, prompting is far from the primary driver. Here's the result of the above prompt on the Dall-E 3 API. The model ChatGPT is using behind the scenes seems to be a significant improvement over Dall-E 3. ![](/blog/assets/dall-e-3-2.webp) The same **detailed** prompt does extremely well on [ImageGen 3](https://aitestkitchen.withgoogle.com/tools/image-fx), though. ![](/blog/assets/imagegen3-2.webp) --- Update: 6 Oct 2024. Here's what I get with [meta.ai](https://www.meta.ai/). ![](/blog/assets/calvin-1.webp) Update: 8 Oct 2024. Here's what I got with [Flux 1.1 Pro](https://flux1.ai/flux1-1) with the short prompt. (The detailed prompt gave me an error: "NSFW content detected in image. Try running it again, or try a different prompt.") ![](/blog/assets/flux1.1.webp) Update: 4 Dec 2024. With [Amazon Nova Canvas](https://www.aboutamazon.com/news/aws/amazon-nova-artificial-intelligence-bedrock-aws), here's what the detailed prompt gave me. ![](/blog/assets/nova-calvin-2.webp) --- ## Comments - **[RK](https://mvark.blogspot.com/)** _27 Aug 2024 12:12 am_: The Calvin cartoon generated by ChatGPT is eerily close to the original! It's amazing to see what these bots can create when you ask them to bring a funny idea to life. My last experiment brainstorming with the bots - https://mvark.blogspot.com/2024/08/let-ai-handle-overthinking.html - **[My Year in 2024 - S Anand](/blog/my-year-in-2024/)** _30 Dec 2024 10:02 pm_ _(pingback)_: […] Creating my own Calvin & Hobbes strips […]