Rules to Better AI-Generated Media

Imagine a scenario where your project requires unique artwork, but you're working with a tight budget or limited time. You don't have much bandwith to brainstorm artwork or search for what you need. In situations like this, experimenting with AI image generation can be a quick and productive solution.

Some of the benefits are:

Lowered Artwork Costs
Fast Experimentation
Accessibility

Video: A first look at Midjourney 5 and the ethics of AI image tech (3 min)

There are many cases when AI-generated images can come in handy:

Web Design - You can use AI to create unique background images, icons, and UI elements
Marketing Campaigns - AI can be a useful tool to help with social media graphics, newsletter images, and event promotions
Content Creation - You can use AI to add interesting visuals to blog posts, social media updates, and more.

Imagine you’re browsing through an art exhibition online and you come across some strikingly unique images. Upon reading the descriptions, you realize they were all generated using Artificial Intelligence (AI). This piques your interest, and you find yourself wondering, "Could I create something like this?" The answer is yes. Here is how you can do it yourself.

Video: How To Generate INSANE AI Art For Beginners (2 min)

Note: Midjourney is no longer free as it was in this video demo.

How AI Image Generation Works

AI image generators are tools that utilize trained AI models to create new images. The models are trained on databases of existing artwork. The most popular AI image generators are Midjourney, DALLE-2, and StableDiffusion.

These image generators work by reading your text and using it to create an image. The text you input is called a ‘prompt’, and it guides the model towards the type of image that you describe.

Each AI image generator works slightly differently, but the basic idea is the same: text ➡️ image.

Generating Your First AI Image

The exact process will vary depending on the generator you're using, but generally, you can follow the following steps:

Navigate to the generator’s interface.
Write a short description of the image you want (optional: include a negative prompt (specifying what you DON’T want in the image)
Generate the image: The tool will create an image based on your specifications.

prompt figure — Figure: Good example – Writing a prompt into an AI image generator

Practice makes perfect

The more you experiment, the better you'll understand how different inputs affect your results. Try new things and make mistakes – it's all part of the learning process!

Current rankings suggest Midjourney as the top tool, followed by ChatGPT Image Generation (GPT-4o), Google's Imagen, Adobe Firefly, Stable Diffusion, Leonardo AI, and Flux. Beware, other tools may produce noticeably less natural images.

Midjourney

Midjourney generates high-quality images via Discord (/imagine command) or its new web interface.

Cost: Starts at $10 USD/month ($8/month annually)
Offers subscription tiers with GPU hours and commercial usage options
Supports detailed prompts for customized images

ChatGPT Image Generation (GPT-4o)

Integrated into ChatGPT for detailed, realistic image generation from text prompts.

Direct access via ChatGPT
Handles complex prompts seamlessly

chatgpt4o — Figure: "A red tree in a valley. Hi res" - by ChatGPT4o

Ideogram

Ideogram is a freemium image generation tool known for its excellent handling of text within images, a common weakness in other models.

Supports various artistic and design styles, including 3D, anime, and realistic
Ideal for creating posters, logos, and text-rich compositions

Available via web and mobile apps

idiogram3 — Figure: " A red tree in a valley. Hi res" - by Ideogram 3

Google Imagen

Offers exceptional photorealism and language comprehension.

Accessible via Vertex AI or ImageFX
Produces highly detailed, realistic images

imagen3 — Figure: "A red tree in a valley. Hi res" - by Imagen 3

Reve

Reve is an emerging image generator designed to offer creativity and control, with a focus on aesthetic quality and prompt flexibility.

Aimed at creators seeking artistic and dream-like visuals
Supports stylised compositions and illustrative looks
Gaining attention as a new player in the image generation space

Adobe Firefly

Known for detailed control over image generation and editing, ideal for customized outputs.

Advanced editing and generation features
Integrated into Adobe Creative Cloud

firefly — Figure: "A red tree in a valley. Hi res" - by Firefly

Stable Diffusion (via DreamStudio)

Highly customizable, open source tool offering fine-tuned image generation.

Free and available for local use
Highly customizable with flexible output adjustments
Supports inpainting and outpainting with Stable Diffusion XL (SDXL)

stablediffusion — Figure: "A red tree in a valley. Hi res" - by StableDiffusion via DreamStudio

Leonardo AI

Known for imaginative visuals and concept art. Free account available at Leonardo.ai.

User-friendly website
Highly customizable with built-in image editing and prompt generation

Flux

Developed by Black Forest Labs, Flux generates high-quality images from text prompts.

Multiple model variants (Schnell, Dev, Pro)
Supports various artistic styles and precise color control
Capable of rendering text within images

Adobe Photoshop’s AI-powered and prompt-based generative tool can turn a simple picture into anything you imagine.

Video: Generative Fill for Beginners (4 min)

Generative Fill is easy to use

Select an object or area with any selection tool (lasso, marquee, magic wand, etc)
Enter a prompt into the contextual taskbar that appears after your selection
Click generate!
Use the arrows in the taskbar to see some alternate variations, or press generate again until you see a result you like.

Tips and tricks

Leave the prompt blank and Generative Fill will use the area surrounding your selection to mask it away. Removing unwanted objects is that easy.
Be intentional with the size and shape of the selections you make. The tool will use this as extra information to interpret your prompts.
Generative Fill will create a new generative layer with the same name as your prompt. You can come back and alter the generated content at any time by selecting that layer.

Although there are still discussions around AI images and copyright, Generative Fill is powered by Adobe Firefly and designed for commercial use.

Prompts are the instructions that you input. They can be as simple or as complex as you like.

Video: Advanced Midjourney V5.1 Guide (11 min) - Prompting

Prompting basics

A general prompt might be "an image of a sunset over the ocean," which tells the AI exactly what you're looking for.

Tip: When sharing an AI image, add the prompt to the image caption so others can see how you made it.

“A brown dog on a skateboard”

Figure: Good example - A basic prompt

A well-structured prompt often has more details, in the format “A {{ TYPE OF PICTURE }} of a {{ MAIN SUBJECT }}, {{ STYLE CUES }}”.

“A photograph of a robot, cartoon style”

Figure: Good example - A basic prompt with style cues.

You can add more even more detail to make a descriptive prompt by following this template:

{{ ADJECTIVE }}, {{ EMOTION }}, {{ SUBJECT }}, {{ STYLE }}, {{ COLOR }}

Figure: Good example - Use this prompt template!

Prompt length

Prompts that are too short do not give the AI enough information to create an image matching your idea. It is a good idea to be detailed with your prompts, but too much detail is equally likely to give the AI too many instructions, which "confuses" it and can reduce the acurracy.

"A landscape"

Figure: Bad example - A vague prompt like this gives an ambiguous image

"A snowy mountain landscape at sunset with warm hues in the style of a photograph"

Figure: Good example - A detailed and concise description, but not too long. This will provide the AI with specific elements to incorporate, resulting in an accurate image.

"A snowy mountain landscape at sunset... Majestic peaks rise high into the sky, their rugged outlines etched against the fading golden light. Adorned with a pristine layer of glistening snow, they stand as silent sentinels, towering over the vast expanse below. The snow-covered slopes cascade down in gentle curves, inviting adventurers and nature lovers alike. In the foreground, a small creek meanders through the snow-covered landscape, flowing steadily beneath a delicate layer of ice. Over the creek, a footbridge stretches, connecting the two banks. Two travelers cross the bridge, their footprints leaving a mark on the pristine snow. The frozen trees, adorned with icicles, stand like witnesses to the passage of time."

Figure: Bad example - This prompt is too long. It will probably just look like the one above!

Parameters allow you to control different aspects of the generated image via settings on the image generator. Most AI image generators have parameter options, and they can significantly affect the result.

Resolution: This defines the quality of the generated image. Higher resolution values will result in higher-quality images.
Randomness: This parameter influences the amount of random variation in the generated image. Higher randomness values may result in more unique or creative images, but they can also lead to images that deviate more from the initial prompt.
Aspect Ratio: Aspect ratio dictates the proportions of the image. For instance, you might choose a square (1:1) aspect ratio for social media posts, a landscape (16:9) ratio for video thumbnails, or a portrait (3:4) ratio for smartphone screens.
Style: Style refers to specifying a particular visual style for the image. This could be a certain artistic style (like "impressionistic" or "cubist"). The AI uses this information to guide the stylistic aspects of image generation.
Uploading: Most AI image generators allow you to upload an existing image so that the AI will create different variations of it.

Traditionally, UI wireframes have been made with tools like Figma, Excalidraw, Balsamiq, or pen and paper. Generative AI tools now streamline the process. We can use Prompt Engineering to create a Proof of Concept (PoC) or translate a design to code with the click of a button!

Video: Build a fullstack app in 7 minutes with v0 (Figma to code)! (7 min)

Here is a longer example of someone using v0.dev to generate a calculator UI and then trying to replicate the same UI by hand: V0 coded a calculator 30x faster than me (17 min)

Boilerplate

When a web developer is handed design images, it can be a drag to spend the next few hours (or days/weeks) building the baseline components, HTML, and CSS before getting to the fun stuff.

Thankfully, AI-assisted tools are beginning to help with the more mundane aspects of this process, and sites such as v0.dev can produce React components (with the code) you can drop into your project to give you a huge head start on the work!

Upload your designers' images and tweak the output with a few prompts to get ready-made components for your app. Just be careful - these tools will often take "creative liberties" with the designs, so keep an eye out.

Proof of concepts

Generative UI tools can be a great way to collaborate with stakeholders in realtime, as you can generate ideas and iterate over them using natural language prompts. This can give developers or designers a useful starting point and frame of reference to build a highly polished design.

If you're a dev that likes (or is forced) to moonlight as a designer these tools can be a great timesaver in creating professional looking web components in a short amount of time. These tools aren't a substitute for a layout created by a qualified designer but they can be great for generating boilerplate code or proof of concept designs that you can build on in the future.

Example

Here is what you can achieve using v0.dev and a few prompts:

matrix blog — Figure: A matrix-style blog site

🤖 AI prompts excamples:

Create a form to input customer data like: first name, last name, DoB, email, phone, etc..
Do it using material style component. With a primary color red
Use this red instead : #CC4141, and add icons next to labels, like a phone icon next to the phone number, etc

Tools

v0.dev
builder.io
- Generate Figma Designs with AI
- Design to Code
UIMagic (waitlist)
Pineapple Builder

Sometimes you might be trying to create a specific type of image, and the AI keeps including pieces that you don't want. A straightforward way to get around this problem is by including what you DON'T want in your prompt.

What is negative prompting?

Negative prompting is specifying what you don't want in your image. It guides the AI away from certain features that you're not interested in. Some AI image generators (e.g. Midjourney and Dreamstudio) have this option as a parameter. In others (e.g. DALLE-2), you can include it in your prompt.

Imagine you are using Midjourney to generate a photo of an emplty highway in the mountains.

“A highway in the mountains”

Figure: Bad example - When you use this prompt, Midjourney keeps putting cars on the highway!

This is when it can be helpful to include a negative prompt. In Midjourney, this is done by using two dashes. (see Midjourney parameter list)

“A highway in the mountains --no cars”

Figure: Good example - The prompt with '--no cars' is more likely to eliminate cars in the picture.

AI tools like Rask, Voice.ai, and ElevenLabs are revolutionizing how we generate voices and translations. They can even create transcripts, making our lives a lot easier. But remember, with great power comes great responsibility!

rask bad v2 — Figure: Bad example - Relying solely on AI without any oversight can lead to errors

rask good v2 — Figure: Good example - Using Rask to translate an English voiceover into French in the speakers own voice

Video: Fix Your Website Chatbot! - English version (1min)

Video: Fix Your Website Chatbot! - French translation (1min)

The Need for Oversight 👀

While these tools are incredibly helpful, they're not perfect. It's important to double-check their output to ensure accuracy. After all, we wouldn't want any miscommunication, would we?

The Ever-Changing AI Landscape 🌅

The AI landscape is constantly changing and improving. Today's best tool might be tomorrow's old news. So, keep an open mind and be ready to adapt to the latest and greatest!

Rules to Better AI-Generated Media - 9 Rules

Do you know when to use AI generated images?

Do you know how to generate an AI image?