SSW Foursquare

Rules to Better AI Generated Media - 8 Rules

  1. Do you know when to use AI generated images?

    Imagine a scenario where your project requires unique artwork, but you're working with a tight budget or limited time. You don't have much bandwith to brainstorm artwork or search for what you need. In situations like this, experimenting with AI image generation can be a quick and productive solution.

    Some of the benefits are:

    • Lowered Artwork Costs
    • Fast Experimentation
    • Accessibility

    Video: A first look at Midjourney 5 and the ethics of AI image tech (3 min)

    There are many cases when AI-generated images can come in handy:

    • Web Design - You can use AI to create unique background images, icons, and UI elements
    • Marketing Campaigns - AI can be a useful tool to help with social media graphics, newsletter images, and event promotions
    • Content Creation - You can use AI to add interesting visuals to blog posts, social media updates, and more.
  2. Do you know how to generate an AI image?

    Imagine you’re browsing through an art exhibition online and you come across some strikingly unique images. Upon reading the descriptions, you realize they were all generated using Artificial Intelligence (AI). This piques your interest, and you find yourself wondering, "Could I create something like this?" The answer is yes. Here is how you can do it yourself.

    Video: How To Generate INSANE AI Art For Beginners (2 min)

    Note: Midjourney is no longer free as it was in this video demo.

    How AI Image Generation Works

    AI image generators are tools that utilize trained AI models to create new images. The models are trained on databases of existing artwork. The most popular AI image generators are Midjourney, DALLE-2, and StableDiffusion.

    These image generators work by reading your text and using it to create an image. The text you input is called a ‘prompt’, and it guides the model towards the type of image that you describe.

    Each AI image generator works slightly differently, but the basic idea is the same: text ➡️ image.

    Generating Your First AI Image

    The exact process will vary depending on the generator you're using, but generally, you can follow the following steps:

    1. Navigate to the generator’s interface.
    2. Write a short description of the image you want (optional: include a negative prompt (specifying what you DON’T want in the image)
    3. Generate the image: The tool will create an image based on your specifications.

    prompt figure
    Figure: Good example – Writing a prompt into an AI image generator

    Practice makes perfect

    The more you experiment, the better you'll understand how different inputs affect your results. Try new things and make mistakes – it's all part of the learning process!

  3. Do you know the best AI image generators?

    There is general consensus that Midjourney is the best tool. Behind Midjourney are DALLE-2 and StableDiffusion. Beware, if you are using any tool besides Midjourney, your images are likely to be noticeably less natural.

    Midjourney

    Midjourney is used on Discord, where users interact with the bot by typing /imagine.

    Note: A Discord account is required first.

    • Cost: $8USD/month
    • Images can be reiterated on
    • Many parameters: Midjourney Parameter List e.g. “--aspect”
    • Prompting in Midjourney:

      • Even short prompts can produce beautiful images
      • Basic - /imagine cat
      • Specify an artistic medium – /imagine {{{ ANY ART STYLE }} style cat
      • Get specific – /imagine {{ STYLE }} sketch of a cat
      • Time travel – /imagine {{ DECADE }} cat illustration
      • Emote – /imagine {{ EMOTION }} cat
      • Be colorful – /imagine {{ COLOR WORD }} colored cat
      • Explore environments – /imagine {{ LOCATION }} cat

    red tree midjourney2
    Figure: "A red tree in a valley. Hi res" - by Midjourney

    DALL-E 2

    DALL-E is an AI system capable of creating realistic images from a natural language description.

    • Uses a credit system where users purchase credits to use the model
    • Some OpenAI users get free credits each month

    red tree dalle
    Figure: "A red tree in a valley. Hi res" - by DALL-E2

    Leonardo AI

    Leonardo AI gets compared to Midjourney for its focus on concept art and imagination. You can use it by creating a free account at Leonardo.ai. Some of Leonardo's unique features are:

    • Great website UI
    • Incredibly customiseable, with many different models and parameters avaliable
    • Built-in image editor
    • Built-in prompt generator to help you make the best prompt

    red tree leonardo
    Figure: "A red tree in a valley. Hi res" - by Leonardo.ai

    DreamStudio

    DreamStudio is made by StabilityAI and is used, like DALLE2, on a web interface. It is based on the Stable Diffusion model of image generation.

    You can use the demo here for free Stable Diffusion Web, or you can use it through the DreamStudio interface (starting with a free trial).

    • You can use the web demo without signing up
    • Easy customization of parameters on the interface (e.g., style, aspect ratio)

    red tree dreamstudio
    Figure: "A red tree in a valley. Hi res" - by DreamStudio

  4. Do you know how to write an image prompt?

    Prompts are the instructions that you input. They can be as simple or as complex as you like.

    Video: Advanced Midjourney V5.1 Guide (11 min) - Prompting

    Prompting basics

    A general prompt might be "an image of a sunset over the ocean," which tells the AI exactly what you're looking for.

    “A brown dog on a skateboard”

    Figure: Good example - A basic prompt

    A well-structured prompt often has more details, in the format “A {{ TYPE OF PICTURE }} of a {{ MAIN SUBJECT }}, {{ STYLE CUES }}”.

    “A photograph of a robot, cartoon style”

    Figure: Good example - A basic prompt with style cues.

    You can add more even more detail to make a descriptive prompt by following this template:

    {{ ADJECTIVE }}, {{ EMOTION }}, {{ SUBJECT }}, {{ STYLE }}, {{ COLOR }}

    Figure: Good example - Use this prompt template!

    Prompt length

    Prompts that are too short do not give the AI enough information to create an image matching your idea. It is a good idea to be detailed with your prompts, but too much detail is equally likely to give the AI too many instructions, which "confuses" it and can reduce the acurracy.

    "A landscape"

    Figure: Bad example - A vague prompt like this gives an ambiguous image

    "A snowy mountain landscape at sunset with warm hues in the style of a photograph"

    Figure: Good example - A detailed and concise description, but not too long. This will provide the AI with specific elements to incorporate, resulting in an accurate image.

    "A snowy mountain landscape at sunset... Majestic peaks rise high into the sky, their rugged outlines etched against the fading golden light. Adorned with a pristine layer of glistening snow, they stand as silent sentinels, towering over the vast expanse below. The snow-covered slopes cascade down in gentle curves, inviting adventurers and nature lovers alike. In the foreground, a small creek meanders through the snow-covered landscape, flowing steadily beneath a delicate layer of ice. Over the creek, a footbridge stretches, connecting the two banks. Two travelers cross the bridge, their footprints leaving a mark on the pristine snow. The frozen trees, adorned with icicles, stand like witnesses to the passage of time."

    Figure: Bad example - This prompt is too long. It will probably just look like the one above!

  5. Do you use negative prompting?

    Sometimes you might be trying to create a specific type of image, and the AI keeps including pieces that you don't want. A straightforward way to get around this problem is by including what you DON'T want in your prompt.

    What is negative prompting?

    Negative prompting is specifying what you don't want in your image. It guides the AI away from certain features that you're not interested in. Some AI image generators (e.g. Midjourney and Dreamstudio) have this option as a parameter. In others (e.g. DALLE-2), you can include it in your prompt.

    Imagine you are using Midjourney to generate a photo of an emplty highway in the mountains.

    “A highway in the mountains”

    Figure: Bad example - When you use this prompt, Midjourney keeps putting cars on the highway!

    This is when it can be helpful to include a negative prompt. In Midjourney, this is done by using two dashes. (see Midjourney parameter list)

    “A highway in the mountains --no cars”

    Figure: Good example - The prompt with '--no cars' is more likely to eliminate cars in the picture.

  6. Do you use parameters in your image prompts?

    Parameters allow you to control different aspects of the generated image via settings on the image generator. Most AI image generators have parameter options, and they can significantly affect the result.

    • Resolution: This defines the quality of the generated image. Higher resolution values will result in higher-quality images.
    • Randomness: This parameter influences the amount of random variation in the generated image. Higher randomness values may result in more unique or creative images, but they can also lead to images that deviate more from the initial prompt.
    • Aspect Ratio: Aspect ratio dictates the proportions of the image. For instance, you might choose a square (1:1) aspect ratio for social media posts, a landscape (16:9) ratio for video thumbnails, or a portrait (3:4) ratio for smartphone screens.
    • Style: Style refers to specifying a particular visual style for the image. This could be a certain artistic style (like "impressionistic" or "cubist"). The AI uses this information to guide the stylistic aspects of image generation.
    • Uploading: Most AI image generators allow you to upload an existing image so that the AI will create different variations of it.
  7. Do you know how to transform images with Photoshop Generative Fill?

    Adobe Photoshop’s AI-powered and prompt-based generative tool can turn a simple picture into anything you imagine.

    Video: Generative Fill for Beginners (4 min)

    Generative Fill is easy to use

    1. Select an object or area with any selection tool (lasso, marquee, magic wand, etc)
    2. Enter a prompt into the contextual taskbar that appears after your selection
    3. Click generate!
    4. Use the arrows in the taskbar to see some alternate variations, or press generate again until you see a result you like.

    Tips and tricks

    • Leave the prompt blank and Generative Fill will use the area surrounding your selection to mask it away. Removing unwanted objects is that easy.
    • Be intentional with the size and shape of the selections you make. The tool will use this as extra information to interpret your prompts.
    • Generative Fill will create a new generative layer with the same name as your prompt. You can come back and alter the generated content at any time by selecting that layer.

    Although there are still discussions around AI images and copyright, Generative Fill is powered by Adobe Firefly and designed for commercial use.

  8. Do you know how to use AI tools to generate voices and translations?

    AI tools like Rask, Voice.ai, and ElevenLabs are revolutionizing how we generate voices and translations. They can even create transcripts, making our lives a lot easier. But remember, with great power comes great responsibility!

    rask bad v2
    Figure: Bad example - Relying solely on AI without any oversight can lead to errors

    rask good v2
    Figure: Good example - Using Rask to translate an English voiceover into French in the speakers own voice

    Video: Fix Your Website Chatbot! - English version (1min)

    Video: Fix Your Website Chatbot! - French translation (1min)

    The Need for Oversight 👀

    While these tools are incredibly helpful, they're not perfect. It's important to double-check their output to ensure accuracy. After all, we wouldn't want any miscommunication, would we?

    The Ever-Changing AI Landscape 🌅

    The AI landscape is constantly changing and improving. Today's best tool might be tomorrow's old news. So, keep an open mind and be ready to adapt to the latest and greatest!

We open source. Powered by GitHub