MidJourney

Introduction

Introduction.

Midjourney is a generative artificial intelligence program and service created and hosted by San Francisco-based independent research lab Midjourney, Inc. It was founded by David Holz, previously co-founder of Leap Motion.

Midjourney generates images from natural language descriptions, called “prompts”.

Users create artwork with Midjourney using Discord bot commands.

What sets Midjourney apart from other text-to-image models is its ability to create highly detailed, precise, and defined images. These images can have dimensions of up to 1,792 x 1,024 pixels. Descriptive text is needed to instruct the AI to create the image. The more information provided in the description, the more accurate the resulting image will be.

Model versions.

The company has been working on improving its algorithms, releasing new model versions every few months. Version 2 of their algorithm was launched in April 2022 and version 3 on July 25. On November 5, 2022, the alpha iteration of version 4 was released to users and on March 15, 2023, the alpha iteration of version 5 was released. The 5.1 model is more ‘opinionated’ than version 5, applying more of its own stylization to images, while the 5.1 RAW model adds improvement while working better with more literal prompts. After version 5.2 is released with an increasingly better image quality.

Midjourney is currently only accessible through a Discord bot on their official Discord server, by direct messaging the bot, or by inviting the bot to a third party server. To generate images, users use the /imagine command and type in a prompt; the bot then returns a set of four images. Users may then choose which images they want to Upscale or make Variations.

How it works.

Midjourney is an example of generative AI that specializes in creating images based on textual prompts. It is a product of the evolving field of Diffusion models.

Diffusion models are a type of generative model that have gained significant popularity in recent years due to their ability to generate high-quality data, such as images. They are fundamentally different from other generative methods and are based on the idea of decomposing the image generation process into many small “denoising” steps.

Here’s a step-by-step explanation of how they work:

Forward Process (Diffusion Process): This process involves gradually adding Gaussian noise to the input data through a series of steps12. This is also known as the diffusion process. The input data is progressively noised, transforming it into a latent variable2.

Reverse Process (Reverse Diffusion Process): After the forward process, a neural network is trained to recover the original data by reversing the noising process. This is also known as the reverse diffusion process. By being able to model this reverse process, we can generate new data.

Midjourney Guide

Midjourney guide 07

Created by Midjourney. Image by jcmm.art

Midjourney Guide.

Midjourney is currently only accessible through a Discord bot on their official Discord server, by directly messaging the bot, or by inviting the bot to a third party server.

To access the Midjourney server you need to have a Discord account.

L35 already has a Discord account and a Midjourney Standard Subscription.

To access the L35 server on Discord, go to the Midjourney website https://www.midjourney.com/home/

This screen may show to you.

Midjourney guide 07

A new window will open for you to Sign in.

Midjourney guide 07

If you don’t have a Discord account you must click on the Register link and create one.

If you are already registered, just fill in the data in the pop-up window and you will access to your Discord account.

Midjourney guide 07

After clicking on Authorize, you will enter Discord, where the L35 server will appear.

From now on you have access to the Midjourney bot inside your Server (it is recommended to create a bookmark in your browser for later access).

Inside Discord server

What Is a Discord Server? A Discord server is a home for your personal community you’re involved in. Within this server are multiple channels for different topics your community members like to talk about. Just click on one of those channels and the corresponding conversations will appear (in this case the prompts and images generated by Midjourney). When you access your server you will see a page similar to the one below. The Discord page is divided into three areas:

On the left are the servers you have access to.
In the adjacent column are the characteristics of your server, with the channels created on it.
The main area. This is where you will write your prompts and where Midjourney will generate the images.

Channels. These are separate spaces for text-based conversation (Text Channel) or video and audio conversations (Voice Channel). They keep conversations organized. You can create separate channels for all the topics your group likes. Midjourney bot only works on Text type Channels. Click below to see a video about:

Your browser does not support the video tag.

Midjourney bot

Discord Commands.

To interact with the Midjourney bot you must use one of the slash commands.

Commands are used to create images, change default settings, monitor user info, and perform other helpful tasks.

The list of available slash commands pop up when you type ‘/‘.

Generate images with the Midjourney bot.

All prompts to generate images with Midjourney bot should start by typing “/imagine” + “Enter” in the message field.

This will cause the request for your prompt to appear: ‘/imagine prompt:‘

You can also select the /imagine command from the list of available slash commands that pop up when you type ‘/’.

Type a description of the image you want to create in the prompt field. Send your message (known as a Prompt).

After submitting your text prompt, the Midjourney Bot processes your request, creating a grid of four image options.

Below the generated images, buttons will appear to modify or redo the generation.

Upscale Buttons

U1 U2 U3 U4 buttons upscale an image generating a larger version of the selected image and adding more details.

Redo Button

The redo (re-roll) button reruns a job. In this case, it would rerun the original prompt producing a new grid of images. If “Remix mode” mode is active, you can re-write the prompt.

Variation Buttons

V1 V2 V3 V4 buttons create incremental variations of the selected grid image. Creating a variation generates a new image grid similar to the chosen image’s overall style and composition.

jcmm.art

Introduction

Midjourney Guide

Midjourney's Prompt Helper

prompt area:

Copy Prompt

Clear Prompt

Prompts examples

Images Gallery

FOLLOW US

Pinterest

Behance

Facebook

YouTube

Dribbble

Twitter

CONTACT

jcmm.art Madrid 28005 - Spain

+34 616913979

josecarlosmartinmateos.ai@gmail.com

jcmm.art
Madrid 28005 - Spain