Midjourney is an independent research lab, exploring AI applications, and recently they launched a bot capable of producing art with text inputs.
Others were able to produce AI models to generate images from text, such as: DALL-E and Wombo. But until know, Midjourney was the best experience, at least for me.
My AI paintings 🎨
I will explain you ahead, how to use the Midjourney algorithm on discord, but first check my paintings generated out of text prompts!
Using Midjourney
The AI bot it's still on beta version, but you can try it on discord. You can access their discord through Midjourney's website.
Once on Midjourney's Discord, you have the instructions to use it, basically you just need to join a "newbie" channel and prompt:
/imagine <your quote>
The output will be 4 images like this:
You can then decide to get an high resolution image from the 4, or get more versions of one of the images.
Note that you don't have many tries as a free user (around 25 queries), so make sure to prompt the best ideas that come to your mind! (or not, just have fun 😄)
Behind the hood 💻
The technical background behind the magic easily gets very complex to explain, but mainly includes 3 steps to output the image: The training data, the deep learning model, and the latent space.
The training data passes through the deep learning model (CNN) and needs to be encoded into a low-dimensional latent space before classification. The later basically allows for data compression.
In the end of the training, the last layer of the model captures the patterns of the input that are needed for the image classification. In the latent space, images that are labelled as the same object have very close representations.
For more technical information, I found this video very enlightening: