BEST AI Art Generator? Dall E 2 vs Midjourney vs Stable Diffusion

Wade McMaster - Creator Impact
22 Dec 202207:04

TLDRThis video compares three leading AI art generators: Dall-E 2, Midjourney, and Stable Diffusion. The comparison is based on the outputs generated from basic prompts to evaluate their performance and styles. Dall-E 2 is noted for its photorealistic images, while Midjourney stands out for its artistic and well-composed visuals. Stable Diffusion, though capable of creating photorealistic images, is considered second to the other two in this aspect. The video also discusses the user interfaces and complexity of each platform, with Dall-E 2 having a more user-friendly interface and Midjourney offering more artistic results despite its complexity. Stable Diffusion is mentioned as being free but more challenging to set up. The video concludes by inviting viewers to share their preferences and thoughts on the platforms.

Takeaways

  • 🎨 Dall-E 2, Midjourney, and Stable Diffusion are three prominent AI art platforms being compared for their ability to generate images from basic prompts.
  • 🔍 Dall-E 2 creates almost photorealistic images, although some details like teeth may appear off.
  • 🌟 Midjourney's images are described as stunning and artistic, with a style that stands out compared to the other platforms.
  • 📈 Stable Diffusion tends to produce more standard-looking images, which, while decent, are often considered less impressive than the other two.
  • 🖼️ When comparing the platforms, Vision Prep (Midjourney) is noted to have the best-looking image, while Dall-E 2 has the most realistic photorealistic output.
  • 🖌️ Dall-E 2's interface is praised for being user-friendly with features like in-painting and out-painting, enhancing the user experience.
  • 🤖 Midjourney is acknowledged for its artistic and better-composed images, although it has a more complex setup process.
  • 📷 Dall-E 2 excels in photorealism, making it the preferred choice for users seeking realistic images.
  • 🎭 Midjourney is favored for its artistic style, which is more appealing to the reviewer's taste.
  • 🆓 Stable Diffusion is noted for being free to use, but it is considered the most complex to set up.
  • 📱 The reviewer suggests that the choice of platform may depend on the user's preference for photorealism or artistic composition.
  • 💬 The script invites viewers to share their preferences and thoughts on the platforms, encouraging engagement and discussion.

Q & A

  • What are the three main AI art platforms mentioned in the transcript?

    -The three main AI art platforms mentioned are Dall-E 2, Midjourney, and Stable Diffusion.

  • According to the transcript, which platform produced the most photorealistic image of a beautiful woman with blue eyes?

    -Dall-E 2 produced the most photorealistic image of a beautiful woman with blue eyes.

  • What style of image did Midjourney create for the oil painting of a Shaolin monk?

    -Midjourney created an image that was sharp, fantastic, and had an exciting style for the oil painting of a Shaolin monk.

  • Which platform's output for the sunny outdoor scene was described as an artistic masterpiece?

    -Midjourney's output for the sunny outdoor scene was described as an artistic masterpiece.

  • How did the transcript describe the interface of Dall-E 2?

    -The transcript described the interface of Dall-E 2 as nice, easy to use, and featuring great functionalities like in-painting and out-painting.

  • What is the main advantage of Stable Diffusion mentioned in the transcript?

    -The main advantage of Stable Diffusion mentioned is that it can be obtained for free.

  • Which platform was considered the weakest for the busy city street scene?

    -Dall-E 2 was considered the weakest for the busy city street scene.

  • What is the distinctive feature of the image created by Midjourney for the cyborg with glowing eyes?

    -The distinctive feature of the image created by Midjourney for the cyborg with glowing eyes is that it has a really impressive and video game-style look.

  • How does the transcript describe the photorealism of Dall-E 2's images compared to the other platforms?

    -The transcript describes Dall-E 2's images as having the most photorealistic quality compared to the other platforms.

  • What is the general preference of the speaker regarding the artistic style of the platforms?

    -The speaker generally prefers the artistic style of Midjourney for its better composition and more artistic output.

  • What is the complexity of using Midjourney according to the transcript?

    -According to the transcript, Midjourney is a bit more complex to use, especially when compared to Dall-E 2.

  • What is the main consideration when choosing between these platforms based on the transcript?

    -The main considerations when choosing between these platforms are the style of the output (photorealistic, artistic, etc.), the ease of use of the interface, and the cost (free or not).

Outlines

00:00

🎨 Comparing AI Art Platforms: Dolly 2, Mid-Journey, and Stable Diffusion

This paragraph discusses the comparison of three prominent AI art platforms: Dolly 2, Mid-Journey, and Stable Diffusion. The author uses basic prompts to evaluate the outputs of each platform, noting that while there is skill involved in refining the results, the initial outputs provide a good indication of each platform's capabilities. Dolly 2 is noted for its photorealistic results, although with some quirks like funny-looking teeth. Mid-Journey is praised for its stunning and artistic outputs, while Stable Diffusion's results are considered decent but not the best among the three. The author concludes that Vision Prep (Dolly 2) has the most realistic image, and Mid-Journey has the most appealing style. The platforms also differ in their approaches to creating images, with Dolly 2 and Stable Diffusion leaning more towards photorealism, and Mid-Journey offering more artistic and stylized results. The paragraph also touches on the user interfaces and complexity of using each platform.

05:01

📷 Photorealism and Artistic Composition in AI Art Platforms

The second paragraph focuses on the photorealistic capabilities of the AI art platforms and their artistic composition. Dolly 2 is highlighted for its strong performance in creating photorealistic images, while Mid-Journey is recognized for producing more artistic and well-composed imagery. Stable Diffusion is noted to be better at photorealism than Mid-Journey but still falls short compared to Dolly 2. The author also discusses the user interfaces of the platforms, with Dolly 2 having a more user-friendly interface with features like in-painting and out-painting, making it easier to add AI art to specific areas. Mid-Journey, while offering better imagery, is described as more complex to use, and Stable Diffusion, despite being free, is considered the most complex to set up. The paragraph concludes with the author's preference for Mid-Journey and an invitation for viewers to share their thoughts and preferences in the comments.

Mindmap

Keywords

AI art platforms

AI art platforms are digital tools that use artificial intelligence to generate artwork based on user prompts. In the video, three such platforms are discussed: Dall E 2, Midjourney, and Stable Diffusion. These platforms are central to the video's theme as they are compared for their ability to produce various styles of artwork.

Photorealistic

Photorealistic refers to artwork that closely resembles a photograph in terms of visual detail and realism. In the context of the video, Dall E 2 is noted for producing photorealistic images, which is a significant aspect of the comparison between the AI art platforms.

Artistic composition

Artistic composition involves the arrangement of visual elements in a work of art to create a coherent and aesthetically pleasing image. Midjourney is highlighted in the video for its artistically composed images, which are more stylized and less focused on photorealism.

Prompts

Prompts are the textual descriptions or ideas provided by users to guide the AI in generating artwork. The video script mentions using basic prompts to compare the performance of the AI art platforms, emphasizing the importance of prompts in determining the final output.

Oil painting

An oil painting is a traditional art form where pigments are bound with a medium of drying oil, usually linseed oil, to create a painting. The video compares how each AI platform interprets and generates images resembling oil paintings, showcasing different styles and levels of realism.

Cyborg

A cyborg is a fictional or hypothetical being with both organic and biomechatronic body parts. In the video, the creation of a cyborg with glowing eyes is one of the prompts used to evaluate the AI platforms' ability to generate complex and imaginative artwork.

3D render

A 3D render is a two-dimensional representation of a three-dimensional object or scene, created using computer graphics. The video discusses the platforms' ability to generate 3D-rendered images, such as a turtle, and how they differ in style and quality.

Ink sketch

An ink sketch is a drawing made using ink, often characterized by bold lines and minimal color. The video includes an ink sketch of a dragon as one of the prompts to assess the AI platforms' capability to produce detailed and stylistic drawings.

Photograph

A photograph is an image created by capturing light on a light-sensitive surface, such as with a camera. The video script contrasts the AI platforms' ability to mimic the look of a photograph, with Dall E 2 being noted for its photorealistic qualities in this context.

Interface

The interface in the context of software refers to the point of interaction between the user and the system. The video mentions the user interfaces of the AI platforms, discussing their ease of use, features like in-painting and out-painting, and the complexity of setup.

Pixel resolution

Pixel resolution is the number of pixels in a digital image or display, which determines its clarity and detail. The video notes the difference in pixel resolution among the platforms, with Dall E 2 and Midjourney producing higher resolution images of 1024 by 1024 pixels.

Highlights

Dall-E 2, Midjourney, and Stable Diffusion are three main AI art platforms used for creating art.

Dall-E 2 creates almost photorealistic images, although sometimes with minor imperfections.

Midjourney produces stunning, artistic images that are not as photorealistic but have a unique style.

Stable Diffusion generates decent images but is considered the least impressive among the three platforms.

Vision prep is noted as the best-looking image, while Dall-E 2 has the most realistic photorealistic image.

Midjourney's image of a Shaolin monk is favored for its cooler style and artistic appeal.

Dall-E 2's outdoor scene looks very much like a photo, although not the most appealing.

Midjourney's artistic interpretation of the outdoor scene is more impressive and preferred.

Stable Diffusion opts for a more photorealistic look, positioning it between Midjourney and Dall-E 2 in terms of style.

Dall-E 2's depiction of a cyborg with glowing eyes is simple but not exactly what was envisioned.

Midjourney's cyborg image is described as impressive with a video game style appearance.

Stable Diffusion's cyborg image is cool but lacks the glowing eyes effect.

Midjourney is preferred for its artistic and more impressive style across different prompts.

Dall-E 2 offers a reasonably photographic look for a cute puppy wearing sunglasses and headphones.

Stable Diffusion's image of the puppy is more photorealistic but lacks the artistic depth of Midjourney's image.

Midjourney's 3D render of a turtle is considered superior to Dall-E 2's and Stable Diffusion's versions.

Dall-E 2's ink sketch of a dragon is rough but captures a cool style.

Midjourney's ink sketch is described as next level, offering more detail and a fun style.

Stable Diffusion's ink sketch is neater and more cohesive, but may not be as detailed as Midjourney's.

Dall-E 2 is noted for its superior interface with features like in-painting and out-painting.

Midjourney, while more complex to use, provides better imagery.

Stable Diffusion is free but can be complex to set up without using an online interface.

Dall-E 2 is recognized for its photorealistic capabilities, Midjourney for artistic composition, and Stable Diffusion for a balance between the two.