Was NOT Expecting This! Midjourney V6 Competes with DALL-E 3 | Comparison & Review

MattVidPro AI
21 Dec 202319:33

TLDRMidjourney V6, a significant update to the AI art generator, is reviewed in comparison to DALL-E 3. The video discusses the impressive advancements in V6, highlighting its ability to generate more realistic and coherent images, particularly in text generation, which has seen notable improvement. Despite initial skepticism, the reviewer is pleasantly surprised by Midjourney V6's performance, especially considering it's still in the alpha stage. The video compares various outputs from both AIs, noting that while DALL-E 3 excels in certain areas, Midjourney V6 offers a strong alternative with its own unique strengths, such as better control over image manipulation and less censorship. The reviewer also touches on the accessibility and cost of using these platforms, with DALL-E 3 being available for free on certain platforms, whereas Midjourney V6 requires a subscription. The summary concludes with the potential for Midjourney V6 to compete effectively in the AI art space.

Takeaways

  • πŸš€ Midjourney V6 has made significant advancements and is now competing with DALL-E 3 in the AI art landscape.
  • 🎨 The development time for Midjourney V6 was nearly twice as long as the previous longest development cycle, indicating substantial improvements.
  • πŸ†• Midjourney V6 is currently in its Alpha version, suggesting that its capabilities will continue to improve over time.
  • πŸ“ˆ Community reactions show that Midjourney V6 can generate more photorealistic and cinematic images compared to DALL-E 3.
  • πŸ“œ Text generation in Midjourney V6 has improved, but there are still instances where accuracy lags behind DALL-E 3.
  • πŸ’¬ Aesthetics in Midjourney V6 are considered superior by some users, compensating for slightly less text accuracy.
  • πŸ“± Midjourney V6 excels in photorealism, creating images that closely resemble real-life photographs or Instagram posts.
  • πŸ“ˆ The prompt understanding in Midjourney V6 is better than its V5 predecessor but still does not match DALL-E 3's level of detail and accuracy.
  • πŸ’° Access to Midjourney V6 requires a subscription, whereas DALL-E 3 can be accessed for free on certain platforms, giving it a competitive edge.
  • πŸ“ Midjourney V6 offers more control and less censorship, which may appeal to users looking for greater creative freedom.
  • βœ… Midjourney V6 has successfully passed the 'cigarette test,' demonstrating its ability to generate detailed and complex images.

Q & A

  • How long did it take for Midjourney to develop from version 5 to version 6?

    -The development of Midjourney version 6 took nearly twice as long as the previous longest development cycle, which was the time it took to go from the first version of Midjourney to the third version.

  • What is the current status of Midjourney V6?

    -Midjourney V6 is currently in its Alpha version, which means it is still under development and is expected to improve over time.

  • How does Midjourney V6 compare to DALL-E 3 in terms of text generation?

    -Midjourney V6 has shown the ability to generate text that is competitive with DALL-E 3, although it sometimes requires more attempts to get the text right and can occasionally appear less natural.

  • What are some of the strengths of Midjourney V6?

    -Midjourney V6 excels in photorealism, has a better understanding of prompts compared to its previous version, offers more control with less censorship, and has a better understanding of pop culture characters.

  • How does the user feel about the community's initial reactions to Midjourney V6?

    -The user is impressed by the community's initial reactions, noting that the images generated by Midjourney V6 are good-looking, realistic, and in some cases, superior to DALL-E 3.

  • What are the user's thoughts on the comparison between Midjourney V6 and DALL-E 3?

    -The user is surprised by how well Midjourney V6 is competing with DALL-E 3, despite their initial doubts. They acknowledge that both have their strengths and that Midjourney V6 has made significant improvements.

  • What is the user's opinion on the photorealism of Midjourney V6?

    -The user is very impressed by the photorealism of Midjourney V6, stating that it has maintained the strengths of version 5 and made significant improvements, making it more competitive with DALL-E 3.

  • How does the user feel about the text accuracy in images generated by Midjourney V6?

    -The user notes that while Midjourney V6 can produce good text, it sometimes requires multiple attempts to get the text correct and can occasionally produce text that looks unnatural or 'Photoshop-esque'.

  • What are the user's thoughts on the pricing and accessibility of Midjourney V6 compared to DALL-E 3?

    -The user mentions that Midjourney V6 requires a subscription, with a minimum cost of $10 per month for access to V6, whereas DALL-E 3 can be accessed for free on certain platforms, making it more accessible.

  • What is the user's theory about the training differences between Midjourney V6 and DALL-E 3?

    -The user theorizes that Midjourney V6 might be synthetically trained to produce text, while DALL-E 3 could be naturally trained, which could explain the differences in text quality and character generation between the two.

  • What does the user suggest for improving the user experience with Midjourney V6?

    -The user suggests that Midjourney should develop a better web interface for easier image manipulation and to reduce reliance on Discord, which they find inconvenient.

Outlines

00:00

🎨 Mid Journey V6's Impressive AI Art Development

The video discusses the significant advancements in AI art generation with the release of Mid Journey V6. It highlights the extended development time for V6, which nearly doubled the previous longest development period, reflecting the rapid changes in AI art landscape prompted by competitors like SDXL and Dolly 3. The video praises Mid Journey V6's ability to generate highly realistic images, competitive with Dolly 3, and notes that it's still in its Alpha version, promising further improvements. Community reactions and comparisons with Dolly 3 are shared, showcasing the quality of text and image generation, with examples including a man standing alone, an anime movie poster, and a Coca-Cola advertisement. The video also touches on the versatility of SDXL and the photorealistic capabilities of Mid Journey V6.

05:02

πŸ“ˆ Mid Journey V6's Text Generation and Photorealism Compared to Dolly 3

This paragraph focuses on the text generation capabilities of Mid Journey V6 and compares them with Dolly 3. It presents community-generated examples that demonstrate V6's ability to produce accurate text in images, such as a standup pouch product photo and a movie poster. The video also shows a side-by-side comparison with Dolly 3, noting that while Dolly 3 has strong character generation, Mid Journey V6 excels in photorealism and text accuracy. The video discusses the need for specific prompting techniques to achieve the best results with V6 and mentions the requirement of a Mid Journey subscription to access V6. It concludes by reiterating the competitive edge V6 has in photorealism and the potential for future improvements.

10:03

🧐 Dolly 3 vs. Mid Journey V6: A Head-to-Head Test on Text and Character Generation

The video script presents a detailed comparison between Dolly 3 and Mid Journey V6, focusing on their ability to generate text and characters. It discusses the results of specific prompts, such as generating images of a lemon character and a Disney logo, where Mid Journey V6 outperformed Dolly 3 in terms of text accuracy. The video also presents a conspiracy theory suggesting that Dolly 3 might be naturally trained to produce text, while Mid Journey V6 is synthetically trained, leading to differences in output quality. The script further compares the two AIs on generating realistic photos, with Mid Journey V6 appearing to be more successful in creating photorealistic images that resemble Instagram posts. However, it notes that Dolly 3 has a better understanding of complex concepts like a pirate outfit on a dog.

15:04

πŸš€ Mid Journey V6's Competitive Edge and Future Prospects

The final paragraph discusses the current standing of Mid Journey V6 in comparison to Dolly 3. While acknowledging Dolly 3's leading position in image generation, the video argues that Mid Journey V6 is a close second, with impressive text generation capabilities and better prompt understanding than its predecessor, V5. It also notes Mid Journey V6's strengths in less censorship, better control, and the inclusion of in-painting, a feature lacking in Dolly 3. The video mentions the cost of accessing Mid Journey V6, which requires a subscription, versus the free access to Dolly 3 through certain platforms. It concludes by stating that Mid Journey V6's development has been impressive, especially considering the competition with well-funded entities like Open AI, and that it has the potential to be a solid competitor in the AI image generation market.

Mindmap

Keywords

Midjourney V6

Midjourney V6 refers to the sixth version of the AI art generation software, Midjourney. It is a significant update that has taken nearly twice as long to develop as the previous longest development cycle. The video discusses how this version competes with DALL-E 3, another AI art generator, in terms of text generation and photorealism. An example from the script is the comparison of text generation where Midjourney V6 successfully creates a man standing alone in a dark area staring at a neon sign that says 'empty,' similar to the quality of DALL-E 3.

DALL-E 3

DALL-E 3 is an advanced AI image generation model developed by OpenAI. It is known for its high level of coherence, prompt understanding, and the ability to generate images at an impressive scale and price level. In the video, it is compared with Midjourney V6, where both are evaluated on their ability to generate text and images. An example from the script is the comparison of an anime movie poster where DALL-E 3's text generation is noted to be slightly less accurate than Midjourney V6.

Photorealism

Photorealism in the context of the video refers to the ability of an AI to generate images that closely resemble real photographs. It is a key aspect being compared between Midjourney V6 and DALL-E 3. The video discusses how Midjourney V6 has improved in this area, particularly in generating images that could trick the viewer into thinking they are real Instagram photos. An example is the image of a Shih Tzu puppy wearing a pirate outfit, which looks very realistic and professional.

Text Generation

Text generation is the process by which AI creates textual content within an image. It is a feature that both Midjourney V6 and DALL-E 3 are evaluated on in the video. The script discusses how Midjourney V6 has made strides in text generation, making it competitive with DALL-E 3, although there are instances where the text appears less natural compared to DALL-E 3's output. An example from the script is the successful generation of the words 'Matt vidpro' in a movie poster style image.

AI Art Landscape

The AI art landscape refers to the current state and advancements in the field of AI-generated art. The video discusses how this landscape has changed significantly with the introduction of competitors like Midjourney and DALL-E. It is important as it sets the stage for the comparison between Midjourney V6 and DALL-E 3 and their respective capabilities in AI art generation. An example from the script is the mention of how the release of DALL-E 3 and the emergence of a free competitor, SDXL, influenced the development of Midjourney V6.

SDXL

SDXL is mentioned as a versatile, free, and open-source AI model that is compared to Midjourney and DALL-E in terms of its specialized use cases. It is described as more suitable for certain specific applications rather than general AI art generation. The script notes that while SDXL has its benefits, it is not the primary focus of the comparison in the video, which is between Midjourney V6 and DALL-E 3.

Anime Movie Poster

An 'Anime Movie Poster' is used as an example in the video to illustrate the text generation capabilities of Midjourney V6 and DALL-E 3. It is a specific type of image that includes text and design elements typical of promotional materials for anime films. The script mentions how Midjourney V6 correctly spelled 'tomorrow' in the context of an anime movie poster, showcasing its ability to generate coherent text within a complex and thematic image.

Coca-Cola Ad

The 'Coca-Cola Ad' is another example used in the video to demonstrate the capabilities of Midjourney V6. It involves generating an image that includes the Coca-Cola logo and traditional Hawaiian patterns on the Coca-Cola itself. The discussion around this example highlights the challenges AI faces in accurately generating well-known logos and patterns, and how Midjourney V6 performs in this regard.

In-Painting

In-painting is a feature that allows AI to fill in missing or selected areas of an image with new content that matches the surrounding context. The video mentions that Midjourney V6 has an in-painting feature, which is notably absent in DALL-E 3. This feature is significant as it provides users with more control over the final output of their AI-generated images, allowing for more creative and less constrained results.

Discord

Discord is a communication platform where the user interface for Midjourney V6 is currently operating. The video script expresses frustration with the use of Discord for this purpose, indicating a preference for a web interface. The mention of Discord highlights the user experience aspect of interacting with AI art generation tools and the need for more convenient and user-friendly interfaces.

Bing Image Creator

Bing Image Creator is one of the platforms mentioned in the video where users can access DALL-E 3 for free. It is an example of how Microsoft is using DALL-E 3's API to offer free image generation services, which is a strategic move to gain market share and user adoption. The script discusses how this compares to Midjourney V6, which requires a subscription to access.

Highlights

Midjourney V6 has been developed for nearly twice as long as the previous longest development cycle, showcasing significant advancements.

Midjourney V6 is now capable of competing with DALL-E 3 in terms of AI art generation.

The Alpha version of Midjourney V6 has impressed with its capabilities, hinting at even better performance in future updates.

Community reactions suggest that Midjourney V6 generates more beautiful and realistic words compared to DALL-E 3.

Midjourney V6's cinematic and realistic approach to image generation is noted to be superior to the more Photoshop-esque vibe of DALL-E 3.

In comparison to SDXL, Midjourney V6 is favored for its specialization in certain use cases, while SDXL is more versatile and open-source.

Midjourney V6 has shown a higher level of creativity and prompt accuracy, especially in generating photorealistic advertisements.

DALL-E 3 sometimes struggles with text accuracy in its generated images, unlike Midjourney V6 which has a better grasp of text.

Midjourney V6's text generation is speculated to be synthetically trained, whereas DALL-E 3 might be naturally trained, affecting their respective outputs.

DALL-E 3's market shift has influenced Midjourney to improve its text generation to remain competitive.

Midjourney V6 excels in photorealism, producing images that closely resemble professional photography.

DALL-E 3 has shown better understanding and generation of copyrighted characters, unlike Midjourney V6 which faced challenges.

Midjourney V6 still faces some issues with generating multiple characters in a scene, sometimes leading to blending or inaccurate representation.

Despite being an Alpha version, Midjourney V6 has made significant strides in photorealism and text generation, positioning it as a strong contender.

Midjourney V6 offers more control, less censorship, and better understanding of pop culture characters compared to DALL-E 3.

The subscription model for Midjourney V6 starts at $10/month, whereas DALL-E 3 can be accessed for free through certain platforms due to Microsoft's backing.

Midjourney V6's ability to upscale images and pass the 'cigarette test' for AI image generators demonstrates its advanced capabilities.

The reviewer has resumed their Midjourney subscription due to the impressive performance of V6, signaling its competitive edge in the AI art space.