AI News: Amazing New Tools You Can Use NOW!

Matt Wolfe
14 Jun 202433:20

TLDRThis week in AI brought exciting new tools for creators, including Luma AI's 'Dream Machine' for video generation, updates to 'Stable Diffusion 3', and Apple's AI integration across devices. Despite some initial frustrations with wait times and errors, 'Dream Machine' shows promise in image-to-video conversion. 'Stable Diffusion 3' is now accessible, offering advanced image generation. Apple's WWDC event highlighted AI enhancements in iOS, iPad, and Mac, focusing on privacy and on-device intelligence. The video also covers Adobe's terms of service update, the introduction of 'Gen Type' by Google Labs, and 'Soono's' song creation feature, showcasing the rapidly evolving AI landscape.

Takeaways

  • ๐Ÿ†• Luma AI released 'Dream Machine', a competitor to other AI video tools like Sora, Veo, and Runway, which can generate videos but has some limitations and inconsistencies.
  • ๐ŸŒ Due to high demand, there were initial frustrations with wait times for video generation on Luma AI's platform, but the service seems to have scaled up to reduce delays.
  • ๐Ÿ“น Luma AI's 'Dream Machine' shows promise in image-to-video generation, excelling in scenarios like flyover shots and time-lapses, despite some morphing and animation issues.
  • ๐ŸŽฅ Other AI video tools like Pika have also updated their models, with improvements in image-to-video capabilities, suggesting a competitive landscape in AI video generation.
  • ๐ŸŽจ Stable Diffusion 3 by Stability AI has been released, offering improved text-to-image generation with weights available for download on hugging face for local use.
  • ๐Ÿ” The need for detailed prompts is highlighted for getting better results from AI image generation models like Stable Diffusion 3.
  • ๐ŸŒŸ Leonardo AI introduced the 'Phoenix' model, a custom foundational model not based on Stable Diffusion, with enhanced features and prompt adherence.
  • ๐ŸŽถ Mid Journey AI launched a 'Model Personalization' feature, allowing users to train the model based on their preferences by ranking images they like.
  • ๐ŸŽต Soono unveiled a song generation feature that extends and improves upon user-uploaded audio clips, offering a creative tool for music creation.
  • ๐Ÿ“ Adobe is revising its terms of service regarding AI training on customer work, clarifying that they won't use customer content for AI training without permission.
  • ๐ŸŽ Apple's WWDC event showcased the integration of AI across all their platforms, with new features for iOS, iPad, and Mac, emphasizing privacy and on-device intelligence.

Q & A

  • What is the main topic of the AI News video?

    -The main topic of the AI News video is the introduction and discussion of various new AI tools that are available for use, including video generators, image generation models, and music creation features.

  • What is Luma AI's Dream Machine and how does it compare to its competitors?

    -Luma AI's Dream Machine is an AI video generator that competes with tools like Sora, Veo, Cling, Pika, and Runway. While it has some impressive results in certain scenarios, it still has room for improvement, particularly in text-to-video generation where it sometimes fails to accurately represent the prompt.

  • What issues did the user face when first using Luma AI's Dream Machine?

    -The user faced long wait times for video generation, with one request taking 7 hours to start. Additionally, there were instances of video generation failure, resulting in error messages without any output.

  • How has Luma AI addressed the initial scaling issues of Dream Machine?

    -Luma AI appears to have scaled up their service to eliminate the long wait times, making the video generation process faster and more efficient.

  • What is the current state of image-to-video generation in Luma AI's Dream Machine?

    -Image-to-video generation is where Dream Machine shines, with the user finding it to be more effective and producing more realistic and consistent results compared to text-to-video generation.

  • What is the cost associated with using Luma AI's Dream Machine during its research preview phase?

    -During the research preview phase, users get 30 free generations per month. After that, the cost is approximately 25 cents per video generated.

  • What updates did Pika, a competitor to Dream Machine, make to their image-to-video model?

    -Pika made updates to their image-to-video model, although the specific changes were not detailed. The updates seem to have improved the model's performance, making it a viable alternative to Dream Machine.

  • What is the significance of the release of Stable Diffusion 3 by Stability AI?

    -The release of Stable Diffusion 3 is significant as it makes the model's weights available for public use. This allows users to download and run the model locally, enabling more personalized and customizable image generation.

  • What are some of the features of Leonardo's new Phoenix model?

    -Leonardo's Phoenix model features enhanced prompt adherence, coherent text in images, superior image quality, and more creative control. However, some advanced features like image guidance and photoreal versions are still in development.

  • What is the new feature introduced by Mid Journey called, and how does it work?

    -Mid Journey introduced a feature called Model Personalization. It works by learning the user's preferences based on their past voting on images. Once the user has voted on enough images, they can generate images that are tailored to their preferences.

  • What is the purpose of the new tool Gen Type by Google Labs, and how does it function?

    -Gen Type by Google Labs is a tool that generates letters in a specified style. Users input the style they want, and the tool generates each letter in that style, allowing for the creation of stylized text like 'colorful electronic circuitry'.

  • What is the new song creation feature in Sunno, and how does it extend existing music?

    -Sunno's new song creation feature allows users to upload or record audio, and then extend it into a full song. The tool can add elements like beats and lyrics, and it can generate music based on the initial audio input, enhancing and expanding upon the user's original creation.

  • What updates did Adobe announce regarding their terms of service and AI training?

    -Adobe clarified that they will not train AI on their customers' work, contrary to earlier terms of service updates that implied otherwise. They are revising the terms to ensure customers that their work will not be used for AI training without consent.

  • What are some of the AI features announced by Apple during their WWDC event?

    -Apple announced several AI features across their devices, including AI-powered text summarization, smart replies, and image generation in Notes and Calculator apps. They also introduced Image Playground for creating images and Gen Emoji for custom emojis, as well as updates to Siri and Photos app.

  • What is the collaboration between Apple and OpenAI, and how will it function?

    -Apple is partnering with OpenAI to integrate Chat GPT into Siri. When Siri encounters a question it can't answer as effectively, it will ask the user if it can send the question to Chat GPT. This integration is optional and controlled by the user, with data anonymization ensuring privacy.

  • What was Elon Musk's response to Apple's integration of OpenAI, and what clarification did Apple provide?

    -Elon Musk tweeted that if Apple integrates OpenAI at the OS level, it would be a security violation and Apple devices would be banned at his companies. Apple clarified that OpenAI operates separately and user permission is required before any data is shared with OpenAI.

  • What recent changes did OpenAI make to their executive team?

    -OpenAI brought on Sarah Friar as CFO, who was previously the CEO of Nextdoor and CFO of Square, and Kevin Weil as CPO, who was recently the president of product and business at Planet Labs and has worked at Facebook, Instagram, and Twitter.

  • What is the significance of the new Quinn 2 model, and how does it compare to other models like LLaMA 3?

    -Quinn 2 is a new open-source model that outperforms LLaMA 3 and other models in various benchmarks. Despite having fewer parameters than the previous Quinn model, it achieves higher scores, indicating its efficiency and effectiveness.

  • What incident occurred with a photographer at an AI image contest, and what was the outcome?

    -A photographer was disqualified from an AI image contest after winning with a real photo. The incident highlights the ongoing debate about the value of human creativity versus AI-generated art in contests.

Outlines

00:00

๐ŸŽจ AI Video Tools and Creative Experiments

The first paragraph discusses the excitement in the AI community due to the release of various creative AI tools, including Luma AI's 'dream machine', which is a competitor to other AI video generators like Sora and Veo. The speaker shares their experience with Luma AI's platform, highlighting the initial frustrations with long wait times and generation errors, but also showcasing successful video outputs. The paragraph emphasizes the tool's potential in image-to-video generation and its current limitations in text-to-video scenarios.

05:01

๐Ÿ–ผ๏ธ Advancements in Image-to-Video AI and Stable Diffusion 3

This paragraph focuses on the advancements in AI image generation, particularly the transition from text-to-video to image-to-video as the more effective method with Luma. It also covers the release of Stable Diffusion 3 by Stability AI, which has been highly anticipated and is now available for public use. The speaker provides examples of generated images, noting the need for detailed prompts to achieve better results with Stable Diffusion 3, and compares it to other models like Leonardo Phoenix for image quality and prompt adherence.

10:02

๐ŸŽผ Sunno's Music Generation Feature and Adobe's AI Integration

The third paragraph introduces Sunno's new music generation feature that allows users to create songs by uploading or recording audio and then extending it with AI-generated melodies and lyrics. It also touches on Adobe's overhaul of their terms of service regarding AI training on customer work, which caused concern but was later clarified to not involve using customer content for AI training. Additionally, the paragraph mentions Apple's WWDC event, where they announced the integration of AI across all their devices and services.

15:03

๐Ÿค– Apple's AI Integration and Siri Updates

This paragraph delves into the details of Apple's AI integration across their devices, as announced at their WWDC event. It covers the new capabilities of Siri, including the ability to type directly to Siri, on-device AI processing, and context-awareness across apps, emails, and messages. The speaker also mentions new features like Image Playground for image generation, personalized emojis, and photo editing enhancements, emphasizing Apple's focus on privacy and secure AI usage.

20:03

๐Ÿ”ฎ Open AI's Partnership with Apple and Industry Reactions

The fifth paragraph discusses the partnership between Apple and Open AI, where Apple will use Chat GPT to enhance Siri's capabilities. It addresses the misconceptions about the partnership, clarifying that user data will not be shared without permission. The paragraph also includes reactions from industry figures like Elon Musk, who expressed concerns about the integration, and updates on Open AI's legal disputes and executive changes.

25:05

๐Ÿ† Quinn 2 Model and AI Image Contest Controversy

The final paragraph highlights the release of the Quinn 2 model, which has outperformed other AI models in benchmark testing. It also discusses an incident where a photographer was disqualified from an AI image contest for using a real photo, which was intended to highlight the value of human creativity over AI-generated art. The speaker wraps up by encouraging viewers to stay updated with AI news and tools through their resources.

Mindmap

Keywords

Luma AI

Luma AI is an AI tool introduced in the video that offers a 'dream machine' for video generation. It competes with other AI video tools like Sora, Veo, and Runway. The tool is noted for its ability to generate videos, though it has faced issues with long wait times and imperfect outputs.

Stable Diffusion 3

Stable Diffusion 3 is an updated AI image generation model that was announced as now available. It can be used to generate images from text prompts and is accessible for free on platforms like Hugging Face. The model is noted for its improved ability to include text in images.

Image to Video

Image to Video is a feature in Luma AI that allows users to generate videos from static images. This method is highlighted as a strength of Luma AI, producing more realistic and consistent results compared to text-to-video generation.

MidJourney

MidJourney is an AI tool that recently introduced a feature called Model Personalization. This feature tailors image generation based on user preferences, which are learned through user rankings of different images.

Leonardo Phoenix

Leonardo Phoenix is a new AI model released by Leonardo AI, designed from the ground up to enhance prompt adherence and image quality. It is a foundational model not based on stable diffusion, offering superior creative control and the ability to generate coherent text in images.

Apple AI Integration

Apple announced the integration of AI across its devices during the WWDC event. Features include AI-powered proofreading, summarization, image generation, and personalized notifications. This integration aims to enhance user experience by embedding AI capabilities directly into Apple's ecosystem.

Gen Type

Gen Type is a tool from Google Labs that allows users to generate letters in various styles based on user descriptions. This tool is highlighted for its ability to create visually appealing text art, which can be used for creative projects.

Suono

Suono is an AI music creation tool that allows users to generate songs by uploading audio or recording directly. The tool extends and enhances user-created music with features like random lyrics and added beats, making it a favorite for AI-driven music production.

Adobe AI Terms of Service

Adobe updated its terms of service to clarify that it will not use customer-created content to train its AI models. This response came after concerns were raised about potential privacy issues, reassuring users that their work would not be exploited for AI training purposes.

OpenAI and Apple Partnership

Apple and OpenAI have partnered to integrate ChatGPT into Apple's ecosystem. This integration allows Siri to use ChatGPT for answering user queries, enhancing Siri's capabilities with advanced AI while ensuring user data privacy through explicit permission controls.

Highlights

Introduction of Luma AI's Dream Machine, a competitor to other AI video tools.

Dream Machine's initial high demand causing long wait times for video generation.

Luma AI's Dream Machine's performance in text-to-video generation and its limitations.

Improved results with image-to-video generation in Dream Machine.

Luma Labs offering 30 free generations per month during research preview.

Pika's update to its image-to-video model and comparison with Dream Machine.

Release of Stable Diffusion 3 by Stability AI, making the weights available for download.

Examples of images generated by Stable Diffusion 3 and its prompt requirements.

Introduction of Leonardo Phoenix, a custom model by Leonardo with enhanced features.

Demonstration of text addition in images using Leonardo Phoenix.

Comparison between Leonardo Phoenix and other AI models like SDXL.

Mid Journey's new feature 'Model Personalization' based on user preferences.

Google Labs' 'Gen Type' tool for creating letters in custom styles.

Sunno's new song generation feature from user-uploaded audio clips.

Adobe's clarification on terms of service regarding AI training on customer work.

Apple's WWDC event focusing on integrating AI across all their devices and services.

Details of Apple's new features like Image Playground, Gen Emoji, and Siri updates.

Elon Musk's tweets about Apple's integration with OpenAI and privacy concerns.

OpenAI's addition of new executives and updates on their partnership with Microsoft.

Introduction of Quinn 2, a new open-source AI model outperforming others in benchmarks.

Photographer disqualified from AI image contest for using a real photo to make a point about human creativity.