My Top 4 Favorite Ai Models - Civitai / A1111 / Stable Diffusion

Olivio Sarikas
19 Aug 202318:01

TLDRIn this video, the presenter shares their top four favorite AI models for creating digital art: Ref Animated, Realistic Vision, Magic Mix, and Photon. They discuss the strengths of each model, such as Ref Animated's ability to produce dynamic and detailed images with excellent color choices, and Realistic Vision's modern photographic vibe and realistic scenes. Magic Mix is praised for its authentic and expressive style, while Photon is highlighted for its exceptional facial detail and skin texture in Laura face training. The presenter also provides tips on using these models effectively, including suggestions for prompts, negative prompts, and settings for optimal image quality.

Takeaways

  • ๐ŸŽจ **Ref Animated Model**: The speaker's all-time favorite for its ease of prompting and ability to produce high-quality digital art images with attention to detail and idealized features.
  • ๐Ÿ–ผ๏ธ **Artistic Decisions**: The model's rendering includes artistic choices like color contrasts and highlights that enhance the subject's visibility and add to the overall aesthetic.
  • ๐ŸŒ„ **Landscapes**: While the speaker doesn't use Ref Animated for landscapes often, they acknowledge its potential for creating realistic images, despite their personal limited experience.
  • ๐ŸŒˆ **Color Choices**: The model is praised for its excellent color combinations and the way it harmonizes colors to create visually appealing images.
  • ๐Ÿ‘— **Fantasy and Clothing**: The model's versatility is highlighted in its ability to generate fantasy-themed images with various types of clothing and decorations.
  • ๐Ÿ“ธ **Realistic Vision**: Ideal for a modern photographic look, this model is noted for its realism, expressiveness, and attention to materials and ethnicities.
  • ๐ŸŒฟ **Foliage and Nature**: The model adeptly handles foliage and natural backgrounds, contributing to a realistic scene that could easily be mistaken for a professional photograph.
  • ๐Ÿค– **Magic Mix**: A recently discovered favorite, this model is recognized for its authentic and somewhat eerie style, with a focus on realistic skin tones and expressiveness.
  • ๐ŸŒ **Backgrounds and Bokeh**: Magic Mix is also commended for its ability to create soft, realistic backgrounds with a nice progression of bokeh effects.
  • ๐Ÿ“ท **Photon Model**: Used primarily for Laura face training, Photon is highly detailed and capable of recreating faces with remarkable accuracy, making it suitable for creating convincing 'fake' photos.
  • ๐ŸŒŸ **Additional Models**: The speaker suggests trying out the Dream Shaper XL for beautiful results and the Ray Liberate model for its playful take on colors and poses, inspired by digital art.

Q & A

  • What is the first AI model discussed in the transcript and why is it the speaker's favorite?

    -The first AI model discussed is 'ref animated'. It is the speaker's favorite due to its ease of prompting and the ability to generate amazing digital art images with great attention to detail and dynamic, expressive poses.

  • How does the 'ref animated' model handle details and artistic decisions?

    -The 'ref animated' model is very good at handling details, such as exaggerated body shapes and idealized images. It also makes artistic decisions like color choices and lighting that enhance the overall composition, for example, by adding highlights around the hair to make it stand out against a dark background.

  • What is the use of 'clip skip' in the context of the AI models discussed?

    -Clip skip is a feature used with the AI models to improve the quality of the generated images. It allows for adjustments within the model to better refine the output, and it is particularly important for the 'automatic 1111' model mentioned in the transcript.

  • What is the second favorite model of the speaker and what makes it stand out?

    -The second favorite model of the speaker is 'Realistic Vision'. It stands out for its modern photographic vibe, professional photo wipe, and the ability to create very realistic looking scenes with attention to materials, expressions, and details.

  • How does the 'Realistic Vision' model handle different ethnicities and backgrounds?

    -The 'Realistic Vision' model can handle different ethnicities well, as it can render a variety of skin tones, facial features, and hair types in a realistic manner. It also does a good job with different kinds of backgrounds, including foliage and realistic-looking tattoos.

  • What is the 'Magic Mix' model and what is its strength?

    -The 'Magic Mix' model is a newer discovery of the speaker and is praised for its very realistic style with an authentic vibe. It is particularly good with fabric, hair, and skin color, and how light is reflected from the skin, making it ideal for creating images that feel very real.

  • What is the 'Photon' model used for and what are its capabilities?

    -The 'Photon' model is used for Laura face training. It is capable of recreating faces in great detail, including the shape of lips, nose, eyes, and even tiny details like skin texture and little hairs on the face. It can be used to put a person into different costumes and create realistic fake photos.

  • What are the two additional models suggested for experimentation with SDXL?

    -The two additional models suggested for experimentation with SDXL are the 'Dream Shaper XL' model, known for its beautiful results, and the 'Ray Liberate' model, which is more playful with colors, posing, and style, drawing inspiration from digital art.

  • How does the speaker suggest improving the quality of images generated by the models?

    -The speaker suggests using high-res fix with a 4X upscaler and a denoise strength of 0.2, or alternatively, sending the image to an image-to-image upscaler with a denoise between 0.2 and 0.35 and upscaling it to a size of two times the original.

  • What is the importance of negative prompts and embeddings in the context of AI models?

    -Negative prompts and embeddings are crucial for refining the output of the AI models. They help to avoid unwanted elements in the generated images and improve the overall quality by guiding the model to focus on desired characteristics.

  • How does the speaker recommend using the information provided on the Civitai model page?

    -The speaker recommends scrolling down on the Civitai model page to review the model information, which includes what the model is good for, the sizes it works with, the recommended VAEs, and negative embeddings. They also suggest watching linked videos, clicking on images for detailed settings, and using the provided information to understand the prompt, negative prompt, sampler, and other settings.

  • What is the significance of the 'One Girl' prompt in the context of the Magic Mix model?

    -The 'One Girl' prompt is traditionally associated with anime images and is used with the Magic Mix model to create a single character image. Interestingly, it suggests that the model may prefer this prompt due to its ability to create expressive and authentic looking characters, regardless of age.

Outlines

00:00

๐ŸŽจ Introduction to Favorite Models for Digital Art Creation

The speaker introduces their preferred models for creating stable and evocative digital art. They highlight the ease of prompting and the high-quality results from the 'ref animated' model, which is praised for its attention to detail and ability to generate dynamic, expressive images with excellent color choices and artistic decisions. The model's effectiveness in rendering details and idealizing images is demonstrated through various examples, including the contrast in color choices and the strategic use of light to enhance the subject's features. The speaker also briefly touches on the model's application in creating landscapes and realistic images, emphasizing the artistic decisions that contribute to the model's appeal.

05:03

๐ŸŒŸ Discussing Realistic Vision and High-Resolution Image Enhancement

The speaker expresses their admiration for the 'Realistic Vision' model, noting its modern photographic vibe and professional photo finish. They discuss the model's proficiency in handling various elements such as different ethnicities, clothing, and backgrounds, with a particular emphasis on the realistic portrayal of materials and expressions. The model's ability to create close-ups, macros, and foliage is also mentioned. The speaker recommends using high-res fix with a 4X upscaler and denoise strength for image quality improvement. They also provide advice on navigating the model page for essential information and settings to optimize the model's performance.

10:05

๐Ÿง™โ€โ™‚๏ธ Exploring Magic Mix for Authentic and Realistic Imagery

The speaker introduces 'Magic Mix' as a recently discovered favorite for its authentic and somewhat eerie style. They appreciate the model's effectiveness in creating images with a realistic feel, particularly noting the quality of skin rendering and the expressive nature of the images produced. The model's versatility in handling various scenarios, including landscapes and eerie shots, is highlighted. The speaker also discusses the importance of negative prompts and the impact they have on image quality, urging the audience to explore the provided information and settings for the best results.

15:05

๐Ÿ“ธ Utilizing Photon for Detailed Laura Face Training

The speaker discusses the use of 'Photon,' a photo-based model, for detailed Laura face training. They demonstrate how the model accurately recreates facial features and skin texture, even capturing minute details like tiny hairs on the face. The versatility of the model in placing the trained face onto different costumes and creating realistic fake photos is emphasized. The speaker also provides guidance on using the model for Laura training and integrating the Laura into various scenes, suggesting that the original model used for training often works best. They briefly mention other models like 'Dream Shaper XL' and 'Ray Liberate' as alternatives for different styles and invite viewers to share their favorite models and uses in the comments.

Mindmap

Keywords

Stable Diffusion

Stable Diffusion refers to a type of AI model used for generating images from textual descriptions. It is characterized by its stability in producing high-quality images. In the video, the speaker discusses their favorite AI models for Stable Diffusion and how they utilize them for creating digital art, emphasizing the model's ability to handle details and idealize images.

Ref Animated

Ref Animated is mentioned as the speaker's all-time favorite model for playing with in the context of Stable Diffusion. It is appreciated for its ease of prompting and its capability to produce digital art images with amazing details and dynamic poses. The video highlights the model's effectiveness in rendering color choices and artistic decisions, such as the strategic use of light and contrast to enhance the subject's silhouette and hair details.

Clip Skip

Clip Skip is a feature within the AI model that allows for the adjustment of how closely the generated image adheres to the textual prompt. The speaker suggests using Clip Skip in conjunction with the models discussed, indicating its importance in fine-tuning the image generation process. It is used to achieve a balance between creativity and adherence to the input prompt.

Realistic Vision

Realistic Vision is another AI model highlighted in the video, known for its modern photographic vibe and the ability to create images with professional photo-like quality. It is praised for its realism, especially in rendering materials, expressions, and the interplay of light and color. The model is also noted for its versatility in handling different ethnicities, clothing, and backgrounds.

High-Res Fix

High-Res Fix is a technique suggested for improving the quality of the generated images. The speaker recommends using it with a specific model and denoise strength to achieve ultra-sharp images. It is part of the post-processing steps that can significantly enhance the clarity and detail of the final output, contributing to the overall professional look of the images.

Magic Mix

Magic Mix is a model that the speaker discovered recently and found to be very good for creating realistic images with an authentic vibe. It is appreciated for its handling of fabric, hair, and skin color, as well as its ability to produce soft and progressive bokeh effects in the background. The model is also noted for its effectiveness in generating landscapes and eerie, realistic shots.

Photon

Photon is not an AI model but a photo used by the speaker for Laura face training within AI models. It is highlighted for its detailed capture of facial features, skin texture, and reflectiveness, which allows for high-quality recreation of a person's face in various costumes and settings. The video emphasizes the model's utility in creating convincing fake photos by integrating Laura faces into different scenes.

Lora Training

Lora Training refers to the process of training AI models to recognize and recreate specific details, such as a person's face, with high accuracy. The speaker uses the Photon photo for this purpose and then integrates the trained 'Loras' into various scenes, which is a technique to achieve highly personalized and detailed results in image generation.

Dream Shaper XL

Dream Shaper XL is an AI model mentioned as an alternative for those interested in exploring SDXL (Stable Diffusion eXtreme Large) models. It is noted for producing beautiful results, suggesting its capability for generating high-quality images that may appeal to users looking for diverse options in image creation.

Ray Liberate

Ray Liberate is an AI model that the speaker suggests as an alternative for realistic models. It is developed by the same creators as the Day Liberate model and is characterized by its playful approach to colors, posing, and style, drawing inspiration from digital art. This model is recommended for users who prefer a more artistic and less conventional approach to image generation.

Negative Embeddings

Negative Embeddings are used in AI image generation to guide the model away from producing certain undesired features or elements in the generated images. The speaker discusses downloading and using specific negative embeddings to improve the quality of images, emphasizing their significant impact on steering the creative process towards the desired outcome.

Highlights

The speaker's all-time favorite model for creating digital art is 'ref animated' due to its ease of prompting and the ability to generate detailed and dynamic images.

Ref animated is particularly good at rendering exaggerated body shapes and poses with dramatic and expressive results.

The model's color choices and artistic decisions contribute to the quality of the images, with attention to light highlights and color contrasts.

Ref animated can also be used for creating realistic images, showcasing beautiful lighting and artistic decisions that enhance emotion and depth.

The model is capable of generating fantasy-themed images with various types of clothing and backgrounds, including idealized head shapes.

Clip skip and VAE choice are important settings to adjust within Automatic 1111 for optimal results with ref animated.

High-Res Fix with a 4X Ultra Sharp model and a denoise strength of 0.2 is suggested for image quality improvement.

Realistic Vision is praised for its modern photographic vibe and professional photo wipe, suitable for images with less clothing on the model.

The model can handle different ethnicities, backgrounds, and materials with high realism and attention to detail.

Realistic Vision is also good for creating close-ups, macros, and foliage with a realistic and aesthetic appeal.

The model is capable of creating authentic-looking amateur vibe images with cooler lighting, resembling daylight captures.

Magic Mix is a newly discovered favorite for its realistic style and authenticity, with good handling of fabric, hair, and skin color.

The model can create beautiful bokeh effects and is suitable for landscape and scenery shots with a drone view.

Photon is used for Laura face training, providing detailed and accurate recreations of faces with skin texture and reflectiveness.

The speaker suggests trying out the Dream Shaper XL and Ray Liberate models for SDXL, offering beautiful results and a playful style inspired by digital art.

The speaker invites viewers to share their favorite models and how they use them in the comments section.