How To Make Photorealistic Images In Fooocus
TLDRIn this tutorial, the presenter shares a straightforward method to create photorealistic images using the software Focus. The process involves ensuring the Advanced tab is selected and choosing between Focus V2, Focus enhance sharp, and photograph under Styles. The presenter uses the Juggernaut XL as the base model and Realistic Vision version 6 as the refiner, which is a stable diffusion 1.5 model. The video demonstrates the difference between using the base model with an SDXL refiner and using Realistic Vision as the refiner, highlighting a more photorealistic outcome with the latter. The presenter also discusses the benefits of this method, such as avoiding common issues like deformed limbs and double heads at higher resolutions. Additionally, the presenter provides information on where to obtain the necessary models, such as on Civit AI, and suggests experimenting with different base and refiner models to achieve desired results. The video concludes with a demonstration of the method's effectiveness on various subjects, including people and cars, and a brief mention of using face swap technology.
Takeaways
- πΌοΈ To achieve photorealistic images in Focus, ensure the Advanced tab is checked for additional options.
- π¨ Under Styles, select Focus V2, Focus enhance sharp, or photograph for different looks. You can also experiment with cinematic.
- π€ For the base model, Juggernaut XL is used, which is crucial for the photorealistic outcome.
- π The refiner should be set to Realistic Vision version 6, a stable diffusion 1.5 model, for better photorealism.
- π Using SD 1.5 models as a refiner allows for higher resolution images without common issues like deformed limbs or double heads.
- π¦ Download and use additional models like Realistic Vision version 6 or Epic Photogasm from Civit AI to enhance your results.
- π Turbo models may not work well on Extreme Speed, so it's recommended to keep the speed setting as is until further optimization.
- βοΈ Experiment with different base models and refiner models to find the best combination for your desired outcome.
- π« Keep negative prompts minimal, focusing on avoiding ugly, deformed, text, and watermarks.
- π This method works well for half body and full body shots, producing coherent and realistic results.
- π§ββοΈ It can also be used for face swapping, getting closer to the user's likeness with fine-tuning of settings.
- π There are multiple combinations possible with different models, offering flexibility for various photorealistic applications.
Q & A
What is the easiest way to get photorealistic images in Focus?
-The easiest way to get photorealistic images in Focus is by ensuring the Advanced tab is checked, selecting a suitable style under the Styles option, choosing the right model, and using a refiner such as Realistic Vision version 6 with a stable diffusion 1.5 model.
How does the method described in the transcript differ from using a stable diffusion 1.5 model as the base model?
-Using the method described allows for higher resolution images while still benefiting from fine-tuned SD 1.5 models. Placing the stable diffusion 1.5 model directly as the base model doesn't work because the pipeline is an SDXL pipeline, which requires a different approach.
What are some of the styles available under the Styles option in Focus?
-Some of the styles available include Focus V2, Focus enhance sharp, and photograph. Occasionally, cinematic style is also used, offering a slightly different look.
What is the role of the refiner in the process of generating photorealistic images?
-The refiner, such as Realistic Vision version 6, is crucial as it enhances the image to achieve a more photorealistic look. It works in conjunction with the base model to produce the desired outcome.
How does the use of SDXL offset Laura with a weight of 0.5 contribute to the image generation process?
-The SDXL offset Laura with a weight of 0.5 is used to fine-tune the image generation process, providing a balance that works best for achieving photorealistic results according to the transcript.
What are some of the advantages of using this method over generating a stable diffusion 1.5 model on a platform like Auto1111?
-This method allows for higher resolution images without the risk of deformations such as double heads or limbs, which can occur when generating stable diffusion 1.5 models at higher resolutions on some platforms.
Where can one find and download the models needed for this process?
-The models, such as Realistic Vision version 6, can be found and downloaded from Civit AI.
What are some other photorealistic models mentioned in the transcript?
-Other photorealistic models mentioned include Epic Photo Gasm and Epic Realism, both developed by the same developer.
How does the Dream Shaper XL Turbo model compare to the Juggernaut XL in terms of performance?
-The Dream Shaper XL Turbo model is not recommended to be used on Extreme Speed, and as of the knowledge in the transcript, it is not yet fully optimized, but it is expected to improve soon.
What are the negative prompts used in the process and why are they minimal?
-The negative prompts used are 'ugly', 'deformed', 'text', and 'Watermark'. They are minimal because too many negative prompts can interfere with the generation of a high-quality, photorealistic image.
How effective is this method for generating images of people and objects?
-The method is highly effective for generating images of people, as evidenced by the examples provided. It also works well for objects like cars, achieving a general photorealistic quality.
How close did the method get in replicating the presenter's likeness using face swap?
-While not an exact match, the method provided a closer representation of the presenter's likeness, particularly resembling a younger version of the presenter, although not as muscular as depicted.
Outlines
πΌοΈ Photorealistic Image Generation with Focus
The video script introduces a method for creating photorealistic images using the Focus platform. The presenter begins by ensuring the Advanced tab is checked for options and suggests selecting 'Focus V2', 'Focus enhance sharp', or 'photograph' under Styles. The use of the 'Juggernaut XL' as a base model is highlighted, with 'realistic Vision version 6' as the refiner, which is a stable diffusion 1.5 model. The script explains that using this refiner allows for higher resolution images without the common issues associated with direct use of stable diffusion 1.5 models. The presenter also demonstrates the difference between using the base model with the refiner set to 8 versus using realistic Vision as the refiner for a more photorealistic look. The benefits of this method are showcased through examples, and the presenter provides guidance on where to obtain the necessary models from Civit AI. Additionally, the script touches on the use of negative prompts and the compatibility of the method with half-body and full-body shots.
π Photorealism in Various Subjects Including Cars
The second paragraph discusses the versatility of the photorealistic method introduced, highlighting its effectiveness not only for human subjects but also for objects such as cars. The presenter shares their personal experience with face swapping using Focus and how the method has improved the results, making them closer to their actual appearance, albeit with some differences. The paragraph also mentions the potential for further refining the settings to achieve an even closer likeness. The presenter encourages viewers to watch a previous video on face swapping for additional context and concludes with a friendly sign-off, promising to see the audience in the next video.
Mindmap
Keywords
Photorealistic Images
Focus
Advanced Tab
Styles
Juggernaut XL
Realistic Vision
Stable Diffusion 1.5
SDXL Offset Laura
Resolution
Civit AI
Negative Prompts
Highlights
The easiest way to get photorealistic images every time in Focus is by using a specific method involving the Advanced tab and certain styles.
Under Styles, choose between Focus V2, Focus enhance sharp, and photograph for different looks.
Occasionally, switching between cinematic and photograph can yield slightly different results.
Using Juggernaut XL as the base model is key to achieving the desired photorealistic effect.
Realistic Vision version 6 is a stable diffusion 1.5 model used as the refiner for enhanced photorealism.
Focus is primarily an SDXL platform, but this method allows for the results of a stable diffusion 1.5 model.
SDXL offset Laura with a weight of 0.5 is recommended for optimal results, though other luras can be experimented with.
Unchecking random ensures the same seed is used for consistent results.
Comparing the results of the current setting with SDXL base and refiner at 8 reveals different styles.
The base SDXL and refiner tend to produce a hyper-realistic look, while Realistic Vision as the refiner offers a more photorealistic appearance.
Generating a stable diffusion 1.5 model on automatic UI may result in deformations, but this method avoids such issues.
Higher resolution images are possible with this method while still benefiting from fine-tuned SD 1.5 models.
Civit AI is a source for obtaining models like Realistic Vision version 6 and other photorealistic models.
Experimenting with different base models and refiner models can yield a variety of photorealistic results.
The method works well for half body and full body shots, producing great results for eyes and teeth.
Negative prompts should be minimal, focusing on avoiding ugly, deformed, text, and watermark.
The method can handle a wide range of subjects, including people and cars, with high photorealism.
Using face swap with this method can help achieve a closer likeness to the original subject.
The Dream Shaper XL Turbo is used as the base model for another example, with the refiner set to Epic Realism at 4.
Extreme Speed may not work well with the turbo models, so it's recommended to leave it on speed.