Want Consistent Characters in Midjourney? Do THIS Instead…

4 Oct 202316:41

TLDRIn this video, Globetry shares a method for consistently generating the same character in any scene using Midjourney, an AI art tool. The process begins with a simple description of the character and a scene prompt. Using the 'very region' feature, the creator can erase unwanted parts of the generated image and replace them with their character, Elara, ensuring consistency across different scenes and styles. The video demonstrates how to refine the character's features, such as eyes and hair, through an iterative process of generating and selecting the best images. The method is applicable to any character and style, making it a powerful tool for artists looking to create a unique and recognizable character in various contexts.


  • 🎨 **Consistency in Character Generation**: The video discusses a method to consistently generate the same character in different scenes, regardless of the pose, lighting, or composition.
  • 🖌️ **No External Software Needed**: The entire process takes place within Midjourney, without the need for Photoshop or additional training of models.
  • 📸 **Photorealistic Style**: The character, Elara, can be inserted into any scene in a photorealistic style while maintaining her consistent appearance.
  • 📝 **Simple Prompts**: A simple description is used to generate the character, avoiding the need for complex prompts detailing every physical trait.
  • 🔍 **Iterative Process**: The character becomes more consistent with each iteration, as the image prompt is refined and updated with better representations of the character.
  • 👗 **Consistent Costume Design**: The workflow begins with generating a character design sheet to establish a consistent costume design and basic features.
  • 🧞‍♂️ **Character Design Sheet**: A series of poses in a cartoon watercolor style are used to create a versatile character design sheet for generating various character images.
  • 🌟 **Feature-By-Feature Refinement**: The process involves refining the character feature by feature to achieve consistency, starting broad and then focusing on specific details like eyes or mouth.
  • 🔗 **Using Image URLs**: Image URLs from preferred character poses are used in the Midjourney bot to guide the generation process towards consistency.
  • 🔧 **Very Region Feature**: The 'Very Region' feature is used to erase unwanted parts of the generated image and replace them with the desired character features.
  • 🔄 **Refined Iterations**: Through multiple iterations and adjustments using 'Very Region', the character's features are gradually aligned to achieve a consistent look across different images.

Q & A

  • What is the main challenge the video aims to address?

    -The main challenge is to generate a consistent character in any scene using mid-journey AI, without needing additional tools like Photoshop or training a Dream Booth model.

  • Who is the character Elara in the video?

    -Elara is the example character used in the video to demonstrate how to consistently generate the same character in different scenes and poses.

  • What is the 'very region' feature mentioned in the video?

    -The 'very region' feature is a tool that allows users to erase parts of an image they want to change, which is used in the process of generating a consistent character.

  • How does the video suggest to create a consistent character?

    -The video suggests using a simple description prompt, generating an image, and then using the 'very region' feature to replace certain features with those of the desired character.

  • What is the purpose of using a 'slash prefer option' in the video?

    -The 'slash prefer option' is used to store a set of reference images and text prompts that define the character, which helps in generating consistent character images.

  • How does the video approach the generation of a character in a new style?

    -The video suggests starting with a broad stroke, like generating a character design sheet in a cartoon watercolor style, and then iteratively refining the character's features to achieve consistency in a new, desired style.

  • What is the iterative process mentioned for refining a character?

    -The iterative process involves generating images of the character, selecting the best features from those images, and then using those as references to improve the character's consistency in subsequent generations.

  • Why is it important to have a fleshed-out image prompt to start generating a character?

    -A fleshed-out image prompt provides a clear guide for the AI to understand and replicate the character's appearance, ensuring consistency across different scenes and styles.

  • How does the video suggest to handle the creation of a character when you don't have pre-existing images?

    -The video suggests starting with a broad description and generating a series of poses in a cartoon style to create a character design sheet, which then serves as the initial reference for further generations.

  • What role does the community play in the process described in the video?

    -The community, such as the one on Discord mentioned in the video, can provide support, share insights, and help solve problems that may arise during the character generation process.

  • How can one ensure that their character's features, like eyes or nose, are consistent across different images?

    -By using the 'very region' feature to focus on specific features, generating images of those features separately, and then updating the 'slash prefer option' with these improved images to guide future generations.



🎨 Generating Consistent Characters in Mid-Journey

The video introduces a method for generating a consistent character named Elara in any scene using Mid-Journey, an AI tool. The process involves creating a simple description prompt and then using a 'very region' feature to modify the generated character to match the desired look. The speaker, Globetry, demonstrates how to replace features of a generated character with those of Elara, using reference images and text prompts to guide the AI. The video emphasizes the ease of the process and invites viewers to like the video for support.


🖌️ Refining Character Design with Mid-Journey's 'Very Region'

The speaker discusses the iterative process of refining a character's design using Mid-Journey. They share their approach to creating a consistent character by starting with a broad design sheet in a cartoon watercolor style. The character, a purple alien named Purple, is used as an example. The process involves generating a series of poses, selecting the best ones, and using them to create a 'slash prefer option' that guides the AI in generating consistent images. The video also covers how to upscale a chosen design and how to use the 'very region' feature to replace and refine specific features, such as eyes, to achieve a more consistent look across different images.


🚀 Enhancing Character Consistency with Iterative Generation

The video continues with the process of enhancing character consistency by iterating over different features. It demonstrates how to take a photorealistic version of the character, Purple, and further refine the eyes using 'very region' and specific prompts. The speaker shows how to replace the eyes in a generated image with a screenshot of preferred eyes, thus guiding the AI to improve the feature. The updated images are then used to update the 'slash prefer option', allowing for more consistent generation across various scenes and styles. The video emphasizes the importance of patience and iteration in developing a well-defined character.


🌟 Finalizing Character Design for Versatile Scene Integration

The final paragraph outlines the process of finalizing a character design to the point where it can be seamlessly integrated into any scene. The speaker encourages viewers to iterate over new traits and features, using 'very region' and trial and error to refine the character. The goal is to create a set of high-quality images that showcase the character from various angles and poses. The video concludes with an invitation to share created characters in the speaker's Discord community and offers additional resources for learning more about using Mid-Journey and improving character generation.




Mid-journey refers to the process of creating and consistently generating a character within an AI-driven environment, such as an AI art generator. In the context of the video, it is the stage where the character is being developed and refined through iterative processes to maintain consistency across different scenes and styles.

💡Character Generation

Character Generation is the process of creating a character's appearance and attributes within a digital or AI medium. The video focuses on generating a character named Elara and later a purple alien, ensuring that they are consistently portrayed across various scenes and styles.

💡Photorealistic Style

Photorealistic Style denotes an artistic approach that aims to replicate the visual details and depth of real-life photographs. The video script discusses generating characters in a photorealistic style, which means the characters should look as if they were actual photographs.


Expression in the context of the video refers to the character's facial features and emotions. The challenge is to maintain consistency in the character's expression even when the character's pose or the scene changes.


Region, as used in the video, refers to a specific part of the generated image that the user can select and manipulate. The 'very region' feature allows for precise editing of the character's attributes within the generated scene.


A Prompt is a text description used to guide the AI in generating a specific image or scene. In the video, prompts are crucial for instructing the AI to generate characters and scenes with desired attributes and styles.


Consistency in the video script refers to the ability to generate a character that looks the same across different images and scenes. It is a key goal of the character generation process described in the video.


Elara is a character used as an example in the video to demonstrate the process of character generation. The video shows how to ensure Elara's consistent appearance in various scenes, which is a central theme of the tutorial.


Discord, in this context, refers to a communication platform where the video creator and the community interact. It is mentioned as a resource for viewers to share their generated characters and get help with any issues they encounter in the character generation process.

💡AI Experiences

AI Experiences refer to the interactive and creative processes involving artificial intelligence, such as generating images or characters. The video is part of a series that helps viewers navigate and enhance their experiences with AI tools.

💡迭代过程 (Iterative Process)

迭代过程 is a Chinese term used in the video to describe the repetitive and incremental approach to refining the character's design. It involves making small changes and improvements over time until the desired level of consistency and detail is achieved.


The video discusses a method to generate consistent characters in any scene using mid-journey AI without additional software like Photoshop.

The character Elara can be placed in any photorealistic scene, maintaining her appearance regardless of pose, lighting, or composition.

The process involves a simple description prompt, avoiding the need for complex prompts detailing every physical trait of the character.

The 'very region' feature is used to erase parts of an image that need to be changed, allowing for adjustments to better match the desired character.

A combination of image and text prompts helps ensure the character's consistency, with the text prompt ensuring no conflict with the image prompt.

The character's features, such as pointed chin, bluish eyes, and high cheekbones, are emphasized to maintain consistency across different generations.

An iterative process is suggested for refining the character, updating the image prompt with better representations each time.

A character design sheet with multiple poses is used as a starting point for generating a consistent character design.

The 'slash prefer option' is a tool for setting up a consistent character, which can be iteratively improved with each generation.

The video provides a step-by-step guide for generating a character in a cartoon watercolor style, which can then be adapted to a photorealistic style.

Features such as eyes, mouth, and nose can be individually refined for consistency by using the 'very region' feature and updating the image prompt.

The character 'Purple' is used as an example to demonstrate the process of generating and refining a character's appearance in different styles and scenes.

The importance of iteration and fine-tuning is emphasized to achieve a high-quality, consistent character across various images and scenes.

The video offers additional resources, including Discord, for further assistance with character generation and troubleshooting mid-journey issues.

The presenter shares other videos for learning how to use 'very region' and crafting impactful, photorealistic characters.

The method is applicable to any character or style, allowing for a wide range of creative possibilities within the mid-journey AI platform.