Midjourney: Consistent Characters & Kaiber 3.0

Theoretically Media
12 Mar 202413:37

TLDRIn this video, the host dives into the new feature of creating consistent characters in Midjourney, a tool for generating images. The host demonstrates how to use character references to maintain a character's appearance across different scenes without the need for complex prompts or third-party tools. He also explores the latest update from Kyber 3.0, which offers impressive video generation capabilities, including longer video durations and a variety of motion settings. The host provides tips and tricks for maximizing the use of character consistency and showcases the potential of style references in conjunction with character references. He also highlights the limitations, such as the inability to use photographs of real people as character references. The video concludes with a look at Kyber's unique and surreal video generation, emphasizing the importance of maintaining diversity in visual styles.


  • ๐Ÿš€ Midjourney has introduced the ability to create consistent characters across different scenes without needing complex prompt formulas or third-party tools.
  • ๐Ÿ‘” To start, a character reference is needed, and the video uses the example of a man in a blue business suit to demonstrate the process.
  • ๐Ÿ–ผ๏ธ Character references can be added by using the 'D-CF' (character reference) tag followed by the image URL in the prompt.
  • ๐Ÿ” Midjourney may require a few attempts to generate the exact character you're looking for, but it's part of getting the system to understand the character.
  • ๐Ÿ“ Changing the aspect ratio in the prompt can help control the composition of the generated image, such as getting a full body shot.
  • ๐Ÿงฉ Multiple images of the same character can be used as references to reinforce the character's look in the generated images.
  • ๐Ÿ–Œ There may be occasional inconsistencies in character generation, but the system allows for inpaint edits to correct these.
  • ๐ŸŽจ Character weight (D-CW) is a scale from 1 to 100 that controls the influence of the character reference, allowing for variations.
  • ๐ŸŒŸ Midjourney's character referencing works best with humanoid characters, but there are ways to use it creatively for other applications.
  • ๐Ÿ“น Kyber 3.0 has been updated with impressive video generation capabilities, including longer durations and motion control.
  • ๐ŸŽฌ Style references can be combined with character references for a powerful combination, influencing the aesthetic and color grade of the generated images.
  • ๐Ÿš€ Kyber's video transform feature has improved significantly, offering a unique and surreal look that stands out from other video generators.

Q & A

  • What new feature has Midjourney released that was discussed in the transcript?

    -Midjourney has released the ability to create consistent characters, which allows users to achieve a consistent character across different scenes without the need for complex prompt formulas or third-party plugins.

  • How does the character reference feature work in Midjourney?

    -The character reference feature in Midjourney works by enabling users to upload an image of a character and use it as a reference for generating images. This ensures that the character maintains consistency in appearance across various scenes.

  • What is the purpose of the 'character weight' or 'D-CW' command in Midjourney?

    -The 'character weight' or 'D-CW' command in Midjourney allows users to adjust the influence of the character reference on the generated image. It has a scale from 1 to 100, where 100 means the generated character will be identical to the reference image, and lower values will result in a more varied or less similar character.

  • How can users maximize the use of character references in Midjourney?

    -Users can maximize the use of character references by uploading multiple images of the same character in different poses or styles, which helps reinforce the character's overall look. Additionally, they can use the 'character weight' command to adjust the character's influence on the generated image.

  • What is the latest update from Kyber, as mentioned in the transcript?

    -The latest update from Kyber is the Kyber 3.0 model, which includes impressive features such as video transformation and motion capabilities. The update allows for video generation of up to 16 seconds, with adjustable motion and evolve sliders for more control over the generated content.

  • What is the significance of the 'evolve' slider in Kyber 3.0?

    -The 'evolve' slider in Kyber 3.0 is associated with giving the generated content a unique, warpy, and hallucinogenic look. By adjusting this slider, users can control the intensity of the surreal effects in the generated videos.

  • How can users ensure character consistency in generated videos using Kyber 3.0?

    -Users can ensure character consistency in generated videos by using the character reference feature and adjusting the 'character weight' to maintain the character's appearance. Additionally, avoiding excessive camera movement can help preserve the character's consistency.

  • What is the role of style references in conjunction with character references in Midjourney?

    -Style references, when used in conjunction with character references, allow users to generate images that not only maintain character consistency but also adopt the aesthetic and color grade of a chosen style, such as a specific movie or art style.

  • Why is it important to continuously reroll, explore, and swap out prompts when using Midjourney?

    -Continuously rerolling, exploring, and swapping out prompts is important because it helps users discover a wider range of possibilities and achieve more varied and interesting results. It also prevents the generation process from becoming too predictable or homogenized.

  • What is the limitation regarding the use of photographs of actual people as character references in Midjourney?

    -Midjourney states that the character reference feature can only be used for fictional, humanoid characters. It should not be used with photographs of actual people, as it is not intended to function as a face-swapping tool.

  • How does the transcript suggest using style references to enhance the generated content?

    -The transcript suggests using style references by combining them with character references to create a powerful combination that allows for the generation of images with a specific aesthetic and color grade, such as those from a particular movie or art style.



๐ŸŽจ Character Consistency in Mid-Journey

The video begins with an introduction to the new feature in Mid-Journey that allows for the creation of consistent characters across various scenes. The host shares tips and tricks to maximize the use of character references. They demonstrate the process by using an example character, a man in a blue business suit, and guide viewers through finding the character, using rerolls, and employing character references for different prompts. The host also covers the use of multiple images as references to reinforce a character's look and the ability to inpaint out unwanted elements. Additionally, they touch upon a unique trick for generating a character model turnaround sheet to assist in character consistency.


๐Ÿ–ผ๏ธ Styling and Weighting Character References

The second paragraph delves into changing styles for characters using references and how the character reference sheet can be utilized to create a more photorealistic look. The host explains the use of 'character weight' (D-CW) to adjust the prominence of the character's traits from the reference image, allowing for flexibility in character portrayal. They also discuss the combination of style references with character references to achieve specific aesthetics, such as those from iconic films. The video highlights the limitations of using photographs of real people as character references and the potential for using the feature in different applications beyond humanoid characters.


๐Ÿ“น Exploring Kyber 3.0 for Video Generation

The final paragraph focuses on the updates to Kyber, a video generation tool, which has been upgraded to a 3.0 model. The host showcases the improved video transform feature and the motion capabilities of the new model. They provide a demonstration by taking a still image of a character and generating a video with various settings, such as duration, evolve slider for a hallucinogenic look, and motion slider for controlling the amount of movement. The video also addresses the challenges of maintaining character consistency and the lack of camera movement controls. The host concludes by appreciating the unique and surreal output of Kyber and expresses excitement for future explorations of the tool.



๐Ÿ’กConsistent Characters

Consistent characters refer to the ability to maintain the same visual identity of a character across different scenes or contexts within a creative work. In the video, this concept is crucial as it allows artists to create a character once and then use it seamlessly in various settings without needing to recreate it from scratch. This is achieved through character references in Midjourney, which is a significant update that simplifies the creation process.


Midjourney is a platform or tool that allows users to create and manipulate images, particularly focusing on generating consistent characters across different scenes. It is mentioned in the context of its new feature that enables character consistency, which was a challenge before. The video discusses how to use this feature effectively for character creation and manipulation.

๐Ÿ’กCharacter References

Character references are images or descriptions that define the visual aspects of a character to ensure that they remain consistent throughout various scenes. In the video, the author demonstrates how to use character references in Midjourney to maintain a character's appearance, which is a new feature that streamlines the creative process.

๐Ÿ’กKyber 3.0

Kyber 3.0 is an updated version of a tool or software used for video generation and transformation. The video highlights its impressive features, such as the ability to create longer videos with consistent character animations and the option to adjust motion and evolve settings for a unique look. Kyber 3.0 is presented as a significant advancement in video generation capabilities.


Discord is a communication platform used in the video to facilitate the process of image transfer for character creation. The script mentions copying an image URL from a website and pasting it into Discord, which is then used to generate character images through a prompt. It serves as a practical tool for the workflow described in the video.

๐Ÿ’กCinematic Still

A cinematic still refers to a static image that is created to have the aesthetic and emotional impact of a frame from a movie. In the context of the video, the term is used when generating images with a cinematic quality, such as a man in a blue business suit in various scenarios, contributing to the narrative and visual storytelling.

๐Ÿ’กCharacter Weight (D-CW)

Character weight, denoted as D-CW in the video, is a parameter that allows users to adjust the influence of the character reference on the final image or video. It ranges from 1 to 100, with 100 meaning the generated character will closely resemble the original reference. The video explains how adjusting this weight can help in changing the character's style or appearance to fit different scenes.

๐Ÿ’กStyle References

Style references are used to apply a specific visual style or aesthetic to the generated content. In the video, the author discusses using style references in conjunction with character references to create images that not only have consistent characters but also match a desired artistic style, such as the look of a scene from a James Cameron action film.


Inpainting is a technique used in image editing where parts of an image are removed or altered to create a seamless result as if the removed parts never existed. The video script mentions using inpainting to remove unwanted elements, such as background characters, to maintain the focus on the main character.

๐Ÿ’กModel Turnaround Sheet

A model turnaround sheet is a collection of images that show a character from different angles, often used in animation and game development to establish the character's design and appearance. In the video, it is suggested as a method to generate a character reference without needing to create multiple images of that character, which aids in reinforcing the character's look in the creative process.

๐Ÿ’กVideo Duration

Video duration refers to the length of the video generated by a tool or software. In the context of Kyber 3.0, the video script discusses the ability to select a video duration of up to 16 seconds, which is significantly longer than many other video generators, allowing for more detailed and extended character animations.


Midjourney has released the ability to create consistent characters, eliminating the need for complex prompt formulas or third-party plugins.

Character references can now be used to maintain a consistent character across different scenes.

The process involves finding a character, such as 'the man in the blue business suit,' and using it as a starting point.

A character reference can be upscaled subtly without losing its essence.

Discord can be used to copy and paste the image URL for character reference in the Midjourney prompt.

The aspect ratio can be adjusted in the prompt to achieve full body shots of the character.

Multiple images of a character can be used as references to reinforce the character's look.

Inpainting can be used to remove unwanted elements and replace them with the original character.

A character without needing multiple images can be generated using a model turnaround sheet.

Character reference sheets can blend multiple characters into a cohesive look.

CF (Character Reference) can change styles, allowing for a variety of character poses and appearances.

The character weight (D-CW) command can adjust the emphasis on the character's original traits in a scene.

Style references can be combined with character references to create unique aesthetic combinations.

Background characters should be used carefully as they can dominate the scene.

Actual photographs of people cannot be used as character references for face swapping.

Kyber 3.0 has been updated with impressive video transform features and longer video durations.

The motion slider in Kyber 3.0 allows for control over the amount of movement in generated videos.

Kyber 3.0 maintains character consistency even with surreal and weird video generations.

There are plans to explore more features of Kyber 3.0, including beat matching from music and text-to-video options.