3D Optimism | Midjourney Office Hours Recap April 3rd 2024 | Midjourney News

Future Tech Pilot
3 Apr 202403:42

TLDRIn the Midjourney Office Hours Recap from April 3rd, 2024, there were no major announcements due to slower progress during vacation times. The team is focusing on website enhancements, including new social features, which will initially be tested with guides and moderators. They aim to eventually allow users to create both public and private spaces. Personalization is a work in progress, with challenges arising from working across different time zones. An algorithm to improve hand and body depiction in images is being developed, which should reduce the frequency of poor image quality. A speed update is also anticipated, potentially making processes 25-50% faster and more cost-effective. A 'Caption Party' is planned to help the version 7 model understand the connection between images and language, with potential rewards for participants in the future. The team is considering a new class of trusted users for rating and captioning. While video features are not yet satisfactory, there is optimism for a robust version 7 model, with a focus on high-quality 3D models. The feedback leaderboard on the Midjourney website will be updated with more ideas, and the possibility of adding demographic data to understand user preferences better is mentioned. Consistent character generation might be introduced in version 7. The recap concludes with a prompt example for creating a serene double exposure image.

Takeaways

  • πŸ“ˆ **Medium Recommendation**: Creative professionals are advised to check out Medium for customizable prompts to save time at work.
  • πŸ–οΈ **Vacation Impact**: Progress has been slower due to people being on vacation.
  • 🌐 **Website Updates**: The team is working on the website, including new social features, which will initially be tested with guides and mods.
  • πŸ”’ **Social Spaces**: There will be a limited number of social spaces at the start to stress test the system before allowing everyone to create public and private spaces.
  • πŸ€– **Personalization Efforts**: Personalization is a work in progress, with people in multiple time zones contributing, which has slowed the pace.
  • 🎨 **Style Random Return**: The 'style random' feature is set to return, likely from dial tuning, though users won't have access to the tuning part.
  • 🀝 **Algorithm Development**: An algorithm to improve hands, bodies, and text accuracy is in the works, aiming to reduce the occurrence of bad images.
  • πŸ“Έ **Image Quality Improvements**: Efforts are being made to enhance image quality and address small pixel artifacts.
  • βš™οΈ **Speed Update**: A potential speed update could make processes 25-50% faster and cheaper, but it's on hold until other updates are completed.
  • πŸŽ‰ **Caption Party**: An upcoming event aims to teach the version 7 model about the connection between images and language, with possible future rewards for participation.
  • πŸ‘₯ **New User Class**: There's mention of a new class of trusted users for rating and captioning, potentially leading to larger rewards.
  • πŸ“Ή **Video Updates**: While not entirely satisfied with video features, the team is confident in a version 7 model, focusing on high-quality 3D over exportable 3D.
  • πŸ“Š **Feedback Leaderboard**: The team plans to add more ideas to the feedback leaderboard and may incorporate demographic data to understand feature requests better.
  • 🚫 **Content Policies**: There are no plans to expand on not-safe-for-workplace features, and user manipulation of images with the Midjourney model is not yet supported.
  • 🧍 **Consistent Characters**: Multiple consistent characters in generation are not available in V6 but may be possible in version 7.
  • 🎨 **Art Appreciation**: The art in the video is appreciated, and viewers are encouraged to follow the creator on Instagram and Twitter for more content.

Q & A

  • What is the main focus of the Midjourney office hours recap from April 3rd, 2024?

    -The main focus of the recap is an update on the progress of Midjourney's website development, including new social features, personalization, and improvements in image quality and text accuracy. They also discuss plans for a caption party to help teach the version 7 model and mention potential new classes of users.

  • What is the current status of the website development at Midjourney?

    -The website development is progressing, though slower than usual due to people being on vacation. They are working on new social features and personalization, but the pace is affected by having team members in multiple time zones.

  • What is the purpose of the upcoming caption party?

    -The caption party aims to help teach the version 7 model the connection between images and language. If successful, it might be implemented as an official activity where participants could earn rewards in the future.

  • What is the current state of the 'style random' feature?

    -The 'style random' feature will show up again, although the specifics are not clear. It is expected to come from dial tuning, but users will not have access to the tuning part.

  • How does Midjourney plan to improve image quality?

    -Midjourney is working on an algorithm to improve hands and bodies as well as text accuracy. They are also focusing on reducing small pixel artifacts to enhance image quality.

  • What is the potential speed update mentioned in the recap?

    -There might be a small speed update that could make processes 25-50% faster and cheaper. However, this update will be released after other updates are completed.

  • What is the feedback mechanism that Midjourney uses to gather user opinions?

    -Midjourney uses a feedback leaderboard on their website where they add ideas and ask people to rate them. They are considering adding demographics to better understand who is asking for each feature.

  • What is the current stance on implementing not safe for work (NSFW) features?

    -Midjourney is not ready to implement NSFW features from a moderation standpoint. However, they jokingly mentioned that they could be swayed if more than half of all women wanted nudity.

  • What was mentioned about the possibility of multiple consistent characters in a generation?

    -Multiple consistent characters are not planned for version 6 but might be considered for version 7.

  • What is the current progress on the 3D model development?

    -David, presumably a team member, is optimistic about having a really good 3D model thanks to their progress on hardware capture. They are focusing on producing high-quality 3D models rather than just exportable ones.

  • How can one follow the speaker for more updates and prompts?

    -The speaker can be followed on Instagram at 'Futuretechpilot' for a better look at pictures and on Twitter for updates on prompts.

  • What is the general sentiment towards the progress and updates discussed in the recap?

    -The general sentiment is one of optimism and anticipation for the upcoming features and improvements, despite the slower than usual progress.

Outlines

00:00

πŸ“ Mid-Journey Office Hours Recap

This paragraph provides a recap of the Mid-Journey office hours held on April 3rd. It begins with a recommendation for creative professionals to check out Medium, a website offering customizable prompts. The speaker notes a lack of exciting announcements due to people being on vacation, and slower progress. The main focus is on the website's development, including new social features, which will initially be tested with guides and mods. The speaker mentions that personalization is a work in progress, and while it's moving slower than desired, the team is spread across multiple time zones. An upcoming feature called 'style random' is teased, along with an algorithm to improve hand, body, and text accuracy. There's also mention of potential improvements to image quality and speed. A caption party is planned to help teach the version 7 model about the connection between images and language. The possibility of a new class of trusted users for rating and captioning is briefly mentioned. Lastly, the speaker discusses feedback from the Mid-Journey website and the potential for adding more features based on user requests.

Mindmap

Keywords

Midjourney Office Hours

Midjourney Office Hours refers to a scheduled time where the Midjourney team discusses updates, progress, and answers questions from their community. In the context of the video, it is the main event being recapped, providing insights into the company's recent activities and future plans.

Customizable Prompts

Customizable prompts are user-defined inputs that guide the output of a generative system, such as an AI. In the script, it is mentioned as a feature on Medium, which could potentially save time for creative professionals by allowing them to tailor the prompts to their needs.

Social Features

Social features in the context of the video refer to new functionalities being developed for the Midjourney website that will allow for more interaction between users. The script mentions that these features will initially be tested with guides and mods, indicating a focus on community engagement and collaboration.

Personalization

Personalization involves tailoring a product or service to individual preferences. In the video, it is mentioned that the Midjourney team is working on personalization aspects of their platform, which is crucial for enhancing user experience by making it more relevant and efficient for each user.

Style Random

Style Random is a term that likely refers to a feature or setting within the Midjourney system that allows for varied and unpredictable stylistic outputs. The script suggests that while it will make a comeback, users won't have direct control over the tuning, which affects the style of the generated content.

Algorithm

An algorithm in this context is a set of rules or procedures for solving problems or accomplishing tasks. The script discusses an algorithm under development aimed at improving the depiction of hands and bodies in the generated images, as well as text accuracy, which is vital for the quality of the AI's output.

Image Quality

Image quality refers to the clarity, detail, and overall visual appeal of a digital image. The script mentions that the team is working on enhancing image quality, particularly focusing on reducing small pixel artifacts, which are minor errors or imperfections in the image.

Speed Update

A speed update implies an improvement in the processing speed of the system, making it faster and potentially more cost-effective. The video mentions a possible 25-50% increase in speed, which would be significant for users looking for quicker results.

Caption Party

A caption party, as mentioned in the script, is an upcoming event where participants will help teach the version 7 model of Midjourney the connection between images and language. This event is designed to improve the AI's understanding and generation of captions for images.

3D Model

A 3D model in this context refers to a three-dimensional representation of an object or scene, created using computer graphics. The script discusses optimism about the development of a high-quality 3D model for Midjourney, which is expected to be a significant advancement in their technology.

Feedback Leaderboard

The feedback leaderboard is a tool used by the Midjourney team to gauge user opinions and prioritize features or improvements. It allows the community to vote on ideas, helping the developers understand which aspects are most valued by users.

Highlights

Medium is a website selling customizable prompts that can save creative professionals time at work.

Progress has been slower due to vacations, with the main focus on website improvements and new social features.

Initial social spaces will be limited in number, with a focus on stress testing the system.

Users will eventually be able to create both public and private social spaces.

Personalization features are in development but are progressing slower than desired.

The 'style random' feature will reappear, likely from dial tuning, without user access to the tuning.

An algorithm is being developed to improve hand and body representation, as well as text accuracy.

Bad images will still occur, but less frequently with the new algorithm.

Enhancements are being made to image quality, addressing small pixel artifacts.

A potential speed update could make processes 25-50% faster and cheaper.

A caption party is planned to teach the version 7 model the connection between images and language.

A new class of users may be introduced, trusted with rating and captioning, potentially leading to larger rewards.

David is optimistic about the development of a high-quality 3D model, thanks to progress in hardware capture.

The focus is on producing high-quality 3D models rather than exportable ones, although plans may change.

The feedback leaderboard on the Midjourney website will be updated with more ideas periodically.

The team is not ready to allow users to manipulate images with the Midjourney model.

There are no plans to expand on not-safe-for-workplace features.

Demographic data could be added to feedback to understand user preferences better.

Multiple consistent characters in a generation may be possible in version 7.

A serene double exposure image prompt is shared, showcasing the art style.

The presenter shares their social media handles for further engagement.