Stable Diffusion 3 is HERE! MASSIVE Improvements, Turbo, 3D, Can Stability AI Survive?
TLDRStability AI has recently announced the release of Stable Diffusion 3 and Stable Diffusion 3 Turbo on their developer platform API, in partnership with Fireworks AI. This comes after a period of uncertainty for the company, including CEO departure and restructuring. The new models are said to be equal to or surpass state-of-the-art text-image generation systems. However, access to the model weights will require a Stability AI membership, indicating a potential new revenue stream for the company. The pricing for using the API is detailed, with costs varying based on the type of task, such as image generation, upscaling, inpainting, and video generation. The community's response to the membership model and the impact on model fine-tuning and modifications remain to be seen. Stability AI is positioning itself for a more robust and reliable service, aiming to address past API performance issues.
Takeaways
- 🚀 Stability AI has released Stable Diffusion 3 and Stable Diffusion 3 Turbo on their developer platform API, marking a significant update in their generative AI capabilities.
- 🔄 The company has partnered with Fireworks AI for API orchestration, aiming to improve service reliability with a 99.9% service availability.
- 💸 Stability AI introduces a new membership model that requires a subscription for access to model weights, potentially as a strategy to generate revenue and attract investors.
- 📈 The new models are claimed to be equal to or surpass state-of-the-art text-image generation systems in terms of prompt adherence and human preference evaluations.
- 💡 The multimodal diffusion Transformer architecture uses separate sets of weights for image and language, enhancing text understanding and spelling capabilities.
- 📊 The pricing for using Stable Diffusion 3 through the API is roughly 10 times the cost of SDXL, with variations for different types of tasks like upscaling, inpainting, and video generation.
- 🤔 There is a notable absence of free credits for new accounts, indicating a shift towards a more monetized approach for accessing Stability AI's services.
- 📉 The company has faced recent challenges, including CEO departure, corporate restructuring, and issues with paying for GPU services, casting doubt on its financial stability.
- 🌐 The release of Stable Diffusion 3 comes with a sparse release page and omitted features from initial previews, raising questions about the company's transparency and commitment to its community.
- 📚 Stability AI plans to make the model weights available for self-hosting with a membership in the near future, which could lead to a new licensing model for generative AI.
- 🔗 The community's reaction to the membership model and the potential impact on model fine-tuning and sharing on platforms like Hugging Face remains to be seen.
Q & A
What has been the recent situation with Stability AI?
-Stability AI has faced some challenges recently, including the departure of their CEO to work on a crypto project and corporate restructuring. They have also been reportedly struggling with paying their GPU bills to Amazon and Coro weave, casting doubts on the company's future.
What is the significance of the announcement of Stable Diffusion 3 and Stable Diffusion 3 Turbo?
-The announcement signifies a major update to Stability AI's generative AI capabilities. It includes a new model and a turbo version, which are now available on their developer platform API, representing a significant step forward despite recent struggles.
How does Stability AI plan to make the model weights of Stable Diffusion 3 available?
-Stability AI plans to make the model weights available for self-hosting to those with a Stability AI membership in the near future. This is a new approach that requires a membership, possibly as a way to generate revenue.
What is the role of Fireworks AI in the release of Stable Diffusion 3?
-Fireworks AI is partnered with Stability AI to deliver the Stable Diffusion 3 models. They provide the API platform and are responsible for the API orchestration, aiming to offer an enterprise-grade solution with high service availability.
What are the improvements in the new multimodal diffusion Transformer architecture?
-The new architecture uses separate sets of weights for image and language representations, which enhances text understanding and spelling capabilities compared to older versions of Stable Diffusion.
What are the different tiers of Stability AI membership and what do they offer?
-Stability AI offers various membership tiers that provide access to different models hosted online, including image, video, language, and 3D models. The tiers range from free, which does not allow commercial use, to professional and enterprise levels that offer faster GPU response times and commercial access.
How does the pricing for Stable Diffusion 3 compare to its predecessor?
-The efficiency and relative cost for Stable Diffusion 3 is roughly ten times that of SDXL when used through the same API. This means it's more computationally intensive or potentially reflects changes in GPU availability.
What are the costs associated with generating different types of content using Stable Diffusion 3?
-The costs vary based on the type of content generated. For images, it's around 7 cents per image with Stable Diffusion 3 and 4 cents with Stable Diffusion 3 Turbo. Upscaling to 4K costs 25 cents per image, in-painting is about 3 cents, out-painting about 4 cents, and video generation is around 20 cents.
What is the current status of the availability of the 3D model endpoint for Stable Diffusion 3?
-As of the time of the transcript, there is no current API endpoint available for Stable Diffusion 3 that does 3D models, although it is mentioned as a feature of the membership.
How does Stability AI's partnership with Fireworks AI address the past API performance issues?
-The partnership with Fireworks AI is aimed at delivering a more reliable and robust service. Fireworks AI provides an enterprise-grade API solution with 99.9% service availability, which should help improve the past API performance issues Stability AI has faced.
What is the potential impact of the new licensing model on the community's use and modification of the models?
-The new licensing model, which requires a Stability AI membership to access the model weights, may affect how the community fine-tunes and modifies the models. It could potentially reduce the need for quantizations and modifications, as the base model is said to be more efficient.
How does the community perceive the new Stability AI membership and its pricing?
-The community's perception is not detailed in the transcript, but it raises questions about the acceptance of a membership model for access. It also brings up considerations about the potential costs and willingness to pay for such memberships, which may influence the community's engagement with Stability AI's offerings.
Outlines
🚀 Stability AI's New Release and Challenges
Stability AI, known for its open-source generative AI, has faced recent challenges including the departure of their CEO, corporate restructuring, and issues with unpaid bills. Despite these hurdles, they've announced the release of Stable Diffusion 3 and Stable Diffusion 3 Turbo on their developer platform API, in partnership with Fireworks AI. This move aims to improve API performance and reliability. The company also plans to make model weights available for self-hosting to Stability AI members, which is a new approach requiring membership. The announcement teases impressive capabilities of the new models in creating detailed and cohesive scenes from text. However, there are concerns about the pricing and the omission of certain features initially promised. The release is seen as a strategic move to secure revenue and investor confidence, and to address past financial difficulties.
💡 Stability AI Membership and Pricing Structure
Stability AI has introduced a new membership model, which offers access to various models including image, video, language, and 3D models hosted online. The membership is likened to Adobe's Creative Cloud, with different tiers offering varying levels of access and response times. Notably, commercial use is restricted for the free tier, while professional membership allows it. The enterprise tier's features are not explicitly detailed but imply enhanced performance. The membership also introduces a new API called Stable Image Core for accessing Stable Diffusion 3. The pricing for using the models is detailed, with costs associated with image generation and other tasks such as upscaling and video generation. There's an emphasis on the efficiency and cost reduction of the new model, with a comparison to the previous version. The community's reception to the membership model and its impact on model fine-tuning and modifications remains to be seen.
Mindmap
Keywords
Stable Diffusion 3
Turbo
3D
Open Source
Corporate Restructuring
API
Fireworks AI
Membership
Pricing
Benchmarks
Multimodal Diffusion Transformer
Highlights
Stable Diffusion 3 and Stable Diffusion 3 Turbo have been released on Stability AI's developer platform API.
Stability AI has partnered with Fireworks AI for API orchestration and delivery of these models.
The company aims to make model weights available for self-hosting with a Stability AI membership in the near future.
Stable Diffusion 3 is capable of creating massive, cohesive scenes with text, as demonstrated in the provided artwork.
The release includes demos showcasing the model's ability to generate complex scenes, such as a couch on top of a Brooklyn apartment.
Stability AI's website provides sparse information on the release, with a focus on the availability through the API platform.
The research paper reveals that Stable Diffusion 3 equals or outperforms state-of-the-art text-image generation systems.
The new multimodal diffusion Transformer architecture uses separate sets of weights for image and language representations.
Stability AI is working on an advanced release for an open release of the model.
The Stability AI membership offers access to various models hosted online, including image, video, language, and 3D models.
The membership has different tiers, with commercial access available at the professional level.
The pricing for using Stable Diffusion 3 through the API is significantly lower than the previous version, SDXL.
Generating non-image outputs such as outpainting and inpainting is priced differently, with Turbo offering a lower cost.
Upscaling to 4K, video generation, and other advanced features have their own pricing structures.
Stability AI's new licensing model may impact how the community fine-tunes and modifies the models.
The community's reaction to the membership model for access to the model weights is yet to be seen.
Stability AI's potential move away from Hugging Face could have significant implications, given Amazon's involvement with both entities.