Flux AI Image Generator 💀 RIP Midjourney! 🚀

3 min readAug 11, 2024

On Thursday, the AI startup Black Forest Labs announced its official launch and the release of its first suite of text-to-image AI models, known as FLUX.1. Based in Germany, Black Forest Labs was founded by a group of researchers who previously developed the technology behind Stable Diffusion and pioneered the latent diffusion technique. Their mission is to push the boundaries of generative AI for creating images and videos.

The Rise of FLUX.1: A New Contender in Generative AI

The launch of FLUX.1 comes shortly after Stability AI’s release of Stable Diffusion 3, which faced criticism for its poor handling of human anatomy, leading to distorted images that frustrated users. The issues with Stable Diffusion 3 were compounded by the departure of key engineers from Stability AI, including Robin Rombach, Andreas Blattmann, and Dominik Lorenz. These engineers, along with latent diffusion co-developer Patrick Esser, went on to establish Black Forest Labs.

With FLUX.1, Black Forest Labs introduces three distinct text-to-image models:

FLUX.1 Pro: A high-end commercial version designed for professionals.
FLUX.1 Dev: A mid-range model with open weights available for non-commercial use.
FLUX.1 Schnell: A faster version with open weights, aimed at users needing quick results.

These models are claimed to outperform current industry leaders like Midjourney and DALL-E, especially in terms of image quality and adherence to text prompts.

Performance and Innovation: FLUX.1’s Edge

FLUX.1 models utilize a “hybrid architecture” that combines transformer and diffusion techniques, scaled up to 12 billion parameters. This approach, coupled with innovations like flow matching, represents a significant leap forward in the quality and accuracy of AI-generated images.

In practice, FLUX.1’s higher-end models deliver results that are comparable to OpenAI’s DALL-E 3 in terms of prompt fidelity, with a photorealism that rivals Midjourney 6. The models also show marked improvement over the last major release from Stability AI, Stable Diffusion XL.

A notable advancement in FLUX.1 is its ability to generate human hands accurately, addressing a longstanding challenge in AI image synthesis. While models like Midjourney have already mastered this, FLUX.1’s success is particularly impressive for an open-weights model.

Experimenting with FLUX.1: Accessibility and Challenges

For those interested in trying FLUX.1, the models are available on cloud-hosting platforms like Fal and Replicate, which offer access for a fee. However, running FLUX.1 locally presents some challenges, particularly with the “Dev” model’s 23GB weights file, which exceeds the capacity of standard GPUs like the RTX 3060. Some users on Reddit have reported success with quantization, a technique that reduces the model’s size to fit into smaller VRAM.

Looking Ahead

The release of FLUX.1 marks a significant moment in the evolution of text-to-image AI. With the backing of a team that has already made major contributions to the field, Black Forest Labs is poised to be a key player in the rapidly advancing world of generative AI. As the technology continues to develop, FLUX.1 may set new standards for what is possible in AI-driven image creatio

Flux AI Image Generator 💀 RIP Midjourney! 🚀

The Rise of FLUX.1: A New Contender in Generative AI

Performance and Innovation: FLUX.1’s Edge

Experimenting with FLUX.1: Accessibility and Challenges

Looking Ahead

Written by Saeed Ai

No responses yet