MIT researchers develop method for lightning-fast image generation based on Stable Diffusion



Until now, diffusion models could only generate high-quality images with many iterations. A team at MIT has now succeeded in compressing the process into a single step – with a quality comparable to multistep Stable Diffusion.

Scientists at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have developed a new method that can dramatically speed up image generation using diffusion models such as Stable Diffusion or DALL-E.

Instead of the previously required 20 or more iteration steps, the new method called Distribution Matching Distillation (DMD) only needs a single step.

Similar experiments have already been conducted, including directly by Stability AI, the company behind Stable Diffusion. However, the quality of the images generated with the MIT method is said to be comparable to the more computationally intensive methods.



“This advancement not only significantly reduces computational time but also retains, if not surpasses, the quality of the generated visual content,” says Tianwei Yin, a PhD student in electrical engineering and computer science at MIT and lead author of the study.

Diffusion models generate images by gradually adding structure to a noisy initial state until a clear image emerges. This process typically requires hundreds of iterations to perfect the image.

MIT’s new approach is based on a “teacher-student” model: A new AI model learns to mimic the behavior of more complex original models for image generation. DMD combines the scoring principles (real, fake) of Generative Adversarial Networks (GANs) with those of diffusion models.

Overview of the DMD method. | Image: Yin et al.

For the new student model, the researchers used pre-trained networks, which simplified the process. By copying and refining the parameters of the original models, they achieved fast training convergence of the new model. The architectural basis is preserved.

“This enables combining with other system optimizations based on the original architecture to further accelerate the creation process,” says Yin.


Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top