Papers
arxiv:2311.16567

MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices

Published on Nov 28, 2023
Authors:
,
,
,

Abstract

MobileDiffusion optimizes text-to-image diffusion models for mobile devices, achieving sub-second inference through architecture redesign and distillation techniques.

The deployment of large-scale text-to-image diffusion models on mobile devices is impeded by their substantial model size and slow inference speed. In this paper, we propose MobileDiffusion, a highly efficient text-to-image diffusion model obtained through extensive optimizations in both architecture and sampling techniques. We conduct a comprehensive examination of model architecture design to reduce redundancy, enhance computational efficiency, and minimize model's parameter count, while preserving image generation quality. Additionally, we employ distillation and diffusion-GAN finetuning techniques on MobileDiffusion to achieve 8-step and 1-step inference respectively. Empirical studies, conducted both quantitatively and qualitatively, demonstrate the effectiveness of our proposed techniques. MobileDiffusion achieves a remarkable sub-second inference speed for generating a 512times512 image on mobile devices, establishing a new state of the art.

Community

This comment has been hidden

ๅฐๆกฅๆตๆฐดไบบๅฎถ

Printed scarves

How can I make a photo

This comment has been hidden
This comment has been hidden

Can't wait for this model or its code to never, ever get released ๐Ÿค—

This comment has been hidden
This comment has been hidden

MobileDiffusion: Instant Text-to-Image on Your Phone!

Links ๐Ÿ”—:

๐Ÿ‘‰ Subscribe: https://www.youtube.com/@Arxflix
๐Ÿ‘‰ Twitter: https://x.com/arxflix
๐Ÿ‘‰ LMNT (Partner): https://lmnt.com/

By Arxflix
9t4iCUHx_400x400-1.jpg

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2311.16567
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2311.16567 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2311.16567 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 8