Stable Diffusion

4320 readers

5 users here now

Discuss matters related to our favourite AI Art generation technology

Also see

Other communities

founded 1 year ago

MODERATORS

[email protected]

Squeezing Large-Scale Diffusion Models for Mobile (arxiv.org)

submitted 1 year ago by [email protected] to c/[email protected]

2 comments fedilink hide all child comments

A test post linking to papers with a short generated summary I generated. If there's anything wrong with the summary, let me know.

This paper is about a new way to make Stable Diffusion run faster on mobile devices. The paper talks about the problems and solutions for making Stable Diffusion work better on mobile GPUs using TensorFlow Lite framework. TensorFlow Lite is a framework that helps run AI models on mobile devices. The paper says that their Mobile Stable Diffusion can make a 512 x 512 image in less than 7 seconds on Android devices, which is faster than other methods or platforms. The paper also shows some images made by their method.

Some of the problems and solutions are:

Changing the model's graph to use the GPU more efficiently and avoid slow communication with the CPU. The model's graph is like a blueprint of how the model does its calculations.

Using a better approximation of GELU to avoid errors and inconsistencies on different devices. GELU is a function that helps the model learn complex patterns from the data.

Reducing the model's size and memory usage by quantizing and pruning the weights, and loading and unloading some modules when they are not needed. Quantizing means using fewer bits to represent the numbers in the model, and pruning means removing some of the numbers that are not important. Modules are parts of the model that do specific tasks.

It can be slow to generate images compared to other types of generative models, such as GANs or VAEs, which can produce images in one shot.

It can produce some artifacts or errors in the images, such as blurry edges, unnatural colors, or missing details.

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 2 points 1 year ago

LESS THAN 7 SECONDS??? Holy cow, my 3060 takes 4... this is optimization sorcery.