Hacker Read

jsemrau · 2023-01-26 00:25:45

Not saying anything about training those models, but I can run the weights of Stable Diffusion without larger problems on a vintage RTX 1080Ti.

Gigachad | karma 15126 | avg karma 2.88 · | 2022-09-21 17:45:21

The models are expensive to train right now, but I suspect in 10 years, anyone with a multi gpu rig could train the equivalent of Stable Diffusion.

fragmede | karma 18795 | avg karma 1.82 · | 2022-08-30 02:41:33

for what it's worth, stable diffusion was trained on 32 x 8 x A100 GPUs

TulliusCicero | karma 16642 | avg karma 3.49 · | 2022-09-28 15:06:51

Stable Diffusion is an image generation model that's been released to the public at large. If you have a decent GPU, you can run the model yourself. (Even without a decent GPU technically you can still do it, though it's much slower)

pja | karma 5680 | avg karma 3.74 · | 2022-10-10 15:06:49

Stable Diffusion works for me with a Polaris GPU. Had to compile my own local copy of Tensorflow to use it, but everything runs.

lostmsu | karma 5641 | avg karma 1.11 · | 2022-08-24 18:41:56

The Stable Diffusion models are very small though. You can probably train one with relatively low investment, e.g. 4x3090 under $20k.

rngname22 | karma 972 | avg karma 3.24 · | 2022-10-21 15:19:48

From what I've seen, it's possible to take a version of Stable Diffusion and add your training set on top.

rexreed | karma 6465 | avg karma 3.25 · | 2022-12-16 09:05:29

What sort of setup do you need to be able to fine tune Stable Diffusion models? Are there good tutorials out there for fine tuning with cloud or non-cloud GPUs?

neilv | karma 22736 | avg karma 4.25 · | 2023-06-14 18:34:44

Many old consumer gaming GPUs will run an implementation of Stable Diffusion. But this page seems to be about getting use of H100 and A100, such as one might want for running or training decent-sized LLMs.

onion2k | karma 46633 | avg karma 4.72 · | 2022-09-13 10:04:53

It might require dedicated hardware. That only really becomes possible when you've proven the idea, but ASICs for cryptomining, TensorFlow, etc are quite real. There's no reason why dedicated hardware for training Stable Diffusion couldn't happen.

thrwawy74 | karma 448 | avg karma 3.58 · | 2022-09-10 22:37:07

A lot of HN has been having fun with stable diffusion. Do we really need 1 x GPU with 10GB of RAM? How do you distribute or "shard" a model you're training? Could we get this running on the raspberry pi clusters we all have? Hook it up to OpenFaaS too.

nl | karma 29762 | avg karma 2.49 · | 2023-02-16 23:59:48

Stable Diffusion runs really well on M1 GPU.

e2021 | karma 131 | avg karma 2.73 · | 2022-10-25 18:41:14

You can run stable diffusion on a MBP and produce images in under a minute. It's training these models that takes the crazy GPU power - running them is quite reasonable.

SrZorro | karma 30 | avg karma 1.88 · | 2022-10-26 08:00:57

With stable diffusion Im making right now an image every 26 seconds at 512x512 with 50 sampling steps with https://github.com/JoePenna/Dreambooth-Stable-Diffusion

The training with a beefy GPU from vast.ai (RTX 3090 with 24vram) and Im generating the images with a GTX 1080 with 4vram, so no need for 6 or even 10 GVram from my testing

reply

rafaelero | karma 813 | avg karma 1.49 · | 2023-01-17 14:22:07

Hopefully stability.ai will train this model and release it open-source. It's much easier to train it than Stable Diffusion after all.

ChildOfChaos | karma 1642 | avg karma 2.52 · | 2022-08-29 14:31:39

Yes, Stable diffusion is an open source AI image generator that runs on your own hardware.

sebzim4500 | karma 5679 | avg karma 2.5 · | 2023-03-01 14:17:03

No one uses raw stable diffusion though, there are model mixes for whatever usecase you have.

nl | karma 29762 | avg karma 2.49 · | 2022-09-13 01:13:36

Stable Diffusion has been trained at 512x512 and doesn't work very well above this. But upscalers are ok and can even run on CPUs.

GaggiX | karma 3608 | avg karma 2.43 · | 2022-08-11 14:57:52

Stable Diffusion has a smaller text encoder than Dalle 2 and other models (Imagen, Parti, Craiyon) so that it can fit into consumer GPUs. I believe StabilityAI will train models based on a larger text encoder, the text encoder is frozen and does not require training, so scaling the text encoder is quite free. For now this is the biggest bottleneck with Stable Diffusion, the generator is really good and the image quality alone is incredible (managing to outperform Dalle 2 most of the time).

TheRealSteel | karma 1433 | avg karma 3.89 · | 2023-08-03 22:49:12

I agree, I've definitely seen way more information about running image synthesis models like Stable Diffusion locally than I have LLMs. It's counterintuitive to me that Stable Diffusion takes less RAM than an LLM, especially considering it still needs the word vectors. Goes to show I know nothing.

I guess it comes down to the requirement of a very high end (or multiple) GPU that makes it impractical for most vs just running it in Colab or something.

Tho there are some efforts:

https://github.com/cocktailpeanut/dalai

reply