Hacker Read

ttul · 2024-02-19 20:52:18

Have you experimented with running diffusion models on Groq hardware?

machinekob | karma 340 | avg karma 1.83 · | 2022-04-19 10:59:48

Biggest problem for diffusion models were performance (as you need to iterate even at inference) But I'm not up to date with newest architectures maybe its already solved :P

abdellah123 | karma 87 | avg karma 0.89 · | 2024-01-30 05:19:35

I wonder if there is a detailed benchmark on using stable diffusion on this hardware (and other) ...

nico | karma 2695 | avg karma 2.03 · | 2023-09-19 12:49:58

Has anyone used any of the generative models mentioned in the article? Didn’t see any images or direct comparisons of the outputs with current diffusion models

mrtksn | karma 20352 | avg karma 4.42 · | 2023-03-08 01:46:14

Unlike Stable Diffusion, I don't stumble upon people who actually use it. Are there examples of the output this can generate? What happens once you manage to run the model?

jsemrau | karma 1963 | avg karma 1.78 · | 2023-01-26 00:25:45

Not saying anything about training those models, but I can run the weights of Stable Diffusion without larger problems on a vintage RTX 1080Ti.

6gvONxR4sf7o | karma 9787 | avg karma 3.43 · | 2022-04-06 15:40:16

Any pointers on getting up to speed on diffusion models? I haven't encountered them in my corner of the ML world, and googling around for a review paper didn't turn anything up.

antimatter15 | karma 4727 | avg karma 16.36 · | 2023-01-10 16:00:37

There are already open source LLMs with comparable parameter counts (Facebook's OPT-175B, BLOOM), but you'll need ~10x A100 GPUs to run them (which would cost ~$100K+).

I suspect a big part of why stable diffusion managed to consume so much mindshare is that it can run on ordinary consumer hardware. On that point, I would be excited about an open-source RETRO (https://arxiv.org/pdf/2112.04426.pdf) model with comparable performance to GPT-3 that could run on consumer hardware with an NVMe SSD.

reply

affgrff2 | karma 157 | avg karma 1.94 · | 2023-01-18 08:24:16

Diffusion models?

nutanc | karma 824 | avg karma 1.97 · | 2022-07-03 20:25:22

Have been thinking about this idea. Are there any open source implementations of diffusion models which take an image as an input?

az226 | karma 474 | avg karma 1.64 · | 2024-06-30 05:38:57

And diffusion models.

atalaykutlay | karma 1 | avg karma 0.5 · | 2023-09-23 13:09:18

A short blog post going through steps of running a stable diffusion model on a CPU-only VM (tested on an Ubuntu Linode VM)

brucethemoose2 | karma 7874 | avg karma 2.4 · | 2023-07-26 14:28:14

Diffusion is relatively compute intensive compared to transformers llms, and (in current implementation) doesn't quantize as well.

A 70B parameter model would be very slow and vram hungry, hence very expensive to run.

Also, image generation is more reliant on tooling surrounding the models than pure text prompting. I dont think even a 300B model would get things quite right through text prompting alone.

reply

rexreed | karma 6465 | avg karma 3.25 · | 2022-12-16 09:05:29

What sort of setup do you need to be able to fine tune Stable Diffusion models? Are there good tutorials out there for fine tuning with cloud or non-cloud GPUs?

thrwawy74 | karma 448 | avg karma 3.58 · | 2022-09-10 22:37:07

A lot of HN has been having fun with stable diffusion. Do we really need 1 x GPU with 10GB of RAM? How do you distribute or "shard" a model you're training? Could we get this running on the raspberry pi clusters we all have? Hook it up to OpenFaaS too.

foreverpiano | karma 1 | avg karma 1.0 · | 2024-06-25 14:25:29

Can this work build some easy-to-use apis? So it may be easy to apply in diffusion on other model.

jph00 | karma 7468 | avg karma 8.69 · | 2023-07-02 16:02:18

Yes. I created a course which uses implementing Stable Diffusion from scratch as the project, and goes through lots of architecture choices, hyperparam selection, and debugging. (But note that this isn't something that's fast or easy to learn - it'll take around a month full-time intensive study.) https://course.fast.ai/Lessons/part2.html

throwaway675309 | karma 1570 | avg karma 2.06 · | 2022-12-06 22:56:50

Then you recall incorrectly. Stable diffusion models are capable of running on laptops with even modest discrete GPUs. I'm running it using the automatic GitHub repo on a laptop that's over four years old and a 50 step iteration only takes about 15 seconds.

You need zero technical acumen to be able to install it, just the ability to follow basic instructions. Maybe you should ask ChatGPT to help you.

reply

gillesjacobs | karma 1735 | avg karma 4.65 · | 2022-12-11 06:54:59

Rest assured someone is working on a self-hosted (distilled) model. Stable Diffusion has shown there is a viable market for open, consumer-hardware inferencable models.

Jack000 | karma 2438 | avg karma 3.28 · | 2023-03-14 07:24:34

Yes I think so, but it would depend a lot on the data. If properly normalized like FFHQ you don’t even need a diffusion model.