Hacker Read

22c · 2023-09-30 21:59:03

Presumably the 32GB of VRAM is what makes it compelling, as you could cram some fairly substantial AI models on there.

jandrese | karma 30121 | avg karma 3.36 · | 2023-04-03 11:02:12

24GB of VRAM is awesome for running AI models though.

dale_glass | karma 7333 | avg karma 4.33 · | 2023-10-31 06:45:11

What's the usage pattern like? Is all of the VRAM used extensively in AI workloads, or one could hope to augment things a bit with system RAM with little performance impact?

Pesthuf | karma 465 | avg karma 4.84 · | 2024-06-25 16:38:29

I feel like these huge graphics cards with insane amounts of RAM are the moat that AI companies have been hoping for.

We can't possibly hope to run the kinds of models that run on 192GB of VRAM at home.

reply

samplatt | karma 1145 | avg karma 2.4 · | 2024-03-19 03:43:43

You're comparing RAM amounts to other RAM amounts without considering requirements. 24GB is more than (most) current games would ever require, but is considered a uncomfortably-constrictive minimum for most industrial work.

Traditional CPU-bound physics/simulation models have typically wanted all the RAM they could get; the more RAM the more accurate the model. The same is true for AI models.

I can max out 24GB just using spreadsheets and databases, let alone my 3D work or anything computational.

reply

syntaxing | karma 4590 | avg karma 2.72 · | 2023-11-14 12:50:53

Maybe I missed it but does anyone know what it will take to run this model? Seems something fun to try out but not sure if 24GB of VRAM is suffice.

smcleod | karma 8476 | avg karma 6.43 · | 2023-12-17 06:35:29

32GB memory only leaves about 24-26GB for the GPU by default which is quite low for a larger model like that. For comparison it runs great on a M2 Max 96GB.

dygd | karma 56 | avg karma 1.7 · | 2024-01-16 04:58:00

The RTX 4060Ti is the most affordable nVIDIA card with 16GB VRAM from the current generation, making it a good option for AI experimentation. So that might contribute.

capableweb | karma 37790 | avg karma 4.15 · | 2022-09-20 11:12:07

24GB is enough for some serious AI work. 48GB would be better, of course. But high end GPUs are still used for other things than gaming, from ML/AI stuff to creative work like video editing, animation renders and more.

barbariangrunge | karma 2807 | avg karma 3.27 · | 2023-04-09 16:40:38

I thought it needed 64gb of vram. 64gb of ram is easy to obtain

WanderPanda | karma 1374 | avg karma 1.59 · | 2020-12-06 12:32:22+00:00

You are probably right, but I guess there are plenty of interesting use cases << 8GB VRAM :D

f38zf5vdt | karma 1223 | avg karma 2.49 · | 2022-02-02 11:08:00

Right on, they're closing in on "Open"AI's best models. Can this still be run on a GPU, or does it require a lot more VRAM?

abledon | karma 2682 | avg karma 1.76 · | 2020-10-28 16:47:34+00:00

hmm. 16 GB VRAM vs 3070 8 GB... looks better for fitting TF models in memory

nickwalton00 | karma 372 | avg karma 4.89 · | 2019-12-05 19:14:30

Hmmm... I haven't seen that before there should be enough memory on the GPU to hold the model.

teaearlgraycold | karma 2842 | avg karma 1.71 · | 2023-05-22 01:33:48

2GB VRAM means you can run things comparable to GPT2, a glorified Markov chain. On your CPU you could run much larger models at far from real time speeds.

dev_throw | karma 336 | avg karma 3.03 · | 2023-09-30 21:48:33

16 GB is a really nice offering at that price point for AI workloads. I'm keeping my fingers crossed for a higher end Battlemage offering and some real competition for Nvidia.

atonse | karma 11460 | avg karma 4.51 · | 2020-02-04 15:03:33

This is so exciting but the biggest question in my mind is what hardware you'll need to drive all this stuff.

Are they using 3 Nvidia cards in SLI for the demo, or something similarly insane? The 32 GB memory isn't that crazy given how cheap memory is these days.

reply

easeout | karma 340 | avg karma 2.88 · | 2023-06-06 02:48:00

The wild thing about that 192GB of memory: it's all potentially VRAM.

zimpenfish | karma 8997 | avg karma 1.63 · | 2023-04-26 13:51:10

16GB VRAM minimum is a bit steep. Sadly excludes my 3080 which is annoying because I'd like something better than Stable Diffusion locally.

risho | karma 813 | avg karma 4.37 · | 2024-03-15 19:04:17

it doesn't matter how much compute you have if you don't have enough vram to run the model.