Hacker Read

ahaferburg · 2020-05-14 13:13:51+00:00

It's quite possible that they make heavy use of instancing, which might reduce the memory footprint by a lot. Not every pebble or rock needs an individual geometry.

chrisseaton | karma 36438 | avg karma 2.64 · | 2020-03-05 19:44:47

But it's only using as many mappings in total as the memory would be if it had not been compacted.

prolepunk | karma 170 | avg karma 0.76 · | 2012-01-27 21:29:48+00:00

I wonder if they would run into fragmentation problems with this much of memory, and if so how they are going to deal with that?

jobigoud | karma 4124 | avg karma 1.83 · | 2019-11-08 12:29:34+00:00

But with instancing. So they could have a few dozens of grain types 5K tris each and the total number is from instances, no need to load 5 billion triangles into memory.

davidw | karma 63893 | avg karma 3.82 · | 2013-03-26 10:18:05+00:00

Good point; I wonder what kind of memory usage you could squeeze that in to.

azakai | karma 7684 | avg karma 3.12 · | 2021-12-06 17:51:57

Ah, good point, memory consumption might indeed be a factor here. If that's the case I'd expect higher variance in the measurements perhaps - looking for that in the raw data could be interesting.

nl | karma 29762 | avg karma 2.49 · | 2023-03-01 07:06:43

It's worth noting that this is a comparatively small model (1.6B params from memory).

It'll be interesting what capabilities emerge as they grow that model capacity.

reply

twelvedogs | karma 104 | avg karma 1.82 · | 2022-12-12 03:56:34

Wouldn't high memory usage suggest it will need to bring more from dusk into memory not less

adiusmus | karma 137 | avg karma 1.01 · | 2018-08-14 04:32:24+00:00

We’re running 43 clusterio-like connected worlds here. Worlds allocated 8gb ram each. Less if we can get away with it by deleting unused chunks.

yjftsjthsd-h | karma 28510 | avg karma 2.78 · | 2018-09-28 20:56:49

Memory footprint?

Yoric | karma 6014 | avg karma 2.98 · | 2020-06-13 16:25:57

I think that the memory savings would be very limited, unfortunately.

filterfiber | karma 355 | avg karma 2.93 · | 2023-12-13 15:36:29

The current bottleneck for most current hardware is RAM capacity than memory bandwidth and last is FLOPS/TOPS.

The coral has 8 MB of SRAM which uh, won't fit the 2GB+ that nearly any decent LLM require even after being quantized.

LLMs are mostly memory and memory bandwidth limited right now.

reply

ericflo | karma 6355 | avg karma 10.28 · | 2010-10-07 23:05:04+00:00

Yea, but that doesn't look like what they're describing here. They seem to be describing having the entire dataset in memory.

virtualwhys | karma 3253 | avg karma 3.04 · | 2018-02-12 15:38:56+00:00

I'm aware of it, nice project, but 2.6MB + in memory only storage, no thanks.

Scarbutt | karma 1633 | avg karma 0.85 · | 2020-04-30 17:35:23+00:00

Your data still has to fit in memory though.

fallat | karma 1155 | avg karma 1.99 · | 2023-03-14 12:31:22

They're using specialized hardware to accelerate their development feedback loop. Without a doubt researchers and hackers will find ways to cut down model sizes and complexity, to run on consumer hardware, soon enough. Just use stable diffusion as an example: 4GB for the whole model. Even if text models are 16GB that'd be great.

MacsHeadroom | karma 2958 | avg karma 2.23 · | 2023-05-28 10:43:08

It's infinite but not free. Larger context still means more VRAM used and longer compute times.

didntreadarticl | karma 330 | avg karma 2.66 · | 2023-02-10 08:06:48

I dont think of the 4000 tokens as its memory as such. Its more like the size of its thinking workspace

fxtentacle | karma 18712 | avg karma 5.34 · | 2022-03-15 03:28:51

> TSDF memory isn’t an issue since Niessner et al. (2013).

I would strongly disagree. This paper uses TSDF and runs into memory issues. And ATLAS is using TSDF and running into memory issues. So for practical applications, TSDF is still too memory hungry.

reply

fizwhiz | karma 1235 | avg karma 3.47 · | 2017-11-22 01:58:21+00:00

Perhaps a better response would be outlining how much memory it actually takes? This way people can decide (i.e. if they care deeply about the memory footprint)