Hacker Read

kylegalbraith · 2024-04-07 06:39:16

Yup! We observed the same thing back before we built Depot. The act of saving/loading cache over a GHA network pretty much negated any performance gain from layer caching. So, we created a solution to persist cache to NVMe disks and orchestrate that across builds so it's immediately available on the next build. All the performance of layer caching without any network transfer.

The registry cache idea is a neat idea, but in practice suffers the same problem.

reply

maxmcd | karma 1735 | avg karma 3.64 · | 2024-02-06 22:35:38

hopefully depot will reply, but from my perspective it is mostly laid out on their homepage. they are comparing against builds in other CI products that use network-backed disks, virtualized hardware, and don’t keep a layer cache around. Depot provides fast hardware and disks and is good at making the layer cache available for subsequent builds.

You could likely get very similar performance by provisioning a single host with good hardware and simply leverage the on-host cache.

reply

adityamaru | karma 115 | avg karma 7.19 · | 2024-04-07 03:16:44

Yeah that still holds true to some extent today with the GHA cache. Blacksmith colocates its cache with our CI runners, and ensures that they're in the same local network allowing us to saturate the NIC and provide much faster cache reads/writes. We're also thinking of clever ways to avoid downloading from a cache entirely and instead bind mount cache volumes over the network into the CI runner. Still early days, but stay tuned!

jws | karma 12672 | avg karma 5.01 · | 2012-04-08 15:31:40+00:00

An application that is aware of the half dozen of so caching layers from register to platter can perform dramatically better than a naive program. Two wrinkles:

1) it needs to either be told the various sizes, speeds, and quirks on each server to make best use. (just some work)

2) it needs to coordinate with the other processes running on the system to divide up the resources. This is hard. Generally people bail and just assign some share of RAM and hope for the best with the other layers.

reply

cqqxo4zV46cp | karma 1240 | avg karma 1.26 · | 2024-04-07 09:29:40

I got it working, with intermediate layers, too. All to find that I didn’t see that material a performance benefit after taking into account how long it takes to pull from and push to the cache.

10098 | karma 1453 | avg karma 2.93 · | 2013-10-07 07:06:49+00:00

Pretty sure the load time problem can be mitigated by caching.

derptron | karma 34 | avg karma 0.27 · | 2018-08-02 20:03:22+00:00

A caching layer

mmsnberbar66 | karma 33 | avg karma 0.62 · | 2022-09-23 12:00:44

No layer caching mechanisms

adrianN | karma 29995 | avg karma 2.78 · | 2023-12-23 08:54:59

I know cases where caches on top of a kv store improved performance a lot. I don’t think it’s as simple as you claim.

skybrian | karma 22817 | avg karma 2.5 · | 2019-06-30 18:37:26

Yes, it's a trade-off. But if you move towards frequent or continuous deployment, all your users could see a stale cache quite often.

j45 | karma 3932 | avg karma 0.91 · | 2023-11-07 02:43:07

There are other caching layers too.

rblatz | karma 2985 | avg karma 2.73 · | 2018-12-11 03:43:35

Cache is insanely fast, orders of magnitude faster than ram, and basically instant compared to going to disk or another machine on the network. I would find it unlikely that they could overcome the added network latency introduced in such a system.

Edit: check this out for more info https://people.eecs.berkeley.edu/~rcs/research/interactive_l...

reply

dreamfactored | karma 202 | avg karma 1.18 · | 2017-10-23 20:46:19+00:00

lack of cacheability is the main issue I've heard

Aperocky | karma 5363 | avg karma 2.29 · | 2023-01-18 19:25:26

I broadly agree, however running a cache/networking layer is difficult, and is handled well by other open sourced implementations.

If someone already wrote that networking layer, why would I want to do it again? And run into all of the bugs that they already discovered and solved?

reply

allerratio | karma 111 | avg karma 2.47 · | 2012-12-02 10:27:15+00:00

I didn't say it's an cache system. I said that it effectively is a caching system. You won't notice much difference to a write back cache.

kimixa | karma 1206 | avg karma 3.32 · | 2024-01-21 14:52:11

Much of the cache "management" can be done with specialist load/store instructions that skip the cache rather than being OS managed like a mapping.

enos_feedler | karma 1948 | avg karma 1.77 · | 2022-10-31 19:55:33

The idea is you cache a rendering engine. It just isn’t core in the system layer

wutwutwat | karma 556 | avg karma 1.64 · | 2024-04-20 12:15:38

Right I wasn’t saying don’t cache network topology, IO is slow, cache it. That’s why I wondered why they didn’t have LRU or some other eviction policy to deal with a full cache, stale entries, etc.

danachow | karma 796 | avg karma 2.38 · | 2022-07-15 16:52:55

What block layer cache?

AstralStorm | karma 5566 | avg karma 0.91 · | 2017-06-29 10:47:03

Page cache and disk cache are quite shared between containers...