Hacker Read top | best | new | newcomments | leaders | about | bookmarklet login

Is this basically Archive.org becoming a customer of Cloudflare CDN to reduce load off their servers?


view as:

It's Archive.org being provided URL telemetry for archiving public sites they have not yet found through traditional means (crawling or users submitting requests through the Wayback Save page) by a Cloudflare product.

The next step would be for Cloudflare to point to Archive.org Wayback links when an origin isn't available (similar to browser extensions that point to Archive.org when sites 404 or are down, but in Cloudflare's core).

Cool stuff. Thanks Cloudflare folks.


I really doubt their customers would want that. Usually when a page is 404 it's because the company in question wants to forget about it :)

You would return the archived page for a 5xx error, not a 4xx error.

Ah I see. But this is precisely a usecase for cloudflare's own caching service.

It wouldn't be fair to use archive.org's community-sponsored resources for propping up businesses which are too cheap to pay for proper IT :)


While it's not explicitly mentioned, I think Cloudflare is providing financial support to the Internet Archive.

One would hope so. Considering the timing relative to the IA's potentially very expensive legal battle, I full expect this to be the case. Still, considering CF's anti-privacy/anti-TOR stance this is a deal with the devil. Guess I should give money directly to the IA. Considering how much value they provide, I'll do this immediately after updating this post.

In what ways is CF anti-privacy?

Legal | privacy