What software does the Internet Archive run?

Avid Amoeba@lemmy.ca · edit-2 1 year ago

What software does the Internet Archive run?

max@lemmy.blahaj.zone · 1 year ago

afaik, archive.org isnt open source. id recommend something like archivebox.io

Possibly linux@lemmy.zip · 1 year ago

Archive box is a piece of software and the Internet archive is a organization that is focused on predicting the content on the internet.

The Internet Archive has PBs worth of data. I doubt any home user could manage that.

z00s@lemmy.world · edit-2 1 year ago

archive

predicting

?

mosiacmango@lemm.ee · 1 year ago

Protecting

recapitated@lemmy.world · 1 year ago

They’re beating the algorithm

max@lemmy.blahaj.zone · 1 year ago

i dont think op is looking to mirror archive.org, my take was that they wanted someyhing like archive.org but selfhosted and for personal / small-scale use

Avid Amoeba@lemmy.ca · 1 year ago

Exactly. I’m already running a local wiki, but I don’t want stuff I link to in my wiki to result in 404 in a few years. Or worse, to some AI-ridden ad-infested dumpster fire.

layzerjeyt@lemmy.dbzer0.com · 1 year ago

You can use something as simple as a browser extension like SingleFile that can automatically download complete, contained copies of anything bookmarked or only certain URLs.

Avid Amoeba@lemmy.ca · edit-2 1 year ago

Oh yes, this looks like a winner. Thanks!

It seems like it’s written in Python too, which means I can maintain it if need be.

Oh boy I wish I had set this up many years ago. I wouldn’t have to resort to scouring [email protected] for the top quality memes of the past when I need them…

On a far side of the moon note, I wonder if ActivityPub could be used to federate multiple archiveboxes to create a more resilient Internet Archive alternative. 🤔 Then integrate that with Lemmy to autoarchive links from posts. Aaand lemmy.world ran out of disk space. 🤣

density@kbin.social · 1 year ago

a network between networks to make them more resilient i think you’ve just invented the arpanet?.

Dehydrated@lemmy.world · 1 year ago

+1 for ArchiveBox