3

I've found that all the web archiving software I've encountered are either manual (you have to archive everything individually in a separate application) or crawler-based (which can end up putting a lot of extra load on smaller web server, and could even get your ip blocked).

Are there any solutions that simply automatically archive web pages as you load them in your browser? If not, why aren't there?

I could also see something like that being useful as a self-hosted web indexer, where if you ever go "I think I've seen this name before", you can click on it, and your computer will say something like "this name appeared in a news headline you scrolled past two weeks ago"

OQB @kayzeekayzee@lemmy.blahaj.zone

top 2 comments
sorted by: hot top new old
[-] lunchbox2287@lemmy.world 2 points 5 days ago

Have you looked at hunchly - https://hunch.ly/ ? It indexes sites as you browse and let's you search for content within them - and only the sites you hit, so its not crawling the rest of a domain. It's built for investigations, really, but it sounds like it does what you're describing. Its not free, unfortunately, and uses an installed program coupled with a browser extension to do what it does.

[-] jaredj@infosec.pub 1 points 5 days ago

haven't tried it but i think the thing i've read about before is https://archivebox.io/

this post was submitted on 17 Feb 2026
3 points (100.0% liked)

Data Hoarder

315 readers
1 users here now

We are digital librarians. Among us are represented the various reasons to keep data -- legal requirements, competitive requirements, uncertainty of permanence of cloud services, distaste for transmitting your data externally (e.g. government or corporate espionage), cultural and familial archivists, internet collapse preppers, and people who do it themselves so they're sure it's done right. Everyone has their reasons for curating the data they have decided to keep (either forever or For A Damn Long Time (tm) ). Along the way we have sought out like-minded individuals to exchange strategies, war stories, and cautionary tales of failures.

founded 2 years ago
MODERATORS