ArchiveBox
archivebox.ioPreserve the web. On infrastructure you control.
Utilitiesweb-archivingself-hostedopen-sourcedockerbookmarksdata-preservationcli

About
ArchiveBox is an open-source, self-hosted web archiving tool that saves websites, bookmarks, RSS feeds, social posts, and media into durable formats including HTML, PDF, PNG, WARC, and SQLite. It can be run via Docker, CLI, or a self-hosted web UI, giving individuals and organizations full control over their archived data. It is designed for personal archivists, journalists, researchers, and institutions who want portable, long-lasting captures without relying on third-party services.
Problem
Link rot, platform churn, censorship, and disappearing media cause valuable web content to be permanently lost without a reliable local archiving solution.
For
Individuals, professionals (lawyers, journalists), and institutions (researchers, libraries, governments) who want to self-host web archives
How it works
Users install ArchiveBox via Docker Compose or other methods, then feed it URLs or bookmark exports; it captures pages using tools like Chrome, wget, and yt-dlp and stores outputs as ordinary files organized in a local data directory.
Business model
open-source
Status
launched