web-archive: A Free Web Archiving and Sharing Service on Cloudflare

Summary
web-archive is an open-source project offering a free web archiving and sharing service built entirely on Cloudflare's serverless ecosystem, including Workers, D1, and R2. It comprises a browser plugin for saving pages, a server to manage uploads, and a web client for viewing archived content. This self-hosted solution provides robust features for preserving and accessing web pages.
Repository Info
Tags
Click on any tag to explore related repositories
Introduction
web-archive is a powerful, free, and open-source web archiving and sharing service designed to help users preserve web content. Built entirely on Cloudflare's serverless platform, it leverages Cloudflare Workers, D1 database, and R2 storage bucket for a robust and scalable solution. The project consists of three main components: a browser plugin for capturing web pages, a server to process and store the archived data, and a web client for searching, viewing, and sharing your saved content. It offers features like folder classification, mobile adaptation, AI-generated tag classification, and a reading mode to enhance the archiving experience.
Installation
To get started with web-archive, you will need to deploy the server component and install the browser plugin. The project provides comprehensive documentation to guide you through the deployment process.
You can refer to the deploy document for detailed instructions on setting up your instance.
Once your server is deployed, you can download and install the browser plugin:
After installation, configure the plugin with your deployed service address and key to begin archiving.
Examples
Using web-archive is straightforward. Once the browser plugin is installed and configured with your self-hosted server, you can simply click the plugin icon to save the current web page. The plugin captures the page as a single HTML file and uploads it to your Cloudflare-based server.
After a page is archived, you can access it through the web client. The client allows you to browse your saved pages, organize them into folders, search for specific content, and even share archived pages with others. The reading mode provides a clean, distraction-free view of your saved articles, while AI-generated tags help in automatic classification and discovery.
Why Use web-archive?
web-archive offers several compelling reasons for users looking for a web archiving solution:
- Free and Open Source: It's completely free to use and modify, backed by an active open-source community.
- Self-Hosted Control: Deploying it on your own Cloudflare account gives you full control over your archived data and infrastructure.
- Cloudflare Ecosystem: Leverages the power and cost-effectiveness of Cloudflare Workers, D1, and R2, providing a highly scalable and performant solution.
- Rich Features: Beyond basic archiving, it includes advanced features like folder organization, AI-powered tagging, mobile responsiveness, and a dedicated reading mode.
- Privacy: By self-hosting, you maintain greater control over your data privacy compared to third-party services.
Links
- GitHub Repository: https://github.com/Ray-D-Song/web-archive
- Deployment Documentation: https://web-archive-docs.pages.dev/en/deploy.html
- Chrome Plugin: https://chromewebstore.google.com/detail/web-archive/dfigobdhnhkkdniegjdagofhhhopjajb?hl=zh-CN&utm_source=ext_sidebar
- Firefox Plugin: https://addons.mozilla.org/zh-CN/firefox/addon/web-archive-ray-banzhe/