
WebScrapBook by Danny Lin
Capture web pages to local device or backend server for future retrieval, organization, annotation, and edit.
Extension Metadata




***This extension is under development. Every feature could change in the future. Use in production carefully and be sure to make a backup frqeuently.***
WebScrapBook is a browser extension that captures the web page faithfully with various archive formats and customizable configurations, for future retrieval, organization, annotation, and editing. This project inherits from legacy Firefox add-on ScrapBook X.
Features:
1. Capture faithfully: A web page shown in the browser can be captured without losing any subtle detail. Metadata such as source URL and timestamp are also recorded.
2. Customizable capture: WebScrapBook can save selected area in a page, save source page (before processed by scripts), or save page as a bookmark. How to capture images, audio, video, fonts, frames, styles, scripts, etc. are also customizable. A web page can be saved as a folder, a ZIP-based archive file (HTZ or MAFF), or a single HTML file.
3. Organizable collections: Captured pages can be organized in the browser sidebar using one or more "scrapbooks". A scrapbook holds a hierarchical tree structure to organize data items, and can be further indexed for a rich-feature search (using a combination of title, fulltext keywords, custom comment, source URL, or other metadata). (*)
4. Page editing: A web page can be highlighted, annotated, or edited before or after a capture. You can additionally create and manage notes using HTML or markdown format. (*)
5. Remote access: Captured data can be hosted with a central backend server and be read or edited from other devices. Alternatively, a static site index can be generated for a scrapbook, which can therefore be hosted on a shared web server that doesn't support dynamic web hosting. (*)
6. Mobile support: WebScrapBook supports mobile browsers such as Firefox for Android and Kiwi browser. You can capture and edit the web page from a mobile phone or tablet.
7. Legacy ScrapBook support: Scrapbooks created from legacy ScrapBook or ScrapBook X can be converted into WebScrapBook-compliant format for usage. (*)
* All or partial functionality of a starred feature above requires a running collaborating backend server, which can be easily set up using PyWebScrapBook.
* An HTZ or MAFF archive file can be viewed using the built-in archive page viewer, with PyWebScrapBook or other assistant tools, or by opening the index page after unzipping.
See Also:
* For further information and frequently asked questions, visit the documentation wiki.
* For better discussion, please report an issue to the source repository.
WebScrapBook is a browser extension that captures the web page faithfully with various archive formats and customizable configurations, for future retrieval, organization, annotation, and editing. This project inherits from legacy Firefox add-on ScrapBook X.
Features:
1. Capture faithfully: A web page shown in the browser can be captured without losing any subtle detail. Metadata such as source URL and timestamp are also recorded.
2. Customizable capture: WebScrapBook can save selected area in a page, save source page (before processed by scripts), or save page as a bookmark. How to capture images, audio, video, fonts, frames, styles, scripts, etc. are also customizable. A web page can be saved as a folder, a ZIP-based archive file (HTZ or MAFF), or a single HTML file.
3. Organizable collections: Captured pages can be organized in the browser sidebar using one or more "scrapbooks". A scrapbook holds a hierarchical tree structure to organize data items, and can be further indexed for a rich-feature search (using a combination of title, fulltext keywords, custom comment, source URL, or other metadata). (*)
4. Page editing: A web page can be highlighted, annotated, or edited before or after a capture. You can additionally create and manage notes using HTML or markdown format. (*)
5. Remote access: Captured data can be hosted with a central backend server and be read or edited from other devices. Alternatively, a static site index can be generated for a scrapbook, which can therefore be hosted on a shared web server that doesn't support dynamic web hosting. (*)
6. Mobile support: WebScrapBook supports mobile browsers such as Firefox for Android and Kiwi browser. You can capture and edit the web page from a mobile phone or tablet.
7. Legacy ScrapBook support: Scrapbooks created from legacy ScrapBook or ScrapBook X can be converted into WebScrapBook-compliant format for usage. (*)
* All or partial functionality of a starred feature above requires a running collaborating backend server, which can be easily set up using PyWebScrapBook.
* An HTZ or MAFF archive file can be viewed using the built-in archive page viewer, with PyWebScrapBook or other assistant tools, or by opening the index page after unzipping.
See Also:
* For further information and frequently asked questions, visit the documentation wiki.
* For better discussion, please report an issue to the source repository.
Report this add-on for abuse
If you think this add-on violates Mozilla's add-on policies or has security or privacy issues, please report these issues to Mozilla using this form.
Please don't use this form to report bugs or request add-on features; this report will be sent to Mozilla and not to the add-on developer.
The developer of this extension asks that you help support its continued development by making a small contribution.
This add-on needs to:
- Download files and read and modify the browser’s download history
- Access browser tabs
- Store unlimited amount of client-side data
- Access browser activity during navigation
- Access your data for all websites
- Add-on Links
- Version
- 0.97.1
- Size
- 418.53 KB
- Last updated
- 8 hours ago (Jan 21, 2021)
- License
- Mozilla Public License, version 2.0
- Version History
- There are no ratings yet
- There are no ratings yet
- There are no ratings yet
- There are no ratings yet
- There are no ratings yet
- There are no ratings yet