Revisiones de WebScrapBook
WebScrapBook por Danny Lin
Revisado por Usuario de Firefox 13523201
Se valoró con 1 de 5
por Usuario de Firefox 13523201, el hace 7 añosTwice I tried to use the indexer to index my Scrapbook X archive and the resulting tree doesn't display any of my saved pages. It is basically empty. Tried to open tree/frame.html and tree/map.html but it made no difference.
To answer Danny Lin's questions:
I copied the original folder to a new location and then the 1st time I selected to open a folder and selected it and the 2nd time I dragged and dropped it on the indexer page.
I picked to index the ScrapBook folder, the one that contains the data folder and the rdf files.
Output of indexer:
Got directory 'ScrapBook'. Inspecting files...
Found 279110 files. Found 'scrapbook.rdf' for legacy ScrapBook. Importing... Inspecting data files... Inspecting metadata... Inspecting TOC... Adding new items to TOC... Inspecting favicons... Checking for created and updated files... Generating fulltext cache... Creating cache for '20120308204342'... Creating cache for '20160405214414'... Creating cache for '20140803112931'... Creating cache for '20150301111333'... Creating cache for '20140607214937'... Creating cache for '20120408181737'... Creating cache for '20150610205548'... Creating cache for '20150804222048'...
...
Creating cache for '20151213141303'... Creating cache for '20151109181126'... Creating cache for '20140919193509'... Creating cache for '20170116200240'... Generating zip file... If the download doesn't start, click me. Done.
The ScrapBook.zip contains the following:
Archive: ScrapBook.zip
Length Date Time Name
--------- ---------- ----- ----
0 2017-12-30 04:15 tree/
2651584 2017-12-29 22:15 tree/meta.js
132501 2017-12-29 22:15 tree/toc.js
9949 2017-12-29 22:15 tree/map.html
526 2017-12-29 22:15 tree/frame.html
21474 2017-12-29 22:15 tree/search.html
0 2017-12-30 04:15 tree/icon/
807 2017-12-29 22:15 tree/icon/toggle.png
661 2017-12-29 22:15 tree/icon/search.png
281 2017-12-29 22:15 tree/icon/collapse.png
279 2017-12-29 22:15 tree/icon/expand.png
523 2017-12-29 22:15 tree/icon/external.png
502 2017-12-29 22:15 tree/icon/item.png
752 2017-12-29 22:15 tree/icon/fclose.png
790 2017-12-29 22:15 tree/icon/fopen.png
445 2017-12-29 22:15 tree/icon/note.png
515 2017-12-29 22:15 tree/icon/postit.png
60927503 2017-12-29 22:22 tree/fulltext.js
--------- -------
63749092 18 files
To answer Danny Lin's questions:
I copied the original folder to a new location and then the 1st time I selected to open a folder and selected it and the 2nd time I dragged and dropped it on the indexer page.
I picked to index the ScrapBook folder, the one that contains the data folder and the rdf files.
Output of indexer:
Got directory 'ScrapBook'. Inspecting files...
Found 279110 files. Found 'scrapbook.rdf' for legacy ScrapBook. Importing... Inspecting data files... Inspecting metadata... Inspecting TOC... Adding new items to TOC... Inspecting favicons... Checking for created and updated files... Generating fulltext cache... Creating cache for '20120308204342'... Creating cache for '20160405214414'... Creating cache for '20140803112931'... Creating cache for '20150301111333'... Creating cache for '20140607214937'... Creating cache for '20120408181737'... Creating cache for '20150610205548'... Creating cache for '20150804222048'...
...
Creating cache for '20151213141303'... Creating cache for '20151109181126'... Creating cache for '20140919193509'... Creating cache for '20170116200240'... Generating zip file... If the download doesn't start, click me. Done.
The ScrapBook.zip contains the following:
Archive: ScrapBook.zip
Length Date Time Name
--------- ---------- ----- ----
0 2017-12-30 04:15 tree/
2651584 2017-12-29 22:15 tree/meta.js
132501 2017-12-29 22:15 tree/toc.js
9949 2017-12-29 22:15 tree/map.html
526 2017-12-29 22:15 tree/frame.html
21474 2017-12-29 22:15 tree/search.html
0 2017-12-30 04:15 tree/icon/
807 2017-12-29 22:15 tree/icon/toggle.png
661 2017-12-29 22:15 tree/icon/search.png
281 2017-12-29 22:15 tree/icon/collapse.png
279 2017-12-29 22:15 tree/icon/expand.png
523 2017-12-29 22:15 tree/icon/external.png
502 2017-12-29 22:15 tree/icon/item.png
752 2017-12-29 22:15 tree/icon/fclose.png
790 2017-12-29 22:15 tree/icon/fopen.png
445 2017-12-29 22:15 tree/icon/note.png
515 2017-12-29 22:15 tree/icon/postit.png
60927503 2017-12-29 22:22 tree/fulltext.js
--------- -------
63749092 18 files
Respuesta del desarrollador
publicado el hace 7 añosPlease provide more detail: Where do you save web pages to? How do you run the indexer (by dropping or by selecting a folder or a file)? Which is the folder you pick to index? What is the output of the indexer?
Update:
Do you see a list of your captured pages when you open tree/frame.html or tree/map.html? If yes, the indexing seems successful. Just note that the zip only contains the "list" part, and you need to extract them into your ScrapBook folder (so that your ScrapBook folder has data/, tree/, etc.), or each link to the page will show nothing.
Update:
Do you see a list of your captured pages when you open tree/frame.html or tree/map.html? If yes, the indexing seems successful. Just note that the zip only contains the "list" part, and you need to extract them into your ScrapBook folder (so that your ScrapBook folder has data/, tree/, etc.), or each link to the page will show nothing.
135 revisiones
- Se valoró con 1 de 5por ehobby, el hace 2 mesesUnable to get this to work on Firefox 133 and Fedora 40. Installed the backend and the browser extension. Could not find any configuration for the backend that would work. There needs to be a more specific set of instructions written by someone who has successfully installed this extension in Linux, unless this only works in Windows. Perhaps this is a great extension but it is worthless if it cannot be installed.
Respuesta del desarrollador
publicado el hace 2 mesesHave you read the documentation: https://github.com/danny0838/webscrapbook/wiki/Basic#3-browser-sidebar-approach ? - Se valoró con 5 de 5por OM_RA, el hace 2 meses
- Se valoró con 5 de 5por Silopolis, el hace 4 meses
- Se valoró con 5 de 5por Usuario de Firefox 18235051, el hace un añoI needed to backup a website that used form login, making a simple scraping not possible. This extension worked like a charm after figuring out some of the configuration options.
- Se valoró con 5 de 5por Avater, el hace un año
- Se valoró con 5 de 5por Usuario de Firefox 14643647, el hace un año
- Se valoró con 4 de 5por Supriyadi, el hace 2 años
- Se valoró con 5 de 5por Yaliang, el hace 2 añosThanks for developing this plugin. It makes it extremely easy to achieve and save web pages.
- Se valoró con 5 de 5por texsd, el hace 2 años
- Se valoró con 1 de 5por Usuario de Firefox 13058149, el hace 2 añosI choose the WebScrapBook/data option but it wants me to configure Backend server... ? ? ?
Useless.
I miss the old Scrapbook Extension.Respuesta del desarrollador
publicado el hace 2 añosPlease consult the documentation about different approaches to capture a page: https://github.com/danny0838/webscrapbook/wiki/Basic. Raise an issue with more details (e.g. a screenshot illustrating where you are asked for backend server configuration) in the source repository if you still don't get it. - Se valoró con 5 de 5por Alexander, el hace 3 años
- Se valoró con 5 de 5por Usuario de Firefox 12472805, el hace 3 años
- Se valoró con 5 de 5por azzone, el hace 3 años
- Se valoró con 1 de 5por Usuario de Firefox 13474132, el hace 3 añosTanto utile prima ora completamente inusabile perchè oltremodo macchinoso.
- Se valoró con 5 de 5por mike1985, el hace 3 añosAmazing Tool, would be nice to set also a min. size for pictures...
- Se valoró con 5 de 5por Usuario de Firefox 15902721, el hace 3 años
- Se valoró con 3 de 5por Hann, el hace 3 años
- Se valoró con 5 de 5por Usuario de Firefox 13637550, el hace 3 añosThis is incredible and the work that is being put into it is admirable. The documentation isn't the best but the creator was very quick to answer my questions and I was able to create a JSON batch capture that captured an entire site for me. It's not the easiest thing to use but read the documentation, look at the examples, and ask questions on the github if you need help. This is a very powerful plugin and I hope the developer continues to develop it.
- Se valoró con 5 de 5por DrakeFromFrance, el hace 4 añosThank you for your job Danny. I use WebScrapbook in Firefox, and Firefox is my default browser on Windows 10. I already set a new task using the .pyw extension to start WSB with the console hidden however it starts Firefox and open a new tab. Is there a way to start WebScrapbook without opening a new tab?
Just a remark: when I click on the icon of WSB in the tools bar of Firefox, it tooks me always a few seconds to find "Open Scrapbook". It would be nice if you emphasised it, or at least if you put it at the top of the menu. Thank you.
Where do I set server.browse to false??Respuesta del desarrollador
publicado el hace 4 años1. Configure PyWebScrapBook and set server.browse to false. We'll consider use false as default in the future.
2. The primary feature for WebScrapBook is capturing, not management, and thus putting "Open ScrapBook" as the first command doesn't make sense. Emphasizing which command is likely controversial for the same reason. We need a thorough evaluation/discussion before doing this. FYI: you can configure a hotkey using the browser extension hotkey manager for a faster way to open the sidebar. - Se valoró con 4 de 5por noone, el hace 4 añosI just wanted to know how i can stop the auto-capture from making new folder every time for the same URL and instead skip or overwrite the same files.I'm sorry i searched a lot through the settings and wiki but couldn't find anything (also I'm not that knowledgeable with this stuff so i didn't understand a lot), its not just auto-capture but capturing a page twice still creates two folders
Respuesta del desarrollador
publicado el hace 4 añosThis is probably something not yet implemented. If you'd like to provide a feature request, it'd be better to raise an issue in the source repo (https://github.com/danny0838/webscrapbook/issues) or send an email, and provide more details (e.g. Do you use a backend server? What is your use case and intention?) as it's not easy to discuss in this Addon site (we won't receive any notification from an update of a comment).
cf. thread of the original request: https://github.com/danny0838/webscrapbook/issues/8