A suggestion for making WACZ and WARC-requests #663

hamoudak · 2024-08-03T02:29:35Z

I would like to be wacz-requests section as there is already zim-requests ;because I've seen many people not have the ability (for instance, system-requirements or enough space etc.) or knowledge for crawling a website with browsertrix . manual-archiving with archiveweb.page is good and easily handled but it will produce many links for archiving a website. so I think this idea will help many.

ikreymer · 2024-08-29T20:34:34Z

We have a service that we offer, https://browsertrix.com/, where you can sign-up and run crawls via a UI. The crawls are run via Browsertrix Crawler. Unfortunately, we don't have the resources to offer WACZ files of sites on-demand, like the Zimit service does. One idea, perhaps, is for Zimit could offer WACZ files alongside ZIM as part of the same crawl - that's a question for @benoit74 @rgaudin and others.

hamoudak · 2024-08-29T22:44:10Z

thank you for clearing this up; I do know the website for a long time but all I see is [log in] and the premium offers. will it be free or something.

tw4l · 2024-08-29T22:59:24Z

thank you for clearing this up; I do know the website for a long time but all I see is [log in] and the premium offers. will it be free or something.

Our hosted service is and will remain a paid service, but the software is FOSS and it is possible to self-host if you're comfortable with Kubernetes: https://github.com/webrecorder/browsertrix. Probably more than you want to do given the requirements about system limitations in the issue description, but it is an option.

benoit74 · 2024-09-02T12:46:46Z

One idea, perhaps, is for Zimit could offer WACZ files alongside ZIM as part of the same crawl

Definitely not something "light" to implement, we suppose we are dealing with ZIM files in multiple places ^^ Pretty sure we will probably never make it unless there is something stronger than someone wishing to have this feature

github-project-automation bot added this to Webrecorder Projects Aug 3, 2024

github-project-automation bot moved this to Triage in Webrecorder Projects Aug 3, 2024

ikreymer added the question Further information is requested label Aug 29, 2024

ikreymer closed this as completed Sep 2, 2024

github-project-automation bot moved this from Triage to Done! in Webrecorder Projects Sep 2, 2024

ikreymer closed this as not planned Won't fix, can't repro, duplicate, stale Sep 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A suggestion for making WACZ and WARC-requests #663

A suggestion for making WACZ and WARC-requests #663

hamoudak commented Aug 3, 2024 •

edited

Loading

ikreymer commented Aug 29, 2024

hamoudak commented Aug 29, 2024

tw4l commented Aug 29, 2024

benoit74 commented Sep 2, 2024

A suggestion for making WACZ and WARC-requests #663

A suggestion for making WACZ and WARC-requests #663

Comments

hamoudak commented Aug 3, 2024 • edited Loading

ikreymer commented Aug 29, 2024

hamoudak commented Aug 29, 2024

tw4l commented Aug 29, 2024

benoit74 commented Sep 2, 2024

hamoudak commented Aug 3, 2024 •

edited

Loading