Skip to content
View Butters3388214's full-sized avatar

Block or report Butters3388214

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

CLI

13 repositories

The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns

Python 1,395 135 Updated Jul 7, 2024

The successor of GNU Wget. Contributions preferred at https://gitlab.com/gnuwget/wget2. But accepted here as well 😍

C 562 76 Updated Nov 1, 2024

Wget Git mirror

C 393 132 Updated Sep 25, 2024

Wget-compatible web downloader and crawler.

HTML 555 77 Updated Apr 29, 2024

Google Drive Public File Downloader when Curl/Wget Fails

Python 4,300 350 Updated Aug 12, 2024

Google Drive direct download of big files

Perl 937 196 Updated May 12, 2023

Got: Simple golang package and CLI tool to download large files faster 🏃 than cURL and Wget!

Go 723 46 Updated Jan 16, 2024

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON do…

Pascal 681 42 Updated Apr 20, 2024

tget is wget for torrents

JavaScript 622 51 Updated Dec 11, 2020

🐋 Web Archiving Integration Layer: One-Click User Instigated Preservation

Roff 350 35 Updated Oct 4, 2024

Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Java 2,829 763 Updated Nov 7, 2024

simple script to convert web resources to a single warc file

Python 18 2 Updated May 11, 2023

A file management automation tool with SQL-like syntax.

TypeScript 60 2 Updated Mar 26, 2024