wep

scrape responses using goquery and optionally playwright. Also has the ability to spider through urls in page based on separate matching css selector and attribute options.

install with : go install github.com/reallygoodprogrammer/wep@latest

usage:

Usage of wep: wep [OPTIONS] <CSS SELECTOR>
-u, --url <URL>                 set site url for initial request
-a, --attr <ATTR>               extract from ATTR attribute instead of HTML
-H, --display-url               display the url of the page with each match

-n, --headless                  run the program in chromium headless mode
-p, --playwright                use playwright instead of net/http lib for requests
-c, --concurrency <LEVEL>       set concurrency level for requests (def=1)
-t, --timeout <LEVEL>           set timeout for requests in sec (def=10)
-b, --cookie <COOKIE>           set 'Cookie' header for each request
-i, --inner                     display only the inner content of matching element

-l, --local <FILENAME>          search through local HTML file instead
-s, --stdin-urls                read urls from stdin instead of html data

-T, --traverse <CSS SELECTOR>   find new urls to spider by matching css selector
-A, --traverse-attr <ATTR>      find spider urls in ATTR of matching -T arg
-L, --leave-domain              allow spidering urls outside original domain

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

wep

usage:

About

Uh oh!

Releases

Packages

Languages

reallygoodprogrammer/wep

Folders and files

Latest commit

History

Repository files navigation

wep

usage:

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages