Skip to content

Commit

Permalink
Add TG23 crawl config
Browse files Browse the repository at this point in the history
  • Loading branch information
niccofyren committed Oct 1, 2023
1 parent 8df0257 commit 81ade95
Showing 1 changed file with 20 additions and 0 deletions.
20 changes: 20 additions & 0 deletions browsertrix-crawler/configs/tg23.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
seeds:
- url: https://www.gathering.org/tg23/
include: www.gathering.org/(tg23|api)

collection: tg23

behaviors: autoscroll,autoplay,autofetch,siteSpecific
waitUntil: load,networkidle0
generateCDX: true
combineWARCs: true
saveState: always
workers: 2

# Make "live" crawling view available at 9037
newContext: window
screencastPort: 9037

warcinfo:
operator: The Gathering
hostname: gathering.org

0 comments on commit 81ade95

Please sign in to comment.