Skip to content

Where is the data saved ? #585

Answered by ato
CubeBeveled asked this question in Q&A
Discussion options

You must be logged in to vote

Heritrix saves data to WARC files in the jobs/{jobname}/{timestamp}/warcs subdirectory.

Most web archives only accept WARC files from trusted sources, not from the general public, as there isn't a way to guarantee the records are unaltered.

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by ato
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
archive.org archive.org services not (just) Heritrix
2 participants