-
Notifications
You must be signed in to change notification settings - Fork 33
Issues: archivesunleashed/aut
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Use Tika's detected MIME type instead of ArchiveRecord getMimeType?
DataFrames
enhancement
Scala
#342
by jrwiebe
was closed Aug 14, 2019
Add crawl_date to binary DataFrames and imageLinks
DataFrames
enhancement
Scala
#413
by ruebot
was closed Jan 18, 2020
Remove http headers, and html on webpages()
bug
DataFrames
enhancement
#538
by ruebot
was closed May 30, 2022
feature request: log when loadArchives opens and closes warc files in a dir
enhancement
RA-Task
#156
by dportabella
was closed Jan 31, 2019
Better approach to ids in WriteGraphML & WriteGEXF
enhancement
#168
by greebie
was closed Feb 17, 2018
Integration of neural network models for image analysis
enhancement
RA-Task
#240
by lintool
was closed Jul 8, 2019
Migration of all RDD functionality over to DataFrames
DataFrames
enhancement
#223
by lintool
was closed Apr 21, 2020
More complete Twitter Ingestion
enhancement
feature
#194
by greebie
was closed Jul 15, 2019
10 of 16 tasks
Add filter/keep by http status to RecordLoader class
enhancement
resolve before 0.18.0
Scala
#315
by greebie
was closed Aug 17, 2019
Plain Text UDF that combines RemoveHTML + RemoveHttpHeader
enhancement
rdd
wontfix
#270
by ianmilligan1
was closed Oct 1, 2018
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.