-
Notifications
You must be signed in to change notification settings - Fork 33
Issues: archivesunleashed/aut
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
feature request: log when loadArchives opens and closes warc files in a dir
enhancement
RA-Task
#156
by dportabella
was closed Jan 31, 2019
Integration of neural network models for image analysis
enhancement
RA-Task
#240
by lintool
was closed Jul 8, 2019
Migration of all RDD functionality over to DataFrames
DataFrames
enhancement
#223
by lintool
was closed Apr 21, 2020
Plain Text UDF that combines RemoveHTML + RemoveHttpHeader
enhancement
rdd
wontfix
#270
by ianmilligan1
was closed Oct 1, 2018
Use Tika's detected MIME type instead of ArchiveRecord getMimeType?
DataFrames
enhancement
Scala
#342
by jrwiebe
was closed Aug 14, 2019
Add method for unknown extensions in binary extractions
DataFrames
enhancement
resolve before 0.18.0
Scala
#343
by ruebot
was closed Aug 18, 2019
PDF binary object extraction
DataFrames
enhancement
feature
Scala
#302
by ruebot
was closed Aug 12, 2019
Method to perform finer-grained selection of ARCs and WARCs
enhancement
in progress
RA-Task
#247
by lintool
was closed May 24, 2022
Discussion: Restyle UDFs in the context of DataFrames
DataFrames
enhancement
rdd
Scala
#425
by lintool
was closed Mar 18, 2020
Replace Java ARC/WARC record processing library
enhancement
Java
#494
by ruebot
was closed May 24, 2022
Replace scala-uri library from ExtractDomain and just parse public_suffix_list.dat
clean-up
enhancement
#521
by ruebot
was closed Nov 1, 2021
Adding getCrawlYear in ArchiveRecords, resolves #104
enhancement
#105
by ianmilligan1
was merged Oct 26, 2017
Loading…
Changing keepDate to allow multiple dates, would close #108
enhancement
#161
by ianmilligan1
was merged Jan 8, 2018
Loading…
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.