Project Page: http://georgetown-university-libraries.github.io/File-Analyzer/
The File Analyzer and Metadata Harvester is a general purpose desktop (and command line) tool designed to automate simple, file-based operations. The File Analyzer assembles a toolkit of tasks a user can perform.
The tasks that have been written into the File Analyzer code base have been optimized for use by libraries, archives, and other cultural heritage institutions.
File Analyzer Wiki: https://github.com/Georgetown-University-Libraries/File-Analyzer/wiki
This code has been derived from the NARA File Analyzer and Metadata Harvester which is available at https://github.com/usnationalarchives/File-Analyzer.
- JDK 1.8 or higher (for build)
- JRE 1.8 or higher (for runtime)
- (If you need to run with Java 6, see Releases for an older version)
- Maven (or you will need to compile the modules manually)
- Clone this code to your computer
- Run mvn install
- Detailed Installation Instructions: https://github.com/Georgetown-University-Libraries/File-Analyzer/wiki/Installation-instructions
This code will build 3 flavors of the File Analyzer.
- All code runs from a self-extracting jar file
- This version of the file analyzer is a self-extracting jar file that references the core file analyzer jar file.
- It contains tools for automating the creation of DSpace ingestion folders
- This version contains extensions illustrating various capabilities of the File Analyzer.
- This version of the file analyzer is a self-extracting jar file that references both the core and dspace file analyzer jar files.
- This version of the application uses features of Apache Tika, BagIt, and Marc4j