This code will read in Excel data submitted by project participants, store it in a PostgreSQL database, and export information into defined reports. This project was developed in IntelliJ IDEA and has configuration files checked in for running easily in that envirionment.
If you're running OS X, use Postgres.app.
If you're running Windows, use the official PostgreSQL installer.
If you're running some flavor of Linux you probably know how to get a PostgreSQL instance up and running so it's left as an exercise to the reader.
Update hibernate.cfg.xml to make sure it reflects your database setting, especially hostname, user, and password.
Run the DDL script to setup database tables, indexes, and constraints.
Data is expected to come in as .xlsx
files and to follow the default template included in this codebase.
You can load data with the cl.TemplateParser class. Run it with the -d
flag and value to load a whole directory of .xlsx
files or with the -f
flag and value to load a single .xlsx
file. While loading watch the console and the log for data warnings and errors. Fix these directed.
You might also need to load extra DNA information into the system with the cl.DnaParser class. This only takes a single file with the -f
flag and value.
You can dump all processed data as one big, combined file using the cl.DnaParser class. Set the -f
flag to filepath you want the report written to (don't forget to end the filename with .xlsx
).
Tip: You might need to have a pretty big heap to run this, -Xmx3g
is recommended.
To export (and compress) a copy of the db, replace with your username
pg_dump -U <username> -F p icpc | gzip > icpc.sql.gz
To import a copy the db
psql -U <username> -d icpc -a -f icpc.sql