ETL process to build research database for the BOPRA study. This repository does not contain any data, just the process to create the database from raw data.
- Read source csv files.
- Apply proper data types.
- Remove irrelevant and sensitive fields.
- Convert all timestamps to nanosecond intervals relative to the alert time of the physician-staffed EMS unit.
- Remove periods marked as artifacts in manual validation of physiological signals.
- Write the database on disk in SQLite and Parquet formats.
Please refer to the source code for more detailed insight.