This is a private repository to set up and showcase a data pipeline and architecture design.
- CI/CD process
- CI with GitHub Actions
- Coupled with unit testing
- With pytest and pylint
- SQL dump into mariadb
- Hosting of mariadb-server on local linux env
- File upload to AWS s3
- File upload from local linux env to cloud storage
- Airflow schedule
- Hosting of airflow-server on local linux env
- Error logging