BMAT Library

BMAT library’s Structure

The library is divided into three different modules to make simple and clean adding new code and functionalities in the future.

data_persistence: module for CRUD-ing functions, thinking in new possible DBs.
- mongodb.py
  - MongoDB
  - insert_many(data)
  - get_by_iswc(data)
data_sources: module for extract and transform different data sources.
- csv_source.py (for csv_files)
  - CsvSourceType1 (db_works_test.csv type)
    - transform_to_list_dict()
    - save_to_file_as_dict(path_output)
    - check_consistency()
- aux_functions.py (auxiliary functions for all data sources)
  - clean_string_iswc(iswc)
tests (only a few examples):
- unit
- integrity

Also, three main files have been created:

main_etl: Extract, transform and load db_works_test.csv info into mongodb database (db=bmat, collection=musicalworks, location=localhost:27017) as required.
main_get_owners: get right owners according iswc data (from bmat db and musicalworks collection)
main_flask_server: initialize a server (127.0.0.1:5001) for querying right owners data using HTTP GET requests (from bmat db and musicalworks collection).

Pipenv library has been used to generate an environment. 0. For installing pipenv: pip install pipenv

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
data_files_raw		data_files_raw
data_persistence		data_persistence
data_sources		data_sources
tests		tests
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
Readme.md		Readme.md
main_etl.py		main_etl.py
main_flask_server.py		main_flask_server.py
main_get_owners.py		main_get_owners.py
requirements.txt		requirements.txt