Big Data Analytics on Healthcare is currently in top demand in the software industries as the health data (EHR, PHR, EMR, etc.) increased exponentially over the past decade, with growing data repositories across healthcare systems. The leverage of the rising Big Data to discover the knowledge and insights of data required prior skills and expertise in data pre-processing, statistics, data mining, machine learning, and programming languages as well. Applications of big data analytics on health can improve the patient-based service, detect spreading diseases earlier, generate new insights into disease mechanisms, monitor the quality of the medical and healthcare institutions as well as provide better treatment methods.
- JavaScript
- Python
- PostgreSQL
- Apache Superset
- Tableau
During the MySOFT HDA vInternship program, I will learn/develop the following expertise/skills by solving real-life problems using anonymous data:
- Structure of clinical laboratory, environmental, and census data
- Data pre-processing, noise reduction, missing value imputation, normalization, and standardization
- Multidimensional data modeling (e.g. star schema, snowflake schema)
- Online Analytical Processing (OLAP)
- Aggregation, correlation, association, clustering, prediction, and Data visualization
The root three folder will contain your Jupyter Notebook, Flask API, and Angular Visualization codebase respectively as follow:
notebook- Your Jupyter Notebook code. Don't include any large file e.g. data.xlxs
api - Your Flask API codebase.
viz - Your angular frontend for Visualization codebase
- clone the master branch on you local machine:
git clone https://github.com/VIS-eSRD-Lab/HDA-Tech-Learning.git
- create a new branch using your intern id:
git checkout -b i22b0006s01
- push your codebase into your branch:
git push origin i22b0006s01