Life is made up of goals and metrics. Passionate about data science, MLOps and technology.
PROGRAMMING LANGUAGE AND DATABASE • Python focused on data analysis • Web Scraping with python • SQL for data extraction • SQLite and MySQL database (Postgres, Oracle and Mongo DB)
STATISTICS AND MACHINE LEARNING • Descriptive statistics (Scatter, location, asymmetry, kurtosis and density) • Regression, classification, Natural Language Processing, clustering and “learn to rank” algorithms. • Data balancing techniques (SMOTE), attribute selection (Feature engineer) and Dimensionality reduction (PCA) • Algorithm performance metrics (RMSE, MAE, MAPE, Confusion Matrix, Precision, Recall, ROC Curve, Lift Curve, AUC, Silhouette Score, DB-index). • Machine Learning Packages: TensorFlow, ScikitLearn and Spacy (NLP)
DATA VIEW • Matplolib, Seaborn and Plotly • Power BI and Data Studio.
SOFTWARE ENGINEERING • Git, GitHub, Cookiecutter, Poetry, Virtual Environment, Dotenv, Docker and Linux. • StreamLit, FastAPI, Flask, Python API's. • Cloud Heroku, AWS Amazon, Google Cloud Platform (GCP) and Azure.
DATA ENGINEERING • Jenkins, Airflow, Apache Beam • Kubernetes • ETL and ELT (Python) • Dataform and BigQuery (Data warehouse for Big Data)
- Project themes (financial market, customer acquisition and public policies)
- Python
- Machine Learning
- Big Data
- SQL
- Artificial intelligence