Skip to content
View mbarbierif's full-sized avatar

Block or report mbarbierif

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mbarbierif/README.md


PROFILE


Detail-oriented Data Engineer & Python Developer, interested in developing and maintaining highly scalable, secure and reliable data processes and architecture. Possesses strong analytical and communication skills, excellent problem-solving abilities, and a deep understanding of technology, leaning towards individual contributor roles. A systems-thinking approach allows him to anticipate technical debt and possible roadblocks during the data project's life cycle, applying technical writing skills to transform any messy data source or data process into a well-documented data catalog or codebase, ensuring that both current and new team members have access to the results of the collective learning process.


SKILLS


  • Python Programming: numpy, pandas, matplotlib, flake, Selenium, pytest, requests, beautifulsoup, flask, FastAPI
  • Database Design and Management: PostgreSQL, MySQL, ClickHouse, MongoDB, Redis, Neo4j, BigQuery
  • Google Cloud Platform: Compute Engine, Cloud Composer (Airflow), BigQuery, Cloud Storage, Looker BI
  • Data Extraction, Transformation and Loading (ETL)
  • Web Crawler & Scraper Design and Implementation
  • Complex Problem-Solving
  • Training Junior Team Members
  • Technical Writing & High Quality Documentation

EXPERIENCE


DATA ENGINEER (Apr 2023 - Jun 2023)
Eclypsium. Portland, OR, USA & Córdoba, Argentina (Remote)

  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Collaborated on web scraping tasks related to cybersecurity.
  • Designed and implemented effective database solutions and models to store and retrieve cybersecurity data.
  • Prepared documentation and analytic reports, delivering summarized results, analysis and conclusions to stakeholders.

Skills: Python · SQL · ETL · Selenium · PostgreSQL · BigQuery · MongoDB · Looker · Google Cloud Platform (GCP) · Web Scraping

BACK END ENGINEER (Nov 2022 - Jan 2023)
The Climate Corporation. San Francisco, CA, USA (Remote)

  • Built APIs and data clients to consume APIs.
  • Troubleshooted and tested software and debugged to clean up code and improve efficiency.
  • Managed efficient SQL queries and data transport.
  • Worked in Agile Scrum team environment with high-tempo production cadence.
  • Developed server-side logic in Python and Scala.

Skills: Python · SQL · Scala · REST APIs · Docker · Amazon Web Services (AWS)

DATA ENGINEERING LEAD & TRAINING SPECIALIST (Sep 2021 - Aug 2022)
Datawheel. Cambridge, MA, USA & Concepción, Chile (Hybrid)

  • Compiled, cleaned and manipulated data for proper handling.
  • Skilled at working independently and collaboratively in a team environment.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Designed and implemented effective database solutions and models to store and retrieve data.
  • Built databases and table structures for web applications.
  • Developed, implemented and maintained data analytics protocols, standards, and documentation.
  • Contributed to internal activities for overall process improvements, efficiencies and innovation.

Skills: Python · pandas · SQL · PostgreSQL · MySQL · ClickHouse · Redis · MongoDB · Neo4j · Rust · FastAPI · Flask · Google Cloud Platform (GCP) · Amazon Web Services (AWS) · REST APIs · Data Warehousing · Big Data Analytics

DATA ENGINEER (Oct 2018 - Sep 2021)
Datawheel. Cambridge, MA, USA & Concepción, Chile (Hybrid)

  • Compiled, cleaned and manipulated data for proper handling.
  • Skilled at working independently and collaboratively in a team environment.
  • Collaborated on ETL (Extract, Transform, Load) tasks, maintaining data integrity and verifying pipeline stability.
  • Designed and implemented effective database solutions and models to store and retrieve data.
  • Built databases and table structures for web applications.

Skills: Python · pandas · SQL · PostgreSQL · ClickHouse · Rust · Google Cloud Platform (GCP) · Amazon Web Services (AWS) · REST APIs · Data Warehousing · Big Data Analytics

OPERATIONS RESEARCH CONSULTANT (Aug 2016 - Oct 2016)
Freelance. Concepción, Chile (On-Site)

Developed a Python application that solved a special assignment problem, optimizing the earnings of a consulting firm taking an Excel spreadsheet as input.

Skills: Python · NumPy · Microsoft Excel · Operations Research · Optimization


EDUCATION


MITx MicroMasters Program in Statistics & Data Science (May 2019 - Dec 2020)
Massachusetts Institute of Technology (MIT). Cambridge, MA, USA (Online)

  • Relevant Coursework:
    • Machine Learning and Deep Learning, 2020
    • Fundamentals of Statistics, 2020
    • Data Analysis for Social Science, 2019

Bachelor of Science in Industrial Engineering (Mar 2010 - Mar 2018)
Universidad de Concepción. Concepción, Chile (On-Site)

  • Relevant Coursework:
    • Machine Learning and Business Intelligence, 2018
    • Vehicle Routing Problems in Smart Cities, 2017
  • Professional Development Studies:
    • Gender & Women's Studies, 2017

Popular repositories Loading

  1. dw-localenv-etl dw-localenv-etl Public

    Python 1

  2. rust-problems rust-problems Public

    A set of 89 problems that helped me learn C, but solved in Rust instead

    Rust

  3. cbp_scripts cbp_scripts Public archive

    Python

  4. fictitious-dataset fictitious-dataset Public

    A fictitious dataset for Datawheel training.

    Python

  5. dw-localenv-tesseract dw-localenv-tesseract Public

    Shell

  6. bamboo-resources-api bamboo-resources-api Public archive

    A Flask API designed to return sample data for Bamboo resources.

    Python