Skip to content

Latest commit

 

History

History
25 lines (20 loc) · 2.41 KB

intermediate-challenge-readme.md

File metadata and controls

25 lines (20 loc) · 2.41 KB

Intermediate Questions

Excel

  • How do you use Excel to clean and preprocess large datasets, including techniques like filtering, sorting, and conditional formatting?
  • Can you explain the concept of pivot tables in Excel, and describe some use cases for summarizing and analyzing data?
  • Can you describe some advanced Excel functions for statistical analysis, including tools for hypothesis testing, regression analysis, and Monte Carlo simulation?
  • How do you use Excel to build and analyze data visualizations, including techniques for charting, graphing, and mapping data?

SQL

  • Can you describe some common database design patterns, including techniques for normalization, denormalization, and schema optimization?
  • How do you use SQL to aggregate and summarize data from multiple tables, using techniques like joins, subqueries, and window functions?

Python

  • How do you use Python to build and optimize machine learning models, including techniques like hyperparameter tuning, model selection, and ensembling?
  • How do you use Python to preprocess and clean messy data, including techniques like data imputation, outlier detection, and feature scaling?

Hands-on Project (Select One)

  • Churn Prediction for Telecommunications: Develop a churn prediction model to identify customers who are at risk of canceling their contracts, using data from customer interactions, billing, and usage patterns. The model should help the business proactively address customer concerns and improve retention rates.
  • Sentiment Analysis for Social Media: Develop a sentiment analysis model to analyze social media posts and comments and classify them into positive, negative, or neutral categories. The model should help businesses monitor their brand reputation, understand customer feedback, and identify opportunities for improvement.
  • Predictive Maintenance for Industrial Equipment: Develop a predictive maintenance model to detect potential equipment failures before they occur, using sensor data from industrial machinery. The model should be able to identify the specific components that are likely to fail and provide recommendations for maintenance actions to prevent unplanned downtime.

How to submit

  • Create a new branch or fork this repo, your answers to the theory questions should be in a readme or txt file.
  • The solutions to the hands-on can be created in the repository too.
  • Create a pull request when you are ready to submit.