SHAP-based validation for linear and tree-based models. Applied to binary, multiclass and regression problems.
-
Updated
Apr 10, 2025 - Python
SHAP-based validation for linear and tree-based models. Applied to binary, multiclass and regression problems.
A Linear Regression model to predict the car prices for the U.S market to help a new entrant understand important pricing variables in the U.S automobile industry. A highly comprehensive analysis with detailed explanation of all steps; data cleaning, exploration, visualization, feature selection, model building, evaluation & MLR assumptions vali…
Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.
Feature selection package of the mlr3 ecosystem.
using Drebin dataset to distinguish between malwares and not malwares
HR Analytics Dataset
This project showcases a Network Intrusion Detection System (NIDS) designed to bolster cybersecurity defenses against evolving threats
Tumor prediction from microarray data using 10 machine learning classifiers. Feature extraction from microarray data using various feature extraction algorithms.
Evaluating machine learning methods for detecting sleep arousal, bachelor thesis by Jacob Stachowicz and Anton Ivarsson (2019)
King County House Sales
The classification goal is to predict whether the client will subscribe (1/0) to a term deposit (variable y).
A Jupyter Notebook with the analysis and prediction of Final Grades (Pass/Fail) for students of mechatronics engineering in several mechanic courses.
Through this research, we are able to model a student’s final grade in a particular subject and link it directly to certain relevant features that influence the outcome. We use the C5.0 decision tree technique to model the data.
To model the demand for shared bikes with the available independent variables
Feature-Engg
Developed a predictive real estate model leveraging XG Boost Regressor, integrating web-scraped market data with existing datasets to forecast daily store visits, achieving a MAPE of 13.3%, enabling strategic retail location decisions
A Chinese automobile company Geely Auto aspires to enter the US market by setting up their manufacturing unit there and producing cars locally to give competition to their US and European counterparts. They have contracted an automobile consulting company to understand the factors on which the pricing of cars depends. Specifically, they want to …
This repository contains the notebook used for the Spring 2021 Kaggle Dengue Fever Prediction Competition. Placement was in the top 10% with a MAE of 24.86. Our best approach involved Random Forest Regression on a reduced featureset selected with Recursive Feature Elimination in combination with correlation with the target (number of dengue cases).
Add a description, image, and links to the recursive-feature-elimination topic page so that developers can more easily learn about it.
To associate your repository with the recursive-feature-elimination topic, visit your repo's landing page and select "manage topics."