GE-461-Data-Stream-Mining

This repository contains my work for the fifth project of the GE-461: Introduction to Data Science course.

In this project, the objective is to explore the concept of classification ofdata streams. We generate data streams with varying noise and drift features. Then we implement K Nearest Neighbors (KNN), Hoeffding Tree (HT), and Naïve Bayes (NB) with varying batch sizes to analyze online learners, and we implement Majority Voting (MV) and Weighted Majority Voting (WMV) in order to observe the performance of ensemble methods. Finally, we also take a look at another way to increase the performance of our models. My report can be accessed here: https://xmassmx.github.io/GE-461-Data-Stream-Mining/

Note: Please do not copy this work and stay away from plagiarism. The work in this repository is my solution and is meant to be used as a guide only.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
GE5-func.ipynb - Colaboratory.pdf		GE5-func.ipynb - Colaboratory.pdf
Hyperplane Dataset 10_2.csv		Hyperplane Dataset 10_2.csv
Hyperplane Dataset 10_5.csv		Hyperplane Dataset 10_5.csv
Hyperplane Dataset 30_2.csv		Hyperplane Dataset 30_2.csv
Hyperplane Dataset 30_5.csv		Hyperplane Dataset 30_5.csv
README.md		README.md
index.html		index.html
streamMiningMuhammadAbdullahMulkana.pdf		streamMiningMuhammadAbdullahMulkana.pdf
streamMiningMuhammadAbdullahMulkana.py		streamMiningMuhammadAbdullahMulkana.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GE-461-Data-Stream-Mining

About

Uh oh!

Releases

Packages

Languages

xmassmx/GE-461-Data-Stream-Mining

Folders and files

Latest commit

History

Repository files navigation

GE-461-Data-Stream-Mining

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages