This site supports coding examples and homework for Introduction to Data Mining (CIS 3902) at Catawba College.
Getting Started:
- Create and verify a Github Account.
- Sign in to Github and access Catawba Data Mining on Github.
- Follow instructions in Blackboard for the weekly assignments for class on https://github.com/catawba-data-mining/CIS-3902-Data-Mining/blob/main/README.md.
Markdown Tutorial Markdown Tutorial
### Add tocolab after github in the URL to open in Colab - Open Souce Google Colab Jupyter Notebook ### You will need to sign in to a google account for Colab to open ### if input is accessed via URL, no changes are needed to pd.read
- Assignment 1 Mining the Classics
- Assignment 2 Exploring Visualization NYC Fire Data
- Assignment 3 Customer Segmentation Using Clustering: Mall Data
- K-nearest neighbor clustering
- Project
- K Nearest Neighbor Example
- Pycaret Multiclass Classification
OPTIONAL TEXTBOOKS: These reference texts for the class are provided by UC Berkeley:
Data 8 Text: https://www.inferentialthinking.com/chapters/intro Computataional and Inferential Thinking (also available through creative commons license, UC Berkeley)
Data 100 Text: Principles and Techniques of Data Science http://www.textbook.ds100.org/intro.html
- Chapter 1 Lab and HW 1
- Chapter 2 Lab and HW 2
- Chapter 7 Lab and HW 3
- Chapter 11 Lab and HW 4
- Chapter 15 & 16 Lab and HW 5
- Titanic Lab HW 6
- Decision Trees Random Forests Confusion Matrix HW 7
- Project Deliverable 1 Predicting Shark Presence
- Project Deliverable 2 Association Rules
- Project Deliverable 3 Clustering