This repository is a complete, beginner-to-intermediate Data Science learning collection containing math & statistics notes, plotting examples, and hands-on Python notebooks. The goal is to gather small, focused examples and datasets you can run locally to learn common data science workflows: exploratory data analysis, visualization, statistics, and basic machine learning.
- Overview
- Repository structure
- Quick start
- How to use the notebooks
- Learning roadmap and recommended order
- Contributing
- License
- Author / Contact
This repository organizes short, focused notebooks and datasets for learning Data Science concepts. It's intentionally lightweight and notebook-first so you can open examples, run cells, and tweak code. It includes:
- Math and Statistics lecture notebooks and notes
- Plotting examples (bar, box, line) with included datasets
- Python notebooks covering language basics and common patterns
Use this repo as a self-study kit or a teaching resource.
Top-level layout (files and folders you can expect):
Readme.md— this file (project overview and instructions)Math/Statistics/— lecture notebooks for statistics (e.g.Stats_Lec1_&_2.ipynb)Ploting/— plotting examples organized into subfolders:Bar/— bar charts andShoppings.csvBox/— box plots and datasets likeHousing.csv,Products.csvLine/— line plots andHousing.csv
Python/— short Python notebooks covering variables, control flow, functions, arrays, dictionaries and basics of packaging
Files you'll likely open first:
Math/Statistics/Lec1/Stats_Lec1_&_2.ipynb— introductory statistics notes and codePloting/Bar/script.ipynb— quick bar plot examples usingShoppings.csvPython/01-varable-if.ipynb— Python basics notebook
If you add new notebooks or datasets, update this README with a short line describing them.
Prerequisites: Python 3.8+ and Git. Recommended to use a virtual environment.
- Clone the repository:
git clone https://github.com/zeeshanali90233/Data-Science.git
cd "Data-Science"- Create and activate a virtual environment (Windows PowerShell):
python -m venv .venv
.\.venv\Scripts\Activate.ps1- Install recommended packages (this repository includes a minimal
requirements.txt):
pip install -r requirements.txt- Start Jupyter Notebook or Jupyter Lab and open the notebooks:
jupyter notebook
# or
jupyter lab- Open a notebook in Jupyter, run the cells from top-to-bottom, and experiment by changing small pieces of code.
- Notebooks contain short explanations and runnable examples. Look for
# TODOcells where exercises may be placed. - Datasets are small CSV files included in the
Ploting/subfolders. Load them with pandas:
import pandas as pd
df = pd.read_csv('Ploting/Bar/Shoppings.csv')For newcomers, a suggested path through this repo:
Python/01-varable-if.ipynb— Python basicsPython/02-logical-loops-func.ipynb— control flow and functionsPython/03-arrays-dict-oop.ipynb— data structures and basic OOPMath/Statistics/Lec1/Stats_Lec1_&_2.ipynb— introductory statisticsPloting/*notebooks — visualization practice with real CSVs
After that, try combining what you learned: load a dataset, compute summary statistics, and produce plots.
- Compute summary statistics (mean, median, quartiles) for one dataset.
- Create a simple exploratory data analysis (EDA) notebook: show distributions, correlations, and a short write-up of findings.
- Transform an example into a small ML task: predict a numeric column with a simple regression model using
scikit-learn.
Contributions are welcome. A few guidelines:
- Add one small, focused notebook or dataset per PR.
- Include a short description at the top of the notebook explaining what the example demonstrates.
- Keep datasets small (under 1–2 MB) or link to external sources instead of storing very large data here.
- If you add a new folder, update
Readme.mdwith a one-line description.
Steps to contribute:
- Fork the repository on GitHub.
- Create a branch for your change.
- Add your notebook or dataset and commit.
- Open a pull request describing the change.
This repository does not include a license file by default. If you want to make the repository open-source, consider adding an LICENSE (for example, the MIT License). If you want me to add a standard license file, say which license you prefer and I will add it.
If you want changes to the README (different sections, badges, or a different tone), tell me what style you prefer and I'll update it.