This project simulates purchase behavior across different age groups using two probabilistic models: a weighted probability model and a uniform probability model.
To visualize and compare how weighted vs uniform probability distributions affect purchasing behavior, using simulated data grouped by age.
- Python 3.x
- Pandas
- NumPy
- Seaborn
- Matplotlib
- Jupyter Notebook
- Simulated Dataset of 40,000 users, categorized into age groups (20, 30, ..., 70)
- Weighted Model: Each age group is assigned a purchase probability (e.g., 70-year-olds have 90%)
- Uniform Model: All users have the same purchase probability equal to the weighted average
- Visualization: Line graph comparing actual probabilities between models
- Older age groups showed significantly higher purchase rates in the weighted model
- The uniform model flattens out the variation between age groups
- Graph clearly shows divergence between assumptions
Project_Analysis.ipynb: Main notebook with code and plotsConditional_Probability_Graph.png: Graph comparing both modelsReport.txt: 200-word results summary (won't be uploaded here)
Using a uniform distribution can oversimplify customer behavior. Weighted models more accurately reflect real-world differences across demographics. Detailed analysis will be found in the expected report.
- Add gender/income factors
- Fit logistic regression models
- Create an interactive Streamlit dashboard
