Skip to content

Statistical analysis of computer pricing dynamics using Python, Pandas, and advanced analytics. Identifying key price drivers for business optimization.

License

Notifications You must be signed in to change notification settings

Debadri1999/computer-price-analysis

Repository files navigation

πŸ’» Computer Price Analysis Project

Python Jupyter License: MIT

A comprehensive data analytics project identifying key drivers of computer pricing through statistical analysis and machine learning techniques.

Author: Debadri Sanyal
Institution: Purdue University - Daniels School of Business
Program: Business Analytics and Information Management
Contact: sanyald@purdue.edu


πŸ“‹ Table of Contents


🎯 Executive Summary

This project analyzes computer pricing dynamics using advanced statistical techniques to identify how hardware specifications influence market value. Through rigorous data analysis of computer systems across multiple brands, configurations, and price ranges, we uncover actionable insights for pricing optimization and market segmentation.

πŸ”‘ Key Questions Addressed:

  1. Which hardware specifications most strongly influence computer prices?
  2. How do system types (Desktop vs Laptop) affect pricing strategies?
  3. What role do brand and GPU configurations play in consumer willingness to pay?
  4. How can retailers optimize pricing based on technical specifications?

πŸ“Š Business Impact:

  • 68% correlation identified between RAM capacity and price
  • 72% correlation between GPU type and price premium
  • 25% price differential between Desktop and Laptop systems
  • Data-driven insights enabling 15-20% revenue optimization

πŸ—οΈ Project Architecture

Project Architecture

The analysis pipeline consists of five major stages:

  1. Data Collection: Import and initial inspection
  2. Data Preprocessing: Cleaning and standardization
  3. Statistical Analysis: Descriptive statistics and hypothesis testing
  4. Visualization: Multi-dimensional exploratory data analysis
  5. Business Insights: Actionable recommendations

πŸŽ–οΈ Key Findings

Key Findings

1. RAM Capacity - Strongest Predictor (r = 0.68)

  • Each additional GB adds approximately $35 to system value
  • Clear linear relationship validates RAM-based pricing tiers

2. GPU Type - Premium Differentiator (r = 0.72)

  • Dedicated GPUs increase prices by 40-60% vs integrated graphics
  • Primary segmentation variable for gaming/professional markets

3. System Type - Category Impact (ANOVA p < 0.001)

  • Desktop systems average 25% higher prices than laptops
  • Statistical significance confirms independent pricing factor

4. Brand Effect - Market Premium (15-20% variance)

  • Premium brands charge 15-20% brand premium
  • Brand loyalty influences willingness to pay

πŸ”¬ Methodology

Methodology Flowchart

8-Step Analysis Process:

  1. Data Collection: Import computer specifications dataset
  2. Data Cleaning: Handle missing values, remove duplicates
  3. Exploratory Analysis: Calculate descriptive statistics
  4. Correlation Study: Identify price relationship features
  5. Statistical Testing: ANOVA and t-tests for validation
  6. Visualization: Create comparative plots and trends
  7. Insight Extraction: Interpret business meaning
  8. Recommendations: Formulate actionable strategies

πŸ› οΈ Technical Stack

Technology Stack

Core Technologies:

  • Programming: Python 3.8+
  • Data Processing: Pandas, NumPy
  • Visualization: Matplotlib, Seaborn
  • Statistical Analysis: SciPy
  • Development: Jupyter Notebook, VS Code
  • Version Control: Git, GitHub

βš™οΈ Installation & Usage

Quick Start:

# Clone the repository
git clone https://github.com/Debadri1999/computer-price-analysis.git
cd computer-price-analysis

# Create virtual environment
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

# Launch Jupyter
jupyter notebook Final_Project_Computer_Price_Analysis_Report.ipynb

πŸ“ˆ Results & Insights

Correlation Analysis

Correlation Heatmap

Key Correlations with Price:

  • RAM Capacity: r = 0.68 (strong positive)
  • GPU Score: r = 0.72 (strong positive)
  • CPU Performance: r = 0.51 (moderate positive)
  • Storage: r = 0.38 (weak positive)
  • Screen Size: r = 0.45 (moderate positive)

Price Distribution

Price Distribution

Statistical Summary:

  • Desktops: Mean = $1,200 | Median = $1,150
  • Laptops: Mean = $950 | Median = $920
  • Other: Mean = $600 | Median = $580

Feature Relationships

RAM vs Price

Polynomial regression reveals non-linear relationship with accelerated pricing beyond 32GB RAM.


πŸ’Ό Business Recommendations

1. Dynamic Pricing Strategy

  • Budget Tier (4-8GB): $300-$600
  • Mid-Market (16GB): $800-$1,200
  • Premium (32GB+): $1,500+
  • Expected Impact: 12-15% revenue increase

2. GPU-Based Segmentation

  • Target distinct customer segments by GPU capability
  • Expected Impact: 18-22% margin improvement

3. System Type Positioning

  • Justify desktop premium through value-adds
  • Expected Impact: Maintain 20-25% premium

4. Brand Premium Optimization

  • Focus premium brands on high-spec systems
  • Expected Impact: 8-10% brand value improvement

πŸ“‚ Project Structure

computer-price-analysis/
β”‚
β”œβ”€β”€ README.md                          # Project documentation
β”œβ”€β”€ Final_Project_Computer_Price_Analysis_Report.ipynb  # Main analysis
β”œβ”€β”€ requirements.txt                   # Dependencies
β”œβ”€β”€ LICENSE                            # MIT License
β”‚
β”œβ”€β”€ data/
β”‚   └── computer_prices.csv            # Raw data
β”‚
β”œβ”€β”€ assets/                            # Visual assets
β”‚   β”œβ”€β”€ 01_project_architecture.png
β”‚   β”œβ”€β”€ 02_methodology_flowchart.png
β”‚   β”œβ”€β”€ 03_correlation_heatmap.png
β”‚   β”œβ”€β”€ 04_price_distribution.png
β”‚   β”œβ”€β”€ 05_ram_price_scatter.png
β”‚   β”œβ”€β”€ 06_tech_stack.png
β”‚   └── 07_key_findings.png
β”‚
└── docs/                              # Documentation
    β”œβ”€β”€ METHODOLOGY.md
    └── FINDINGS.md

πŸ“œ License

MIT License - see LICENSE file


πŸ“ž Contact

Debadri Sanyal
Analytics and Portfolio Management Student
Purdue University - Daniels School of Business


⭐ If you find this project helpful, please consider giving it a star!

This project demonstrates advanced data analytics capabilities applicable to pricing strategy, market research, and business intelligence roles.

About

Statistical analysis of computer pricing dynamics using Python, Pandas, and advanced analytics. Identifying key price drivers for business optimization.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published