This repository contains my personal learning notes and summaries as I study the book "Pattern Recognition and Machine Learning" by Christopher M. Bishop. The aim is to document key concepts, mathematical foundations, and reflections on each topic for future reference and sharing with fellow learners.
- Applications of Machine Learning and Pattern Recognition
- Polynomial curve fitting as a way to demonstrate basic regression
- Core Concepts Covered:
- Generalization
- Overfitting
- Polynomial functions as models
- Cost function: Sum-of-squares error
- Root Mean Square Error (Erms)
- Derivation of sum and product rules using probability axioms
- Bayes' Theorem for conditional probability, including matrix-based understanding
- Introduction to Probability Density Functions (PDFs)
- Key Takeaways:
- Importance of marginal, joint, and conditional probability
- Visualizing dependencies with matrix representations
- Understanding the shift from discrete to continuous probability using densities
- Change of variable and Jacobian in probability densities
- Expectation, variance, and covariance as statistical summaries
- Introduction to Bayesian probability: prior, likelihood, posterior
- Normal (Gaussian) distribution and its importance
- Reformulated polynomial curve fitting as a probabilistic model using Gaussian noise
- Learned Maximum Likelihood Estimation (MLE) via log-likelihood derivation
- Identified MLE's limitations (overfitting, no regularization)
- Introduced Bayesian reasoning with priors → Maximum A Posteriori (MAP) estimation
- Hyperparameters like α (prior precision) and β (noise precision) introduced for better generalization
- Studied the "curse of dimensionality": high-dimensional spaces lead to sparse data and unreliable distance metrics
- Introduced decision theory as a framework for optimal predictions under uncertainty
- Explored minimizing misclassification rate by choosing class with highest posterior probability
- Covered expected loss minimization using a loss matrix ( L_{kj} )
- Learned about the reject option — deferring decisions when confidence is low
- Distinguished between inference (estimating posteriors) and decision-making (choosing action)
- Reviewed loss functions for regression, including squared loss and absolute loss
- Title: Pattern Recognition and Machine Learning
- Author: Christopher M. Bishop
- Publisher: Springer
- ISBN: 978-0-387-31073-2
This is a foundational text in machine learning, known for its mathematical rigor and comprehensive coverage of probabilistic models.
- Browse important topics to learn in ML theory
- Use the content as a reference for understanding core ML concepts.
- Clone/fork the repository to maintain your own notes.
git clone https://github.com/yourusername/prml-study-notes.git