An evergrowing, professionally curated list of resources on everything decision-making.
👍Do you like the project and want to help spread the word?👍
Here's what you can do:
- give it a star⭐
- add it to your watchlist👁️
- contribute. Please check the contribution guidelines first.
❗Found a broken or missing link, a newer version of a resource, or a duplicate in the list? Please file an issue or make a pull request!❗
🍔Click on the hamburger next to the file name for a better browsing experience:
- Engelbrecht, Andries P. Computational intelligence: an introduction. John Wiley & Sons, 2007. [Link]
- Deisenroth, Marc Peter, A. Aldo Faisal, and Cheng Soon Ong. Mathematics for machine learning. Cambridge University Press, 2020. [Link]
- Goodfellow, Ian, Yoshua Bengio, and Aaron Courville. Deep learning. MIT Press, 2016. [Link]
- Simon Prince. Understanding Deep Learning. MIT Press, to appear. [Link]
- Aston Zhang, Zack Lipton, Mu Li, Alex Smola. Dive into Deep Learning. [Link]
- Philipp Grohs, Gitta Kutyniok. Mathematical Aspects of Deep Learning. [Link]
- Molnar, Christoph. Interpretable machine learning. Lulu. com, 2020. [Link]
- Bishop, Christopher M., and Nasser M. Nasrabadi. Pattern recognition and machine learning. Vol. 4. No. 4. New York: Springer, 2006. [Link]
- Deisenroth, Marc Peter, A. Aldo Faisal, and Cheng Soon Ong. Mathematics for machine learning. Cambridge University Press, 2020. [Link]
- James, G., Witten, D., Hastie, T., Tibshirani, R., Taylor, J. An Introduction to Statistical Learning: With Applications in Python; Springer: Berlin/Heidelberg, Germany, 2023. [Link]
- Efron, Bradley, and Trevor Hastie. Computer age statistical inference, student edition: algorithms, evidence, and data science. Vol. 6. Cambridge University Press, 2021. [Link]
- Hastie, Trevor, Robert Tibshirani, and Martin Wainwright. Statistical learning with sparsity: the lasso and generalizations. CRC press, 2015. [Link]
- Katsov, Ilya. Introduction to algorithmic marketing: Artificial intelligence for marketing operations. Grid Dynamics, 2017. [Link]
- Murphy, Kevin P. Probabilistic machine learning: Advanced topics. MIT Press, 2023. [Link]
- Murphy, Kevin P. Probabilistic machine learning: an introduction. MIT Press, 2022. [Link]
- Siddiqi, Naeem. Intelligent credit scoring: Building and implementing better credit risk scorecards. John Wiley & Sons, 2017. [Link]
- Lippe, Phillip. UvA Deep Learning Tutorials. 2022. [Link]
- Ollion, Charles, and Olivier Grisel. Deep Learning course: lecture slides and lab notebooks. Institut Polytechnique de Paris, 2017. [Link]
- Lakkaraju, Hima, et al. Explainable Artificial Intelligence: From Simple Predictors to Complex Generative Models. Harvard University, 2023. [Link]
- MLU-Explain Team. MLU-Explain. Amazon (2021). [Link]
- Dimitry Bertsekas. Reinforcement Learning and Optimal Control. [Link]
- Elad Hazan, Karan Singh. Introduction to Online Nonstochastic Control. [Link]
- Andreas Luttens, et al. Large-scale Docking Datasets for Machine Learning. 2, Zenodo, 8 May 2023. [Link]
- Catboost. A fast, scalable, high-performance Gradient Boosting on Decision Trees library used for ranking, classification, regression, and other machine learning tasks for Python, R, Java, and C++. Supports computation on CPU and GPU. [Link]
- Khuat, Thanh Tung, and Bogdan Gabrys. "hyperbox-brain: A Toolbox for Hyperbox-based Machine Learning Algorithms." arXiv preprint arXiv:2210.02704 (2022). [Link]
- Arbel, Julyan, et al. A Primer on Bayesian Neural Networks: Review and Debates. arXiv preprint arXiv:2309.16314 (2023). [Link]
- Hellström, Fredrik, et al. Generalization bounds: perspectives from information theory and PAC-Bayes. arXiv preprint arXiv:2309.04381 (2023). [Link]
- Kingma, Diederik P., and Max Welling. "Auto-encoding variational bayes." arXiv preprint arXiv:1312.6114 (2013). [Link]
- Nalisnick, Eric, and Padhraic Smyth. "Stick-breaking variational autoencoders." arXiv preprint arXiv:1605.06197 (2016). [Link]
- Coste, Simon. Diffusion. University of Paris, 2023. [Link]
- Galerne, Bruno, and Valentin De Bortoli. Generative Modelling. ENS Paris-Saclay, 2023. [Link]
- Bartlett, Peter L., Andrea Montanari, and Alexander Rakhlin. Deep learning: a statistical viewpoint. Acta numerica 30 (2021): 87-201. [Link]
- Berner, Julius, et al. The modern mathematics of deep learning. arXiv preprint arXiv:2105.04026 (2021): 86-114. [Link]
- DeVore, Ronald, Boris Hanin, and Guergana Petrova. Neural network approximation. Acta Numerica 30 (2021): 327-444. [Link]
- Jacot, Arthur, Franck Gabriel, and Clément Hongler. "Neural tangent kernel: Convergence and generalization in neural networks." Advances in neural information processing systems 31 (2018). [Link]
- Hornik, Kurt. "Approximation capabilities of multilayer feedforward networks." Neural networks 4.2 (1991): 251-257. [Link]
- Hornik, Kurt, Maxwell Stinchcombe, and Halbert White. "Multilayer feedforward networks are universal approximators." Neural networks 2.5 (1989): 359-366. [Link]
- Petersen, Philipp Christian. Neural network theory. University of Vienna 535 (2020). [Link]
- Khaled, Ahmed, and Peter Richtárik. "Better theory for SGD in the nonconvex world." arXiv preprint arXiv:2002.03329 (2020). [Link]
- Sun, Ruoyu. Optimization for deep learning: theory and algorithms. arXiv preprint arXiv:1912.08957 (2019). [Link]
- Angelopoulos, Anastasios N., and Stephen Bates. "A gentle introduction to conformal prediction and distribution-free uncertainty quantification." arXiv preprint arXiv:2107.07511 (2021). [Link]
- Fontana, Matteo, Gianluca Zeni, and Simone Vantini. "Conformal prediction: a unified review of theory and new challenges." arXiv preprint arXiv:2005.07972 (2020). [Link]
- Manokhin, Valery. (2022). Awesome Conformal Prediction (v1.0.0). Zenodo. [Link]
- Shapash. Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models. MAIF, 2021.[Link]
- Sudjianto, Agus, et al. "PiML Toolbox for Interpretable Machine Learning Model Development and Validation." arXiv preprint arXiv:2305.04214 (2023).
- Khuat, Thanh Tung, Dymitr Ruta, and Bogdan Gabrys. "Hyperbox-based machine learning algorithms: a comprehensive survey." Soft Computing 25.2 (2021): 1325-1363. [Link]
- Mirzasoleiman, Baharan, Jeff Bilmes, and Jure Leskovec. "Coresets for data-efficient training of machine learning models." International Conference on Machine Learning. PMLR, 2020. [Link]
- Dieng, Adji B. Learning From Data: The Two Cultures. Association for Computing Machinery, 2021. [Link]
- Rich, DJ. Mutual Information. True Theta LLC, 2020. [Link]
- Duarte, Nancy. Resonate: Present visual stories that transform audiences. John Wiley & Sons, 2013. [Link]
- Duarte, Nancy. Slide: ology: The art and science of creating great presentations. Vol. 1. Sebastapol: O'Reilly Media, 2008. [Link]
- Knaflic, Cole Nussbaumer. Storytelling with data: A data visualization guide for business professionals. John Wiley & Sons, 2015. [Link]
- Knaflic, Cole Nussbaumer. Storytelling with data: let's practice!. John Wiley & Sons, 2019. [Link]
- Wexler, Steve, Jeffrey Shaffer, and Andy Cotgreave. The big book of dashboards: visualizing your data using real-world business scenarios. John Wiley & Sons, 2017. [Link]
- Wilke, Claus O. Fundamentals of data visualization: a primer on making informative and compelling figures. O'Reilly Media, 2019. [Link]
- Polars. Dataframes powered by a multithreaded, vectorized query engine, written in Rust. [Link]
- PyGWalker. Turn your pandas dataframe into an interactive UI for visual analysis. [Link]
- Streamlit. A faster way to build and share data apps. [Link]
- Vizro. Vizro is a toolkit for creating modular data visualization applications. [Link]
- Genie. 🧞The highly productive Julia web framework. [Link]
- Downey, Allen. Think complexity: complexity science and computational modeling. " O'Reilly Media, Inc.", 2018. [Link]
- Downey, Allen. Think data structures: algorithms and information retrieval in Java. " O'Reilly Media, Inc.", 2017. [Link]
- Downey, Allen. Think Python. " O'Reilly Media, Inc.", 2012. [Link]
- Johnston, Nathaniel, and Dave Greene. Conway's Game of Life: Mathematics and Construction. Self-published, 2022. [Link]
- Miller, Brad, and David Ranum. Problem-solving with algorithms and data structures. University of Auckland, 2013. [Link] [Website]
- Chacon, Scott, and Ben Straub. Pro git. Springer Nature, 2014. [Link]
- Petrov, Alex. Database Internals: A deep dive into how distributed data systems work. O'Reilly Media, 2019. [Link]
- Shvets, Alexander. Refactoring Guru. 2014. [Link]
- Lovelace, Robin, Jakub Nowosad, and Jannes Muenchow. Geocomputation with R. CRC Press, 2019. [Link]
- Moraga, Paula. Geospatial health data: Modeling and visualization with R-INLA and shiny. CRC Press, 2019. [Link]
- Moraga, Paula. Spatial Statistics for Data Science: Theory and Practice with R. CRC Press, 2023. [Link]
- Axler, Sheldon. Linear algebra done right. Springer Nature, 2023. [Link]
- Isoz, Vincent. Opera Magistris (Elements of Applied Mathematics). Sciences.ch, 2016. [Link]
- Downey, Allen B. Modeling and Simulation in Python: An Introduction for Scientists and Engineers. No Starch Press, 2023. [Link]
- McNulty, Keith. Handbook of graphs and networks in people analytics: with examples in R and Python. CRC Press, 2022. [Link]
- Sargent, Thomas J., and John Stachurski. Economic Networks: Theory and Computation. QuantEcon, 2022. [Link]
- Boumal, Nicolas. An Introduction to Optimization on Smooth Manifolds. Cambridge University Press, 2023. [Link]
- Boyd, Stephen P., and Lieven Vandenberghe. Convex optimization. Cambridge University Press, 2004. [Link]
- Kwon, Changhyun. Julia Programming for Operations Research. Changhyun Kwon, 2019. [Link]
- Martins, J. R. R. A. and Ning, A., Engineering Design Optimization, Cambridge University Press, 2022. [Link]
- Nesterov, Yurii. Lectures on convex optimization. Vol. 137. Berlin: Springer, 2018. [Link]
- Sargent, Thomas J., and John Stachurski. Dynamic Programming Volume 1. QuantEcon, 2023. [Link]
- Powell, Warren B. Sequential decision analytics and modeling: modeling with Python. Now, 2022. [Link]
- Arya, Nisha. Learn Probability in Computer Science with Stanford University for FREE. KDNuggets, 2023. [Link]
- MATLAB. Why Padé Approximations Are Great! | Control Systems in Practice. YouTube, 2022. [Link]
- Jaccard, James, and Jacob Jacoby. Theory construction and model-building skills: A practical guide for social scientists. Guilford publications, 2019. [Link] [Website]
- Judd, Kenneth. The Potential Partnership Between Economics and Computational Science. PyData Chicago, 2021. [Link]
- Breiman, Leo. "Statistical modeling: The two cultures (with comments and a rejoinder by the author)." Statistical science 16.3 (2001): 199-231. [Link]
- Polya, George. How to solve it: A new aspect of mathematical method. Vol. 85. Princeton university press, 2004. [Link]
- Wolfram, Stephen. A new kind of science. Vol. 5. Champaign, IL: Wolfram media, 2002. [Link]
- Govaert, Gérard, and Mohamed Nadif. Co-clustering: models, algorithms and applications. John Wiley & Sons, 2013. [Link]
- Scrucca, Luca, et al. Model-Based Clustering, Classification, and Density Estimation Using mclust in R. Chapman and Hall/CRC, 2023. [Link]
- Evans, Richard W., Computational Methods for Economists using Python, Open access Jupyter Book, v#.#.#, 2023. [Link]
- Wooldridge, Jeffrey M.. Introductory Econometrics: A Modern Approach. Brésil, Cengage Learning, 2020. [Link]
- Martin, Osvaldo A., Ravin Kumar, and Junpeng Lao. Bayesian modeling and computation in Python. CRC Press, 2021. [Link]
- McElreath, Richard. Statistical rethinking: A Bayesian course with examples in R and Stan. Chapman and Hall/CRC, 2020. [Link]
- Soch, Joram, et al. StatProofBook/StatProofBook.Github.Io: StatProofBook 2021. 2021, Zenodo, 2022. [Link]
- Wasserman, Larry. All of nonparametric statistics. Springer Science & Business Media, 2006. [Link]
- Wasserman, Larry. All of statistics: a concise course in statistical inference. Vol. 26. New York: Springer, 2004. [Link]
- Van Buuren, Stef. Flexible imputation of missing data. CRC Press, 2018. [Link]
- McNulty, Keith. Handbook of regression modeling in people analytics: with examples in R and Python. CRC Press, 2021. [Link]
- Agresti, Alan. Categorical data analysis. Vol. 792. John Wiley & Sons, 2012. [Link]
- Kuhn, Max, and Julia Silge. Tidy modeling with R. " O'Reilly Media, Inc.", 2022. [Link]
- Wickham, H., Çetinkaya-Rundel, M., & Grolemund, G. (2023). R for data science. " O'Reilly Media, Inc.". [Link]
- Cochrane, John H. "Time series for macroeconomics and finance." (1997). [Link]
- Hyndman, R.J., & Athanasopoulos, G. (2021) Forecasting: principles and practice, 3rd edition, OTexts: Melbourne, Australia. [Link]
- Neusser, Klaus. Time series econometrics. Springer publication, 2016. [Link]
- Cunningham, Scott et al. Mixtape Sessions: Causal Inference. 2022. [Link]
- Ding, Peng. "A First Course in Causal Inference." arXiv preprint arXiv:2305.18793 (2023). [Link]
- Canay, Ivan. Econ 480-3 - Introduction to Econometrics. Northwestern University, 2021. [Link]
- De Haan, Monique. ECON4150 - Introductory Econometrics. University of Oslo, 2018. [Link]
- Kunin, Daniel, et al. Seeing Theory. Brown University, 2016. [Link]
- Kozyrkov, Cassie. Statistical Thinking. YouTube, 2019. [Link]
- Godahewa, Rakshitha, et al. "Monash time series forecasting archive." arXiv preprint arXiv:2105.06643 (2021). [Link]
- "6 Free, High-Quality, Marketing Mix Modeling Datasets | Forecastegy." Web. 10/14/2023 [Link]
- Gaël Bernard and Periklis Andritsos. Datasets Simulating Customer Journeys. [Link]
- Fold. Fast Adaptive Time Series ML Engine. [Link]
- Functime. Time-series machine learning at scale. Built on Polars for embarrassingly parallel feature engineering and forecasts. [Link]
- HierarchicalForecast. Probabilistic Hierarchical forecasting 👑 with statistical and econometric methods. [Link]
- mlforecast. Scalable machine 🤖 learning for time series forecasting. [Link]
- NeuralForecast. Scalable and user-friendly neural 🧠 forecasting algorithms. [Link]
- StatsForecast. Lightning ⚡️ fast forecasting with statistical and econometric models. [Link]
- ThymeBoost. Forecasting with Gradient Boosted Time Series Decomposition. [Link]
- Blei, David M. Build, compute, critique, repeat: Data analysis with latent variable models. Annual Review of Statistics and Its Application 1 (2014): 203-232. [Link]
- Blei, David M., Alp Kucukelbir, and Jon D. McAuliffe. "Variational inference: A review for statisticians." Journal of the American Statistical Association 112.518 (2017): 859-877. [Link]
- Dieng, Adji Bousso. Deep Probabilistic Graphical Modeling. Columbia University, 2020. [Link]
- Figurnov, Mikhail, Shakir Mohamed, and Andriy Mnih. "Implicit reparameterization gradients." Advances in neural information processing systems 31 (2018). [Link]
- Gelman, Andrew, Xiao-Li Meng, and Hal Stern. "Posterior predictive assessment of model fitness via realized discrepancies." Statistica sinica (1996): 733-760. [Link]
- Clarke, Bertrand, and Yuling Yao. "A Cheat Sheet for Bayesian Prediction." arXiv preprint arXiv:2304.12218 (2023). [Link]
- Assaad, Charles K., Emilie Devijver, and Eric Gaussier. "Survey and evaluation of causal discovery methods for time series." Journal of Artificial Intelligence Research 73 (2022): 767-819. [Link]
- Leemis, Lawrence M., and Jacquelyn T. McQueston. "Univariate distribution relationships." The American Statistician 62.1 (2008): 45-53. [Paper] [Website].
- Gelman, Andrew. “Commentary: P Values and Statistical Practice.” Epidemiology, vol. 24, no. 1, 2013, pp. 69–72. JSTOR. Accessed 10 Dec. 2023. [Link]
- Greenland, Sander et al. “Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations.” European journal of epidemiology vol. 31,4 (2016): 337-50. doi:10.1007/s10654-016-0149-3 [Link]
- Lakens, Daniël. “Equivalence Tests: A Practical Primer for t Tests, Correlations, and Meta-Analyses.” Social psychological and personality science vol. 8,4 (2017): 355-362. doi:10.1177/1948550617697177 [Link]
- Lin, Mingfeng, et al. “Research Commentary: Too Big to Fail: Large Samples and the p-Value Problem.” Information Systems Research, vol. 24, no. 4, 2013, pp. 906–17. JSTOR. Accessed 10 Dec. 2023. [Link]
- Lumley, Thomas et al. “The importance of the normality assumption in large public health data sets.” Annual review of public health vol. 23 (2002): 151-69. doi:10.1146/annurev.publhealth.23.100901.140546 [Link]
- Mohd Razali, Nornadiah, and Bee Yap. ‘Power Comparisons of Shapiro-Wilk, Kolmogorov-Smirnov, Lilliefors and Anderson-Darling Tests’. J. Stat. Model. Analytics, vol. 2, 01 2011. [Link]
- Morey, Richard D et al. “The fallacy of placing confidence in confidence intervals.” Psychonomic bulletin & review vol. 23,1 (2016): 103-23. doi:10.3758/s13423-015-0947-8 [Link]
- Olszewski, Adrian. On the p-values - links library significance ditching. Adrian Olszewski, 2022. [Link]
- Pernet, Cyril. “Null hypothesis significance testing: a short tutorial.” F1000Research vol. 4 621. 25 Aug. 2015, doi:10.12688/f1000research.6963.3 [Link]
- Serdar, Ceyhan Ceran et al. “Sample size, power and effect size revisited: simplified and practical approaches in pre-clinical, clinical and laboratory studies.” Biochemia medica vol. 31,1 (2021): 010502. doi:10.11613/BM.2021.010502 [Link]
- Verhagen, Arianne P., et al. ‘Is the p Value Really so Significant?*’. Australian Journal of Physiotherapy, vol. 50, no. 4, 2004, pp. 261–262. [Link]
- Yao, Yuling. Bayes is guaranteed to overfit, for any model, any prior, and every data point. Yuling Yao, 2023. [Link]
- gung Reinstate Monica (https://stats.stackexchange.com/users/7290/gung-reinstate monica). Algorithms for Automatic Model Selection. Cross Validated, https://stats.stackexchange.com/q/20856. [Link]
- Chopin, Nicolas, et al. "Bayesian Causal Inference for Real World Interactive Systems." Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2021.[Link]
- Maxim Kochurov. State of Bayes Lecture Series. PyMC Labs, 2023. [Link]
- Hakenes, Hendrik. Ito's Lemma -- Some intuitive explanations on the solution of stochastic differential equations. University of Bonn, 2021. [Link]
- Silge, Julia, and David Robinson. Text mining with R: A tidy approach. " O'Reilly Media, Inc.", 2017. [Link]
- Horwood, Ghraham V. Humanitarian Assistance and Disaster Relief (HA/DR) Articles and Lexicon. V1, Harvard Dataverse, 2017, doi:10.7910/DVN/TGOPRU. [Link]
- Goldberg, Yoav. "A primer on neural network models for natural language processing." Journal of Artificial Intelligence Research 57 (2016): 345-420. [Link]