Skip to content

BinRoot/Haskell-Data-Analysis-Cookbook

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Haskell-Data-Analysis-Cookbook

This is the accompanying source code for Haskell Data Analysis Cookbook.

The latest source code is available on GitHub: https://github.com/BinRoot/Haskell-Data-Analysis-Cookbook

Summary

The Hunt for Data, identifies core approaches in reading data from various external sources such as CSV, JSON, XML, HTML, MongoDB, and SQLite.

Integrity and Inspection, explains the importance of cleaning data through recipes about trimming whitespace, lexing, and regular expression matching.

The Science of Words, introduces common string manipulation algorithms including base conversions, substring matching, and computing the edit distance.

Data Hashing, covers essential hashing functions such as MD5, SHA256, GeoHashing, and perceptual hashing.

A Dance with Trees, establishes an understanding of the tree data structure through examples including tree traversals, balancing trees, and Huffman coding.

Graph Fundamentals, manifests rudimentary algorithms for graphical networks such as graph traversals, visualization, and maximal clique detection.

Statistics and Analysis, begins the investigation of important data analysis techniques encompassing regression algorithms, Bayesian networks, and neural networks.

Clustering and Classification, involves quintessential analysis methods involving k-means clustering, hierarchical clustering, constructing decision trees, and implementing the k-Nearest Neighbors classifier.

Parallel and Concurrent Design, introduces advance topics in Haskell such as forking IO actions, mapping over lists in parallel, and benchmarking performance.

Real-time Fugue, incorporates streamed data interactions from Twitter, Internet Relay Chat (IRC), and sockets.

Stunning Visuals, comprises of sundry approaches to plotting graphs including line charts, bar graphs, scatter plots, and D3.js visualizations.

The Final Exporting, concludes the book with an enumeration of algorithms for exporting data to CSV, JSON, HTML, MongoDB, and SQLite.

Artwork Attribution

Illustrations by Lonku.