A comprehensive pipeline to analyze Sequential Data with Machine learning algorithm.
Problem description:
- Use industrial data as input to classify output with eight categories.
Transform the raw text data into csv file data.
Visualize the raw data in GIF format.
Dynamic Time Warping + K Nearest Neighbors
Build some cross-validation functions to examine the performance of model.
Montero, P and Vilar, J.A. (2014) TSclust: An R Package for Time Series Clustering. Journal of Statistical Software, http://www.jstatsoft.org/v62/i01/