This Project was implemented for academic purpose by Vasilis Kordalis and Aristeidis Oikonomou.
Aim of this project is to manipulate some bulk data. These data are lat-lon-timestamp rows on a csv file which are part of bus trajectories.
We start with some data containing different bushes' trajectories file (data_sets/train_set.csv)
After a quick edit of that file, a new one containing the routes is created (results/First_Group_of_Data/trips.csv)
After that this file is cleaned from corrupted data and a new file is created (results/Clean_Routes/tripsClean.csv)
Afterwards there are a few things that are happening:
After that classification comes. For this purpose some features are extraxted. These are: