These two files is almost 50mb altogether https://github.com/dotnet/machinelearning/blob/master/test/data/taxi-fare-test.csv https://github.com/dotnet/machinelearning/blob/master/test/data/taxi-fare-train.csv https://github.com/dotnet/machinelearning/pull/170 allows to download files from external sources. Can we move these files to separate repository and clean history?