Skip to content

Conversation

@codemzs
Copy link
Member

@codemzs codemzs commented Dec 5, 2019

This change improves the overall performance of image classification API and tensorflow transform by making the graph runner memory efficient and saving the bottleneck cached values in image classification API in binary format instead of text format.

This change also adds the option in Image Classification API to create validation set from train set in the event validation set is not provided by the user. Validation set is used for early stopping.

@codemzs codemzs changed the title Image classification performance improvements and store cache bottleneck files in binary format. Image classification performance improvements and option to create validation set from train set. Dec 6, 2019
@codemzs codemzs marked this pull request as ready for review December 6, 2019 09:42
@codemzs codemzs requested a review from a team as a code owner December 6, 2019 09:42
Copy link
Contributor

@harshithapv harshithapv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do you have any statistics on the improvements in training time ? Could you share them ?

@codecov
Copy link

codecov bot commented Dec 12, 2019

Codecov Report

Merging #4522 into master will decrease coverage by 0.02%.
The diff coverage is 92.64%.

@@            Coverage Diff             @@
##           master    #4522      +/-   ##
==========================================
- Coverage   75.12%   75.09%   -0.03%     
==========================================
  Files         909      909              
  Lines      160262   160188      -74     
  Branches    17257    17267      +10     
==========================================
- Hits       120395   120298      -97     
- Misses      35051    35056       +5     
- Partials     4816     4834      +18
Flag Coverage Δ
#Debug 75.09% <92.64%> (-0.03%) ⬇️
#production 70.48% <92.61%> (-0.04%) ⬇️
#test 90.29% <100%> (-0.01%) ⬇️
Impacted Files Coverage Δ
src/Microsoft.ML.TensorFlow/TensorflowTransform.cs 73.67% <100%> (-0.4%) ⬇️
...cenariosWithDirectInstantiation/TensorflowTests.cs 92.08% <100%> (-0.03%) ⬇️
src/Microsoft.ML.Vision/DnnRetrainTransform.cs 57.03% <83.33%> (-0.23%) ⬇️
.../Microsoft.ML.Vision/ImageClassificationTrainer.cs 91.06% <93.07%> (-2.58%) ⬇️
src/Microsoft.ML.TensorFlow/TensorflowUtils.cs 70.65% <94.87%> (+0.12%) ⬆️
...rc/Microsoft.ML.TensorFlow/TensorTypeExtensions.cs 59.09% <0%> (-22.73%) ⬇️
...c/Microsoft.ML.FastTree/Utils/ThreadTaskManager.cs 79.48% <0%> (-20.52%) ⬇️
src/Microsoft.ML.Maml/MAML.cs 24.75% <0%> (-1.46%) ⬇️
...soft.ML.Data/DataLoadSave/Text/TextLoaderCursor.cs 84.9% <0%> (-0.21%) ⬇️
...L.AutoML/TrainerExtensions/TrainerExtensionUtil.cs 86.89% <0%> (+1.74%) ⬆️
... and 2 more

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants