ML.NET 1.5.0
New Features
- New anomaly detection algorithm (#5135). ML.NET has previously supported anomaly detection through DetectAnomalyBySrCnn. This function operates in a streaming manner by computing anomalies around each arriving point and examining a window around it. Now we introduce a new function
DetectEntireAnomalyBySrCnn
that computes anomalies by considering the entire dataset and also supports the ability to set sensitivity and output margin. - Root Cause Detection (#4925) ML.NET now also supports root cause detection for anomalies detected in time series data.
Enhancements
- Updates to TextLoader
- Onnxruntime updated to v1.3 (#5104). This brings support for additional data types for the HashingEstimator.
- Onnx export for OneHotHashEncodingTransformer and HashingTransormer (#5013, #5152, #5138)
- Support for Categorical features in CalculateFeatureContribution of LightGBM (#5018)
Bug Fixes
In this release we have traced down every bug that would occur randomly and sporadically and fixed many subtle bugs. As a result, we have also re-enabled a lot of tests listed in the Test Updates section below.
- Fixed race condition for test MulticlassTreeFeaturizedLRTest (#4950)
- Fix SsaForecast bug (#5023)
- Fixed x86 crash (#5081)
- Fixed and added unit tests for EnsureResourceAsync hanging issue (#4943)
- Added IDisposable support for several classes (#4939)
- Updated libmf and corresponding MatrixFactorizationSimpleTrainAndPredict() baselines per build (#5121)
- Fix MatrixFactorization trainer's warning (#5071)
- Update CodeGenerator's console project to netcoreapp3.1 (#5066)
- Let ImageLoadingTransformer dispose the last image it loads (#5056)
- [LightGBM] Fixed bug for empty categorical values (#5048)
- Converted potentially large variables to type long (#5041)
- Made resource downloading more robust (#4997)
- Updated MultiFileSource.Load to fix inconsistent behavior with multiple files (#5003)
- Removed WeakReference already cleaned up by GC (#4995)
- Fixed Bitmap(file) locking the file. (#4994)
- Remove WeakReference list in PredictionEnginePoolPolicy. (#4992)
- Added the assembly name of the custom transform to the model file (#4989)
- Updated constructor of ImageLoadingTransformer to accept empty imageFolder paths (#4976)
Onnx bug fixes
- ColumnSelectingTransformer now infers ONNX shape (#5079)
- Fixed KMeans scoring differences between ORT and OnnxRunner (#4942)
- CountFeatureSelectingEstimator no selection support (#5000)
- Fixes OneHotEncoding Issue (#4974)
- Fixes multiclass logistic regression (#4963)
- Adding vector tests for KeyToValue and ValueToKey (#5090)
AutoML fixes
- Handle NaN optimization metric in AutoML (#5031)
- Add projects capability in CodeGenerator (#5002)
- Simplify CodeGen - phase 2 (#4972)
- Support sweeping multiline option in AutoML (#5148)
Test updates
- Fix libomp installation for MacOS Builds(#5143, #5141)
- address TF test download fail, use resource manager with retry download (#5102)
- Adding OneHotHashEncoding Test (#5098)
- Changed Dictionary to ConcurrentDictionary (#5097)
- Added SQLite database to test loading of datasets in non-Windows builds (#5080)
- Added ability to compare configuration specific baselines, updated baslines for many tests and re-enabled disabled tests (#5045, #5059, #5068, #5057, #5047, #5029, #5094, #5060)
- Fixed TestCancellation hanging (#4999)
- fix benchmark test hanging issue (#4985)
- Added working version of checking whether file is available for access (#4938)
Documentation Updates
- Update OnnxTransformer Doc XML (#5085)
- Updated build docs for .NET Core 3.1 (#4967)
- Updated OnnxScoringEstimator's documentation (#4966)
- Fix xrefs in the LDSVM trainer docs (#4940)
- Clarified parameters on time series (#5038)
- Update ForecastBySsa function specifications and add seealso (#5027)
- Add see also section to TensorFlowEstimator docs (#4941)
Breaking Changes
- None