Pull from upstream #1

justinormont · 2018-06-04T20:17:42Z

No description provided.

Added a badge for NuGet status and fixed a few typos on the readme.md

We've shipped 0.1.0, we should start producing higher versions now. Fix #85

…lpful. (#50) * Comments added to LearningPipeline class in accordance with #Bug 240636: Intellisense is not helpful with filling in pipeline components. * Comments added to LearningPipeline class in accordance with #Bug 240636: Intellisense is not helpful with filling in pipeline components. * Fixed a typo in namespace * Addressed reviewers' comments. * Addressed reviewers' comments. * Addressed reviewers' comments.

* Issue #104: Update the build tools to 2.1.200 Issues: This closes #104 * Updating .NET Version in the right file.

… '+' signs in them. (#102) * Handle generic types and types with multiple '+' signs in them. * Delete commented code. * Address PR comments. * Skip codegen unit test. * Add example to comment. * Assert that the full type name has a dot in it. * Trigger build.

… required but not found." (#121) * Checking for both ColumnAttribute and ColumnNameAttribute when creating schema in CreateBatchPredictionEngine. * Addressed reviewers' comments.

Make a 'not supported field type' exception more readable, so the developer could figure out why he can't load the data This closes #128

…there is '#' infront of header row in Iris.txt. * Removed '#' from the start of header line in Iris.txt. '#' causes header to be ignored. * Updating test instead of iris.txt because the file is being used at many places.

…stic (#133) * Removed calls to DateTime.Now The codebase now uses DateTime.UtcNow, instead of DateTime.Now, to be locale agnostic, except in cases where timezone info is actually needed. Also replaced one starttime measurement with stopwatch.

…ized Additive Models predictor resilient when feature map is not available. (#122) * Instantiate feature map for disk transpose and make Generalized Additive Models predictor resilient when feature map is not available.

…sleading) (#139) * Fix name for Logistic Regression

* Update NuGet packages to fill out all metadata. Also, a minor build change (move property ordering) to fix SourceLink with our packages. Fix #43 Fix #103 * Adding package icon URL. * Update Parquet package description. * Add source code control properties to the NuGet packages. Also, fix a small bug with the nupkgproj files. The intermediate output folders conflict between the nupkgproj and csproj with the same name. This causes issues because the project.assets.json file is being shared between the two projects, which isn't correct.

…er entry point names (#113) * Update suffix of trainer entry point names by trainer kind. * Address PR comments. * Add unit test. * Update C# API * Move unit test to TestAutoInference and fix EntryPointCatalog test. * Trigger build. * Add reference to the test project to make the sweeper entry point visible to EntryPointCatalog test.

The previous 2 changes conflicted. Resovling the break that happened between them.

Add symbols package for ML.Parquet package. Put common NuGet package logic in props file. Fix #144

…ading data from file) (#106) * in memory loader * add test file for memory collection * even in afterlife EntryPointCatalog will chase me down. * Address some comments. * update tests * address more comments. * remove empty param description * hide collectionloader * refactor classes a little. * pesky new lines! * slightly better comments. but only slighty * rename it * make class static * not a loader * remove alias in entrypoint * address comments

…eline (#154) * Prevent learning pipeline from adding null transform model to the pipeline. * Add test.

* Benchmark * Changed to .NET Core app * Added Accuracy Reporting * fixed build * Feedback from Gleb * Added batch prediction tests * Resolved conflicts the sln file * Renamed the new file to match type name * Removed duplicated method

* Publishing nuget packages to myget feed. Also - set the symbols expiration days default based on feedback from the .NET core-eng team. Fixes #11 * Shorten nuget push timeout to match corefx and coreclr.

…#131) * Fix a bug in Tree leaf featurizer entry point, and add a test for it. * Improve unit test * Update unit test * Decrease number of trees and leaves in unit test

* no need to add combiner if you don't have transforms. * fix NextSigned

* Changing name "Documentation" to "docs" for consistency in the repo. Fixes #143

Removed two NextFloat() extension methods from RandomUtils and replaced all usage of them with `IRandom.NextSingle()`.

Copy our native assemblies using MSBuild when a consumer is using NuGet packages.config, since NuGet doesn't do this automatically. Also, add an error when a project is not targeting x64. ML.NET only supports x64. Fix #93

…issue #78 (#187)

handle boolean type in construction utils.

*Code generate support for IDataLoader *Make TextLoader API code generated so that it's at functional parity with the text loader in the ML.Net infrastructure. *Move TextLoader API under Microsoft.ML.Data namespace *Add convenience TextLoader API. *Add error checking for invalid loader arguments such as ordinal, column names. *Update baselines. *Update samples with new loader API and backward compatibility with old loader API.

* Test Enabled, Zbaseline files added for debug and release

* Test Enabled, Zbaseline files added

* Test Enabled, added Debug and release zbaselines

* Test Enabled, Debug and Release baseline files added

* example * add Clusters tests * cleanup * address comments * bring clustering reference back * rephrasing

Add small fix in Microsof.ML.sln

* Scores to label mapping for multi-class classification problem.

*Cross Validation. *Train Test.

* refactor code from test into functions make it more readable * sprinkle some vars

…263) * Move ZBaselines to test/BaselineOutput * Fix the path in BaseTestBaseline * Move Samples\UCI to test\data

…assifierTesterThresholdingTest (#255) * Tests Enabled & Dataset Moved to correct place in test\BaselineOutput * Correcting path for adult data set for autoInference class, and removing @ from path

) * Linear classifier test enabled * Files added to test\BaselineOutput * Extra space removed * Average Preceptron Pav Caliberator test enabled

Spaces in build scripts now properly quoted.

* introduce IUnsupervisedLearningWithWeights * add test to check KMeans don't need label and can handle presence of weight column. also extract real weight value from cursor.

* Changes to RocketEngine to fix take top k logic. * Add namespace information to allow file to reference correct version of Formatting object.

* make class partial so I can add constuctor in separate file. add constructros for testing * formatting

…rics. Made the private const strings in two classes public. (#276)

* add missing subcomponents * right one * more cleanup

* first attempt * add comments * specify seed for random. make constructor internal.

* Fix for SupportedMetric.ByName() method. Include new unit test for function. * Fix for SupportedMetric.ByName() method. Include new unit test for function. * Fix for SupportedMetric.ByName() method. Include new unit test for function. * Removed unnecessary field filter, per review comment.

When training a FastTreeRanker using the `testFrequency` parameter, it is expected that NDCG is prented every testFrequency iterations. However, instead of NDCG, only empty strings are printed. The root cause was that the MaxDCG property of the dataset was never calculated, so the NDCG calculation is aborted, leaving an empty string as a result. This PR fixes the problem by computing the MaxDCG for the dataset when the Tests are defined (so that if the tests are not defined, the MaxDCG will never be calculated). Closes #242

* Added placeholder * Cleaned up Infos (replaced with ColumnPairs) * Added ColumnInfo * Added all the Create() methods. * Added Mapper * Commented out the EntryPoint * Added PcaEstimator2 * PcaWorkout test passes * Added pigsty api * Fixed EntryPoint * Fixed the arguments * Fixed tests and added pigsty test * Deleted Wrapped PCA transform * Float -> float * Cleaned docstrings * Removed some unnecessary checks * Simplified unnecessary code * Moved some fields to ColumnInfo for simplifications * Simplified weight columns * Address PR comments #1 * Addressed PR comments #2 * Moved the static test * PR comments #3 * Moved schema related information out of ColumnInfo and into Mapper.ColumnSchemaInfo. * PR comments * PR comments * Updated manifest for entrypoint PcaCalculator * Fixed schema exceptions

* Implement VBuffer master plan WIP #1 * Getting everything to build and tests passing * Keep moving to the master plan of VBuffer. * Remove the rest of the VBuffer.Count usages in ML.Data * Remove the rest of the VBuffer.Count usages and make VBuffer.Count private. * Fix two failing tests. * Fix FastTreeBinaryClassificationCategoricalSplitTest by remembering the underlying arrays in the column buffer in Transposer. Also enable a Transposer test, since it passes.

Users/dmitrya/ci testing2

* Draft PR for SrCnn batch detection API interface (#1) * POC Batch transform * SrCnn batch interface * Removed comment * Handled some APIreview comments. * Handled other review comments. * Resolved review comments. Added sample. Co-authored-by: Yael Dekel <yaeld@microsoft.com> * Implement SrCnn entire API by function * Fix bugs and add test * Resolve comments * Change names and add documentation * Handling review comments * Resolve the array allocating issue * Move modeler initializing to CreateBatch and other minor fix. * Fix 3 remaining comments * Fixed code analysis issue. * Fixed minor comments Co-authored-by: klausmh <klausmh@microsoft.com> Co-authored-by: Yael Dekel <yaeld@microsoft.com>

forki and others added 30 commits May 9, 2018 09:00

Expand on NuGet installation (#90)

051b62c

Added a badge for NuGet status and fixed a few typos on the readme.md

Bump version number to 0.2. (#95)

4f9ee89

We've shipped 0.1.0, we should start producing higher versions now. Fix #85

publish symbols enabled (#99)

e1db5c5

Fix reversed hyperparameters in Scenarios Tests. Closes #25. (#94)

ff5fb14

Issue #104: Update the build tools to 2.1.200 (#105)

e9cd4bc

* Issue #104: Update the build tools to 2.1.200 Issues: This closes #104 * Updating .NET Version in the right file.

Fixed exception: "InvalidOperationException: Source column 'Label' is…

ea07be8

… required but not found." (#121) * Checking for both ColumnAttribute and ColumnNameAttribute when creating schema in CreateBatchPredictionEngine. * Addressed reviewers' comments.

Update TextLoader.cs (#129)

3780923

Make a 'not supported field type' exception more readable, so the developer could figure out why he can't load the data This closes #128

Fix entry point name for Logistic Regression (LogisticRegressor is mi…

1b08ae4

…sleading) (#139) * Fix name for Logistic Regression

Fix build break (#146)

80a95b5

The previous 2 changes conflicted. Resovling the break that happened between them.

Remove special case for Logistic Regression in MacroUtils.cs (#147)

3102180

Add Parquet symbols nuget package. (#145)

160e9e4

Add symbols package for ML.Parquet package. Put common NuGet package logic in props file. Fix #144

Prevent learning pipeline from adding null transform model to the pip…

52ea962

…eline (#154) * Prevent learning pipeline from adding null transform model to the pipeline. * Add test.

Added Microsoft.ML.Benchmarks Project (#62)

83f9bac

* Benchmark * Changed to .NET Core app * Added Accuracy Reporting * fixed build * Feedback from Gleb * Added batch prediction tests * Resolved conflicts the sln file * Renamed the new file to match type name * Removed duplicated method

Publishing nuget packages to myget feed. (#155)

436700a

* Publishing nuget packages to myget feed. Also - set the symbols expiration days default based on feedback from the .NET core-eng team. Fixes #11 * Shorten nuget push timeout to match corefx and coreclr.

Fix a bug in Tree leaf featurizer entry point, and add a test for it. (…

efa2644

…#131) * Fix a bug in Tree leaf featurizer entry point, and add a test for it. * Improve unit test * Update unit test * Decrease number of trees and leaves in unit test

no need to add combiner if you don't have transforms. (#172)

c5d168d

* no need to add combiner if you don't have transforms. * fix NextSigned

Change "Documentation" folder to "docs" (#87)

616e75f

* Changing name "Documentation" to "docs" for consistency in the repo. Fixes #143

Fixed RandomUtils.NextFloat() extension methods (#177)

ae1ecef

Removed two NextFloat() extension methods from RandomUtils and replaced all usage of them with `IRandom.NextSingle()`.

Support packages.config (#165)

720ecdc

Copy our native assemblies using MSBuild when a consumer is using NuGet packages.config, since NuGet doesn't do this automatically. Also, add an error when a project is not targeting x64. ML.NET only supports x64. Fix #93

Compile CpuMathNative and FastTreeNative with charset=utf-8, fix for …

f16737c

…issue #78 (#187)

handle boolean type in construction utils. (#183)

3f586bd

handle boolean type in construction utils.

Anipik and others added 25 commits May 24, 2018 15:05

Enabling FastTreeBinaryClassificationNoOpGroupIdTest (#227)

5ebf614

* Test Enabled, Zbaseline files added for debug and release

Enables FastTreeHighMinDocsTest (#228)

b069ba2

* Test Enabled, Zbaseline files added

Enables RandomCalibratorPerceptronTest (#231)

38cd636

* Test Enabled, added Debug and release zbaselines

Enables Calibrators Tests for Linear Svm (#233)

5860a4e

* Test Enabled, Debug and Release baseline files added

Add examples for clustering (#222)

b1bbceb

* example * add Clusters tests * cleanup * address comments * bring clustering reference back * rephrasing

Change "Documentation" folder to "docs" (#240)

5d368b8

Add small fix in Microsof.ML.sln

Scores to Label mapping (#239)

ca8d46a

* Scores to label mapping for multi-class classification problem.

Cross Validation and TrainTest (#212)

11d5ba7

*Cross Validation. *Train Test.

Cleanup SentimentPredictionTests (#260)

9f3076d

* refactor code from test into functions make it more readable * sprinkle some vars

Move ZBaselines => test/BaselineOutput and Samples/UCI => test/data (#…

20c36ca

…263) * Move ZBaselines to test/BaselineOutput * Fix the path in BaseTestBaseline * Move Samples\UCI to test\data

Enables FastTreeBinaryClassificationCategoricalSplitTest and BinaryCl…

ee7a669

…assifierTesterThresholdingTest (#255) * Tests Enabled & Dataset Moved to correct place in test\BaselineOutput * Correcting path for adult data set for autoInference class, and removing @ from path

LinearClassifierTest And PAVCalibratorPerceptronTest being Enabled (#253

0233d71

) * Linear classifier test enabled * Files added to test\BaselineOutput * Extra space removed * Average Preceptron Pav Caliberator test enabled

Fixes build error when path contains space on Linux (#247)

ed57712

Spaces in build scripts now properly quoted.

introduce IUnsupervisedLearningWithWeights (#236)

5ff56ba

* introduce IUnsupervisedLearningWithWeights * add test to check KMeans don't need label and can handle presence of weight column. also extract real weight value from cursor.

Remove references to ILAsmVersion.txt from build script (#266)

62da34e

Bump master to v0.3 (#269)

10508f8

RocketEngine fix for selecting top learners (#270)

c259863

* Changes to RocketEngine to fix take top k logic. * Add namespace information to allow file to reference correct version of Formatting object.

small code cleanup (#271)

9d19d0e

Preparation for syncing sources with internal repo (#275)

fbd4de0

* make class partial so I can add constuctor in separate file. add constructros for testing * formatting

Changes to use evaluator metrics names in PipelineSweeperSupportedMet…

ba9c0f6

…rics. Made the private const strings in two classes public. (#276)

add missing subcomponents to sweepers (#278)

5dc7848

* add missing subcomponents * right one * more cleanup

remove lotus references. (#252)

71e7ff3

Random seed and concurrency for tests (#277)

fb06f38

* first attempt * add comments * specify seed for random. make constructor internal.

justinormont merged commit f03189d into justinormont:master Jun 4, 2018

justinormont pushed a commit that referenced this pull request Apr 15, 2019

Merge pull request #1 from dotnet/users/dmitrya/ci-testing2

ef3a6b3

Users/dmitrya/ci testing2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Pull from upstream #1

Pull from upstream #1

Uh oh!

justinormont commented Jun 4, 2018

Uh oh!

Uh oh!

Pull from upstream #1

Pull from upstream #1

Uh oh!

Conversation

justinormont commented Jun 4, 2018

Uh oh!

Uh oh!