Use L-BFGS-B Fortran library for native logistic regression benchmark #8

bibikar · 2019-06-13T15:36:04Z

This PR swaps out the native logistic regression benchmark's solver for the same one used in SciPy. L-BFGS-B is wrapped in a minimal DAAL optimization_solver class and directly set as the solver for DAAL's logistic regression. This brings native performance for this benchmark to match our optimized scikit-learn performance for a sample binary classification problem with 100k samples and 1k features.

Because we must now directly call BLAS and LAPACK functions instead of using DAAL's internal BLAS and LAPACK implementations, we add a dependency on MKL as well.

… scipy v0.10.0

oleksandr-pavlyk · 2019-06-25T14:01:54Z

native/Makefile

@@ -26,10 +27,19 @@ all: $(addprefix bin/,$(BENCHMARKS))
 bin:
 	mkdir -p bin

+bin/log_reg_lbfgs: log_reg_lbfgs_bench.cpp $(FOBJ) | bin
+	$(CXX) $^ $(CXXINCLUDE) $(CXXFLAGS) $(LDFLAGS) -lmkl_rt -lm -lifcore \


Linking against mkl_rt, are we making sure that MKL is using TBB as the underlying threading layer?

MKL's default is to use OpenMP, as since DAAL's default is to use TBB we end up incurring the cost of runtime of both.

One can either set the threading layer in the benchmark itself, by calling mkl_set_threading_layer, or use explicit dynamic linking, see MKL Linking Advisor.

I added a call to mkl_set_threading_layer in native/log_reg_lbfgs_bench.cpp.

oleksandr-pavlyk

Thanks!

This PR adds logistic regression replicating the results of sklearn.linear_model.LogisticRegression but implemented in daal4py. It supports solvers lbfgs and newton-cg. test_fit uses the canonical form of logistic regression for the binary case (which in scikit-learn is multi_class='ovr') and the multinomial (softmax of exponentiated scores, multi_class='multinomial') for the multi-class case. test_predict supports any combination of n_classes and multi_class, but we use the same multi_class used in test_fit. While we cannot directly use daal4py's logistic regression because it isn't as easy as native DAAL to pass in a custom solver (see #8), we use daal4py's logistic regression objective functions and math primitives to compute logistic regression.

optional build mode without distributed mode

* adding cycling notebook example * Making adjustments to the Jupyter notebook * adding in main landing page changes * Adding in examples page * Fixing examples, formatting, and wording * fixing removed items and typos in example code

bibikar added 9 commits June 13, 2019 10:21

Add L-BFGS-B sources for version 3.0 and begin work on DAAL wrapper

befd446

More work on DAAL driver for L-BFGS-B

19aa7e3

Fix compilation of lbfgsb_daal.h

0237c80

Add extern "C" where needed

a17d2ae

L-BFGS-B minimizer appears to work now

f2d93c3

Use --verbose arg to set L-BFGS-B verbosity

371d7dc

Replace included LINPACK functions with wrappers calling LAPACK as in…

2d5b46a

… scipy v0.10.0

Fix compilation of logistic regression native bench

8bda71a

Note dependency on ifort and MKL for native benches

5bda5c7

bibikar requested a review from oleksandr-pavlyk June 13, 2019 15:36

oleksandr-pavlyk reviewed Jun 25, 2019

View reviewed changes

bibikar added 3 commits June 25, 2019 14:32

Ask MKL to use TBB threading before anything else

6dd9d86

Do not segfault inside L-BFGS-B

5de42cc

Decrease tolerance requirement for newton-cg solver

60334cd

oleksandr-pavlyk approved these changes Jun 27, 2019

View reviewed changes

bibikar mentioned this pull request Jun 27, 2019

Add daal4py logistic regression benchmark #10

Merged

Add LICENSES_bundled like in scipy

cc1e4af

bibikar merged commit a475be0 into IntelPython:master Jun 27, 2019

bibikar deleted the feature/l_bfgs_b branch June 27, 2019 18:10

razdoburdin pushed a commit to razdoburdin/scikit-learn_bench that referenced this pull request Jun 13, 2023

Merge pull request IntelPython#8 from SAT/feature/nodist

176452f

optional build mode without distributed mode

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use L-BFGS-B Fortran library for native logistic regression benchmark #8

Use L-BFGS-B Fortran library for native logistic regression benchmark #8

Uh oh!

bibikar commented Jun 13, 2019

Uh oh!

oleksandr-pavlyk Jun 25, 2019

Uh oh!

bibikar Jun 25, 2019

Uh oh!

oleksandr-pavlyk left a comment

Uh oh!

Uh oh!

Use L-BFGS-B Fortran library for native logistic regression benchmark #8

Use L-BFGS-B Fortran library for native logistic regression benchmark #8

Uh oh!

Conversation

bibikar commented Jun 13, 2019

Uh oh!

oleksandr-pavlyk Jun 25, 2019

Choose a reason for hiding this comment

Uh oh!

bibikar Jun 25, 2019

Choose a reason for hiding this comment

Uh oh!

oleksandr-pavlyk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!