[MXNET-424] dtype option for multinomial #10970

asitstands · 2018-05-16T09:02:34Z

Description

This PR adds dtype option to set the data type of the sample output array of random.multinomial, which is fixed as 'int32' in the current implementation. The default value is int32.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

piiswrong · 2018-05-17T17:56:20Z

src/operator/random/sample_multinomial_op.h

-    Kernel<SampleMultinomialKernel, xpu>::Launch(
-      s, N, K, M, inputs[0].dptr<DType>(), uniform.dptr_, outputs[0].dptr<int>(),
-      param.get_prob ? outputs[1].dptr<DType>() : nullptr);
+    MSHADOW_TYPE_SWITCH(outputs[0].type_flag_, IType, {


This kind of 2 layer switches is very slow to compile. Why do we need type support for output?

Sometimes the multinomial samples need further processing in floating point arithmetic, so the samples need to be copied into a new array of floating point type. The copy slows down the training. For example, in RBM, the samples need to be applied by linalg.gemm which supports only floating point arrays.

A simple cast shouldn't cost that much?

This kind of nested switches are really slow to compile and makes the binary much bigger.
We need to make sure it really justifies the cost

The binary size increases about 0.1% for both shared and static library (CUDA, CUDNN, MKL). Compiling mxnet already takes quite long time, so the relative increase of the compile time is also tiny.

I'm working with some variants of RBM and the use of .astype('float32') in several places increases training time over 20%. In the case of usual basic RBM, it increases about 10% of training time in my test for mnist. Of course, it depends on the hyperparameters and data. However, I think that, in general, the cost cannot be ignored for applications using heavy Monte Carlo samplings of discrete states.

asmushetzel · 2018-05-26T13:38:26Z

This change would make things a bit more consistent with other samplers. For all the rest (uniform , gamma etc), we consistently use a floating point type as return value (by default 32bit) though some of them (Poisson, negative binomial) are distributions on integer values. And for the cases that I have seen where these samplers get used in practice, in fact the users needed floating-point data for further processing.
We hardly can't change the default type of the multinomial anymore, but I think we should add floating point as a result type.
The nested compile switches are used a lot in MXNet, not sure whether this is an issue when used also here.

piiswrong · 2018-05-29T18:44:38Z

Ok. We can add this, but using float to represent int is only accurate within certain ranges. Please add checks for input dimensions for various types.

asitstands · 2018-05-30T04:25:57Z

I added the check at SampleMultinomialOpShape. It ensures that the size of the last dimension of the input array is less than or equal to 2 << (the number of mantissa bits of dtype - 1) for floating point types or std::numeric_limits::max() for integer types. A test for this is also added.

piiswrong · 2018-05-30T19:01:41Z

src/operator/random/sample_multinomial_op.h

@@ -67,6 +70,10 @@ inline bool SampleMultinomialOpShape(const nnvm::NodeAttrs& attrs,
  const TShape& ishape = (*in_attrs)[0];
  if (!ishape.ndim()) return false;

+  MSHADOW_TYPE_SWITCH(param.dtype, DType, {
+    CHECK_LE(ishape[ishape.ndim() - 1], mxnet::common::MaxIntegerValue<DType>());


Need to output a message saying why it failed.

I added an error message.

piiswrong · 2018-05-30T19:01:58Z

otherwise LGTM

* dtype option for multinomial * Add missing test for uint8 * Add check to ensure dtype has a sufficient precision. * Fix lint * Error message for the dtype precision check * Retrigger CI

asitstands requested a review from szha as a code owner May 16, 2018 09:02

asitstands added 2 commits May 16, 2018 18:02

dtype option for multinomial

9d191be

Add missing test for uint8

fdc2e1a

piiswrong reviewed May 17, 2018

View reviewed changes

szha removed their request for review May 21, 2018 21:34

asitstands mentioned this pull request May 25, 2018

Use dtype=int for the indices returned by TopK #11031

Closed

asitstands added 2 commits May 30, 2018 13:28

Add check to ensure dtype has a sufficient precision.

9964521

Fix lint

586cdb7

piiswrong reviewed May 30, 2018

View reviewed changes

asitstands added 2 commits May 31, 2018 13:07

Error message for the dtype precision check

7a127ff

Retrigger CI

0966370

piiswrong merged commit a27b52e into apache:master Jun 1, 2018

asitstands mentioned this pull request Jun 27, 2018

A binary RBM example #11268

Merged

5 tasks

ChaiBapchya mentioned this pull request Feb 18, 2019

MaxIntegerValue is buggy #13455

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MXNET-424] dtype option for multinomial #10970

[MXNET-424] dtype option for multinomial #10970

asitstands commented May 16, 2018

piiswrong May 17, 2018

asitstands May 18, 2018 •

edited

Loading

piiswrong May 21, 2018

asitstands May 22, 2018

asmushetzel commented May 26, 2018

piiswrong commented May 29, 2018

asitstands commented May 30, 2018 •

edited

Loading

piiswrong May 30, 2018

asitstands May 31, 2018

piiswrong commented May 30, 2018

[MXNET-424] dtype option for multinomial #10970

[MXNET-424] dtype option for multinomial #10970

Conversation

asitstands commented May 16, 2018

Description

Checklist

Essentials

piiswrong May 17, 2018

Choose a reason for hiding this comment

asitstands May 18, 2018 • edited Loading

Choose a reason for hiding this comment

piiswrong May 21, 2018

Choose a reason for hiding this comment

asitstands May 22, 2018

Choose a reason for hiding this comment

asmushetzel commented May 26, 2018

piiswrong commented May 29, 2018

asitstands commented May 30, 2018 • edited Loading

piiswrong May 30, 2018

Choose a reason for hiding this comment

asitstands May 31, 2018

Choose a reason for hiding this comment

piiswrong commented May 30, 2018

asitstands May 18, 2018 •

edited

Loading

asitstands commented May 30, 2018 •

edited

Loading