rfc: dropout primitive attribute #1708

itaraban · 2023-08-23T13:17:52Z

This is proposal to support dropout operation in oneDNN via primitive attribute
Link to rendered document
Link to PoC
Performance data for DGL (with PyTorch backend) GNN training benchmarks using Icelake server machine:

benchmark	dataset	default	with oneDNN dropout	difference
bench_gat	cora	0.012357805	0.009989643	124%
	pubmed	0.029577794	0.021833464	135%
bench_sage	cora	0.007244702	0.00432033	168%
	pubmed	0.017254775	0.009778418	176%

mgouicem

Thanks for the RFC and prototype. Could you clarify the parameters for the attribute. For example is the memory descriptor for output masks or input mask ? I guess I see two ways:

user might pass probability and the primitive computes the mask
user might passes the mask and the primitive just applies it.

If you have training in mind, I wonder if we need to support both (on forward pass, generate the mask, on backward pass consume it).

Other questions:

what kind of randomness is required on oneDNN implementations (how much entropy?, what distribution? ...)?
what would be the plan to validate this attribute (the masking part should be fine, talking about randomness)
we might want to add a knob for user to set the random seed, in order to be able to reproduce runs.

mgouicem · 2023-08-23T13:32:39Z

rfcs/20230818-Dropout/README.md

+///     otherwise.
+dnnl_status_t DNNL_API dnnl_primitive_attr_get_dropout(
+        const_dnnl_primitive_attr_t attr,
+        float *p, const_dnnl_memory_desc_t *drop_desc);


Do I understand correctly that, if convolution has the dropout attribute set, on forward we would:

compute the dropout mask

apply the mask to the destination

and write the mask to a new output memory

The question is how would that mask be applied on backward? For example, we have Conv -> Dropout -> Relu.
Do we expect user to pass the dropout mask to the Relu backward computation or the Conv backward computation? I would think the former would be simpler to implement.

Yes, mentioned algorithm is what we expect from this primitive attiribute - compute and apply mask.
On backward pass mask from forward will be multiplied by input(see native_dropout_backward from PyTorch)
The main idea was to move mask generation inside of oneDNN. Otherwise user can just use binary_mul post op to apply existing mask.

Thanks. That clarifies the scope of the rfc (the new attribute is for fwd only).

BTW, what datatypes is oneDNN expected to support for the mask?

Currently, we test only float32. I think in future, probably bfloat16 support will be also needed.

mgouicem · 2023-08-23T13:34:49Z

rfcs/20230818-Dropout/README.md

+/// Sets probability for drop-out primitive attribute.
+///
+/// @param attr Primitive attributes.
+/// @param p Drop-out probability


is the probability needed at creation time, or can we take it at execution time?
In particular, if this parameter changes for each execution of a given primitive, passing probability at execution time would allow to increase primitive cache hit rate.

In our models/benchmarks we have same probability for all layers, but I think, that it is possible to make this runtime parameter.

rfcs/20230818-Dropout/README.md

mgouicem · 2023-09-22T08:07:51Z

From the POC, it seems that in ref path the dropout is applied after post-ops and not before. Is that intentional?

itaraban · 2023-09-22T10:05:27Z

From the POC, it seems that in ref path the dropout is applied after post-ops and not before. Is that intentional?

No, I'll change it. Thank you for finding it!

rfcs/20230818-Dropout/README.md

igorsafo · 2023-10-09T17:11:19Z

rfcs/20230818-Dropout/README.md

+/// @returns #dnnl_success on success and a status describing the error
+///     otherwise.
+dnnl_status_t DNNL_API dnnl_primitive_attr_set_dropout(
+        dnnl_primitive_attr_t attr, uint8_t enable_drop,


I would suggest to remove enable_drop from both APIs:

In getter rely on md returned from the API call. If it is a zero md, then dropout is not set by user.

In setter -- once user called dnnl_primitive_attr_set_dropout(attr, mask) dropout is set.

The current API is a bit confusing when user calls dnnl_primitive_attr_set_dropout(attr, **false**, mask_desc)

@igorsafo , thanks for you suggestion!
I updated README and PoC

igorsafo · 2023-10-09T17:27:58Z

rfcs/20230818-Dropout/README.md

+and runtime dropout arguments: output mask, which can be used in backward pass,
+dropout probability and seed.
+
+```c


(random spot): We might need to introduce a standalone Dropout primitive to support frrameworks like ONNX that registers operations supported by the backend. In the current proposal Dropout functionality will be limited to a few patterns, but the rest patterns will not be able to implement Dropout functionality using oneDNN. Please double check with the frameworks if the solution works for them as well.
+@georgen117

I guess the question here is "is there a benefit to support dropout primitive in oneDNN"?
For the case of ONNX, isn't it ok if the oneDNN provider provides the dropout operation without using oneDNN, but uses oneDNN when dropout fusion occurs?

igorsafo · 2023-10-23T22:42:03Z

rfcs/20230818-Dropout/README.md

+    uint8_t enabled_u8;
+    error::wrap_c_api(
+            dnnl_primitive_attr_get_dropout(get(), &enabled_u8, &cdesc),
+            "could not get parameters of a dropout attribute");
+    dnnl_memory_desc_t cloned_md = nullptr;
+    error::wrap_c_api(dnnl_memory_desc_clone(&cloned_md, cdesc),
+            "could not clone a memory descriptor");
+    mask_desc = memory::desc(cloned_md);
+    enabled = enabled_u8;


Suggested change

uint8_t enabled_u8;

error::wrap_c_api(

dnnl_primitive_attr_get_dropout(get(), &enabled_u8, &cdesc),

"could not get parameters of a dropout attribute");

dnnl_memory_desc_t cloned_md = nullptr;

error::wrap_c_api(dnnl_memory_desc_clone(&cloned_md, cdesc),

"could not clone a memory descriptor");

mask_desc = memory::desc(cloned_md);

enabled = enabled_u8;

error::wrap_c_api(

dnnl_primitive_attr_get_dropout(get(), &cdesc),

"could not get parameters of a dropout attribute");

dnnl_memory_desc_t cloned_md = nullptr;

error::wrap_c_api(dnnl_memory_desc_clone(&cloned_md, cdesc),

"could not clone a memory descriptor");

mask_desc = memory::desc(cloned_md);

Thanks, changed

mgouicem added the RFC A design document label Aug 23, 2023

mgouicem reviewed Aug 23, 2023

View reviewed changes

itaraban force-pushed the itaraban/rfcs/dropout branch 3 times, most recently from c6291de to d6fa6a5 Compare September 21, 2023 11:08

igorsafo reviewed Oct 9, 2023

View reviewed changes

itaraban force-pushed the itaraban/rfcs/dropout branch from d6fa6a5 to a6ffd4d Compare October 23, 2023 22:37

igorsafo reviewed Oct 23, 2023

View reviewed changes

itaraban force-pushed the itaraban/rfcs/dropout branch from a6ffd4d to b19eb59 Compare October 23, 2023 22:50

rfc: dropout primitive attribute

055515a

itaraban force-pushed the itaraban/rfcs/dropout branch from b19eb59 to 055515a Compare October 23, 2023 22:52

vpirogov added this to the v3.6 milestone Jul 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rfc: dropout primitive attribute #1708

rfc: dropout primitive attribute #1708

itaraban commented Aug 23, 2023 •

edited

Loading

mgouicem left a comment

mgouicem Aug 23, 2023

itaraban Sep 1, 2023

mgouicem Sep 4, 2023

mgouicem Sep 4, 2023

itaraban Sep 4, 2023

mgouicem Aug 23, 2023

itaraban Sep 1, 2023

mgouicem commented Sep 22, 2023

itaraban commented Sep 22, 2023

igorsafo Oct 9, 2023

itaraban Oct 23, 2023

igorsafo Oct 9, 2023

mgouicem Oct 10, 2023

igorsafo Oct 23, 2023

itaraban Oct 23, 2023

rfc: dropout primitive attribute #1708

Are you sure you want to change the base?

rfc: dropout primitive attribute #1708

Conversation

itaraban commented Aug 23, 2023 • edited Loading

mgouicem left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mgouicem commented Sep 22, 2023

itaraban commented Sep 22, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

itaraban commented Aug 23, 2023 •

edited

Loading