-
Notifications
You must be signed in to change notification settings - Fork 2
Description
Hey, just see this project from your reddit post. We are working on something related called nnvm https://github.com/dmlc/nnvm
Here are the few things I like to clarify.
-
NNVM does do symbolic differentiation, which enables higher order differentiation if the gradient operator's gradient is well defined. In fact mxnet also creates new nodes for gradient, except that in the front-end it exposes module based API.
-
NNVM's optimization relies on attributes such as shape function, type inference function, which are strict. The observation is at such high level, it is hard to enumerate the operations a DL system want. This being said, it is indeed helpful to have a collection of primitive operators, which can be hinted through additional attributes. We used such thing for fusion module
I am hoping to see if there is any chance for discussion and lure you into NNVM to build interesting abstractions together, or making the abstraction better for everyone