[call for contribution] Improving CPU performance

Currently we are still slow on CPU (#1222). 

There are several things we can do:
1. Do a good profile on a chosen set of benchmarks to understand the bottlenecks.
   Here are some candidates:
   1. MNIST CNN model: since the MNIST model is very small, performance will suffer if system overhead is high, and it will show us potential bottlenecks in non-CNN operations
   2. Cifar: this is much heavier than MNIST, so it will mostly show us the performance of the underlying library we are using, that is OpenBLAS/MKL + MShadow. I think the configuration (#of threads) of the libraries has quite a lot of impact on overall performance.
2. Integrate libraries like NNPACK (https://github.com/Maratyszcza/NNPACK) and MKLDNN (https://software.intel.com/en-us/articles/deep-neural-network-technical-preview-for-intel-math-kernel-library-intel-mkl).
3. Improve the operators not included in NNPACK and MKLDNN. This would include some code in MShadow and some in mxnet operators.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[call for contribution] Improving CPU performance #2986

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development