-
Notifications
You must be signed in to change notification settings - Fork 5.9k
Closed
Description
We plan to tune fluid's performance with a loop with three steps:
- Profile: To figure out which part of the fluid is slow.
- Find problems & Give a fix: We will discuss and find the problems based on profile results.
- Profile: To confirm the problems has been solved and the performance is improved.
There are several jobs for these three steps:
- Find a machine with docker and GPU for profiling. @jacquesqiao
- Neural network configurations for CNN, LSTM, etc. @qingqing01 @dzhwinter
- Setup an environment for profiling. @chengduoZH
- Use cProfile for Python, yap for Python/C++, nvprof for CUDA
- Find problems: All members together.
- Fix GPU problems: @jacquesqiao @qingqing01
- Fix CPU problems: @dzhwinter
- Fix Python problems: TODO
qingqing01, pkuyym, chengduoZH, huxiaoman7, tonyyang-svail and 2 more