[FIXED]: Increase in training time after each batch #3

Miraj98 · 2022-11-12T11:49:10Z

After refactoring the neural net code to use the new tensor lib, there was a perf issue when training the data. In particular after batch, it progressively got slower. The issue was with the line that was updating the weights. However each of these updates is an op which was adding nodes to the computational graph even because of how the requires_grad flag was implemented. Hence after each batch the backprop had to be done on a larger and larger computational graph which was causing the issue.
The issue has been fixed temporarily by not having the operation done through tensors (there by not needing to think about the computational graph at all) but through raw ndarrays and then assigning a new tensor from it after all ops are done

fixed perf issue

0450806

Miraj98 merged commit 781680c into master Nov 12, 2022

Miraj98 deleted the nn-perf-improvment branch November 12, 2022 11:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FIXED]: Increase in training time after each batch #3

[FIXED]: Increase in training time after each batch #3

Miraj98 commented Nov 12, 2022

[FIXED]: Increase in training time after each batch #3

[FIXED]: Increase in training time after each batch #3

Conversation

Miraj98 commented Nov 12, 2022