Hello author,
I want follow your experiments, but I my current single gpu is 8G, when in the test stage after some train iterations, my gpu memory usage is 7g+ when I just set the batch size = 1. (your paper batch size setting is 20)
I would like to know what are the gpu requirements for your experiments.