Right now, the implementation clones the online networks periodically as in Deep-CACLA and DQN. We need to implement softUpdate yet