Closed
Description
Thank you for your code.
Following the original VAT paper, consistency_func
in hparams.py should be reverse_kl
for VAT, although it is set to forward_kl
in your code.
The adversarial noise r in VAT is obtained by maximizing D_KL(p(y|x)||p(y|x+r)), however, the consistency loss D_KL(p(y|x+r)||p(y|x)) is used when consistency_func=forward_kl
. It matters because of the asymmetricity of KL divergence, I think.
Metadata
Metadata
Assignees
Labels
No labels