Putting all weights to a single component when defining a separate Q(z|x, y) for each y

Hi @RuiShu, thanks for sharing your code and thoughts on this problem. I've been playing with your proposed models and implementations in pytorch using MNIST data. I noticed that if I changed the Q(z|x, y) implementation to a separate model for each mixture component, the model will put all training data onto a single component (Q(y|x) is degenerate). Do you know why this is happening? Thank you so much!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Putting all weights to a single component when defining a separate Q(z|x, y) for each y #15

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Putting all weights to a single component when defining a separate Q(z|x, y) for each y #15

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions