PPCA Tutorial #499

timshell · 2017-03-04T10:21:22Z

Re: discussion in Gitter channel

Not sure what level of detail is expected. I used the other tutorials as my guide. Wanted to get one tutorial done in line with what you expect before I start doing more.

All feedback appreciated!

dustinvtran · 2017-03-04T17:28:15Z

Thanks for writing this. Your description of PPCA is accurate and succinct.

I think the ideal for a model tutorial should roughly explain its key ideas through an illustration with a data set; followed by the model; followed by an algorithm to infer it; followed by a check of its fit. (Good references are http://edwardlib.org/tutorials/unsupervised and http://edwardlib.org/tutorials/gan. Some of the other model tutorials are lacking in this regard.) So you could probably scaffold your current writing into sections with the data and output of the script.

timshell · 2017-03-07T20:23:27Z

Thanks for the quick feedback @dustinvtran and sorry it took this long to revise. Let me know if I can improve this in any way. Thanks!

dustinvtran · 2017-03-08T04:28:18Z

docs/tex/bib.bib

+    year = {1999},
+    volume = {61},
+    pages = {611--622}
+


missing }. adding the ending brace lets it compile successfully for me.

dustinvtran · 2017-03-08T04:28:41Z

docs/tex/bib.bib

@@ -667,6 +667,14 @@ @article{marin2012approximate
 pages = {1167--1180}
 }

+@ARTICLE{Tipping99probabilisticprincipal,


i recommend the citekey format tipping1999probabilistic, following google scholar.

dustinvtran · 2017-03-08T04:29:22Z

docs/tex/tutorials/probabilistic-pca.tex

+
+\subsubsection{Data}
+
+We simulate our data points below. We'll talk about the individual variables and what they stand for in the next section. For this example, $\mathbf{x}\in\mathbb{R}^2$.


"For this example, each data point is 2-dimensional, $\mathbf{x}_n\in\mathbb{R}^2$."

dustinvtran · 2017-03-08T04:30:14Z

docs/tex/tutorials/probabilistic-pca.tex

+
+\subsubsection{Model}
+
+Consider a dataset $X = \{(\mathbf{x}_1, \mathbf{x}_2,\ldots , \mathbf{x}_n)\}$ where $\mathbf{x}_i \in \Bbb{R}^D$.


Following the code, I prefer the notation that N denotes the data set size and n denotes an index to one data point.

Just to clarify, are you saying to change the i's to n? Or something else?

Ignore - figured it out by looking at http://edwardlib.org/tutorials/unsupervised

dustinvtran · 2017-03-08T04:35:17Z

docs/tex/tutorials/probabilistic-pca.tex

+
+Note here that regular PCA is simply the specific case of Probabilistic PCA, where $\sigma^2 \to 0$.
+
+We set up our model below.


I would also describe in the model section that you're placing a distribution over principle axes, either viewed as a prior or as a regularizer.

dustinvtran · 2017-03-08T04:35:39Z

docs/tex/tutorials/probabilistic-pca.tex

+
+\subsubsection{Inference}
+
+Since $\mathbf{W}$ cannot be analytically determined, we must use some approximation method. Below, we set up our inference variables and then run the approximation algorithm. For this example, our method is to minimize the $\text{KL}(q\|p)$ divergence measure.


"Since ... determined, we must use some inference method." In this model, I think the posterior is actually normally distributed.

dustinvtran · 2017-03-08T04:36:21Z

@timshell: The tutorial looks great. Only minor suggestions above. Happy to merge it once those are made.

timshell · 2017-03-08T08:02:05Z

Sweet, just made the changes @dustinvtran !

ppca tutorial

dd56015

revising ppca tutorial

7831192

Merge branch 'master' into tutorials

bd90076

dustinvtran reviewed Mar 8, 2017

View reviewed changes

timshell added 2 commits March 8, 2017 07:53

minor changes post code review

804b67e

merge

27b4b3e

dustinvtran merged commit e65a5f9 into blei-lab:master Mar 8, 2017

timshell deleted the tutorials branch March 8, 2017 11:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PPCA Tutorial #499

PPCA Tutorial #499

timshell commented Mar 4, 2017

dustinvtran commented Mar 4, 2017 •

edited

Loading

timshell commented Mar 7, 2017

dustinvtran Mar 8, 2017

dustinvtran Mar 8, 2017

dustinvtran Mar 8, 2017

dustinvtran Mar 8, 2017

timshell Mar 8, 2017

timshell Mar 8, 2017

dustinvtran Mar 8, 2017

dustinvtran Mar 8, 2017

dustinvtran commented Mar 8, 2017

timshell commented Mar 8, 2017


		\subsubsection{Data}

		We simulate our data points below. We'll talk about the individual variables and what they stand for in the next section. For this example, $\mathbf{x}\in\mathbb{R}^2$.


		\subsubsection{Model}

		Consider a dataset $X = \{(\mathbf{x}_1, \mathbf{x}_2,\ldots , \mathbf{x}_n)\}$ where $\mathbf{x}_i \in \Bbb{R}^D$.


		Note here that regular PCA is simply the specific case of Probabilistic PCA, where $\sigma^2 \to 0$.

		We set up our model below.


		\subsubsection{Inference}

		Since $\mathbf{W}$ cannot be analytically determined, we must use some approximation method. Below, we set up our inference variables and then run the approximation algorithm. For this example, our method is to minimize the $\text{KL}(q\\|p)$ divergence measure.

PPCA Tutorial #499

PPCA Tutorial #499

Conversation

timshell commented Mar 4, 2017

dustinvtran commented Mar 4, 2017 • edited Loading

timshell commented Mar 7, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dustinvtran commented Mar 8, 2017

timshell commented Mar 8, 2017

dustinvtran commented Mar 4, 2017 •

edited

Loading