Skip to content

Commit

Permalink
Merge pull request cgpotts#35 from insop/insop/master
Browse files Browse the repository at this point in the history
Minor typo fix
  • Loading branch information
cgpotts authored Dec 26, 2019
2 parents 98ce0d3 + 3d985a6 commit b514014
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions vsm_01_distributional.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -82,7 +82,7 @@
"\n",
"Why build distributed representations? There are potentially many reasons. The two we will emphasize in this course:\n",
"\n",
"1. __Understanding words in context__: There is value to linguists in seeing what these data-rich approaches can teach use about natural language lexicons, and there is value for social scientists in understanding how words are being used.\n",
"1. __Understanding words in context__: There is value to linguists in seeing what these data-rich approaches can teach us about natural language lexicons, and there is value for social scientists in understanding how words are being used.\n",
"\n",
"1. __Feature representations for other models__: As we will see, many models can benefit from representing examples as distributed representations."
]
Expand Down Expand Up @@ -908,7 +908,7 @@
"\n",
"Let's do a quick worked-out example. Suppose we have the count matrix $X$ = \n",
"\n",
"| | a | b | rowsum |\n",
"| . | a | b | rowsum |\n",
"|----------|----|----|-------|\n",
"| __x__ | 34 | 11 | 45 |\n",
"| __y__ | 47 | 7 | 54 |\n",
Expand All @@ -920,7 +920,7 @@
"\n",
"And the full table looks like this:\n",
"\n",
"| | a | b | \n",
"| . | a | b | \n",
"|--------|------|------|\n",
"| __x__ | 0.92 | 1.34 | \n",
"| __y__ | 1.06 | 0.71 |"
Expand Down Expand Up @@ -1380,7 +1380,7 @@
" \n",
" to read in an adjective $\\times$ adverb matrix derived from the Gigaword corpus. Each cell contains the number of times that the modifier phrase __ADV ADJ__ appeared in Gigaword as given by dependency parses of the data. __ADJ__ is the row value and __ADV__ is the column value. Using the above techniques and measures, try to get a feel for what can be done with this matrix.\n",
"\n",
"1. [Turney and Pantel (2010)](http://www.jair.org/media/2934/live-2934-4846-jair.pdf), p. 158, propose a \"contextual discounting\" extension of PMI to try to address its bias for low-frequency events. Extend `vsm.pmi` so that the user has the option of performing this discounting with the keyword argument `discounting=True`."
"1. [Turney and Pantel (2010)](https://jair.org/index.php/jair/article/view/10640), p. 158, propose a \"contextual discounting\" extension of PMI to try to address its bias for low-frequency events. Extend `vsm.pmi` so that the user has the option of performing this discounting with the keyword argument `discounting=True`."
]
}
],
Expand Down

0 comments on commit b514014

Please sign in to comment.