We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 95e748f commit 103e903Copy full SHA for 103e903
Part 7 - Natural Language Processing/NLP.R
@@ -53,7 +53,9 @@ corpus = tm_map(corpus, removePunctuation)
53
# 4. non-relevant words -> remove
54
corpus = tm_map(corpus, removeWords, stopwords())
55
56
+# 5. stemming - reduce the total number of words -> getting the root of each word
57
+corpus = tm_map(corpus, stemDocument)
58
-
59
+# 6. extra spaces -> remove (extra spaces left from removing numbers for example)
60
+corpus = tm_map(corpus, stripWhitespace)
61
0 commit comments