Skip to content

Updating 20180908 #1

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 280 commits into from
Sep 9, 2018
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
280 commits
Select commit Hold shift + click to select a range
ccbbc42
added sample codebook using PROPPR
kirstenfrank Sep 26, 2014
61e2560
added a sample codebook using dogwalking
kirstenfrank Sep 27, 2014
01f3215
added linked to curated knowledge
Sep 27, 2014
869cdfd
Merge pull request #33 from yoke2/master
seankross Sep 29, 2014
69bcffb
Update getclean.md
kirstenfrank Oct 3, 2014
18a576c
removed two sample files
kirstenfrank Oct 3, 2014
fabc666
fixed formatting typos
seankross Oct 7, 2014
d3d606c
Update curated.md
marcmtk Oct 7, 2014
5d79a8c
Merge pull request #34 from marcmtk/master
seankross Oct 7, 2014
3090ff6
Update getclean.md
schnee Oct 9, 2014
3bad92a
Merge pull request #35 from schnee/master
seankross Oct 9, 2014
e483f01
add link to curated knowledge
Oct 11, 2014
26178f6
Merge pull request #36 from wittyfilter/master
seankross Oct 11, 2014
9509d56
added link to prebuilt virtual machine
Oct 13, 2014
77d788d
Merge branch 'queirozfcom-master'
seankross Oct 13, 2014
ab80963
added other virtual machines
seankross Oct 13, 2014
4a2c406
Update curated.md
collect Oct 14, 2014
89a253d
Merge pull request #38 from stoked/patch-2
seankross Oct 14, 2014
67cf76b
updated to match teachers
kirstenfrank Oct 14, 2014
5226adf
cleaned up merge text
kirstenfrank Oct 14, 2014
0a70ba8
Merge pull request #32 from kirstenfrank/master
seankross Oct 17, 2014
8098250
Added DSC link.
aglagla Nov 5, 2014
3278504
Update curated.md
aglagla Nov 5, 2014
a179c2b
Create Networking
charany1 Nov 6, 2014
d314029
Added networking field.
charany1 Nov 6, 2014
6e3cda2
Merge pull request #39 from aglagla/dss
seankross Nov 6, 2014
b64a296
Merge branch 'master' of https://github.com/charany1/DataScienceSpeci…
seankross Nov 8, 2014
28b2bda
added group pages to curated section
seankross Nov 8, 2014
144a342
delete .Rhistory
elmerehbi Nov 9, 2014
64c256c
added md live demo link
elmerehbi Nov 10, 2014
6aa8642
added reference books
elmerehbi Nov 10, 2014
0838cf5
organized links
seankross Nov 10, 2014
7b12d45
add ROC links to Practical Machine Learning page
justmarkham Nov 20, 2014
e7550ab
Merge pull request #42 from justmarkham/master
seankross Nov 20, 2014
1393311
added links on RStudio projects & gitignore
elmerehbi Dec 6, 2014
536ca14
added '.Rhistory' to '.gitignore'
elmerehbi Dec 6, 2014
fe23c65
Merge pull request #43 from elmerehbi/master
seankross Dec 8, 2014
6b88fdb
Added link to Shiny 401K simulator
eddyod Dec 10, 2014
e690b34
Merge pull request #45 from Mephistosoftware/master
seankross Dec 11, 2014
162158d
Create lubridate
peperomia Jan 7, 2015
c7f2aa2
added data science ontology link and corrected typos
yoke2 Jan 7, 2015
b0312e0
added data science ontology link and corrected typos
yoke2 Jan 7, 2015
7329b5f
Added Virtual machine with RStudio server and github setup in other r…
tboloo Jan 7, 2015
a787948
Add link to Code School's Try Git
felipebalbi Jan 8, 2015
026efa6
Add link to Code School's Try R free course
felipebalbi Jan 8, 2015
b4198e6
r language cheatsheet
Jan 9, 2015
260cdf6
Merge pull request #51 from startupjing/master
seankross Jan 9, 2015
314b5e5
Merge pull request #49 from tboloo/master
seankross Jan 9, 2015
dc328b7
Merge pull request #48 from yoke2/master
seankross Jan 9, 2015
bdee96d
ammend pull request
seankross Jan 9, 2015
04f6a29
t push origin masterMerge branch 'felipebalbi-add-try-git-and-try-r-l…
seankross Jan 9, 2015
a75e2c1
Merge branch 'master' of https://github.com/peperomia/DataScienceSpec…
seankross Jan 9, 2015
72c570c
added lubridate info
seankross Jan 9, 2015
26ba0af
Merge branch 'peperot push origin mastermia-master'
seankross Jan 9, 2015
432a746
Fixed 'R' header to match others
Jan 13, 2015
c74d638
Merge pull request #52 from Isweet/header
seankross Jan 14, 2015
4c05b3d
Add Google Developers R Programming Video Lectures
snagargoje Jan 14, 2015
9c97623
fixed extra space
seankross Jan 15, 2015
d37880a
Merge branch 'snagargoje-master'
seankross Jan 15, 2015
a07fa2c
RShiny Youtube Video Tutorials
aagarw30 Jan 15, 2015
e633e9e
added shiny playlist
seankross Jan 15, 2015
229eb06
Merge branch 'aagarw30-patch-1't push origin master
seankross Jan 15, 2015
92e8899
Added github tutorial
chribonn Jan 15, 2015
d36bfbd
Added link to Data Elixir under Further Reading
Jan 17, 2015
b5e83a8
initial commit
cbryant1000 Jan 21, 2015
ff89a3d
Merge pull request #58 from cbryant1000/master
seankross Jan 21, 2015
eda9606
Merge pull request #56 from LonRiesberg/master
seankross Jan 21, 2015
3869a95
Merge pull request #55 from chribonn/master
seankross Jan 21, 2015
709cb7e
added pa1test link
cbryant1000 Jan 28, 2015
83ec820
Merge pull request #59 from cbryant1000/master
seankross Jan 28, 2015
2e0a6cd
Update README.md
scottwdavidson Feb 5, 2015
925ff0e
Merge pull request #60 from scottwdavidson/master
seankross Feb 5, 2015
d877309
Choropleth maps of sum of PM2.5 by county for each of the 4 years of …
BillSeliger Feb 5, 2015
56bf4a8
Emissions Data Choropleth Maps
BillSeliger Feb 6, 2015
806c778
updated eda
seankross Feb 6, 2015
30b9f93
Merge branch 'BillSeliger-master'
seankross Feb 6, 2015
696dde4
Trello Board Collection of Data Science Resources
thiakx Feb 12, 2015
91ba7f0
Trello Board Collection of Data Science Resources
thiakx Feb 12, 2015
e9ff1fc
Merge pull request #63 from thiakx/master
seankross Feb 12, 2015
7d7c6c3
Diving Into Data Science Flipboard
thiakx Feb 12, 2015
00ad838
Merge pull request #64 from thiakx/master
seankross Feb 12, 2015
4257453
added link to an Rscript
elmerehbi Feb 13, 2015
a6b1227
Merge pull request #65 from elmerehbi/master
seankross Feb 18, 2015
6ab13c6
add comparison of supervised learning algorithms to PML page
justmarkham Feb 27, 2015
c1c5bc2
Merge pull request #67 from justmarkham/master
seankross Feb 27, 2015
fc21325
Create Interactive_Git_Game
theendless219 Mar 1, 2015
6836a0d
Update curated.md
theendless219 Mar 1, 2015
7ce91f1
added git branch game
seankross Mar 1, 2015
cb5fd15
Merge branch 'theendless219-master't push origin master
seankross Mar 1, 2015
b8af6de
Added the 8th entry to this list
alkashef Mar 4, 2015
198d463
Changed the title of the 8th entry
alkashef Mar 4, 2015
09fa1a1
Merge pull request #69 from alkashef/master
seankross Mar 5, 2015
75ed32e
add new dplyr tutorial to getclean and improve description of older t…
justmarkham Mar 10, 2015
a475214
Merge pull request #71 from justmarkham/master
seankross Mar 10, 2015
e8d696e
suggested addition
Manu58 Mar 14, 2015
7d35821
Merge pull request #72 from Manu58/master
seankross Mar 17, 2015
b43681b
Create statinf-exp-distro
ProgramErgoSum Mar 19, 2015
4afcac9
codebook template
JorisSchut Mar 22, 2015
31662eb
added births gist
seankross Mar 24, 2015
59fcee2
Merge branch 'ProgramErgoSum-master't push origin master
seankross Mar 24, 2015
8afb04d
Merge branch 'codebook-template' of https://github.com/JorisSchut/Dat…
seankross Mar 24, 2015
a205ec7
added codebook gist
seankross Mar 24, 2015
8752180
Merge branch 'JorisSchut-codebook-template't push origin master
seankross Mar 24, 2015
ec64fab
Added links to the notes I compiled for all 9 classes
sux13 Mar 24, 2015
4674aee
clean up conflicts
seankross Mar 24, 2015
d70dba3
Merge branch 'sux13-master't push origin master
seankross Mar 24, 2015
dab179d
Upload benefit-cost analysis exercise in knitr
dkillian Apr 8, 2015
35e0c36
add link to machine learning video on PML page
justmarkham Apr 9, 2015
04765df
Merge pull request #78 from justmarkham/master
seankross Apr 9, 2015
601d05c
Update repres.md
dkillian Apr 9, 2015
a15e476
Delete Benefit-cost analysis of park user fee.html
dkillian Apr 9, 2015
f0aaa43
Update repres.md
dkillian Apr 11, 2015
5197556
Merge pull request #77 from dkillian/master
seankross Apr 12, 2015
d8db891
Update curated.md
Aratinga Apr 13, 2015
d3b31e4
Merge pull request #79 from Aratinga/master
seankross Apr 13, 2015
d29df74
fixed link
seankross Apr 13, 2015
0437d0e
Add link to Probability and Statistics Cookbook
wxn0000 Apr 23, 2015
eee4951
modified statinf
seankross Apr 24, 2015
d1af785
Merge branch 'wxn0000-master'
seankross Apr 24, 2015
96a2637
add link to post about how to use bash commands
edgarshurtado Apr 24, 2015
a150d46
Merge pull request #81 from edgarshurtado/master
seankross Apr 24, 2015
130e725
Tutorial for those struggling with PA2
DanieleP Apr 26, 2015
a602f96
Merge pull request #82 from DanieleP/patch-1
seankross Apr 26, 2015
71e6a91
Tutorial for those struggling with PA3
DanieleP Apr 28, 2015
daca391
Merge pull request #83 from DanieleP/master
seankross Apr 28, 2015
bf75551
added awesome lists for R & ML
elmerehbi Apr 29, 2015
7871ec1
Merge pull request #84 from elmerehbi/master
seankross Apr 30, 2015
8ed4584
Additions to Curated page
Aratinga May 3, 2015
2145f68
Fixed one line
Aratinga May 3, 2015
f7ef288
fixed conflicts
seankross May 6, 2015
23b5651
Merge branch 'Aratinga-master'
seankross May 6, 2015
fe6a280
Added link to rprog.md for alternative submit script
rchampoux May 12, 2015
35f4fca
Merge pull request #86 from rchampoux/master
seankross May 12, 2015
f11116a
RPub NOAA Data
May 22, 2015
4fb145b
Merge pull request #87 from shanemeister/master
seankross May 22, 2015
7d983f1
Updated eda.md to add new link
impiyush Jun 2, 2015
79a0953
Merge pull request #1 from impiyush/impiyush-eda-link
impiyush Jun 2, 2015
f0d6621
Update curated.md
isvaldo Jun 4, 2015
b3a1862
Update curated.md
isvaldo Jun 4, 2015
3c71788
Merge pull request #90 from isvaldo/patch-1
seankross Jun 5, 2015
86f6ef1
Merge branch 'master' of https://github.com/impiyush/DataScienceSpeci…
seankross Jun 9, 2015
0f69e8b
clean up eda
seankross Jun 9, 2015
3e48af4
Merge branch 'impiyush-master'
seankross Jun 9, 2015
54dc7a8
Update curated.md
larspijnappel Jun 17, 2015
c3a6f2f
Merge pull request #93 from larspijnappel/patch-1
seankross Jun 17, 2015
e8ae917
add link to Shiny simulation tutorial
homerhanumat Jul 9, 2015
146042f
Merge pull request #94 from homerhanumat/homerhanumat-patch-1
seankross Jul 9, 2015
ca0fa60
Update toolbox.md
msoltysik Jul 11, 2015
ca48b7e
added CL to curated
seankross Jul 13, 2015
bde84c0
Merge branch 'msoltysik-master'
seankross Jul 13, 2015
b62cea8
add link to Hadley online book: R Packages
paternogbc Aug 6, 2015
94ddb51
Merge pull request #97 from paternogbc/patch-1
seankross Aug 6, 2015
03187ea
Remove add from Other Resources page
jelford Aug 9, 2015
a12de1c
Merge pull request #98 from jelford/master
seankross Aug 9, 2015
3e1ae39
Update ddp.md
seankross Aug 13, 2015
8bc14a8
Update other.md
flaviobarros Aug 24, 2015
1243a0a
Update other.md
flaviobarros Aug 24, 2015
2c99cc6
moved from other the ddp
seankross Aug 25, 2015
03f451a
Merge branch 'flaviobarros-master'
seankross Aug 25, 2015
89e112d
Added link to 'real world example' reading ACS 2000 PUMS data.
lgreski Sep 7, 2015
b52d1ab
Merge pull request #101 from lgreski/master
seankross Sep 7, 2015
7a38b36
Added David Hood advice for the course
thoughtfulbloke Sep 9, 2015
db97986
Merge pull request #102 from thoughtfulbloke/master
seankross Sep 9, 2015
8465fb8
Update ddp.md
seankross Oct 27, 2015
816edbe
Added link to http://reproducibleresearch.net/
carsten-j Oct 29, 2015
dd1ba39
Merge pull request #105 from carsten-j/master
seankross Oct 29, 2015
cf21437
Added an article containing step by step instructions for using Githu…
lgreski Dec 1, 2015
86e4e43
Merge pull request #106 from lgreski/master
seankross Dec 1, 2015
21b4c2d
Added links for Configuring RStudio to work with Git / Github, Mac an…
lgreski Dec 2, 2015
f346b56
Merge pull request #107 from lgreski/master
seankross Dec 2, 2015
37ae85a
Added articles written to support students in R Programming -- strate…
lgreski Dec 2, 2015
874c7df
Merge pull request #108 from lgreski/master
seankross Dec 2, 2015
5d1cf2b
Update index.md
seankross Dec 12, 2015
be0a48a
Added two links: makeCacheMatrix as an Object, and S Objects, R Objec…
lgreski Dec 25, 2015
0763f5a
Merge pull request #110 from lgreski/master
seankross Dec 26, 2015
0f621d0
Update ddp.md
flaviobarros Dec 28, 2015
608e561
Update other.md
flaviobarros Dec 28, 2015
e131272
Merge pull request #111 from flaviobarros/patch-1
seankross Dec 28, 2015
1c15dad
fixed conflict
seankross Jan 1, 2016
3169146
Merge branch 'flaviobarros-master'
seankross Jan 1, 2016
2035298
Add link for Improving Runtime Performance of caret::train() article.
lgreski Jan 3, 2016
6f2e407
Merge remote-tracking branch 'upstream/master'
lgreski Jan 3, 2016
55d8537
Add two articles: Common Mistakes / overwriting R functions with data…
lgreski Jan 9, 2016
814f6ad
Add five articles related to statinf class.
lgreski Jan 9, 2016
3417573
Add article on configuring shinyapps.io application timeout.
lgreski Jan 9, 2016
c966671
Merge pull request #114 from lgreski/master
seankross Jan 9, 2016
1a9c230
add link to ProjectTemplate blog post
padamson Jan 18, 2016
d48e466
Merge pull request #115 from padamson/projecttemplate
seankross Jan 19, 2016
f5c7e09
added link to interactive CI repo
amcadie Feb 26, 2016
1eb1be9
fixed line break
amcadie Feb 26, 2016
08319fd
fixed conflict
seankross Feb 27, 2016
203da71
Merge branch 'amcadie-master'
seankross Feb 27, 2016
6f08d7b
Added Len Greski to list of community contributors.
lgreski Apr 24, 2016
e1da253
Added link to MiKTeX install walkthrough on Windows 10.
lgreski Apr 25, 2016
93b5aa0
fixed conflict
seankross Apr 25, 2016
d1ff42f
Merge branch 'lgreski-master'
seankross Apr 25, 2016
488e98c
Merge pull request #1 from DataScienceSpecialization/master
lgreski May 7, 2016
d6132ab
Add article describing "optimal" sample size relative to power calcul…
lgreski May 7, 2016
f21d1df
Merge pull request #118 from lgreski/master
seankross May 8, 2016
035ae68
Merge pull request #2 from DataScienceSpecialization/master
lgreski May 15, 2016
925de92
Add R Onboarding for SAS Users
lgreski May 15, 2016
c87f1ee
Merge pull request #119 from lgreski/master
seankross May 16, 2016
bf50b36
Add article on forms of the Extract Operator
lgreski May 30, 2016
18b7e95
Merge pull request #120 from lgreski/master
seankross May 30, 2016
0223ab1
Added article explaining use of binomial theorem in Combining Predict…
lgreski Jun 18, 2016
318b3c1
add 2 articles: breaking down pollutantmean, and a SAS version of pol…
lgreski Jun 18, 2016
ea32fb6
Updated other.md
Jun 20, 2016
febffa1
Merge pull request #121 from lgreski/master
seankross Jun 20, 2016
a859f53
Merge pull request #3 from DataScienceSpecialization/master
lgreski Jul 4, 2016
4a18de5
Add article on permutation tests.
lgreski Jul 4, 2016
2810cd9
Merge pull request #122 from lgreski/master
seankross Jul 9, 2016
53c1f9e
Add url for Demystifying makeVector() article.
lgreski Aug 10, 2016
79c1e5c
Merge pull request #123 from lgreski/master
seankross Aug 11, 2016
3c73cb4
Add article illustrating how to use R to download lecture videos.
lgreski Aug 21, 2016
1b42d0c
Merge pull request #124 from lgreski/master
seankross Aug 22, 2016
370ac7d
Merge pull request #1 from voshchevoz/voshchevoz-patch-proxy
Nov 11, 2016
bff339e
Merge pull request #125 from voshchevoz/master
seankross Nov 11, 2016
9308f29
Update broken link #126
Nov 29, 2016
31283db
Merge pull request #127 from Devinsuit/master
seankross Nov 29, 2016
a1c0e1c
Merge pull request #4 from DataScienceSpecialization/master
lgreski Jan 8, 2017
9cbaa23
Added a "getting started" section, added DSS value proposition articl…
lgreski Jan 8, 2017
6738b9c
Merge pull request #128 from lgreski/master
seankross Jan 9, 2017
39a027e
Fixed broken link
MMohey Apr 18, 2017
451285b
Merge pull request #1 from MMohey/MMohey-patch-1
MMohey Apr 18, 2017
24707fe
Merge pull request #129 from MMohey/master
seankross Apr 18, 2017
579546e
Merge pull request #5 from DataScienceSpecialization/master
lgreski May 20, 2017
5db3cfd
Add articles related to R programming course
lgreski May 20, 2017
4908a93
Merge branch 'master' of https://github.com/lgreski/DataScienceSpecia…
lgreski May 20, 2017
497330b
add articles
lgreski May 20, 2017
51c9e76
Add capstone page to index, and content for capstone page
lgreski May 20, 2017
4dc95e6
Added shiny choropleth app
amsilvr May 23, 2017
8453199
Merge pull request #1 from amsilvr/amsilvr-patch-1
amsilvr May 23, 2017
452a587
Update ddp.md
amsilvr May 23, 2017
4cd7fc3
Add capstone page.
lgreski May 23, 2017
cf0e2fe
Add articles to capstone page
lgreski May 27, 2017
1e2c829
Merge pull request #130 from lgreski/master
seankross May 27, 2017
7263dd1
Merge pull request #131 from amsilvr/master
seankross May 27, 2017
ff0aaf4
Add article explaining why one cannot calculate the area under a spec…
lgreski Aug 5, 2017
a276511
Merge pull request #6 from DataScienceSpecialization/master
lgreski Aug 5, 2017
3c72981
Merge pull request #133 from lgreski/master
seankross Aug 7, 2017
eb0e024
Add missing dash in bullet list
lgreski Aug 19, 2017
5bd2fff
Merge pull request #134 from lgreski/master
seankross Aug 21, 2017
253d615
added link to my pdf file
DocOfi Jan 21, 2018
5549c14
Added a link to my presentation in Rpubs
DocOfi Jan 21, 2018
6fb62a8
Merge pull request #137 from DocOfi/master
seankross Jan 21, 2018
5a351e3
added name and link in about.md
DocOfi Jan 27, 2018
64aba82
added link to metricsgraphics tutorial
DocOfi Jan 27, 2018
83c8abc
Merge pull request #138 from DocOfi/master
seankross Jan 29, 2018
3a05c60
adding a leaflet plot example
DocOfi Mar 23, 2018
aa0b465
Merge pull request #139 from DocOfi/master
seankross Mar 26, 2018
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -1,2 +1,4 @@
_site
.DS_Store
.Rhistory
.Rproj.user
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ Since the beginning of the Data Science Specialization we've noticed the unbelie

## Contributing

If you've created a web page, video, sideshow, or any other kind of media you think should be shared through this directory you should:
If you've created a web page, video, slideshow, or any other kind of media you think should be shared through this directory you should:

1. Fork this repository.
2. Add a link to your content on the appropriate course page.
Expand Down
6 changes: 5 additions & 1 deletion about.md
Original file line number Diff line number Diff line change
Expand Up @@ -19,6 +19,10 @@ The [Data Science Specialization](https://www.coursera.org/specialization/jhudat
- [Kevin Markham](http://www.dataschool.io/)
- Derek Franks
- David Hood
- [Leonard Greski](https://github.com/lgreski)
- Michael Sachs
- Allan Inocêncio de Souza Costa
- [stepds](https://github.com/stepds)
- [stepds](https://github.com/stepds)
- Bastiaan Quast
- [Xing Su](http://sux13.github.io/DataScienceSpCourseNotes/)
- [Edmund julian Ofilada](https://github.com/DocOfi)
14 changes: 14 additions & 0 deletions capstone.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
---
title: "Capstone"
permalink: /capstone/
layout: page
---
## Reference Material

- [Speech and Language Processing, 3rd Edition](https://web.stanford.edu/~jurafsky/slp3/) Working version of Jurafsky, et. al. book on natural language processing whose content on n-grams is helpful for the capstone.

## Course Project

- [n-gram Computations and Computer Capacity](http://bit.ly/2couvxh) Explains the amount of memory required to convert the text files for the course project into n-grams, using the <strong>quanteda</strong> package.
- [Capstone Strategy](http://bit.ly/2rGcgc6) Describes a general strategy to get through the Capstone: use the simplest approaches possible.
- [Choosing a Text Analysis Package](http://bit.ly/2qagsPa) Reviews pros and cons of various R packages used for natural language processing, in the context of requirements for the Capstone project.
81 changes: 80 additions & 1 deletion curated.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,85 @@
---
layout: page
title: Curated Knowledge
title: Curated Pages
permalink: /curated/
---

### Analytics

- [Huge Trello Board Collection of Data Science Resources](https://trello.com/b/rbpEfMld/data-science)
- [Diving Into Data Science Flipboard](https://flipboard.com/@thiakx/diving-into-data-science-5823ectuy)
- [OLAP Operation in R](http://architects.dzone.com/articles/olap-operation-r)
- [Journal of Statistical Software: Tidy data](http://www.jstatsoft.org/v59/i10/paper)
- [Verzani: simpleR – Using R for Introductory Statistics](http://cran.r-project.org/doc/contrib/Verzani-SimpleR.pdf)
- [Data Visualization packages](http://www.datavis.ca/R/)
- [Visualization hints: plotting numeric data by groups](http://www.r-bloggers.com/visualization-series-insight-from-cleveland-and-tufte-on-plotting-numeric-data-by-groups/)
- [Matrix rotation for image and contour plots in R](http://blog.snap.uaf.edu/2012/06/08/matrix-rotation-for-image-and-contour-plots-in-r/)
- [Fig Data: 11 Tips on How to Handle Big Data in R (and 1 Bad Pun)](http://theodi.org/blog/fig-data-11-tips-how-handle-big-data-r-and-1-bad-pun)
- [Data from 538](https://github.com/fivethirtyeight/data)

### Command Line

- [explainshell.com - match command-line arguments to their help text](http://explainshell.com/)
- [The Command Line Crash Course - Quick course in using the command line](http://cli.learncodethehardway.org/book/)
- [Mastering the command line, in one page](https://github.com/jlevy/the-art-of-command-line/blob/master/README.md)

### R

- [Try R](http://tryr.codeschool.com/)
- [The R Book by Michael J. Crawley](https://archive.org/details/TheRBook/)
- [Univ. of Calif. Riverside R Programming](http://manuals.bioinformatics.ucr.edu/home/programming-in-r#TOC-R-Basics)
- [G. Sanchez - Strings in R](http://gastonsanchez.com/Handling_and_Processing_Strings_in_R.pdf)
- [The Lubridate Package](http://www.jstatsoft.org/v40/i03/paper)
- [Google Developers R Programming Video Lectures](http://www.r-bloggers.com/google-developers-r-programming-video-lectures/)
- [awesome R](https://github.com/qinwf/awesome-R) - A curated list of awesome R frameworks, packages and software.
- [awesome machine learning](https://github.com/josephmisiti/awesome-machine-learning#r) - A curated list of awesome Machine Learning frameworks, libraries and software.
- [Google's R Style Guide](https://google-styleguide.googlecode.com/svn/trunk/Rguide.xml)
- [Tufte-style HTML in rmarkdown](http://sachsmc.github.io/tufterhandout/)
- [Creating an R Package](http://hilaryparker.com/2014/04/29/writing-an-r-package-from-scratch/)
- [R Packages (Hadley online book)](http://r-pkgs.had.co.nz/) - How to write your own R packages.
- [Beautiful ggplot2 Cheatsheet](http://zevross.com/blog/2014/08/04/beautiful-plotting-in-r-a-ggplot2-cheatsheet-3/)
- [Intro to Graphics](http://bcb.dfci.harvard.edu/~aedin/courses/Bioconductor/2.Plotting.pdf)
- [data.table cheat sheet](https://s3.amazonaws.com/assets.datacamp.com/img/blog/data+table+cheat+sheet.pdf)
- [Exploratory Data Analysis with data.table](http://varianceexplained.org/RData/lessons/lesson4/)
- [Fast summary statistics in R with data.table](http://blog.yhathq.com/posts/fast-summary-statistics-with-data-dot-table.html)
- [R online in r-fiddle.org](http://www.r-fiddle.org/)

### Probability and Statistics

- [Probability and Statistics Cookbook](http://matthias.vallentin.net/probability-and-statistics-cookbook/)

### GitHub

- [Official Git Tutorial](http://git-scm.com/docs/gittutorial)
- [Git - Simple Guide](http://rogerdudler.github.io/git-guide/)
- [Git Immersion - A guided tour through the fundamentals of Git](http://gitimmersion.com/)
- [GitHub - Dealing with Multiple Accounts](http://hmkcode.com/git-tutorial/how-to-deal-with-multiple-github-accounts-on-one-computer/)
- [Try Git](https://try.github.io/levels/1/challenges/1)
- [Learn Git Branching: Interactive Game](http://pcottle.github.com/learnGitBranching/)
- [Atlassian Git Tutorials - Branches](https://www.atlassian.com/git/tutorials/using-branches/)

### Reproducible Research
- [Markdown live demo](http://markdown-here.com/livedemo.html)
- [Boosting Slides by Ron Meir](https://github.com/Aratinga/Misc/blob/master/BoostingTutorial.pdf)
- [Reproducible Research website](http://reproducibleresearch.net/)

### Machine Learning
- [UC Irvine Machine Learning Data Repository](http://archive.ics.uci.edu/ml/)

### Textbooks
- [OpenIntro textbook](https://www.openintro.org/stat/textbook.php)
- [Statlect - The digital textbook on probability and statistics](http://www.statlect.com/)
- [An Introduction to Statistical Learning with Applications in R](http://www-bcf.usc.edu/~gareth/ISL/) [[PDF, 4th printing]](http://www-bcf.usc.edu/~gareth/ISL/ISLR%20Fourth%20Printing.pdf)
- [The Elements of Statistical Learning: Data Mining, Inference, and Prediction](http://statweb.stanford.edu/~tibs/ElemStatLearn/) [[PDF, 10th ed]](http://statweb.stanford.edu/~tibs/ElemStatLearn/printings/ESLII_print10.pdf)

### Further Reading

- [Data Elixir - Free weekly newsletter of the best data-related resources and inspirations from around the web.](http://dataelixir.com/?referred=true)
- [Linkedin - Top 10 Big Data and Analytics References](https://www.linkedin.com/pulse/article/20140810194033-111366377-top-10-big-data-and-analytics-references)
- [Linkedin - Let's Get Nerdy: Data Analytics for Business Leaders Explained](https://www.linkedin.com/pulse/article/20140918162814-111366377-let-s-get-nerdy-data-analytics-for-business-leaders-explained)
- [Data Science Central : a great repository of news and resources for data science practitioners.](http://www.datasciencecentral.com)
- [Data Science Ontology - A visualized overview of Data Science concepts and tools](http://datascienceontology.com/)

### Data Science Groups, Meetups, and Networking

- [LinkedIn Data Science Specialisation Group](https://www.linkedin.com/groups/Coursera-Specialization-Data-Science-7495000?home=&gid=7495000&trk=anet_ug_hm&goback=%2Egmp_7495000)
18 changes: 18 additions & 0 deletions ddp.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,3 +5,21 @@ permalink: /ddp/
---

- [Slidify to Github walkthrough](http://rpubs.com/thoughtfulbloke/25103)
- [ggvis and rmarkdown slides with interactive plots](http://qua.st/ggvis-shiny-html5-slides)

## Shiny
- Choropleth of PBS WARN Distribution of Wireless Emergency Alerts
- [Code for Shiny App](https://github.com/amsilvr/shiny_choropleth)
- [App running on shinyapps.ip](https://silverman.shinyapps.io/warn_wea/)
- [Shiny app to simulate 401K growth with interactive plots](http://www.mephistosoftware.com/shiny/401k_simulator/)
- [Shiny Video Tutorials Playlist on Youtube](http://www.youtube.com/playlist?list=PL6wLL_RojB5xNOhe2OTSd-DPkMLVY9DfB)
- [Tutorial on writing Shiny simulation apps](https://github.com/homerhanumat/shinyTutorials)
- [Dockerize a Shiny App](http://www.rmining.net/2015/04/30/dockerizing-a-shiny-app/)
- [Git pushing Shiny Apps with Docker/Dokku](http://www.rmining.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/)
- [Share your Shiny Apps with Docker and Kitematic](http://www.rmining.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/)
- [Shinyapps.io: Configuring Application Timeout](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/dataProd-shinyTimeoutConfig.md)
- [Plotting Natural Disasters](http://www.rpubs.com/DocOfi/367052)

## Comprehensive Notes

- Complete notes for [Developing Data Products](http://sux13.github.io/DataScienceSpCourseNotes/)
10 changes: 10 additions & 0 deletions eda.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,3 +4,13 @@ title: Exploratory Data Analysis
permalink: /eda/
---

- [Creating a Kite Graph](http://rpubs.com/thoughtfulbloke/kitegraph)
- [Analyzing Top/Green500 Supercomputer Technology Trends](http://github.com/ww44ss/Exascalar-Analysis-)
- [Emissions Choropleth Maps](https://github.com/BillSeliger/ExData_Plotting2)
- [Data Analysis using Twitter API and Python](http://blog.impiyush.com/2015/03/data-analysis-using-twitter-api-and.html)
- [Exploratory Data Analysis using Flexdashboard](http://rpubs.com/DocOfi/350830)
- [Plotting using Metricsgraphics](http://www.rpubs.com/DocOfi/352947)

## Comprehensive Notes

- Complete notes for [Exploratory Data Analysis](http://sux13.github.io/DataScienceSpCourseNotes/)
19 changes: 19 additions & 0 deletions getclean.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,3 +6,22 @@ permalink: /getclean/

- [Subsetting example walkthrough](http://rpubs.com/thoughtfulbloke/subset)
- [Apples to Oranges Data Organisation Challenge](https://github.com/thoughtfulbloke/faoexample)
- [dplyr introductory tutorial](https://www.youtube.com/watch?v=jWjqLW-u3hc) and [R Markdown document](http://rpubs.com/justmarkham/dplyr-tutorial): A 39-minute video tutorial that covers the five basic dplyr "verbs" and a dozen other dplyr functions. dplyr is an [update](http://blog.rstudio.org/2014/01/17/introducing-dplyr/) to the plyr package, useful for subsetting, sorting, summarizing, and merging data using a more intuitive syntax than plyr or base R.
- [dplyr "going deeper" tutorial](https://www.youtube.com/watch?v=2mh1PqfsXVI) and [R Markdown document](http://rpubs.com/justmarkham/dplyr-tutorial-part-2): A 37-minute video tutorial that covers the new functionality in dplyr versions 0.3 and 0.4.
- [Downloading files general advice](http://rpubs.com/thoughtfulbloke/downloadtips)
- [Codebook sample](https://gist.github.com/kirstenfrank/218c36a1938055d0f4e4)
- [Second Codebook sample](https://gist.github.com/kirstenfrank/699abe3e16fd1dc36e5d)
- [Query string (and other fields-within-fields) unrolling](http://rpubs.com/schnee/32988)
- [Pre-processing Excel files before loading them into R](https://github.com/alkashef/cleaningexceldata)
- [Codebook template that can be used in the Getting and Cleaning Data project](https://gist.github.com/JorisSchut/dbc1fc0402f28cad9b41)
- ["Real world" example - reading American Community Survey 2000 PUMS Data:](https://github.com/lgreski/acsexample) Demonstrates how to extract records of a given type from a data file containing multiple record types, and how to use an Excel-based code book to specify arguments for reading a fixed-width file.
- [18 Months of CTA advice](https://thoughtfulbloke.wordpress.com/2015/08/31/hello-world)
- [Common Problems: Quiz 1 - Missing Java Runtime](http://bit.ly/2jjtyXM) Explains how to solve the problem of a missing Java Runtime for the question that requires students to process a Microsoft Excel spreadsheet.
- [Strategy for Reading Files & APIs / Quiz 2](http://bit.ly/2e4L5oF)
- [Common Problems: Quiz 2 - sqldf() driver fails to connect](http://bit.ly/2kD2KTY)
- [Tutorial: Downloading Files](http://bit.ly/2iP2suj) Illustrates various ways of downloading files, including binary and text files.
- [Creating dataframes from xml data](https://www.dropbox.com/s/7bbzzp4bwsmfl5y/CreatingDataframesfrom%20XmlFiles.odt?dl=0)

## Comprehensive Notes

- Complete notes for [Getting and Cleaning Data](http://sux13.github.io/DataScienceSpCourseNotes/)
5 changes: 3 additions & 2 deletions index.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ layout: page

## Table of Contents

This is site is meant to serve as a directory for the amazing content the
This site is meant to serve as a directory for the amazing content the
community has created around the Data Science Specialization. If you are
interested in contributing [click here](https://github.com/DataScienceSpecialization/DataScienceSpecialization.github.io#contributing).

Expand All @@ -17,6 +17,7 @@ interested in contributing [click here](https://github.com/DataScienceSpecializa
7. [Regression Models](/regmod/)
8. [Practical Machine Learning](/pml/)
9. [Developing Data Products](/ddp/)
10. [Capstone](/capstone/)

- [Other Resources](/other/)
- [Curated Knowledge](/curated/)
- [Curated Pages](/curated/)
25 changes: 24 additions & 1 deletion other.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,30 @@ title: Other Resources
permalink: /other/
---

## Troubleshooting
## Configuring R and RStudio (Linux)

- [Installing xlsx and XML packages on Debian Wheezy](http://allanino.me/blog/programming/installing-some-r-packages/)
- [Rscript to customize R environment](http://bit.ly/r-customize-script) - Installs packages used in the specialization.
- [Installing Some Basic R Packages in Ubuntu; Ibrahim El Merehbi](http://elmerehbi.wordpress.com/2014/09/09/installing-some-basic-r-packages-in-ubuntu)
- [Using Projects in RStudio](https://support.rstudio.com/hc/en-us/articles/200526207-Using-Projects)
- [Using Version Control with RStudio](https://support.rstudio.com/hc/en-us/articles/200532077-Version-Control-with-Git-and-SVN)
- [Using R behind HTTP/HTTPS Proxy](https://support.rstudio.com/hc/en-us/articles/200488488-Configuring-R-to-Use-an-HTTP-or-HTTPS-Proxy)

### Ignoring R & RStudio files
- [gitignore template for R](https://github.com/github/gitignore/blob/master/R.gitignore) (source:[gitignore](https://github.com/github/gitignore))
- [Github Help - Using Git / Ignoring files](https://help.github.com/articles/ignoring-files/)

## Troubleshooting
- [Windows batch file to work around RStudio startup issues](https://github.com/stepds/contrib-DataScienceSpecialization/blob/master/README.md)

## Pre-built virtual machines for R development.
- [Here's a pre-built lightweight Linux machine with R and RStudio already installed](https://github.com/queirozfcom/r-box). You just need to install [vagrant](https://www.vagrantup.com/downloads.html), download (or clone) the github repository and you'll get a clean ubuntu machine with the tools you'll need for the assignments.

- [Data Science Toolbox](http://datasciencetoolbox.org/) - A virtual environment that allows you to start doing data science in a matter of minutes.

- [Virtual machine with RStudio server and github setup](https://github.com/tboloo/vagrant-rstudio) - A VirtualBox, Vagrant & chef-solo managed virtual machine which provides RStudio server with git & github setup

## Deploying and sharing Shiny Apps with Docker
- [Dockerize a Shiny App](http://www.rmining.net/2015/04/30/dockerizing-a-shiny-app/)
- [Git pushing Shiny Apps with Docker/Dokku](http://www.rmining.net/2015/05/11/git-pushing-shiny-apps-with-docker-dokku/)
- [Share your Shiny Apps with Docker and Kitematic](http://www.rmining.net/2015/08/10/share-your-shiny-apps-with-docker-and-kitematic/)
28 changes: 28 additions & 0 deletions pml.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,3 +7,31 @@ permalink: /pml/
## Model Evaluation

- [Simple Guide to Confusion Matrix Terminology (sensitivity, specificity, etc.)](http://www.dataschool.io/simple-guide-to-confusion-matrix-terminology/)
- ROC curves and Area Under the Curve explained: [video tutorial](http://youtu.be/OAl6eAyP-yo), [companion blog post](http://www.dataschool.io/roc-curves-and-auc-explained/) (with video transcript and screenshots)

## Supplementary Videos

- [What is machine learning, and how does it work?](https://www.youtube.com/watch?v=elojMnjn4kk): A high-level overview of machine learning in a 10-minute video
- [Video lectures from "An Introduction to Statistical Learning"](http://www.dataschool.io/15-hours-of-expert-machine-learning-videos/): Videos for Chapters 4, 5, 6, 8, and 10 can help to deepen your understanding of the topics presented in this course.

## Machine Learning Competitions

- [Participating in Kaggle's Allstate Purchase Prediction Challenge](http://www.dataschool.io/kaggle-allstate-purchase-prediction-challenge/): Description of what it's like to compete in a Kaggle competition, including links to a project paper, R code, presentation slides, and a presentation video.

## Choosing a Machine Learning Model

- [Comparing Supervised Learning Algorithms](http://www.dataschool.io/comparing-supervised-learning-algorithms/): Comparing 8 common supervised learning algorithms (for regression and classification) on 13 different dimensions.

## Content Related to the Lectures

- Complete notes for [Practical Machine Learning](http://sux13.github.io/DataScienceSpCourseNotes/)
- [Week 4: Combining Predictors -- Math Explained](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-combiningPredictorsBinomial.md)

## Configuring Github Pages with RStudio for PML Project

- Step by step instructions to [Configure Github Pages with RStudio](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-ghPagesSetup.md) to support the PML course project.

## Improving Runtime Performance of Caret

- Step by step instructions to [implement parallel processing in caret::train()](https://github.com/lgreski/datasciencectacontent/blob/master/markdown/pml-randomForestPerformance.md) on a random forest model, along with runtime performance analysis for a variety of laptops, ranging from an Intel Atom-based tablet to a quad-core i7 processor.

7 changes: 7 additions & 0 deletions regmod.md
Original file line number Diff line number Diff line change
Expand Up @@ -4,3 +4,10 @@ title: Regression Models
permalink: /regmod/
---

## Supplementary Videos

- [Video lectures from "An Introduction to Statistical Learning"](http://www.dataschool.io/15-hours-of-expert-machine-learning-videos/): Videos for Chapter 3 can help to deepen your understanding of regression.

## Comprehensive Notes

- Complete notes for [Regression Models](http://sux13.github.io/DataScienceSpCourseNotes/)
7 changes: 7 additions & 0 deletions repres.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,4 +6,11 @@ permalink: /repres/

- [Turning a RPubs document into a Github website walkthrough](https://github.com/thoughtfulbloke/appleorange)
- [Introduction to knitr with rmarkdown](https://sachsmc.github.io/knit-git-markr-guide/knitr/knit.html)
- [Trends and severity of Data Breaches](http://rpubs.com/ww44ss/29389)
- [Benefit-cost analysis of a park user fee](https://rstudio-pubs-static.s3.amazonaws.com/72135_dc45211d976842c2a9a8c8b5f2472ff0.html)
- [Data Lake Integrity](http://rpubs.com/rshane/81297)
- [ProjectTemplate in RStudio with Git](http://padamson.github.io/r/rstudio/projecttemplate/git/2016/01/17/projecttemplate-in-rstudio-with-git.html)

## Comprehensive Notes

- Complete notes for [Reproducible Research](http://sux13.github.io/DataScienceSpCourseNotes/)
Loading