##The purpose and the goal This repository is a result of coursera getdata-015 course project. The purpose of this project was to demonstrate ability to collect, work with, and clean a data set.
The goal was to prepare tidy data that can be used for later analysis.
##The repository contains
- A main tidy data set as described below
- An additional tidy data set as described below
run_analysis.R
script which performs necessary transformation of raw data necessary to get tidy data sets- A code book called CodeBook.md that describes the variables, the data, and any transformations and work that is neccessary to clean up the data.
- This README.md file with explanations of how all of the scripts work and how they are connected
getrawdata.R
script which contains code to obtain and unpack initial raw data
##About the area The course project is about one of the most exciting areas in all of data science right now - wearable computing. Recomended article
Companies like Fitbit, Nike, and Jawbone Up are racing to develop the most advanced algorithms to attract new users. The data linked to from the course website represent data collected from the accelerometers from the Samsung Galaxy S smartphone.
A full description is available at the site where the data was obtained
##Initial data
Link to the zip file with raw data for the project
##Actual tasks of the course project
Create one R script called run_analysis.R
that does the following.
- Merges the training and the test sets to create one data set.
- Extracts only the measurements on the mean and standard deviation for each measurement.
- Uses descriptive activity names to name the activities in the data set
- Appropriately labels the data set with descriptive variable names.
- From the data set in step 4, creates a second, independent tidy data set with the average of each variable for each activity and each subject.