GitHub - sukalpomitra/AccelerometerDataAnalysis: Raw to tidy data using R code (http://archive.ics.uci.edu/ml/datasets/Human+Activity+Recognition+Using+Smartphones)

title	ReadMe
subtitle
author	Sukalpo Mitra
mode	selfcontained

The instruction list

Step 1 - take the zip downloaded from coursera, and extract it.
Step 2 - Copy the folder UCI HAR Dataset with all its content and paste it in your R working directory
Step 3 - Open up R version 3.1.2 and install the dplyr package by typing install.packages("dplyr")
Step 4 - Load the package by typing library(dplyr)
Step 5 - Source the run_analysis function by typing source(filelocation) where filelocation is the location where the run_analysis.R is kept along with the filename. For e.g:- if run_analysis.R is kept at C drive then file location should be "C:/run_analysis.R"
Step 6 - Run run_analysis function by typing run_analysis()

Explanation of the code

Step 1 - The program checks whether the directory UCI HAR Dataset directory exists in the working directory and whether X_train.txt,y_train.txt,subject_train.txt files exist under the train folder in UCI HAR Dataset directory. It also checks whether X_test.txt,y_test.txt,subject_test.txt files exist under the test folder in UCI HAR Dataset directory. If not the program stops and shows the validation error message.
Step 2 - Once the validation passes all the files are first read and the data are stored as data frames.
Step 3 - The data in y_train.txt is added as a new column to the data read from X_train.txt. The column is named as activity
Step 4 - The data in subject_train.txt is added as a new column to the data read from X_train.txt. The column is named as subject.
Step 5 - The data in y_test.txt is added as a new column to the data read from X_test.txt. The column is named as activity
Step 6 - The data in subject_test.txt is added as a new column to the data read from X_test.txt. The column is named as subject
Step 7 - Both the datasets read from X_test.txt and X_train.txt are then joined together
Step 8 - The columns having the mean and standar deviation of the measures are then extracted from the merged dataset
Step 9 - The activity column is then transformed to a factor using levels and labels from activity_labels.txt
Step 10 - The columns of the merged dataset are then given meaningful names
Step 11 - The merged dataset is then grouped by Activity and Subject column
Step 12 - The grouped by dataset is then summarized by calculating averages on the measure columns and written to a txt file called "tidydataset.txt" and is placed in the working directory.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
CodeBook.md		CodeBook.md
ReadMe.md		ReadMe.md
run_analysis.R		run_analysis.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The instruction list

Explanation of the code

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

The instruction list

Explanation of the code

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages