Mean and Median Vs Summary

Tags:

I'm currently doing a Reproducible Data course on Coursera and one of the questions ask for the Mean and Median of steps per day, I have this but when I confirm it with the summary function, the summary version of Mean and Median is different. I'm running this via knitr

Why would this be? ** below is an edit showing all of my script so far including a link to the raw data:

Click to copy

##Download the data You have to change https to http to get this to work in knitr

target_url <- "http://d396qusza40orc.cloudfront.net/repdata%2Fdata%2Factivity.zip"
target_localfile = "ActivityMonitoringData.zip"
if (!file.exists(target_localfile)) {
  download.file(target_url, destfile = target_localfile) 
}
Unzip the file to the temporary directory

unzip(target_localfile, exdir="extract", overwrite=TRUE)
List the extracted files

list.files("./extract")
## [1] "activity.csv"
Load the extracted data into R

activity.csv <- read.csv("./extract/activity.csv", header = TRUE)
activity1 <- activity.csv[complete.cases(activity.csv),]
str(activity1)
## 'data.frame':    15264 obs. of  3 variables:
##  $ steps   : int  0 0 0 0 0 0 0 0 0 0 ...
##  $ date    : Factor w/ 61 levels "2012-10-01","2012-10-02",..: 2 2 2 2 2 2 2 2 2 2 ...
##  $ interval: int  0 5 10 15 20 25 30 35 40 45 ...
Use a histogram to view the number of steps taken each day

histData <- aggregate(steps ~ date, data = activity1, sum)
h <- hist(histData$steps,  # Save histogram as object
          breaks = 11,  # "Suggests" 11 bins
          freq = T,
          col = "thistle1", 
          main = "Histogram of Activity",
          xlab = "Number of daily steps")


Obtain the Mean and Median of the daily steps

steps <- histData$steps
mean(steps)
## [1] 10766
median(steps)
## [1] 10765
summary(histData$steps)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##      41    8840   10800   10800   13300   21200
summary(steps)
##    Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
##      41    8840   10800   10800   13300   21200
sessionInfo()
## R version 3.1.1 (2014-07-10)
## Platform: i386-w64-mingw32/i386 (32-bit)
## 
## locale:
## [1] LC_COLLATE=English_Australia.1252  LC_CTYPE=English_Australia.1252   
## [3] LC_MONETARY=English_Australia.1252 LC_NUMERIC=C                      
## [5] LC_TIME=English_Australia.1252    
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
## [1] knitr_1.6
## 
## loaded via a namespace (and not attached):
## [1] evaluate_0.5.5 formatR_1.0    stringr_0.6.2  tools_3.1.1

892

asked Oct 14 '14 11:10

Chris

1 Answers

Actually, the answers is correct, you just printing it wrong. You are setting digits option somewhere.

Put this before the scripts:

Click to copy

options(digits=12)

And you'll have:

Click to copy

mean(steps)
# [1] 10766.1886792
median(steps)
# [1] 10765
summary(steps)
#      Min.    1st Qu.     Median       Mean    3rd Qu.       Max. 
#   41.0000  8841.0000 10765.0000 10766.1887 13294.0000 21194.0000

Notice that summary use max(3, getOption("digits")-3) for how many numbers is printed. So it round it a bit (10766.1887 instead of 10766.1886792).

171

answered Oct 07 '22 21:10

m0nhawk

Related questions
                            
                                How to select all
                            
                                How can you tell if a pipe operator is the last (or first) in a chain?
                            
                                Three column graph
                            
                                Understanding data.table invalid .selfref warning
                            
                                For loop R create and populate new column with output
                            
                                unable to install rJava in centos R
                            
                                Using geom_boxplot with facet_grid and free_y
                            
                                Creating new SQL table from dplyr object without using R memory
                            
                                data.table merge produces extra columns [R]
                            
                                Developing R package when functions are written in S4 and using roxygen2
                            
                                ERROR: compilation failed for package ‘Rcpp’
                            
                                Different versions of R, lme4 and OS X give different fixed-effects significance results in glmer
                            
                                Add file extension to all files in a folder in R
                            
                                difftime between rows using dplyr
                            
                                How to build R package from GitHub?
                            
                                R: Christmas Tree
                            
                                Difference between ordered and unordered factor variables in R
                            
                                XPath in R: return NA if node is missing
                            
                                'RCurl' [R] package getURL webpage error when scraping API
                            
                                Add legend to manually added lines using ggplot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Mean and Median Vs Summary

Tags:

r

rstudio

knitr

mean

Chris

People also ask

1 Answers

m0nhawk

Recent Activity

Donate For Us