Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

How to Plot "Reverse" Cumulative Frequency Graph With ECDF

Tags:

r

statistics

I have no problem plotting the following cumulative frequency graph plot like this.

     library(Hmisc)
     pre.test <- rnorm(100,50,10)
     post.test <- rnorm(100,55,10)
     x <- c(pre.test, post.test)
     g <- c(rep('Pre',length(pre.test)),rep('Post',length(post.test)))
     Ecdf(x, group=g, what="f", xlab='Test Results', label.curves=list(keys=1:2))

But I want to show the graph in forms of the "reverse" cumulative frequency of values > x. (i.e. something equivalent to what="1-f").

Is there a way to do it?

Other suggestions in R other than using Hmisc are also very much welcomed.

like image 754
neversaint Avatar asked Dec 12 '22 23:12

neversaint


2 Answers

Using Musa suggestion:

pre.ecdf <- ecdf(pre.test)
post.ecdf <- ecdf(post.test)

r <- range(pre.test,post.test)
curve(1-pre.ecdf(x), from=r[1], to=r[2], col="red", xlim=r)
curve(1-post.ecdf(x), from=r[1], to=r[2], col="blue", add=TRUE)

Proportions

You could set some parameters like title, legend, etc.

If you want frequency instead of proportion simple solution will be:

pre.ecdf <- ecdf(pre.test)
post.ecdf <- ecdf(post.test)

rx <- range(pre.test,post.test)
ry <- max(length(pre.test),length(post.test))
curve(length(pre.test)*(1-pre.ecdf(x)), from=rx[1], to=rx[2], col="red", xlim=rx, ylim=c(0,ry))
curve(length(post.test)*(1-post.ecdf(x)), from=rx[1], to=rx[2], col="blue", add=TRUE)

Frequencies

like image 151
Marek Avatar answered Jan 05 '23 12:01

Marek


The more general Ecdf function from the Hmisc has a what= option for that:

Arguments:

   x: a numeric vector, data frame, or Trellis/Lattice formula

what: The default is ‘"F"’ which results in plotting the fraction
      of values <= x.  Set to ‘"1-F"’ to plot the fraction > x or
      ‘"f"’ to plot the cumulative frequency of values <= x.

So with that we can modify the answer from your earlier question and add what="1-F":

 # Example showing how to draw multiple ECDFs from paired data
 pre.test <- rnorm(100,50,10)
 post.test <- rnorm(100,55,10)
 x <- c(pre.test, post.test)
 g <- c(rep('Pre',length(pre.test)),rep('Post',length(post.test)))
 Ecdf(x, group=g, what="1-F", xlab='Test Results', label.curves=list(keys=1:2))
like image 35
Dirk Eddelbuettel Avatar answered Jan 05 '23 10:01

Dirk Eddelbuettel