<p>I'm using the <code>survival</code> library. After computing the Kaplan-Meier estimator of a survival function:</p> <pre class="prettyprint"><code>km = survfit(Surv(time, flag) ~ 1) </code></pre> <p>I know how to compute percentiles:</p> <pre class="prettyprint"><code>quantile(km, probs = c(0.05,0.25,0.5,0.75,0.95)) </code></pre> <p>But, how do I compute the mean survival time?</p>

<h3>Calculate Mean Survival Time</h3> <p>The mean survival time will in general depend on what value is chosen for the maximum survival time. You can get the restricted mean survival time with <code>print(km, print.rmean=TRUE)</code>. By default, this assumes that the longest survival time is equal to the longest survival time in the data. You can set this to a different value by adding an <code>rmean</code> argument (e.g., <code>print(km, print.rmean=TRUE, rmean=250)</code>).</p> <h3>Extract Value of Mean Survival Time and Store in an Object</h3> <p>In response to your comment: I initially figured one could extract the mean survival time by looking at the object returned by <code>print(km, print.rmean=TRUE)</code>, but it turns out that <code>print.survfit</code> doesn't return a list object but just returns text to the console.</p> <p>Instead, I looked through the code of <code>print.survfit</code> (you can see the code by typing <code>getAnywhere(print.survfit)</code> in the console) to see where the mean survival time is calculated. It turns out that a function called <code>survmean</code> takes care of this, but it's not an exported function, meaning R won't recognize the function when you try to run it like a "normal" function. So, to access the function, you need to run the code below (where you need to set <code>rmean</code> explicitly):</p> <pre class="prettyprint"><code>survival:::survmean(km, rmean=60) </code></pre> <p>You'll see that the function returns a list where the first element is a matrix with several named values, including the mean and the standard error of the mean. So, to extract, for example, the mean survival time, you would do:</p> <pre class="prettyprint"><code>survival:::survmean(km, rmean=60)[[1]]["*rmean"] </code></pre> <h3>Details on How the Mean Survival Time is Calculated</h3> <p>The help for <code>print.survfit</code> provides details on the options and how the restricted mean is calculated:</p> <pre class="prettyprint"><code>?print.survfit </code></pre> <blockquote> <p>The mean and its variance are based on a truncated estimator. That is, if the last observation(s) is not a death, then the survival curve estimate does not go to zero and the mean is undefined. There are four possible approaches to resolve this, which are selected by the rmean option. The first is to set the upper limit to a constant, e.g.,rmean=365. In this case the reported mean would be the expected number of days, out of the first 365, that would be experienced by each group. This is useful if interest focuses on a fixed period. Other options are "none" (no estimate), "common" and "individual". The "common" option uses the maximum time for all curves in the object as a common upper limit for the auc calculation. For the "individual"options the mean is computed as the area under each curve, over the range from 0 to the maximum observed time for that curve. Since the end point is random, values for different curves are not comparable and the printed standard errors are an underestimate as they do not take into account this random variation. This option is provided mainly for backwards compatability, as this estimate was the default (only) one in earlier releases of the code. Note that SAS (as of version 9.3) uses the integral up to the last event time of each individual curve; we consider this the worst of the choices and do not provide an option for that calculation.</p> </blockquote>

How to compute the mean survival time

Tags:

r

survival-analysis

I'm using the survival library. After computing the Kaplan-Meier estimator of a survival function:

km = survfit(Surv(time, flag) ~ 1)

I know how to compute percentiles:

quantile(km, probs = c(0.05,0.25,0.5,0.75,0.95))

But, how do I compute the mean survival time?

665

asked Apr 02 '17 20:04

pyon

1 Answers

Calculate Mean Survival Time

The mean survival time will in general depend on what value is chosen for the maximum survival time. You can get the restricted mean survival time with print(km, print.rmean=TRUE). By default, this assumes that the longest survival time is equal to the longest survival time in the data. You can set this to a different value by adding an rmean argument (e.g., print(km, print.rmean=TRUE, rmean=250)).

Extract Value of Mean Survival Time and Store in an Object

In response to your comment: I initially figured one could extract the mean survival time by looking at the object returned by print(km, print.rmean=TRUE), but it turns out that print.survfit doesn't return a list object but just returns text to the console.

Instead, I looked through the code of print.survfit (you can see the code by typing getAnywhere(print.survfit) in the console) to see where the mean survival time is calculated. It turns out that a function called survmean takes care of this, but it's not an exported function, meaning R won't recognize the function when you try to run it like a "normal" function. So, to access the function, you need to run the code below (where you need to set rmean explicitly):

survival:::survmean(km, rmean=60)

You'll see that the function returns a list where the first element is a matrix with several named values, including the mean and the standard error of the mean. So, to extract, for example, the mean survival time, you would do:

survival:::survmean(km, rmean=60)[[1]]["*rmean"]

Details on How the Mean Survival Time is Calculated

The help for print.survfit provides details on the options and how the restricted mean is calculated:

?print.survfit

The mean and its variance are based on a truncated estimator. That is, if the last observation(s) is not a death, then the survival curve estimate does not go to zero and the mean is undefined. There are four possible approaches to resolve this, which are selected by the rmean option. The first is to set the upper limit to a constant, e.g.,rmean=365. In this case the reported mean would be the expected number of days, out of the first 365, that would be experienced by each group. This is useful if interest focuses on a fixed period. Other options are "none" (no estimate), "common" and "individual". The "common" option uses the maximum time for all curves in the object as a common upper limit for the auc calculation. For the "individual"options the mean is computed as the area under each curve, over the range from 0 to the maximum observed time for that curve. Since the end point is random, values for different curves are not comparable and the printed standard errors are an underestimate as they do not take into account this random variation. This option is provided mainly for backwards compatability, as this estimate was the default (only) one in earlier releases of the code. Note that SAS (as of version 9.3) uses the integral up to the last event time of each individual curve; we consider this the worst of the choices and do not provide an option for that calculation.

168

answered Oct 05 '22 17:10

eipi10

Related questions
                            
                                How do I control space between bars?
                            
                                starting R in windows command terminal
                            
                                Taylor approximation in R
                            
                                cumsum using ddply
                            
                                Add table (aligned text blocks) to plot in R
                            
                                How to set ',' as decimal separator with R
                            
                                add citation information in R package
                            
                                Producing a boxplot in ggplot2 using summary statistics
                            
                                Dependency issue while installing caret package in R
                            
                                R Shiny checkboxGroupInput - select all checkboxes by click
                            
                                R scoping: disallow global variables in function
                            
                                Plotting normal curve over histogram using ggplot2: Code produces straight line at 0
                            
                                How to detect null values in a vector
                            
                                Include a comma separator for data labels
                            
                                How to extract the first line from a text file?
                            
                                Installing "rgl" package in R, Mac OSX El Captian
                            
                                Is it possible to write stdout using write_csv() from readr?
                            
                                How to replace one substring with different substrings in R?
                            
                                How `poly()` generates orthogonal polynomials? How to understand the "coefs" returned?
                            
                                R convert large character string to dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With