I want to apply statistics to the columns of a dataframe in an iterated fashion: columns number 1: 'A' represents the tags that I want to discriminate: <pre class="prettyprint"><code>for (i in names(dataframe)) { i <- as.name(i) group1 <- i[A=="locationX"] group2 <- i[A!="locationX"] p <- wilcox.test(group1,group2,na.action(na.omit))$p.value } </code></pre> however, the <code>as.name()</code> is to try to remove the inverted commas from the column names generated by <code>names(dataframe)</code>. Unfortunately it gives me the error: <blockquote> Error in i[A == "locationX"] : object of type 'symbol' is not subsettable </blockquote> I think <code>as.name()</code> is not the right way to do it. Any clues?

The only way this makes sense if for "A" to be a vector with multiple instances of "locationX" and multiple instance of the converse and for the length of "A" to be the same as the number of rows in "dataframe". If that is the case then something like this might work: <pre class="prettyprint"><code>p <- list() for (i in names(dataframe)) { # using as.names not needed and possibly harmful group1 <- dataframe[[i]][A == "locationX"] group2 <- dataframe[[i]][A != "locationX"] p[i] <- wilcox.test(group1,group2,na.action(na.omit))$p.value } </code></pre> Note that even if you did not get an error with your code that you would still have been overwriting the "p" every time through the loop.

R iterate over columns dataframe

Tags:

dataframe

r

I want to apply statistics to the columns of a dataframe in an iterated fashion:

columns number 1: 'A' represents the tags that I want to discriminate:

for (i in names(dataframe)) {
    i <- as.name(i)
    group1 <- i[A=="locationX"]
    group2 <- i[A!="locationX"]
    p <- wilcox.test(group1,group2,na.action(na.omit))$p.value
}

however, the as.name() is to try to remove the inverted commas from the column names generated by names(dataframe).

Unfortunately it gives me the error:

Error in i[A == "locationX"] : object of type 'symbol' is not subsettable

I think as.name() is not the right way to do it.

Any clues?

290

asked Jan 18 '12 00:01

user1155073

1 Answers

The only way this makes sense if for "A" to be a vector with multiple instances of "locationX" and multiple instance of the converse and for the length of "A" to be the same as the number of rows in "dataframe". If that is the case then something like this might work:

p <- list()
for (i in names(dataframe)) {
   # using as.names not needed and possibly harmful
    group1 <- dataframe[[i]][A == "locationX"]
    group2 <- dataframe[[i]][A != "locationX"]
    p[i] <- wilcox.test(group1,group2,na.action(na.omit))$p.value
}

Note that even if you did not get an error with your code that you would still have been overwriting the "p" every time through the loop.

answered Sep 21 '22 07:09

IRTFM

Related questions
                            
                                rpy2: Converting a data.frame to a numpy array
                            
                                ESS/AucTeX/Sweave integration
                            
                                How I can create a new ties.method with the R rank() function? [duplicate]
                            
                                odbcConnectExcel function from RODBC package for R not found on Ubuntu
                            
                                Apply over two data frames
                            
                                How to use acast (reshape2) within a function in R?
                            
                                best time date format for R [duplicate]
                            
                                Drawing maps without margins in R
                            
                                What is the second column of `str` report in R and what does `atomic` in this column mean?
                            
                                Remove NA when using "order"
                            
                                Regression evaluation in R
                            
                                Removing Unused Factors from a Facet in ggplot2
                            
                                Add statistical information to the bottom of a graph
                            
                                Finding list of positions in multidimensional structure (array)
                            
                                Storing specific XML node values with R's xmlEventParse
                            
                                rgdal package lat/long -> UTM
                            
                                R RODBC putting list of numbers into an IN() statement
                            
                                available.packages by publication date
                            
                                When running R, how to exit from Emacs-ESS gracefully?
                            
                                Suppress C warning messages in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With