Simplest way to get rbind to ignore column names

Tags:

rbind

This came up just in an answer to another question here. When you rbind two data frames, it matches columns by name rather than index, which can lead to unexpected behavior:

> df<-data.frame(x=1:2,y=3:4) > df   x y 1 1 3 2 2 4 > rbind(df,df[,2:1])   x y 1 1 3 2 2 4 3 1 3 4 2 4

Of course, there are workarounds. For example:

rbind(df,rename(df[,2:1],names(df))) data.frame(rbind(as.matrix(df),as.matrix(df[,2:1])))

On edit: rename from the plyr package doesn't actually work this way (although I thought I had it working when I originally wrote this...). The way to do this by renaming is to use SimonO101's solution:

rbind(df,setNames(df[,2:1],names(df)))

Also, maybe surprisingly,

data.frame(rbindlist(list(df,df[,2:1])))

works by index (and if we don't mind a data table, then it's pretty concise), so this is a difference between do.call(rbind).

The question is, what is the most concise way to rbind two data frames where the names don't match? I know this seems trivial, but this kind of thing can end up cluttering code. And I don't want to have to write a new function called rbindByIndex. Ideally it would be something like rbind(df,df[,2:1],byIndex=T).

443

asked Oct 10 '13 13:10

mrip

2 Answers

You might find setNames handy here...

rbind(df, setNames(rev(df), names(df))) #  x y #1 1 3 #2 2 4 #3 3 1 #4 4 2

I suspect your real use-case is somewhat more complex. You can of course reorder columns in the first argument of setNames as you wish, just use names(df) in the second argument, so that the names of the reordered columns match the original.

166

answered Sep 19 '22 13:09

Simon O'Hanlon

This seems pretty easy:

mapply(c,df,df[,2:1])      x y [1,] 1 3 [2,] 2 4 [3,] 3 1 [4,] 4 2

For this simple case, though, you have to turn it back into a dataframe (because mapply simplifies it to a matrix):

as.data.frame(mapply(c,df,df[,2:1]))   x y 1 1 3 2 2 4 3 3 1 4 4 2

Important note 1: There appears to be a downside of type coercion when your dataframe contains vectors of different types:

df<-data.frame(x=1:2,y=3:4,z=c('a','b')) mapply(c,df,df[,c(2:1,3)])      x y z [1,] 1 3 2 [2,] 2 4 1 [3,] 3 1 2 [4,] 4 2 1

Important note 2: It also is terrible if you have factors.

df<-data.frame(x=factor(1:2),y=factor(3:4)) mapply(c,df[,1:2],df[,2:1])      x y [1,] 1 1 [2,] 2 2 [3,] 1 1 [4,] 2 2

So, as long as you have all numeric data, it's okay.

answered Sep 23 '22 13:09

Thomas

Related questions
                            
                                Fill and border colour in geom_point (scale_colour_manual) in ggplot
                            
                                Grouped bar plot in ggplot
                            
                                How can I count runs in a sequence?
                            
                                Replace values in a dataframe based on lookup table
                            
                                heatmap with values (ggplot2)
                            
                                Put whisker ends on boxplot
                            
                                Aggregate multiple columns at once [duplicate]
                            
                                preallocate list in R
                            
                                Unimplemented type list when trying to write.table
                            
                                Parsing command line arguments in R scripts
                            
                                Invalid multibyte string in read.csv
                            
                                Replace characters from a column of a data frame R
                            
                                How to plot just the legends in ggplot2?
                            
                                R: Find the last dot in a string
                            
                                Save multiple ggplots using a for loop
                            
                                Create an ID (row number) column
                            
                                How do you convert dates/times from one time zone to another in R?
                            
                                Replace missing values (NA) with most recent non-NA by group
                            
                                the difference between doMC and doParallel in R
                            
                                Raise to power in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With