Split data.frame into groups by column name

Tags:

I'm new to R. I have a data frame with column names of such type:

file_001   file_002   block_001   block_002   red_001   red_002 ....etc'  
  0.05       0.2        0.4         0.006       0.05       0.3
  0.01       0.87       0.56        0.4         0.12       0.06

I want to split them into groups by the column name, to get a result like this:

group_file
file_001   file_002
  0.05       0.2
  0.01       0.87

group_block
block_001   block_002
  0.4        0.006
  0.56       0.4

group_red
red_001    red_002
  0.05       0.3
  0.12       0.06

...etc'

My file is huge. I don't have a certain number of groups. It needs to be just by the column name's start.

266

asked Nov 14 '17 14:11

Keity

1 Answers

In base R, you can use sub and split.default like this to return a list of data.frames:

myDfList <- split.default(dat, sub("_\\d+", "", names(dat)))

this returns

myDfList
$block
  block_001 block_002
1      0.40     0.006
2      0.56     0.400

$file
  file_001 file_002
1     0.05     0.20
2     0.01     0.87

$red
  red_001 red_002
1    0.05    0.30
2    0.12    0.06

split.default will split data.frames by variable according to its second argument. Here, we use sub and the regular expression "_\d+" to remove the underscore and all numeric values following it in order to return the splitting values "block", "file", and "red".

As a side note, it is typically a good idea to keep these data.frames in a list and work with them through functions like lapply. See gregor's answer to this post for some motivating examples.

166

answered Oct 19 '22 04:10

lmo

Related questions
                            
                                Creating a function with a FUN input in r
                            
                                Remove rows from data frame using row indices where row indices might be zero length vector
                            
                                Using ggplot's sec.axis with a non-monotonic transformation
                            
                                Merge two lists of dataframes
                            
                                Save Rmarkdown's report tables and figures to file
                            
                                Unicode character in R-package Authors Name
                            
                                checking if word exist in english dictionary r
                            
                                Turning an igraph.vs into a data frame
                            
                                Multiple leaflets in a grid
                            
                                Possible to combine DT, formattable and shiny?
                            
                                Labels in italics for x axis
                            
                                How to give color to a given interval of rows of a DT table?
                            
                                Fill in gaps (e.g. not single cells) of NA values in raster using a neighborhood analysis
                            
                                Applying a gradient fill on a density plot in ggplot2
                            
                                Suppress Messages from zip in R
                            
                                map a vector of characters to lm formula in r
                            
                                Reactive CSS properties in R Shiny
                            
                                Automatically stack every nth column of a dataframe
                            
                                convert matrix to numeric data frame
                            
                                What does the error "the condition has length > 1 and only the first element will be used" mean? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Split data.frame into groups by column name

Tags:

dataframe

r

strsplit

Keity

People also ask

1 Answers

lmo

Recent Activity

Donate For Us