Sum columns row-wise with similar names

Tags:

I have a dataframe that has lots of columns that are something like this:

data <- data.frame (a.1 = 1:5, a.2b = 3:7, a.5 = 5:9, bt.16 = 4:8, bt.12342 = 7:11)

I'd like a result with columns that sum the variables that have the same prefix. In this example, I want to return a dataframe: a = (9:13), bt = (11:15)

My real data set is quite a bit more complicated (I want to combine page view counts for web pages with different utm parameters) but a solution for this case should put me on the right track.

762

asked Apr 16 '18 13:04

TSim

2 Answers

Here a solution with base R:

> prefixes = unique(sub("\\..*", "", colnames(data)))
> sapply(prefixes, function(x)rowSums(data[,startsWith(colnames(data), x)]))
      a bt
[1,]  9 11
[2,] 12 13
[3,] 15 15
[4,] 18 17
[5,] 21 19

166

answered Sep 29 '22 22:09

user1981275

You can try

library(tidyverse)
data.frame (a.1 = 1:5, a.2b = 3:7, a.5 = 5:9, bt.16 = 4:8, bt.12342 = 7:11) %>% 
  rownames_to_column() %>% 
  gather(k, v, -rowname) %>% 
  separate(k, letters[1:2]) %>% 
  group_by(rowname, a) %>% 
  summarise(Sum=sum(v)) %>% 
  spread(a, Sum)
#> # A tibble: 5 x 3
#> # Groups:   rowname [5]
#>   rowname     a    bt
#>   <chr>   <int> <int>
#> 1 1           9    11
#> 2 2          12    13
#> 3 3          15    15
#> 4 4          18    17
#> 5 5          21    19

Created on 2018-04-16 by the reprex package (v0.2.0).

You can also do:

data.frame (a.1 = 1:5, a.2b = 3:7, a.5 = 5:9, bt.16 = 4:8, bt.12342 = 7:11) %>% 
  rownames_to_column() %>% 
  pivot_longer(-1, names_to = c(".value", "set"), names_sep = "[.]") %>% 
  group_by(rowname) %>% 
  summarise(across(a:bt,sum, na.rm=T))
# A tibble: 5 x 3
  rowname     a    bt
  <chr>   <int> <int>
1 1           9    11
2 2          12    13
3 3          15    15
4 4          18    17
5 5          21    19

answered Sep 29 '22 21:09

Roman

Related questions
                            
                                Dynamically sorting columns in dplyr via passing ordered vector with column names to select
                            
                                Plot 2 tmap objects side-by-side
                            
                                Is there a function to recognize a word?
                            
                                How to combine two rows in R?
                            
                                Why is standard R median function so much slower than a simple C++ alternative?
                            
                                Aggregate data.frame for each day
                            
                                Faster way to unlist a list of large matrices?
                            
                                How to get the table counts for unique values in column
                            
                                Extract pattern from string in R without distinguishing between upper and lower case letters
                            
                                Shift geom_bar right (not center-aligned)
                            
                                Preserve order of input variables and factor levels in summary table, using dplyr tidyr
                            
                                Get value of last non-NA row per column in data.table
                            
                                filter or subset list by partial object name in R
                            
                                How to extract the "domain" from an email address
                            
                                Complete column with group_by and complete
                            
                                Continuous gradient color & fixed scale heatmap ggplot2
                            
                                Cant set working directory in R notebook chunk, strange error
                            
                                Rapidly generating ~ 10^9 steps of a random process in R
                            
                                adding correlation test results to ggplot
                            
                                How to create a vector from elements in common in vectors R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Sum columns row-wise with similar names

Tags:

dataframe

r

sum

rowwise

TSim

People also ask

2 Answers

user1981275

Roman

Recent Activity

Donate For Us