Say that I have this data frame: <pre class="prettyprint"><code> 1 2 3 4 100 8 12 5 14 99 1 6 4 3 98 2 5 4 11 97 5 3 7 2 </code></pre> In this above data frame, the values indicate counts of how many observations take on <code>(100, 1), (99, 1)</code>, etc. In my context, the diagonals have the same meanings: <pre class="prettyprint"><code> 1 2 3 4 100 A B C D 99 B C D E 98 C D E F 97 D E F G </code></pre> How would I sum across the diagonals (i.e., sum the counts of the like letters) in the first data frame? This would produce: <pre class="prettyprint"><code>group sum A 8 B 13 C 13 D 28 E 10 F 18 G 2 </code></pre> For example, <code>D</code> is <code>5+5+4+14</code>

You can use <code>row()</code> and <code>col()</code> to identify row/column relationships. <pre class="prettyprint"><code>m <- read.table(text=" 1 2 3 4 100 8 12 5 14 99 1 6 4 3 98 2 5 4 11 97 5 3 7 2") vals <- sapply(2:8, function(j) sum(m[row(m)+col(m)==j])) </code></pre> or (as suggested in comments by ?@thelatemail) <pre class="prettyprint"><code>vals <- sapply(split(as.matrix(m), row(m) + col(m)), sum) data.frame(group=LETTERS[seq_along(vals)],sum=vals) </code></pre> or (@Frank) <pre class="prettyprint"><code>data.frame(vals = tapply(as.matrix(m), (LETTERS[row(m) + col(m)-1]), sum)) </code></pre> <code>as.matrix()</code> is required to make <code>split()</code> work correctly ...

How to sum over diagonals of data frame

Tags:

dataframe

r

sum

diagonal

Say that I have this data frame:

     1   2   3   4      
100  8   12  5   14 
99   1   6   4   3   
98   2   5   4   11  
97   5   3   7   2

In this above data frame, the values indicate counts of how many observations take on (100, 1), (99, 1), etc.

In my context, the diagonals have the same meanings:

     1   2   3   4
100  A   B   C   D 
99   B   C   D   E  
98   C   D   E   F 
97   D   E   F   G

How would I sum across the diagonals (i.e., sum the counts of the like letters) in the first data frame?

This would produce:

For example, D is 5+5+4+14

252

asked Apr 29 '15 23:04

bill999

1 Answers

You can use row() and col() to identify row/column relationships.

m <- read.table(text="
    1   2   3   4      
100  8   12  5   14 
99   1   6   4   3   
98   2   5   4   11  
97   5   3   7   2")

vals <- sapply(2:8,
       function(j) sum(m[row(m)+col(m)==j]))

or (as suggested in comments by ?@thelatemail)

vals <- sapply(split(as.matrix(m), row(m) + col(m)), sum)
data.frame(group=LETTERS[seq_along(vals)],sum=vals)

or (@Frank)

data.frame(vals = tapply(as.matrix(m), 
       (LETTERS[row(m) + col(m)-1]), sum))

as.matrix() is required to make split() work correctly ...

142

answered Oct 26 '22 11:10

Ben Bolker

Related questions
                            
                                How to count frequencies of certain character in a string?
                            
                                initctl: Unknown instance: error after Rstudio conf change
                            
                                Easy way to convert long to wide format with counts [duplicate]
                            
                                How to set the default language of date in R
                            
                                Error : "sh: gfortran: command not found" | Ubuntu 16.04
                            
                                Selecting specific elements from a matrix all at once
                            
                                Calculate frequency of occurrence in an array using R
                            
                                Can i host a shiny app on a windows machine?
                            
                                Align bars of histogram centered on labels
                            
                                Model matrix with all pairwise interactions between columns
                            
                                Select/Deselect All Button for shiny variable selection
                            
                                Can I add a "go to top" button to an HTML document rendered in R Markdown?
                            
                                How to put a complicated equation into a R formula?
                            
                                tidyr separate only first n instances [duplicate]
                            
                                ggplot2: Changing the layout of the legend
                            
                                How to create a pivot table in R with multiple (3+) variables
                            
                                Enriching a ggplot2 plot with multiple geom_segment in a loop?
                            
                                Error bars for barplot only in one direction
                            
                                Replace NA values by row means
                            
                                Select only rows if its value in a particular column is 'NA' in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With