I am trying to get a frequency table from this dataframe: <pre class="prettyprint"><code>tmp2 <- structure(list(a1 = c(1L, 0L, 0L), a2 = c(1L, 0L, 1L), a3 = c(0L, 1L, 0L), b1 = c(1L, 0L, 1L), b2 = c(1L, 0L, 0L), b3 = c(0L, 1L, 1L)), .Names = c("a1", "a2", "a3", "b1", "b2", "b3"), class = "data.frame", row.names = c(NA, -3L)) tmp2 <- read.csv("tmp2.csv", sep=";") tmp2 > tmp2 a1 a2 a3 b1 b2 b3 1 1 1 0 1 1 0 2 0 0 1 0 0 1 3 0 1 0 1 0 1 </code></pre> I try to get a frequency table as follow: <pre class="prettyprint"><code>table(tmp2[,1:3], tmp2[,4:6]) </code></pre> But I get : <blockquote> Error in sort.list(y) : 'x' must be atomic for 'sort.list' Have you called 'sort' on a list? </blockquote> Expected output: <img src="https://i.stack.imgur.com/MC9XA.png" alt="enter image description here"> Info: It is not necessary a square matrix for instance I should be able to add b4 b5 and keep a1 a2 a3

An option: <pre class="prettyprint"><code>matrix(colSums(tmp2[,rep(1:3,3)] & tmp2[,rep(4:6,each=3)]), ncol=3,nrow=3, dimnames=list(colnames(tmp2)[1:3],colnames(tmp2)[4:6])) # b1 b2 b3 #a1 1 1 0 #a2 2 1 1 #a3 0 0 1 </code></pre> If you have a different number of <code>a</code> and <code>b</code> columns, you can try: <pre class="prettyprint"><code>acols<-1:3 #state the indices of the a columns bcols<-4:6 #same for b; if you add a column this should be 4:7 matrix(colSums(tmp2[,rep(acols,length(bcols))] & tmp2[,rep(bcols,each=length(acols))]), ncol=length(bcols),nrow=length(acols), dimnames=list(colnames(tmp2)[acols],colnames(tmp2)[bcols])) </code></pre>

Here's a possible solution : <pre class="prettyprint"><code>aIdxs <- 1:3 bIdxs <- 4:7 # init matrix m <- matrix(0, nrow = length(aIdxs), ncol=length(bIdxs), dimnames = list(colnames(tmp2)[aIdxs],colnames(tmp2)[bIdxs])) # create all combinations of a's and b's column indexes idxs <- expand.grid(aIdxs,bIdxs) # for each line and for each combination we add 1 # to the matrix if both a and b column are 1 for(r in 1:nrow(tmp2)){ m <- m + matrix(apply(idxs,1,function(x){ all(tmp2[r,x]==1) }), nrow=length(aIdxs), byrow=FALSE) } > m b1 b2 b3 a1 1 1 0 a2 2 1 1 a3 0 0 1 </code></pre>

Table frequency from multiple col and multiple row in R

Tags:

r

frequency

I am trying to get a frequency table from this dataframe:

tmp2 <- structure(list(a1 = c(1L, 0L, 0L), a2 = c(1L, 0L, 1L),
                       a3 = c(0L, 1L, 0L), b1 = c(1L, 0L, 1L),
                       b2 = c(1L, 0L, 0L), b3 = c(0L, 1L, 1L)),
                       .Names = c("a1", "a2", "a3", "b1", "b2", "b3"),
                       class = "data.frame", row.names = c(NA, -3L))


tmp2 <- read.csv("tmp2.csv", sep=";")
tmp2
> tmp2
  a1 a2 a3 b1 b2 b3
1  1  1  0  1  1  0
2  0  0  1  0  0  1
3  0  1  0  1  0  1

I try to get a frequency table as follow:

table(tmp2[,1:3], tmp2[,4:6])

But I get :

Error in sort.list(y) : 'x' must be atomic for 'sort.list'
Have you called 'sort' on a list?

Expected output:

enter image description here

Info: It is not necessary a square matrix for instance I should be able to add b4 b5 and keep a1 a2 a3

884

asked Apr 13 '16 10:04

S12000

2 Answers

An option:

matrix(colSums(tmp2[,rep(1:3,3)] & tmp2[,rep(4:6,each=3)]),
       ncol=3,nrow=3,
       dimnames=list(colnames(tmp2)[1:3],colnames(tmp2)[4:6]))
#   b1 b2 b3
#a1  1  1  0
#a2  2  1  1
#a3  0  0  1

If you have a different number of a and b columns, you can try:

acols<-1:3 #state the indices of the a columns
bcols<-4:6 #same for b; if you add a column this should be 4:7
matrix(colSums(tmp2[,rep(acols,length(bcols))] & tmp2[,rep(bcols,each=length(acols))]),
           ncol=length(bcols),nrow=length(acols),
           dimnames=list(colnames(tmp2)[acols],colnames(tmp2)[bcols]))

123

answered Sep 30 '22 23:09

nicola

Here's a possible solution :

aIdxs <- 1:3
bIdxs <- 4:7

# init matrix
m <- matrix(0,
            nrow = length(aIdxs), ncol=length(bIdxs),
            dimnames = list(colnames(tmp2)[aIdxs],colnames(tmp2)[bIdxs]))

# create all combinations of a's and b's column indexes
idxs <- expand.grid(aIdxs,bIdxs)

# for each line and for each combination we add 1
# to the matrix if both a and b column are 1 
for(r in 1:nrow(tmp2)){
  m <- m + matrix(apply(idxs,1,function(x){ all(tmp2[r,x]==1) }),
                  nrow=length(aIdxs), byrow=FALSE)
}
> m
   b1 b2 b3
a1  1  1  0
a2  2  1  1
a3  0  0  1

answered Sep 30 '22 21:09

digEmAll

Related questions
                            
                                geom_raster interpolation with log scale
                            
                                Generate a Pop-up box in R
                            
                                show(), hide() usage from shinyjs, Shiny
                            
                                Pandas: aggregating multiple columns with multiple functions
                            
                                Why does the function t return a t.test for objects with class set to "test"?
                            
                                Splitting a string by more than one space
                            
                                How to print an R object to stderr in Rcpp?
                            
                                replace nested loop with expand.grid and call inner function with multiple arguments
                            
                                Traveling salesman (TSP) with set start and end point
                            
                                Fill voronoi polygons with ggplot
                            
                                Histogram function in R - breaks argument not working
                            
                                Integrating/Integral in R: Find the catch
                            
                                Wrapping pandoc table column names in r markdown
                            
                                Shade density plot to the left of vline?
                            
                                how to get name of data.frame from list passed to function using lapply
                            
                                Using columns to control tabBox content in Shiny dashboard
                            
                                Shiny: How to embed a sidebarPanel inside tabPanel?
                            
                                Unable to Replicate "R for Beginners" Example
                            
                                R with ggplot2 horizontal line for average
                            
                                How to update the file related to a `fileInput` variable in R Shiny without user interaction?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With