How to get a data frame with the same data as an already existing matrix has? A simplified example of my matrix: <pre class="prettyprint"><code>mat <- matrix(c(0, 0.5, 1, 0.1, 0.2, 0.3, 0.3, 0.4, 0.5), ncol = 3, nrow = 3, dimnames = list(NULL, c("time", "C_0", "C_1"))) > mat time C_0 C_1 [1,] 0.0 0.1 0.3 [2,] 0.5 0.2 0.4 [3,] 1.0 0.3 0.5 </code></pre> I would like to create a data frame that looks like this: <pre class="prettyprint"><code> name time val 1 C_0 0.0 0.1 2 C_0 0.5 0.2 3 C_0 1.0 0.3 4 C_1 0.0 0.3 5 C_1 0.5 0.4 6 C_1 1.0 0.5 </code></pre> All my attempts are quite clumsy, for example: <pre class="prettyprint"><code>data.frame(cbind(c(rep("C_1", 3), rep("C_2", 3)), rbind(cbind(mat[,"time"], mat[,"C_0"]), cbind(mat[,"time"], mat[,"C_1"])))) </code></pre> Does anyone have an idea of how to do this more elegantly? Please note that my real data has a few more columns (40 columns).

If you change your <code>time</code> column into row names, then you can use <code>as.data.frame(as.table(mat))</code> for simple cases like this. Example: <pre class="prettyprint"><code>data <- c(0.1, 0.2, 0.3, 0.3, 0.4, 0.5) dimnames <- list(time=c(0, 0.5, 1), name=c("C_0", "C_1")) mat <- matrix(data, ncol=2, nrow=3, dimnames=dimnames) as.data.frame(as.table(mat)) time name Freq 1 0 C_0 0.1 2 0.5 C_0 0.2 3 1 C_0 0.3 4 0 C_1 0.3 5 0.5 C_1 0.4 6 1 C_1 0.5 </code></pre> In this case time and name are both factors. You may want to convert time back to numeric, or it may not matter.

You can use <code>stack</code> from the base package. But, you need first to coerce your matrix to a <code>data.frame</code> and to reorder the columns once the data is stacked. <pre class="prettyprint"><code>mat <- as.data.frame(mat) res <- data.frame(time= mat$time,stack(mat,select=-time)) res[,c(3,1,2)] ind time values 1 C_0 0.0 0.1 2 C_0 0.5 0.2 3 C_0 1.0 0.3 4 C_1 0.0 0.3 5 C_1 0.5 0.4 6 C_1 1.0 0.5 </code></pre> Note that <code>stack</code> is generally more efficient than the <code>reshape2</code> package.

<code>melt()</code> from the reshape2 package gets you close ... <pre class="prettyprint"><code>library(reshape2) (res <- melt(as.data.frame(mat), id="time")) # time variable value # 1 0.0 C_0 0.1 # 2 0.5 C_0 0.2 # 3 1.0 C_0 0.3 # 4 0.0 C_1 0.3 # 5 0.5 C_1 0.4 # 6 1.0 C_1 0.5 </code></pre> ... although you may want to post-process its results to get your preferred column names and ordering. <pre class="prettyprint"><code>setNames(res[c("variable", "time", "value")], c("name", "time", "val")) # name time val # 1 C_0 0.0 0.1 # 2 C_0 0.5 0.2 # 3 C_0 1.0 0.3 # 4 C_1 0.0 0.3 # 5 C_1 0.5 0.4 # 6 C_1 1.0 0.5 </code></pre>

Using <code>dplyr</code> and <code>tidyr</code>: <pre class="prettyprint"><code>library(dplyr) library(tidyr) df <- as_data_frame(mat) %>% # convert the matrix to a data frame gather(name, val, C_0:C_1) %>% # convert the data frame from wide to long select(name, time, val) # reorder the columns df # A tibble: 6 x 3 name time val <chr> <dbl> <dbl> 1 C_0 0.0 0.1 2 C_0 0.5 0.2 3 C_0 1.0 0.3 4 C_1 0.0 0.3 5 C_1 0.5 0.4 6 C_1 1.0 0.5 </code></pre>

Create dataframe from a matrix

Tags:

dataframe

r

matrix

How to get a data frame with the same data as an already existing matrix has?

A simplified example of my matrix:

mat <- matrix(c(0, 0.5, 1, 0.1, 0.2, 0.3, 0.3, 0.4, 0.5),
              ncol = 3, nrow = 3,
              dimnames = list(NULL, c("time", "C_0", "C_1")))

> mat
     time C_0 C_1
[1,]  0.0 0.1 0.3
[2,]  0.5 0.2 0.4
[3,]  1.0 0.3 0.5

I would like to create a data frame that looks like this:

     name   time   val
1    C_0    0.0    0.1
2    C_0    0.5    0.2
3    C_0    1.0    0.3
4    C_1    0.0    0.3
5    C_1    0.5    0.4
6    C_1    1.0    0.5

All my attempts are quite clumsy, for example:

data.frame(cbind(c(rep("C_1", 3), rep("C_2", 3)),
                 rbind(cbind(mat[,"time"], mat[,"C_0"]),
                       cbind(mat[,"time"], mat[,"C_1"]))))

Does anyone have an idea of how to do this more elegantly? Please note that my real data has a few more columns (40 columns).

585

asked Apr 08 '13 17:04

user1981275

4 Answers

If you change your time column into row names, then you can use as.data.frame(as.table(mat)) for simple cases like this.

Example:

data <- c(0.1, 0.2, 0.3, 0.3, 0.4, 0.5)
dimnames <- list(time=c(0, 0.5, 1), name=c("C_0", "C_1"))
mat <- matrix(data, ncol=2, nrow=3, dimnames=dimnames)
as.data.frame(as.table(mat))
  time name Freq
1    0  C_0  0.1
2  0.5  C_0  0.2
3    1  C_0  0.3
4    0  C_1  0.3
5  0.5  C_1  0.4
6    1  C_1  0.5

In this case time and name are both factors. You may want to convert time back to numeric, or it may not matter.

164

answered Oct 03 '22 00:10

Greg Snow

You can use stack from the base package. But, you need first to coerce your matrix to a data.frame and to reorder the columns once the data is stacked.

mat <- as.data.frame(mat)
res <- data.frame(time= mat$time,stack(mat,select=-time))
res[,c(3,1,2)]

  ind time values
1 C_0  0.0    0.1
2 C_0  0.5    0.2
3 C_0  1.0    0.3
4 C_1  0.0    0.3
5 C_1  0.5    0.4
6 C_1  1.0    0.5

Note that stack is generally more efficient than the reshape2 package.

answered Oct 02 '22 23:10

agstudy

melt() from the reshape2 package gets you close ...

library(reshape2)
(res <- melt(as.data.frame(mat), id="time"))
#   time variable value
# 1  0.0      C_0   0.1
# 2  0.5      C_0   0.2
# 3  1.0      C_0   0.3
# 4  0.0      C_1   0.3
# 5  0.5      C_1   0.4
# 6  1.0      C_1   0.5

... although you may want to post-process its results to get your preferred column names and ordering.

setNames(res[c("variable", "time", "value")], c("name", "time", "val"))
#   name time val
# 1  C_0  0.0 0.1
# 2  C_0  0.5 0.2
# 3  C_0  1.0 0.3
# 4  C_1  0.0 0.3
# 5  C_1  0.5 0.4
# 6  C_1  1.0 0.5

answered Oct 02 '22 23:10

Josh O'Brien

Using dplyr and tidyr:

library(dplyr)
library(tidyr)

df <- as_data_frame(mat) %>%      # convert the matrix to a data frame
  gather(name, val, C_0:C_1) %>%  # convert the data frame from wide to long
  select(name, time, val)         # reorder the columns

df
# A tibble: 6 x 3
   name  time   val
  <chr> <dbl> <dbl>
1   C_0   0.0   0.1
2   C_0   0.5   0.2
3   C_0   1.0   0.3
4   C_1   0.0   0.3
5   C_1   0.5   0.4
6   C_1   1.0   0.5

answered Oct 03 '22 01:10

sbha

Related questions
                            
                                Extract info inside all parenthesis in R
                            
                                Creating a named list from two vectors (names, values)
                            
                                Adding table within the plotting region of a ggplot in r
                            
                                Installing rgl on Ubuntu and Mac: X11 not found
                            
                                ggplot2 pdf import in Adobe Illustrator missing font AdobePiStd
                            
                                Normalizing y-axis in histograms in R ggplot to proportion
                            
                                Concatenate strings by group with dplyr [duplicate]
                            
                                count number of rows in a data frame in R based on group [duplicate]
                            
                                Importing data into R from google spreadsheet
                            
                                Select the first and last row by group in a data frame
                            
                                Using dplyr window functions to calculate percentiles
                            
                                Select every other element from a vector
                            
                                How to replace NaN value with zero in a huge data frame?
                            
                                Implementing standard software design patterns (focus on MVC) in R
                            
                                How to send R markdown report in body of email?
                            
                                Simple manual RMarkdown tables that look good in HTML, PDF and DOCX
                            
                                Is there a way to output text to the R console in color
                            
                                R not finding package even after package installation
                            
                                How to control number of decimal digits in write.table() output?
                            
                                Creating Classes in R: S3, S4, R5 (RC), or R6? [closed]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With