Transition matrix

Tags:

matrix

Consider the following dataframe:

 df = data.frame(cusip = paste("A", 1:10, sep = ""), xt = c(1,2,3,2,3,5,2,4,5,1), xt1 = c(1,4,2,1,1,4,2,2,2,5))

The data is divided in five states, which are quantiles in reality: 1,2,3,4,5. The first column of the dataframe represents the state at time t, and the second column is the state at time t+1.

I would like to compute a sort of a transition matrix for the five states. The meaning of the matrix would be as follows:

(Row, Col) = (1,1) : % of cusips that were in quantile 1 at time t, and stayed at 1 in time t+1
(Row, Col) = (1,2) : % of cusips that were in quantile 1 at t, and became quantile 2 at t+1
etc...

I am really not sure how to do this in an efficient way. I have the feeling the answer is trivial, but I just can't get my head around it.

Could anyone please help?

774

asked Jan 27 '14 21:01

Mayou

1 Answers

res <- with(df, table(xt, xt1)) ## table() to form transition matrix
res/rowSums(res)                ## /rowSums() to normalize by row
#    xt1
# xt          1         2         4         5
#   1 0.5000000 0.0000000 0.0000000 0.5000000
#   2 0.3333333 0.3333333 0.3333333 0.0000000
#   3 0.5000000 0.5000000 0.0000000 0.0000000
#   4 0.0000000 1.0000000 0.0000000 0.0000000
#   5 0.0000000 0.5000000 0.5000000 0.0000000

## As an alternative to  2nd line above, use sweep(), which won't rely on 
## implicit recycling of vector returned by rowSums(res)
sweep(res, MARGIN = 1, STATS = rowSums(res), FUN = `/`)

126

answered Sep 26 '22 13:09

Josh O'Brien

Related questions
                            
                                Incorporating cross validation in stepwise regression in R
                            
                                colors for two geom_point() in ggplot2 when using aes_string
                            
                                R: Faceted bar chart with percentages labels independent for each plot
                            
                                Proportionally sized arrows in ggplot
                            
                                data.table: vector scan v binary search with numeric columns - super-slow setkey
                            
                                obscure warning lme4 using lmer in optwrap
                            
                                Why the 'Measured negative execution time!' error appears? (And how to deal with it?)
                            
                                Rolling sum of time series with factor
                            
                                How to make relative tile sizes in ggplot2 with geom_tile?
                            
                                Assigning a data.table slice in R
                            
                                Omit floating and document environments from stargazer regression table output
                            
                                efficiently move environment from inside function to global environment
                            
                                Sort list of lists in R: sort one lists' value depending on other lists' value
                            
                                R: Create a new column in a data frame using a mapping from another data frame
                            
                                making sure a function does not use a global variable [duplicate]
                            
                                How to use the "[" function to select a row / column of a matrix
                            
                                How to map ggplot histogram x-axis intervals to fixed colour palette?
                            
                                Binning data according to a threshold?
                            
                                Getting the y-axis intercept and slope from a linear regression of multiple data and passing the intercept and slope values to a data frame
                            
                                R remove non-alphanumeric symbols from a string

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With