how to fill an empty matrix with data frame values

Tags:

I am desperately trying to fill a matrix with values from a data frame. It is trade data, so the data frame looks something like this:

country1 country2 value
1 Afghanistan  Albania    30
2 Afghanistan  Albania    81
3 Afghanistan    China     5
4     Albania  Germany     6
5       China  Germany     8
6       China   Turkey   900
7     Germany   Turkey    12
8     Germany      USA     3
9     Germany   Zambia   700

Using the unique and sort commands I have created a list of all countries that occur in the df (and converted it to a matrix):

     countries_sorted
[1,] "Afghanistan"   
[2,] "Albania"       
[3,] "China"         
[4,] "Germany"       
[5,] "Turkey"        
[6,] "USA"           
[7,] "Zambia"

Using this "list", I have created an empty trade matrix (7x7):

             Afghanistan Albania China Germany Turkey USA Zambia
Afghanistan          NA      NA    NA      NA     NA  NA     NA
Albania              NA      NA    NA      NA     NA  NA     NA
China                NA      NA    NA      NA     NA  NA     NA
Germany              NA      NA    NA      NA     NA  NA     NA
Turkey               NA      NA    NA      NA     NA  NA     NA
USA                  NA      NA    NA      NA     NA  NA     NA
Zambia               NA      NA    NA      NA     NA  NA     NA

I am now hopelessly failing to fill this matrix with the numbers/sums from the value column of df. I have tried something like this:

a<-cast(df, country1~country2 , sum)

which works to a degree BUT the matrix does not retain its original 7x7 format, which is what I need to have a matrix where the diagonal is all 0s.

> a
     country1 Albania China Germany Turkey USA Zambia
1 Afghanistan     111     5       0      0   0      0
2     Albania       0     0       6      0   0      0
3       China       0     0       8    900   0      0
4     Germany       0     0       0     12   3    700

Please, anyone with a solution????

972

asked Sep 23 '15 09:09

samyandi

1 Answers

Starting with these 2 data sets:

#your data.frame
df <- read.table(header=T, file='clipboard', stringsAsFactors = F)
#the list of unique countries
countries <- unique(c(df$country1,df$country2))

You could do:

#create all the country combinations
newdf <- expand.grid(countries, countries)
#change names
colnames(newdf) <- c('country1', 'country2')
#add a value of 0 for the new combinations (won't affect outcome)
newdf$value <- 0
#row bind with original dataset
df2 <- rbind(df, newdf)


#and create the table using xtabs:
#the aggregate function will create the sum of the value for each combination
> xtabs(value ~ country1 + country2, aggregate(value~country1+country2,df2,sum))
             country2
country1      Afghanistan Albania China Germany Turkey USA Zambia
  Afghanistan           0     111     5       0      0   0      0
  Albania               0       0     0       6      0   0      0
  China                 0       0     0       8    900   0      0
  Germany               0       0     0       0     12   3    700
  Turkey                0       0     0       0      0   0      0
  USA                   0       0     0       0      0   0      0
  Zambia                0       0     0       0      0   0      0

187

answered Oct 11 '22 18:10

LyzandeR

Related questions
                            
                                Using substitute to do variable substitutions inside R expressions
                            
                                Create new column in data frame using a for loop to calculate value in R?
                            
                                unable to install 'XML' package dependency for 'pmml' on Ubuntu
                            
                                Fill missing sequence values with dplyr
                            
                                Increase counter by 1 for each unique group of values
                            
                                Replacing multiple occurrences of a character or string inside parentheses in R
                            
                                str_count with overlapping substrings
                            
                                In R, how can I check for the existence of a function in an unloaded package?
                            
                                How use if else in mutate function in R
                            
                                forecasting multiple time series in R using auto.arima
                            
                                User-Function, formals object defined but not found by code
                            
                                data.table operation chaining
                            
                                Arrange points and lines in an r plot legend
                            
                                R In SQL Server [closed]
                            
                                ggplot errorbar position multi-factor problems
                            
                                fill=TRUE will fail when different number of column occurr after 5 rows in read.table? [duplicate]
                            
                                Subset list based on TRUE/FALSE
                            
                                R gsub everything after blank
                            
                                R - Split data frame row into two rows
                            
                                How to do a matrix calculation to get the cross products of variables

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to fill an empty matrix with data frame values

Tags:

casting

r

matrix

samyandi

People also ask

1 Answers

LyzandeR

Recent Activity

Donate For Us