R repeat in column based on value in row

Tags:

I have a dataframe like the following:

Name    School   Weight Days
Antoine Bach     0.03   5
Antoine Ken      0.02   7
Barbara Franklin 0.04   3

I would like to obtain an output like the following:

Name    School   1    2    3    4    5    6    7
Antoine Bach     0.03 0.03 0.03 0.03 0.03 NA   NA
Antoine Ken      0.02 0.02 0.02 0.02 0.02 0.02 0.02
Barbara Franklin 0.04 0.04 0.04 NA   NA   NA   NA

Reproducible Sample Data:

df <- tribble(
  ~Name,    ~School,   ~Weight, ~Days,
  "Antoine", "Bach",     0.03,   5,
  "Antoine", "Ken",      0.02,   7,
  "Barbara", "Franklin", 0.04,   3
)

288

asked Apr 11 '21 15:04

user15462606

Video Answer

2 Answers

Using data.table you can create a long version by repeating the Weight value Days number of times for each row, then dcasting to a wide format with the rowidof the new variable as the column.

library(data.table)
setDT(df)

dcast(df[, .(rep(Weight, Days)), .(Name, School)], 
      Name + School ~ rowid(V1))

# Name   School    1    2    3    4    5    6    7
# 1: Antoine     Bach 0.03 0.03 0.03 0.03 0.03   NA   NA
# 2: Antoine      Ken 0.02 0.02 0.02 0.02 0.02 0.02 0.02
# 3: Barbara Franklin 0.04 0.04 0.04   NA   NA   NA   NA

You could also rep Weight the number of Days, then rep NA enough times to complete the row.

max_days <- max(df$Days) 

df[, as.list(rep(c(Weight, NA), c(Days, max_days - Days))), 
   .(Name, School)]

# Name   School   V1   V2   V3   V4   V5   V6   V7
# 1: Antoine     Bach 0.03 0.03 0.03 0.03 0.03   NA   NA
# 2: Antoine      Ken 0.02 0.02 0.02 0.02 0.02 0.02 0.02
# 3: Barbara Franklin 0.04 0.04 0.04   NA   NA   NA   NA

answered Sep 21 '22 14:09

IceCreamToucan

You can use pmap_dfr to apply a function across the rows and then row bind the resulting list into a tibble object. The function will match arguments to column names, the rest of the row values will be captured in the ellipsis ....

library(purrr)
library(dplyr)

pmap_dfr(df, function(Weight, Days, ...) c(..., setNames(rep(Weight, Days), 1:Days))) %>% 
  mutate(across(3:last_col(), as.numeric))

Because vectors are atomic in R c() will coerce everything in the row to be character. So the mutate converts the newly created columns back to numeric.

setNames is used to name the newly created columns, which is required to bind by row.

Output

  Name    School     `1`   `2`   `3`   `4`   `5`   `6`   `7`
  <chr>   <chr>    <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
1 Antoine Bach      0.03  0.03  0.03  0.03  0.03 NA    NA   
2 Antoine Ken       0.02  0.02  0.02  0.02  0.02  0.02  0.02
3 Barbara Franklin  0.04  0.04  0.04 NA    NA    NA    NA

Note: pmap_dfr is from the purrr package, and mutate, across, and last_col are all from dplyr.

How it works

When you use pmap in the way above the named function arguments will be matched to columns with the same name. So Weights and Days as function arguments are matched to those columns with the same name in each row.

The ... collects the remaining columns that are still passed to the function, but are unused (by name) in the function. Essentially, the ellipsis collects Name and School in your case.

Since Name and School already have names they are passed to c() first to maintain your column order. In addition we combine the other values and give them names as well. The output for a single row is then this:

     Name    School         1         2         3         4         5         6 
"Antoine"    "Bach"    "0.03"    "0.03"    "0.03"    "0.03"    "0.03"        NA 
        7 
       NA

The output of pmap is a list. _dfr is a specific function to row bind (hence the r) these list elements into a dataframe/tibble (hence the df).

answered Sep 19 '22 14:09

LMc

Related questions
                            
                                Restrain scattered jitter points within a violin plot by ggplot2
                            
                                ggplot2 Stacked Bar Chart - Each Bar being 100% and with percenage labels inside each bar
                            
                                R: Calculating distance in miles from one point to another
                            
                                How to compose a list of functions
                            
                                ggplot() scaling with scale::percent_format() producing strange results
                            
                                Plot y = mx + c with ggplot
                            
                                Blogdown kable tables formatting (ugly)
                            
                                Handling empty strings in string detection
                            
                                R shiny dynamic UI in insertUI
                            
                                How to convert a numeric value into a Date value
                            
                                How to filter an R simple features collection using sf methods like st_intersects()?
                            
                                R return true or false per row if string contains any of a list of words
                            
                                How to find the number of times row elements switch from negative to positive (cycles) for each factor level
                            
                                Replacement of plyr::cbind.fill in dplyr?
                            
                                Left-adjust (hjust = 0) vertical x axis labels on facets with free scale
                            
                                How to group rows and get their cell associations layed out in a list form in r?
                            
                                How to establish if the dates in a column are unique?
                            
                                Cumulative product of (1-previous_record)*current_record
                            
                                zsh: command not found: R on terminal using Big Sur Mac
                            
                                How to identify row that matches vector

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

R repeat in column based on value in row

Tags:

dataframe

r

repeat

long-integer

user15462606

People also ask

Video Answer

2 Answers

IceCreamToucan

LMc

Recent Activity

Donate For Us