<p>I'm working with an imported data set that corresponds to the extract below:</p> <pre class="prettyprint"><code>set.seed(1) dta <- data.frame("This is Column One" = runif(n = 10), "Another amazing Column name" = runif(n = 10), "!## This Columns is so special€€€" = runif(n = 10), check.names = FALSE) </code></pre> <p>I'm doing some cleaning on this data using <code>dplyr</code> and I would like to change column names to syntatically correct ones and remove the punctuation as a second step. What I tried so far:</p> <pre class="prettyprint"><code>dta_cln <- dta %>% rename(make.names(names(dta))) </code></pre> <p>generates an error:</p> <blockquote> <pre class="prettyprint"><code>> dta_clean <- dta %>% + rename(make.names(names(dta))) Error: All arguments to rename must be named. </code></pre> </blockquote> <h3>Desired result</h3> <p>What I wan to achieve can be done in base:</p> <pre class="prettyprint"><code>names(dta) <- gsub("[[:punct:]]","",make.names(names(dta))) </code></pre> <p>which would return:</p> <blockquote> <pre class="prettyprint"><code>> names(dta) [1] "ThisisColumnOne" "AnotheramazingColumnname" "XThisColumnsissospecial" </code></pre> </blockquote> <p>I want to achieve the same effect but using <code>dyplr</code> and <code>%>%</code>.</p>

<p>I know this is an old question, and I'm sure you found the solution by now, but I stumbled here searching for the same question, and ultimately found a few new ways to do this.</p> <h3>Dplyr</h3> <p>Using <code>dplyr 0.6.0</code> and above, there is now a <code>rename_all</code> function:</p> <pre class="prettyprint"><code> dta %>% rename_all(funs(gsub("[[:punct:]]", "", make.names(names(dta))))) </code></pre> <p>Which works, but it's a little messy to me. If you want more flexibility with <code>dplyr</code>, you can also call on:</p> <ul> <li><code>rename_at</code></li> <li><code>rename_if</code></li> </ul> <h3>Janitor</h3> <p>This is a pretty nice package (with plenty of additional utility) that can easily clean up column names:</p> <pre class="prettyprint"><code>library(janitor) dta %>% clean_names() </code></pre> <p>Which will rename and clean all column names to the following:</p> <pre class="prettyprint"><code>[1] "this_is_column_one" "another_amazing_column_name" "x_this_columns_is_so_special" </code></pre> <p>Everything becomes snake_case rather than CamelCase, but overall <code>clean_names</code> is very flexible in the column names it handles. If that IS a deal breaker, you can use yet another package <code>snakecase</code> for its function <code>to_big_camel_case()</code> within the <code>rename_all</code> function...although that is starting to get a little too esoteric</p>

Applying dplyr's rename to all columns while using pipe operator

Tags:

syntax

dataframe

r

dplyr

I'm working with an imported data set that corresponds to the extract below:

set.seed(1)
dta <- data.frame("This is Column One" = runif(n = 10),
                     "Another amazing Column name" = runif(n = 10),
                     "!## This Columns is so special€€€" = runif(n = 10),
                    check.names = FALSE)

I'm doing some cleaning on this data using dplyr and I would like to change column names to syntatically correct ones and remove the punctuation as a second step. What I tried so far:

dta_cln <- dta %>% 
    rename(make.names(names(dta)))

generates an error:

> dta_clean <- dta %>% 
+     rename(make.names(names(dta)))
Error: All arguments to rename must be named.

Desired result

What I wan to achieve can be done in base:

names(dta) <- gsub("[[:punct:]]","",make.names(names(dta)))

which would return:

> names(dta)
[1] "ThisisColumnOne"          "AnotheramazingColumnname" "XThisColumnsissospecial"

I want to achieve the same effect but using dyplr and %>%.

561

asked Dec 04 '15 15:12

Konrad

1 Answers

I know this is an old question, and I'm sure you found the solution by now, but I stumbled here searching for the same question, and ultimately found a few new ways to do this.

Dplyr

Using dplyr 0.6.0 and above, there is now a rename_all function:

  dta %>% 
    rename_all(funs(gsub("[[:punct:]]", "", make.names(names(dta)))))

Which works, but it's a little messy to me. If you want more flexibility with dplyr, you can also call on:

rename_at
rename_if

Janitor

This is a pretty nice package (with plenty of additional utility) that can easily clean up column names:

library(janitor)

dta %>% 
  clean_names()

Which will rename and clean all column names to the following:

[1] "this_is_column_one"  "another_amazing_column_name"  "x_this_columns_is_so_special"

Everything becomes snake_case rather than CamelCase, but overall clean_names is very flexible in the column names it handles. If that IS a deal breaker, you can use yet another package snakecase for its function to_big_camel_case() within the rename_all function...although that is starting to get a little too esoteric

123

answered Oct 22 '22 12:10

Dave Gruenewald

Related questions
                            
                                Create a histogram for weighted values
                            
                                Using R from Scala and invoking Scala from R?
                            
                                print or display variable inside function
                            
                                How to create base R plot 'type = b' equivalent in ggplot2?
                            
                                dplyr group by colnames described as vector of strings
                            
                                Replace column names in kable/R markdown
                            
                                What does c do in R? [duplicate]
                            
                                r modify and rebuild package
                            
                                How do I show all boxplot labels
                            
                                R: how to check whether a vector is ascending/descending
                            
                                Convert and save distance matrix to a specific format
                            
                                visualize a list of colors/palette in R
                            
                                How to remove columns with same value in R
                            
                                In R, What is the difference between df["x"] and df$x
                            
                                Create counter within consecutive runs of certain values
                            
                                Functions available for Tufte boxplots in R?
                            
                                How can I make a list of all dataframes that are in my global environment?
                            
                                R: splitting dataset into quartiles/deciles. What is the right method? [duplicate]
                            
                                Make Emacs ESS follow R style guide
                            
                                How to improve randomForest performance?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With