After importing a file, I always try try to remove spaces from the column names to make referral to column names easier. Is there a better way to do this other then using transform and then removing the extra column this command creates? This is what I use now: <pre class="prettyprint"><code>names(ctm2) #tranform function does this, but requires some action ctm2<-transform(ctm2,dymmyvar=1) #remove dummy column ctm2$dymmyvar <- NULL names(ctm2) </code></pre>

There is a very useful package for that, called <code>janitor</code> that makes cleaning up column names very simple. It removes all unique characters and replaces spaces with <code>_</code>. <pre class="prettyprint"><code>library(janitor) #can be done by simply ctm2 <- clean_names(ctm2) #or piping through `dplyr` ctm2 <- ctm2 %>% clean_names() </code></pre>

How to fix spaces in column names of a data.frame (remove spaces, inject dots)?

Tags:

dataframe

r

After importing a file, I always try try to remove spaces from the column names to make referral to column names easier.

Is there a better way to do this other then using transform and then removing the extra column this command creates?

This is what I use now:

names(ctm2) #tranform function does this, but requires some action ctm2<-transform(ctm2,dymmyvar=1) #remove dummy column ctm2$dymmyvar <- NULL names(ctm2)

785

asked May 21 '12 15:05

userJT

2 Answers

There exists more elegant and general solution for that purpose:

tidy.name.vector <- make.names(name.vector, unique=TRUE)

make.names() makes syntactically valid names out of character vectors. A syntactically valid name consists of letters, numbers and the dot or underline characters and starts with a letter or the dot not followed by a number.

Additionally, flag unique=TRUE allows you to avoid possible dublicates in new column names.

As code to implement

d<-read_delim(urltxt,delim='\t',) names(d)<-make.names(names(d),unique = TRUE)

answered Oct 07 '22 01:10

Convex

There is a very useful package for that, called janitor that makes cleaning up column names very simple. It removes all unique characters and replaces spaces with _.

library(janitor)  #can be done by simply ctm2 <- clean_names(ctm2)  #or piping through `dplyr` ctm2 <- ctm2 %>%         clean_names()

answered Oct 07 '22 01:10

camnesia

Related questions
                            
                                How can one work fully generically in data.table in R with column names in variables
                            
                                Is it possible to use spread on multiple columns in tidyr similar to dcast? [duplicate]
                            
                                Rearrange dataframe to a table, the opposite of "melt" [duplicate]
                            
                                two-column layouts in RStudio presentations/slidify/pandoc
                            
                                Using functions of multiple columns in a dplyr mutate_at call
                            
                                Diagnosing R package build warning: "LaTeX errors when creating PDF version"
                            
                                How to merge color, line style and shape legends in ggplot
                            
                                R and Python in one Jupyter notebook
                            
                                R: Break for loop
                            
                                Add panel border to ggplot2
                            
                                Select the top N values by group
                            
                                calculate the mean for each column of a matrix in R
                            
                                R Not in subset [duplicate]
                            
                                How to merge 2 vectors alternating indexes?
                            
                                ggplot2, change title size
                            
                                Putting x-axis at top of ggplot2 chart
                            
                                Cleaning up factor levels (collapsing multiple levels/labels)
                            
                                Place a border around points
                            
                                Adding time to POSIXct object in R
                            
                                Left join using data.table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With