While reading a data set using <code>fread</code>, I've noticed that sometimes I'm getting duplicated column names, for example (<code>fread</code> doesn't have <code>check.names</code> argument) <pre class="prettyprint"><code>> data.table( x = 1, x = 2) x x 1: 1 2 </code></pre> The question is: is there any way to remove 1 of 2 columns if they have the same name?

<code>.SDcols</code> approaches would return a copy of the columns you're selecting. Instead just remove those duplicated columns using <code>:=</code>, by reference. <pre class="prettyprint"><code>dt[, which(duplicated(names(dt))) := NULL] # x # 1: 1 </code></pre>

How to remove duplicated (by name) column in data.tables in R?

Tags:

r

data.table

While reading a data set using fread, I've noticed that sometimes I'm getting duplicated column names, for example (fread doesn't have check.names argument)

> data.table( x = 1, x = 2)
   x x
1: 1 2

The question is: is there any way to remove 1 of 2 columns if they have the same name?

217

asked Mar 16 '15 21:03

Marcin Kosiński

1 Answers

.SDcols approaches would return a copy of the columns you're selecting. Instead just remove those duplicated columns using :=, by reference.

dt[, which(duplicated(names(dt))) := NULL]
#    x
# 1: 1

134

answered Sep 22 '22 12:09

Arun

Related questions
                            
                                Aggregate data in one column based on values in another column
                            
                                Using grep in R to delete rows from a data.frame
                            
                                Removing x-axis label from dendrogram in r
                            
                                R how many element satisfy a condition?
                            
                                Boxplot of table using ggplot2
                            
                                Find consecutive sequence of zeros in R
                            
                                Add a new column between other dataframe columns [duplicate]
                            
                                Formatting of persp3d plot
                            
                                Calculating Time Difference between two columns
                            
                                stringr str_extract capture group capturing everything
                            
                                R: Sample a vector with replacement multiple times
                            
                                Too few periods for decompose() [closed]
                            
                                Removing leading zeros from alphanumeric characters in R
                            
                                How to make gradient color filled timeseries plot in R
                            
                                using leaflet library to output multiple popup values
                            
                                "RTextTools" create_matrix got an error
                            
                                Improving model training speed in caret (R)
                            
                                Interpretation of ordered and non-ordered factors, vs. numerical predictors in model summary
                            
                                R Extract day from datetime
                            
                                dim(X) must have a positive length when applying function in data frame

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With