Suppose that I have a data frame that has a column called C. C has many levels that only occur once. How would I rename all of the levels that occur only once with a new level (called z)? <pre class="prettyprint"><code>A B C a a a a b b a a c a b d a b a </code></pre> The above would turn into: <pre class="prettyprint"><code>A B C a a a a b z a a z a b z a b a </code></pre>

What about this (assuming your data is <code>df</code>)? <pre class="prettyprint"><code>levels(df[,3])[table(df[,3])==1] <- "z" df A B C 1 a a a 2 a b z 3 a a z 4 a b z 5 a b a </code></pre>

Grouping low occuring levels in a dataframe in R

Tags:

r

Suppose that I have a data frame that has a column called C. C has many levels that only occur once. How would I rename all of the levels that occur only once with a new level (called z)?

A  B  C   
a  a  a  
a  b  b  
a  a  c  
a  b  d  
a  b  a

The above would turn into:

A  B  C   
a  a  a  
a  b  z  
a  a  z  
a  b  z  
a  b  a

335

asked Jul 23 '15 18:07

kevin ko

1 Answers

What about this (assuming your data is df)?

levels(df[,3])[table(df[,3])==1] <- "z"
df
  A B C
1 a a a
2 a b z
3 a a z
4 a b z
5 a b a

answered Sep 23 '22 19:09

DatamineR

Related questions
                            
                                Extracting RColorBrewer palette for other use
                            
                                how do you convert output from readLines to data frame in R
                            
                                R - Compare two data frames of different length for same values in two columns
                            
                                Multi-character plot shapes in ggplot
                            
                                convert list of sparse matrix indices to matrix in R
                            
                                Finding common rows in R
                            
                                Is there a way to create Stata's _merge indicator variable with R's merge()?
                            
                                make sum of an empty set/set of NA's NA instead of 0?
                            
                                Making R-Package: NAMESPACE
                            
                                Conditional row removal in an R data frame
                            
                                How can I install packages in knitr?
                            
                                Can't install devtools in RStudio -- Dependencies not found (xml2/rversions)
                            
                                Find the largest element in a vector less than values in another vector in R
                            
                                Creating Shiny reactive variable that indicates which widget was last modified
                            
                                Match everything but numbers regular expression
                            
                                What is the functional form of the assignment operator, [<-?
                            
                                Use order on a single column data frame
                            
                                Identifying strings based on where substrings appear in the string
                            
                                Use poly() in R formula to predict
                            
                                How to make parentheses bigger (to fit the size of an expression inide of them) in an R plot label?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With