I am trying to get a simple way to count the number of distinct categories in a column of a dataframe. For example, in the iris data frame, there are 150 rows with one of the columns being species, of which there are 3 different species. I want to be able to run this bit of code and determine that there are 3 different species in that column. I do not care how many rows each of those unique entries correspond to, just how many distinct variables there are, which is mostly what I found in my research. I was thinking something like this: <pre class="prettyprint"><code>df <- iris choices <- count(unique(iris$Species)) </code></pre> Does a solution as simple as this exist? I have looked at these posts, but they either examine the entire data frame rather than a single column in that data frame or provide a more complicated solution than what I am hoping for. count number of instances in data frame Count number of occurrences of categorical variables in data frame (R) How to count number of unique character vectors within a subset of data

The following should do the job: <pre class="prettyprint"><code>choices <- length(unique(iris$Species)) </code></pre>

Count number of unique levels of a variable

Tags:

r

I am trying to get a simple way to count the number of distinct categories in a column of a dataframe.

For example, in the iris data frame, there are 150 rows with one of the columns being species, of which there are 3 different species. I want to be able to run this bit of code and determine that there are 3 different species in that column. I do not care how many rows each of those unique entries correspond to, just how many distinct variables there are, which is mostly what I found in my research.

I was thinking something like this:

df <- iris
choices <- count(unique(iris$Species))

Does a solution as simple as this exist? I have looked at these posts, but they either examine the entire data frame rather than a single column in that data frame or provide a more complicated solution than what I am hoping for.

count number of instances in data frame

Count number of occurrences of categorical variables in data frame (R)

How to count number of unique character vectors within a subset of data

563

asked Jul 21 '16 00:07

User247365

1 Answers

The following should do the job:

choices <- length(unique(iris$Species))

170

answered Oct 05 '22 21:10

Imran Ali

Related questions
                            
                                Possible to combine position_jitter with position_dodge?
                            
                                Scatter plot with ggplot2 colored by dates
                            
                                R: Dimension names in tables and multi-dimensional arrays
                            
                                BUGS error messages
                            
                                How to print three venn diagrams in the same window
                            
                                Efficient R code for finding indices associated with unique values in vector
                            
                                Combine/merge lists by elements names (list in list)
                            
                                Obtaining Separate Summary Statistics by Categorical Variable with Stargazer Package
                            
                                how to snip or crop or white-fill a large. expanded (by 10%) rectangle outside of a polygon with ggplot2
                            
                                Multiple ggplots with magrittr tee operator
                            
                                ggplot line graph with NA values
                            
                                dplyr and tail to change last value in a group_by in r
                            
                                Successfully coercing paginated JSON object to R dataframe
                            
                                How can I reduce the height of shiny input widgets?
                            
                                Fastest way of finding matching rows
                            
                                R: Efficiently remove singleton dimensions from array
                            
                                package is in use and will not be installed
                            
                                Mixing surface and scatterplot in a single 3D plot
                            
                                Hebrew Encoding Hell in R and writing a UTF-8 table in Windows
                            
                                Aggregating data with ggplot

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Count number of unique levels of a variable

Tags:

dataframe

r

User247365

People also ask

1 Answers

Imran Ali

Recent Activity

Donate For Us