How to remove outliers from a dataset

Tags:

I've got some multivariate data of beauty vs ages. The ages range from 20-40 at intervals of 2 (20, 22, 24....40), and for each record of data, they are given an age and a beauty rating from 1-5. When I do boxplots of this data (ages across the X-axis, beauty ratings across the Y-axis), there are some outliers plotted outside the whiskers of each box.

I want to remove these outliers from the data frame itself, but I'm not sure how R calculates outliers for its box plots. Below is an example of what my data might look like. enter image description here

546

asked Jan 24 '11 21:01

Dan Q

1 Answers

Nobody has posted the simplest answer:

x[!x %in% boxplot.stats(x)$out]

Also see this: http://www.r-statistics.com/2011/01/how-to-label-all-the-outliers-in-a-boxplot/

100

answered Oct 31 '22 13:10

J. Win.

Related questions
                            
                                Specify custom Date format for colClasses argument in read.table/read.csv
                            
                                Sort columns of a dataframe by column name
                            
                                R: Count number of objects in list [closed]
                            
                                switch() statement usage
                            
                                Converting string to numeric [duplicate]
                            
                                R Conditional evaluation when using the pipe operator %>%
                            
                                How can I load an object into a variable name that I specify from an R data file?
                            
                                Getting the top values by group
                            
                                Remove extra legends in ggplot2
                            
                                Subset of rows containing NA (missing) values in a chosen column of a data frame
                            
                                Hosting and setting up own shiny apps without shiny server
                            
                                Define all functions in one .R file, call them from another .R file. How, if possible?
                            
                                Comma separator for numbers in R?
                            
                                List distinct values in a vector in R
                            
                                The cause of "bad magic number" error when loading a workspace and how to avoid it?
                            
                                R programming: How do I get Euler's number?
                            
                                Left align two graph edges (ggplot)
                            
                                Paste multiple columns together
                            
                                How to randomize (or permute) a dataframe rowwise and columnwise?
                            
                                Subscripts in plots in R

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to remove outliers from a dataset

Tags:

r

statistics

outliers

Dan Q

People also ask

1 Answers

J. Win.

Recent Activity

Donate For Us