How to find top n% of records in a column of a dataframe using R

Tags:

I have a dataset showing the exchange rate of the Australian Dollar versus the US dollar once a day over a period of about 20 years. I have the data in a data frame, with the first column being the date, and the second column being the exchange rate. Here's a sample from the data:

>data              V1     V2 1    12/12/1983 0.9175 2    13/12/1983 0.9010 3    14/12/1983 0.9000 4    15/12/1983 0.8978 5    16/12/1983 0.8928 6    19/12/1983 0.8770 7    20/12/1983 0.8795 8    21/12/1983 0.8905 9    22/12/1983 0.9005 10   23/12/1983 0.9005

How would I go about displaying the top n% of these records? E.g. say I want to see the days and exchange rates for those days where the exchange rate falls in the top 5% of all exchange rates in the dataset?

926

asked Oct 14 '09 02:10

Bryce Thomas

1 Answers

For the top 5%:

n <- 5 data[data$V2 > quantile(data$V2,prob=1-n/100),]

answered Sep 21 '22 15:09

Rob Hyndman

Related questions
                            
                                Change arrowhead of arrows()
                            
                                Applying a function to two lists?
                            
                                Remove legend entries for some factors levels
                            
                                How to split Shiny app code over multiple files in RStudio? [closed]
                            
                                R - ordering in boxplot
                            
                                Why are Xs added to data frame variable names when using read.csv?
                            
                                Chi-Squared test in Python
                            
                                How to convert integer into categorical data in R?
                            
                                readRDS(file) in R
                            
                                What are the disadvantages of using .Rdata files compared to HDF5 or netCDF?
                            
                                Display HTML file in Shiny App
                            
                                Disable/suppress tcltk popup for CRAN mirror selection in R
                            
                                how do i exclude specific variables from a glm in R?
                            
                                Thousand separator in label of x or y axis
                            
                                Using lists inside data.table columns
                            
                                Efficiency of operations on R data structures
                            
                                Use character string as function argument
                            
                                move axis labels ggplot
                            
                                How can I change paper size when using Knit PDF in RStudio?
                            
                                Print pretty data.frames/tables to console

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to find top n% of records in a column of a dataframe using R

Tags:

dataframe

r

Bryce Thomas

People also ask

1 Answers

Rob Hyndman

Recent Activity

Donate For Us