Group values that occur only once into an OTHER field

Tags:

ggplot2

I've got a collection of Roman Coins that I'm trying to represent using ggplot in R.

There are 25 different denominations in the data, but I'd like to merge together all denominations that only have one occurrence into an 'OTHER' field, so that the graphs are easier to read.

  Medium Method Denom            Date                  Era
1 Silver Struck Denarius         112 B.C.E.:111 B.C.E. Period V – c. 119-91 B.C.E.
2 Bronze   Cast Χαλκα μεγεθους   181 B.C.E.:174 B.C.E. Period III – c. 187-155 B.C.E.
3 Bronze Struck Litra:Half-litra            269 B.C.E. Period I – 269 - c. 222 B.C.E.
4 Bronze Struck Litra:Half-litra            269 B.C.E. Period I – 269 - c. 222 B.C.E.
5 Silver Struck Didrachm         275 B.C.E.:270 B.C.E. Period I – 269 - c. 222 B.C.E.
6 Bronze Struck Double-litra     275 B.C.E.:270 B.C.E. Period I – 269 - c. 222 B.C.E.

Using the data.frame sample above, the "Denom" column needs to have every value that occurs only one time grouped together and displayed as "other." I think I should do this in the data before starting the plot. Please point me in the right direction.

Here's the code I'm using for ggplot if that helps.

ggplot(data=longbadian, aes(x=Era, fill=Denom)) 
+    geom_bar(aes(x=Era2), data = longbadian, stat="bin") 
+ theme(axis.text.x =  element_text(angle=75, hjust=1), 
                       legend.title=element_blank()) 
+ xlab("Sydenham Periods") 
+ ylab("Coins by Denomination")

Here's an example graph:

enter image description here

525

asked Jun 01 '15 15:06

David Beales

1 Answers

Something like this:

## example data
dd <- data.frame(DENOM=rep(LETTERS[1:7],c(10,5,4,rep(1,4))))
tt <- table(dd$DENOM)                  ## count occurrences
singletons <- names(tt)[tt==1]         ## find singletons
tmpc <- as.character(dd$DENOM)         ## convert from factor to char
tmpc[tmpc %in% singletons] <- "OTHER"  ## replace values
dd$DENOM <- factor(tmpc)               ## convert back to factor

The only problem with this solution is that it will mess up any pre-existing non-default ordering in your DENOM factor.

answered Oct 15 '22 01:10

Ben Bolker

Related questions
                            
                                R beginning match count
                            
                                match() versus %in% operator
                            
                                Merge matrices from two lists in R
                            
                                Compute the mean for each subject for different conditions
                            
                                R Keeps Downloading Packages to tmp Directory
                            
                                foreach and dopar returning NULL instead of the desired answer
                            
                                How to extract correct date from POSIXct element? [duplicate]
                            
                                Parallel RJAGS with convergence testing
                            
                                extract the difference ("relative complement") between two strings in r
                            
                                rvest: how can I visualize the html session webpage in Rstudio?
                            
                                Error while installing Apache SparkR package
                            
                                R: Evaluate an expression in a data frame with arguments that are passed as an object
                            
                                How to put legend right next to heatmap.2 color key
                            
                                R : Int vs Num Anomaly in Vector
                            
                                R - Subsetting to group of matrices based on a condition
                            
                                Dynamic data point label Positioning in ggmap
                            
                                How to assign identical unique IDs to matching observations between two dataframes in r?
                            
                                unable to install rgdal in ubuntu 14.04 (undefined to reference to 'pj_ctx_fclose')
                            
                                R files in capital and lowercase. Difference?
                            
                                Is it safe to use macros in R?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With