I have this text: <pre class="prettyprint"><code>F <- "hhhappy birthhhhhhdayyy" </code></pre> and I want to remove the repeat characters, I tried this code https://stackoverflow.com/a/11165145/10718214 and it works, but I need to remove repeat characters if it repeats more than 2, and if it repeated 2 times keep it. so the output that I expect is <pre class="prettyprint"><code>"happy birthday" </code></pre> any help?

Try using <code>sub</code>, with the pattern <code>(.)\\1{2,}</code>: <pre class="prettyprint"><code>F <- ("hhhappy birthhhhhhdayyy") gsub("(.)\\1{2,}", "\\1", F) [1] "happy birthday" </code></pre> Explanation of regex: <pre class="prettyprint"><code>(.) match and capture any single character \\1{2,} then match the same character two or more times </code></pre> We replace with just the single matching character. The quantity <code>\\1</code> represents the first capture group in <code>sub</code>.

Remove characters which repeat more than twice in a string [duplicate]

Tags:

regex

r

text-mining

I have this text:

F <- "hhhappy birthhhhhhdayyy"

and I want to remove the repeat characters, I tried this code

https://stackoverflow.com/a/11165145/10718214

and it works, but I need to remove repeat characters if it repeats more than 2, and if it repeated 2 times keep it.

so the output that I expect is

"happy birthday"

any help?

579

asked Apr 10 '19 07:04

Fatima

1 Answers

Try using sub, with the pattern (.)\\1{2,}:

F <- ("hhhappy birthhhhhhdayyy")
gsub("(.)\\1{2,}", "\\1", F)

[1] "happy birthday"

Explanation of regex:

(.)          match and capture any single character
\\1{2,}      then match the same character two or more times

We replace with just the single matching character. The quantity \\1 represents the first capture group in sub.

148

answered Sep 16 '22 11:09

Tim Biegeleisen

Related questions
                            
                                What is the purpose difference between README and vignette in R package?
                            
                                grep in R using a character vector with multiple patterns with same order as vector
                            
                                Something weird in pheatmap (a bug?)
                            
                                Multiple gganimate plots side by side
                            
                                How to control node color in ggraph?
                            
                                Simple example of call-by-need
                            
                                Blogdown doesnt render properly on netlify (theme tranquilpeak)
                            
                                dplyr unquoting does not work with filter function
                            
                                dplyr: case_when() over multiple columns with multiple conditions
                            
                                'as.tibble' causes error in tibble 2.0.1 but not 1.4.2
                            
                                gganimate barchart: smooth transition when bar is replaced
                            
                                Animated sorted bar chart: problem with overlapping bars
                            
                                knitr generating errors in document but generates figures correctly regardless
                            
                                Drawing a contour line around connected cells in a heatmap in R
                            
                                Keep auxiliary TeX files when rendering a rmarkdown document
                            
                                geom_point() rainbow colors
                            
                                Find all subsequences with specific length in sequence of numbers in R
                            
                                R datatable search option doesn't handle exotic encoding (latin1)
                            
                                Simplest way to extract date from timestamp
                            
                                How to solve prcomp.default(): cannot rescale a constant/zero column to unit variance

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With