How can I remove repeated characters in a string with R?

Tags:

I would like to implement a function with R that removes repeated characters in a string. For instance, say my function is named removeRS, so it is supposed to work this way:

  removeRS('Buenaaaaaaaaa Suerrrrte')   Buena Suerte   removeRS('Hoy estoy tristeeeeeee')   Hoy estoy triste

My function is going to be used with strings written in spanish, so it is not that common (or at least correct) to find words that have more than three successive vowels. No bother about the possible sentiment behind them. Nonetheless, there are words that can have two successive consonants (especially ll and rr), but we could skip this from our function.

So, to sum up, this function should replace the letters that appear at least three times in a row with just that letter. In one of the examples above, aaaaaaaaa is replaced with a.

Could you give me any hints to carry out this task with R?

517

asked Jun 22 '12 21:06

nhern121

1 Answers

I did not think very carefully on this, but this is my quick solution using references in regular expressions:

gsub('([[:alpha:]])\\1+', '\\1', 'Buenaaaaaaaaa Suerrrrte') # [1] "Buena Suerte"

() captures a letter first, \\1 refers to that letter, + means to match it once or more; put all these pieces together, we can match a letter two or more times.

To include other characters besides alphanumerics, replace [[:alpha:]] with a regex matching whatever you wish to include.

172

answered Sep 23 '22 14:09

Yihui Xie

Related questions
                            
                                The request failed with HTTP status 417: Expectation Failed - Using Web Services
                            
                                How to efficient insert and fetch UUID in Core Data
                            
                                Port forwarding from Host port 80 to VirtualBox port 80 doesn't work
                            
                                Spring Security - No visible WebSecurityExpressionHandler instance could be found in the application context
                            
                                Fatal error: Call to undefined method mysqli_result::fetch_all()
                            
                                How do i know if connection is alive with websockets?
                            
                                Mongo conditional for "key doesn't exist"?
                            
                                EntityFramework - Entity proxy error
                            
                                How do I make a deep copy of a knockout object that was created by the mapping plugin
                            
                                Xcopy Command excluding files and folders [duplicate]
                            
                                What is the equivalence for QString::arg() in QML
                            
                                Creating slow scrolling to indexPath in UICollectionView

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With