Remove text after the second space

Tags:

I have a matrix like this (each row is a string):

m <- matrix(c("Agarista revoluta (Spreng.) Hook. f. ex Nied.", 
              "Amaioua intermedia Mart.", 
              "Baccharis reticularia DC."),, 1)

I would like to remove the text after the second space and to return:

Agarista revoluta
Amaioua intermedia
Baccharis reticularia

I tried some combinations with gsub but I did not succeed.

Can anyone help me with this?

829

asked Dec 21 '16 13:12

Karlo Guidoni Martins

1 Answers

You may use

x <- c("Agarista revoluta (Spreng.) Hook. f. ex Nied.", "Amaioua intermedia Mart.", "Baccharis reticularia DC.")
sub("^(\\S*\\s+\\S+).*", "\\1", x)
## => [1] "Agarista revoluta"     "Amaioua intermedia"    "Baccharis reticularia"

See the regex demo and an online R demo.

Pattern details:

^ - start of string
(\\S*\\s+\\S+) - Group 1 capturing 0+ non-whitespace chars, then 1+ whitespaces, and then 1+ non-whitespaces
.* - any 0+ chars, as many as possible (up to the end of string).

Note that in case your strings might have leading whitespace, and you do not want to count that whitespace in, you should use

sub("^\\s*(\\S+\\s+\\S+).*", "\\1", x)

See another R demo

answered Sep 30 '22 03:09

Wiktor Stribiżew

Related questions
                            
                                Get Emacs to ignore contents of \Sexpr{} command in Sweave document to prevent incorrect $-based syntax highlighting
                            
                                How to use a non-ASCII symbol (e.g. £) in an R package function?
                            
                                How to break ties with order function in R
                            
                                sum of two lists with lists in R
                            
                                R - converting date and time fields to POSIXct with HHMMSS format
                            
                                closing unused RODBC handle
                            
                                Start new R package development on github
                            
                                How to show bars in ggplot2 in descending order of a numeric vector?
                            
                                Equivalent of transform in R/ddply in Python/pandas?
                            
                                How to list all graph vertex attributes in R?
                            
                                Evaluate at which size data.table is faster than data.frame
                            
                                How do I find the polygon nearest to a point in R?
                            
                                How to extract one specific group in dplyr
                            
                                How to reorder a legend in ggplot2?
                            
                                "You must provide a hash." error when using API to download data (in R)
                            
                                Plot title at bottom of plot using ggplot2
                            
                                How to convert factor levels to list, in R
                            
                                Using R to scrape the link address of a downloadable file from a web page?
                            
                                R: Understanding standard evaluation in mutate_
                            
                                dplyr arrange() function sort by missing values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Remove text after the second space

Tags:

string

regex

r

Karlo Guidoni Martins

People also ask

1 Answers

Wiktor Stribiżew

Recent Activity

Donate For Us