Extract string between /

Q: How do I extract a string between two words in Python?

To find a string between two strings in Python, use the re.search() method. The re.search() is a built-in Python method that searches a string for a match and returns the Match object if it finds a match. If it finds more than one match, it only returns the first occurrence of the match.

Q: How do you extract a certain part of a string?

The substr() method extracts a part of a string. The substr() method begins at a specified position, and returns a specified number of characters. The substr() method does not change the original string. To extract characters from the end of the string, use a negative start position.

Tags:

regex

r

If I have these strings:

mystrings <- c("X2/D2/F4",
               "X10/D9/F4",
               "X3/D22/F4",
               "X9/D22/F9")

How can I extract 2,9,22,22. These characters are between the / and after the first character within the /.

I would like to do this in a vectorized fashion and add the new column with transfrom if possible with which I am familiar.

I think this regex gets me somewhere near all the characters within \:

^.*\\'(.*)'\\.*$

459

asked Jan 03 '13 20:01

user1320502

1 Answers

> gsub("(^.+/[A-Z]+)(\\d+)(/.+$)", "\\2", mystrings)
[1] "2"  "9"  "22" "22"

You would "read" (or "parse") that regex pattern as splitting any matched string into three parts:

1) anything up to and including the first forward slash followed by a sequence of capital letters,

2) any digits(= "\d") in a sequence before the next slash and ,

3) from the next slash to the end.

And then only returning the second part....

Non-matched character strings would be returned unaltered.

answered Oct 14 '22 13:10

IRTFM

Related questions
                            
                                What does c do in R? [duplicate]
                            
                                r modify and rebuild package
                            
                                How do I show all boxplot labels
                            
                                R: how to check whether a vector is ascending/descending
                            
                                Convert and save distance matrix to a specific format
                            
                                visualize a list of colors/palette in R
                            
                                How to remove columns with same value in R
                            
                                In R, What is the difference between df["x"] and df$x
                            
                                Create counter within consecutive runs of certain values
                            
                                Functions available for Tufte boxplots in R?
                            
                                How can I make a list of all dataframes that are in my global environment?
                            
                                R: splitting dataset into quartiles/deciles. What is the right method? [duplicate]
                            
                                Make Emacs ESS follow R style guide
                            
                                How to improve randomForest performance?
                            
                                Applying dplyr's rename to all columns while using pipe operator
                            
                                Shiny Tutorial Error in R
                            
                                Count values separated by a comma in a character string
                            
                                dplyr filter() with SQL-like %wildcard%
                            
                                vector field visualisation R
                            
                                How to number/label data-table by group-number from group_by?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With