Extracting a specific word using gsub and regex

Tags:

Leapfrogging from a previous question, I'm having problem with the proper reg expression syntax to isolate a specific word.

Given a data frame:

DL<-c("Dark_ark","Light-Lis","dark7","DK_dark","The_light","Lights","Lig_dark","D_Light")
Col1<-c(1,12,3,6,4,8,2,8)
DF<-data.frame(Col1)
row.names(DF)<-DL

I'm looking extract all of the "Dark" and "Light" (ignoring upper vs lower case) from the row names and make a second column containing only the string "Dark" or "Light"

Col2<-c("Dark","Light","dark","dark","light","Light","dark","Light")
DF$Col2<-Col2

          Col1  Col2
Dark_ark     1  Dark
Light-Lis   12 Light
dark7        3  dark
DK_dark      6  dark
The_light    4 light
Lights       8 Light
Lig_dark     2  dark
D_Light      8 Light

Ive changed the original data a bit to detail my current issue, but working of an excellent answer from Tyler Rinker, I used this:

DF$Col2<-gsub("[^dark|light]", "", row.names(DF), ignore.case = TRUE)

But the gsub gets tripped up on some of the letters in common. Searching the message boards for isolating an exact word with regex, it looks like the answer should be to use double slash with either

\\<light\\>

\\blight\\b

So why does the line

DF$Col2<-gsub("[^\\<dark\\>|\\<light\\>]", "", row.names(DF), ignore.case = TRUE)

Not pull the desired column above? Instead I get

          Col1    Col2
Dark_ark     1 Darkark
Light-Lis   12 LightLi
dark7        3    dark
DK_dark      6  DKdark
The_light    4 Thlight
Lights       8   Light
Lig_dark     2 Ligdark
D_Light      8  DLight

538

asked Jul 28 '13 22:07

Vinterwoo

1 Answers

How about this?

unlist(regmatches(rownames(DF), gregexpr("dark|light", rownames(DF), ignore.case=TRUE)))
# [1] "Dark"  "Light" "dark"  "dark"  "light" "Light" "dark"  "Light"

gsub(".*(dark|light).*$", "\\1", row.names(DF), ignore.case = TRUE)
# [1] "Dark"  "Light" "dark"  "dark"  "light" "Light" "dark"  "Light"

175

answered Nov 02 '22 23:11

Arun

Related questions
                            
                                Find all pattern indexes in string in C#
                            
                                C# Linq .ToDictionary() Key Already Exists
                            
                                Regular expression to replace a value in query parameter
                            
                                test string against multiple regexes in javascript
                            
                                jQuery: How to wrap RegEx matched plain text in an anchor tag?
                            
                                Regex to capture a whole word only using egrep
                            
                                Regular Expression of a Specific Word
                            
                                Regex to catch groups of same digits in Ruby
                            
                                Java Split string into words commas and full stops
                            
                                How can I write a javascript regular expression to replace hyperlinks in this format [*](*) with html hyperlinks?
                            
                                I would like to mimick conditionals in javascript regex
                            
                                Split column by last word in sentence
                            
                                Javascript/jQuery - replace last occurence of a word in a string
                            
                                Python - how to replace 'p' in a number(4p5) with '.' (4p5->4.5)? [closed]
                            
                                using awk in tcl script
                            
                                Calling a function on captured group in re.sub()
                            
                                How can I use regex to search inside sentence -not a case sensitive
                            
                                Regex to select semicolons that are not enclosed in double quotes
                            
                                PHP-RegEx for german full name with umlauts and some internationalisation
                            
                                Split String in Java with [a-z] regular expression

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Extracting a specific word using gsub and regex

Tags:

regex

r

gsub

Vinterwoo

People also ask

1 Answers

Arun

Recent Activity

Donate For Us