I have a data set wherein a column looks like this: <pre class="prettyprint"><code>ABC|DEF|GHI, ABCD|EFG|HIJK, ABCDE|FGHI|JKL, DEF|GHIJ|KLM, GHI|JKLM|NO|PQRS, BCDE|FGHI|JKL </code></pre> .... and so on I need to extract the characters that appear before the first <code>|</code> symbol. In Excel, we would use a combination of MID-SEARCH or a LEFT-SEARCH, R contains <code>substr()</code>. The syntax is - <code>substr(x, <start>,<stop>)</code> In my case, start will always be 1. For stop, we need to search by <code>|</code>. How can we achieve this? Are there alternate ways to do this?

Extract string before "|" [duplicate]

Tags:

r

substr

extract

I have a data set wherein a column looks like this:

ABC|DEF|GHI,   ABCD|EFG|HIJK,   ABCDE|FGHI|JKL,   DEF|GHIJ|KLM,   GHI|JKLM|NO|PQRS,   BCDE|FGHI|JKL

.... and so on

I need to extract the characters that appear before the first | symbol.

In Excel, we would use a combination of MID-SEARCH or a LEFT-SEARCH, R contains substr().

The syntax is - substr(x, <start>,<stop>)

In my case, start will always be 1. For stop, we need to search by |. How can we achieve this? Are there alternate ways to do this?

692

asked Jul 10 '16 12:07

Shounak Chakraborty

2 Answers

We can use sub

sub("\\|.*", "", str1) #[1] "ABC"

Or with strsplit

strsplit(str1, "[|]")[[1]][1] #[1] "ABC"

Update

If we use the data from @hrbrmstr

sub("\\|.*", "", df$V1) #[1] "ABC"   "ABCD"  "ABCDE" "DEF"   "GHI"   "BCDE"

These are all base R methods. No external packages used.

data

str1 <- "ABC|DEF|GHI ABCD|EFG|HIJK ABCDE|FGHI|JKL DEF|GHIJ|KLM GHI|JKLM|NO|PQRS BCDE|FGHI|JKL"

107

answered Oct 22 '22 08:10

akrun

Another option word function of stringr package

library(stringr) word(df1$V1,1,sep = "\\|")

Data

df1 <- read.table(text = "ABC|DEF|GHI,   ABCD|EFG|HIJK,   ABCDE|FGHI|JKL,   DEF|GHIJ|KLM,   GHI|JKLM|NO|PQRS,   BCDE|FGHI|JKL")

answered Oct 22 '22 08:10

user2100721

Related questions
                            
                                Reliable way to detect if a column in a data.frame is.POSIXct
                            
                                How to sort files list by date?
                            
                                Is there a faster lm function
                            
                                dplyr: inner_join with a partial string match
                            
                                Skip specific rows using read.csv in R [duplicate]
                            
                                Dividing columns by colSums in R
                            
                                Is set.seed consistent over different versions of R (and Ubuntu)?
                            
                                Clustering list for hclust function
                            
                                min for each row in a data frame
                            
                                Installing nloptr on Linux
                            
                                Concatenate strings and expressions in a plot's title
                            
                                completely uninstall r linux
                            
                                Quickly remove zero variance variables from a data.frame
                            
                                Split date-time column into Date and time variables
                            
                                Removal of constant columns in R
                            
                                Cumulative sum for positive numbers only [duplicate]
                            
                                Nested facets in ggplot2 spanning groups
                            
                                python equivalent of qnorm, qf and qchi2 of R
                            
                                Add row to data frame with dplyr
                            
                                view source code in R [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With