I'm not new to R but I am relatively new to regular expressions. A similar question can be found in here, but it asks to split on the first comma rather than the last one. As an example, if I use <pre class="prettyprint"><code>> lastcomma_strsplit("UK, USA, Germany", ", ") [[1]] [1] "UK" "USA" "Germany" </code></pre> I want to get <pre class="prettyprint"><code>[[1]] [1] "UK, USA" "Germany" </code></pre> And if I use <pre class="prettyprint"><code>> lastcomma_strsplit("London, Washington, D.C., Berlin", ", ") [[1]] [1] "London" "Washington" "D.C." "Berlin" </code></pre> I want to get <pre class="prettyprint"><code>[[1]] [1] "London, Washington, D.C." "Berlin" </code></pre> One viable way I think is to replace the last comma by something else such as <pre class="prettyprint"><code>$, #, *, ... </code></pre> then use <pre class="prettyprint"><code>strsplit() </code></pre> to split the string by the one you replaced (Make sure it is unique!), but I'm more happy if you can deal with the problem using some built in function directly. So how can I do that?

Here's one approach: <pre class="prettyprint"><code>strsplit("UK, USA, Germany", ",(?=[^,]+$)", perl=TRUE) ## [[1]] ## [1] "UK, USA" " Germany" </code></pre> You may want: <pre class="prettyprint"><code>strsplit("UK, USA, Germany", ",\\s*(?=[^,]+$)", perl=TRUE) ## [[1]] ## [1] "UK, USA" "Germany" </code></pre> As it will match if there is no space after the comma: <pre class="prettyprint"><code>strsplit(c("UK, USA, Germany", "UK, USA,Germany"), ",\\s*(?=[^,]+$)", perl=TRUE) ## [[1]] ## [1] "UK, USA" "Germany" ## ## [[2]] ## [1] "UK, USA" "Germany" </code></pre>

string split on last comma in R

Tags:

string

split

r

comma

I'm not new to R but I am relatively new to regular expressions.

A similar question can be found in here, but it asks to split on the first comma rather than the last one.

As an example, if I use

> lastcomma_strsplit("UK, USA, Germany", ", ")
[[1]]
[1] "UK"      "USA"     "Germany"

I want to get

[[1]]
[1] "UK, USA"     "Germany"

And if I use

> lastcomma_strsplit("London, Washington, D.C., Berlin", ", ")
[[1]]
[1] "London"     "Washington" "D.C."       "Berlin"

I want to get

[[1]]
[1] "London, Washington, D.C."       "Berlin"

One viable way I think is to replace the last comma by something else such as

$, #, *, ...

then use

strsplit()

to split the string by the one you replaced (Make sure it is unique!), but I'm more happy if you can deal with the problem using some built in function directly.

So how can I do that?

275

asked Jul 24 '14 15:07

Jiqing Huang

2 Answers

Here's one approach:

strsplit("UK, USA, Germany", ",(?=[^,]+$)", perl=TRUE)

## [[1]]
## [1] "UK, USA" " Germany"

You may want:

strsplit("UK, USA, Germany", ",\\s*(?=[^,]+$)", perl=TRUE)

## [[1]]
## [1] "UK, USA" "Germany"

As it will match if there is no space after the comma:

strsplit(c("UK, USA, Germany", "UK, USA,Germany"), ",\\s*(?=[^,]+$)", perl=TRUE)

## [[1]]
## [1] "UK, USA" "Germany"
## 
## [[2]]
## [1] "UK, USA" "Germany"

153

answered Oct 05 '22 17:10

Tyler Rinker

You can use stri_split function from stringi package

x <- "USA,UK,Poland"
stri_split_fixed(x,",") # standard split by comma
[[1]]
[1] "USA"    "UK"     "Poland"

stri_split_fixed(x,",",n = 2) # set the max number of elements
[[1]]
[1] "USA"       "UK,Poland"

Unfortunately there is no parameter to change the starting point for splitting (from begin/end) but we can handle this another way - using stri_reverse

stri_split_fixed(stri_reverse(x),",",n = 2) #reverse
[[1]]
[1] "dnaloP" "KU,ASU"

stri_reverse(stri_split_fixed(stri_reverse(x),",",n = 2)[[1]]) #reverse back
[1] "Poland" "USA,UK"
stri_reverse(stri_split_fixed(stri_reverse(x),",",n = 2)[[1]])[2:1] #and again :)
[1] "USA,UK" "Poland"

answered Oct 05 '22 17:10

bartektartanus

Related questions
                            
                                Quicker way to read single column of CSV file
                            
                                Calculating Entropy
                            
                                Select the last n columns of data frame in R
                            
                                Insert a linebreak in title
                            
                                Changing the dataset of a ggplot object
                            
                                R How to mutate a subset of rows
                            
                                Reading objects from shiny output object not allowed?
                            
                                If Column Contains String then enter value for that row
                            
                                How to customize hover information in ggplotly object?
                            
                                Add new variable to list of data frames with purrr and mutate() from dplyr
                            
                                R subtracting 1 month from today's date gives NA
                            
                                Can't download data from Yahoo Finance using Quantmod in R
                            
                                Unable to send email using mailR package
                            
                                Sweave xtable: how to position tables between text?
                            
                                Replace values in a vector based on another vector
                            
                                Making a series of plots that proceed by a click
                            
                                How to compute P-value and standard error from correlation analysis of R's cor()
                            
                                How to directly read an image file from a url address in R
                            
                                writing png plots into a pdf file in R
                            
                                R ggplot2: legend should be discrete and not continuous

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With