regex, multiline extract in R

Question

I am having some problems with deleting everything after the first occurrence of a pattern in R. I have imported the data with paste(readLines(url), collapse=" ").

For example, my string is, \"id=\"fruit_info\"> <tr class='thead'> <th colspan=2>Strawberries</th></table> </tr> </table> <tr class.

I want to remove everything after the first occurrence of </table>. What I want to see is;

\"id=\"fruit_info\"> <tr class='thead'> <th colspan=2>Strawberries</th>

The methods I am trying do not seem to register the first </table> occurrence and not providing the intended results.

Thanks!

hwnd · Accepted Answer

Try using the inline (?s) modifier which forces the dot . to span across newline sequences.

sub('(?s)</table>.*', '', x, perl = TRUE)

regex, multiline extract in R

Tags:

regex

multiline

r

jim mako

1 Answers

hwnd

Recent Activity

Donate For Us

regex, multiline extract in R

Tags:

regex

multiline

r

jim mako

1 Answers

hwnd

Related questions

Recent Activity

Donate For Us