I am working on cleaning up text within a google doc. The challenge is that the copy contains <code>HTML</code> markup and I am trying to remove it to be left with clean text. I have created the following, but it seems to remove only the first instance of <code>HTML</code> code in the cell, how do I get it all out? <pre class="prettyprint"><code>= regexreplace(C9,"\<[a-zA-Z0-9-?]*\>","") </code></pre>

try this regular expression : <pre class="prettyprint"><code>= regexreplace(C9,"<.*?>","") </code></pre>

How to remove HTML markup from a body of text within a Google Spreadsheet?

Tags:

regex

google-docs

I am working on cleaning up text within a google doc. The challenge is that the copy contains HTML markup and I am trying to remove it to be left with clean text.

I have created the following, but it seems to remove only the first instance of HTML code in the cell, how do I get it all out?

= regexreplace(C9,"\<[a-zA-Z0-9-?]*\>","")

820

asked Apr 04 '13 14:04

Greg Hay

1 Answers

try this regular expression :

= regexreplace(C9,"<.*?>","")

126

answered Jan 01 '23 07:01

Oussama Jilal

Related questions
                            
                                sed: cannot solve this regular expression
                            
                                How to make Regex in Objective-C [closed]
                            
                                with regex, is using both "is" and "is not" range definitons within the same range possible?
                            
                                Regex in java to find pattern like ${...} from given string
                            
                                Regex pattern to match positive and negative number values in a String
                            
                                Are there JavaScript equivalents of the Vim regular expression start and end of word atoms "\<" and "\>"?
                            
                                regular expression for c# verbatim like strings (processing ""-like escapes)
                            
                                Find and Replace All But Text Between Double Quotes in VS2010
                            
                                Use of findall and parenthesis in Python
                            
                                Regex: Split string on number/string?
                            
                                Regex to validate that a string contains only 0 - 9, +, #, *, [ and ]
                            
                                Bash - correct way to escape dollar in regex
                            
                                What are the differences between lazy, greedy and possessive quantifiers?
                            
                                Split using RegEx in JavaScript
                            
                                regex match on R gregexpr
                            
                                Why OrientDB doesn't use indexes for searching with "LIKE" operator?
                            
                                Using perl as a better grep to match multiple lines using single line mode m/RE/s
                            
                                Regular expression for conditionally formatting a number string
                            
                                C# Regex Pattern Conundrum
                            
                                Combine Multiple Regexp Patterns

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With