I have a string name s, <pre class="prettyprint"><code>String s = "<NOUN>Sam</NOUN> , a student of the University of oxford , won the Ethugalpura International Rating Chess Tournament which concluded on Dec.22 at the Blue Olympiad Hotel"; </code></pre> I want to remove all <NOUN> and </NOUN> tags from the string. I used this to remove tags, <pre class="prettyprint"><code>s.replaceAll("[<NOUN>,</NOUN>]",""); </code></pre> Yes it removes the tag. but it also removes letter 'U' and 'O' characters from the string which gives me following output. <pre class="prettyprint"><code> Sam , a student of the niversity of oxford , won the Ethugalpura International Rating Chess Tournament which concluded on Dec.22 at the Blue lympiad Hotel </code></pre> Can anyone please tell me how to do this correctly?

Try: <pre class="prettyprint"><code>s.replaceAll("<NOUN>|</NOUN>", ""); </code></pre> In RegEx, the syntax <code>[...]</code> will match every character inside the brackets, regardless of the order they appear in. Therefore, in your example, all appearances of "<", "N", "O" etc. are removed. Instead use the pipe (<code>|</code>) to match both "<NOUN>" and "</NOUN>". The following should also work (and could be considered more DRY and elegant) since it will match the tag both with and without the forward slash: <pre class="prettyprint"><code>s.replaceAll("</?NOUN>", ""); </code></pre>

String.replaceAll() takes a regular expression as its first argument. The regexp: <pre class="prettyprint"><code>"[<NOUN>,</NOUN>]" </code></pre> defines within the brackets the set of characters to be identified and thus removed. Thus you're asking to remove the characters <code><</code>,<code>></code>,<code>/</code>,<code>N</code>,<code>O</code>,<code>U</code> and comma. Perhaps the simplest method to do what you want is to do: <pre class="prettyprint"><code>s.replaceAll("<NOUN>","").replaceAll("</NOUN>",""); </code></pre> which is explicit in what it's removing. More complex regular expressions are obviously possible.

How to remove a specific special character pattern from a string

Tags:

java

string

I have a string name s,

String s = "<NOUN>Sam</NOUN> , a student of the University of oxford , won the Ethugalpura International Rating Chess Tournament which concluded on Dec.22 at the Blue Olympiad Hotel";

I want to remove all <NOUN> and </NOUN> tags from the string. I used this to remove tags,

s.replaceAll("[<NOUN>,</NOUN>]","");

Yes it removes the tag. but it also removes letter 'U' and 'O' characters from the string which gives me following output.

 Sam , a student of the niversity of oxford , won the Ethugalpura International Rating Chess Tournament which concluded on Dec.22 at the Blue lympiad Hotel

Can anyone please tell me how to do this correctly?

515

asked Aug 03 '12 08:08

Roshanck

2 Answers

Try:

s.replaceAll("<NOUN>|</NOUN>", "");

In RegEx, the syntax [...] will match every character inside the brackets, regardless of the order they appear in. Therefore, in your example, all appearances of "<", "N", "O" etc. are removed. Instead use the pipe (|) to match both "<NOUN>" and "</NOUN>".

The following should also work (and could be considered more DRY and elegant) since it will match the tag both with and without the forward slash:

s.replaceAll("</?NOUN>", "");

191

answered Oct 05 '22 08:10

Hubro

String.replaceAll() takes a regular expression as its first argument. The regexp:

"[<NOUN>,</NOUN>]"

defines within the brackets the set of characters to be identified and thus removed. Thus you're asking to remove the characters <,>,/,N,O,U and comma.

Perhaps the simplest method to do what you want is to do:

s.replaceAll("<NOUN>","").replaceAll("</NOUN>","");

which is explicit in what it's removing. More complex regular expressions are obviously possible.

answered Oct 05 '22 09:10

Brian Agnew

Related questions
                            
                                Is it clone safe to pass a classes enum to a clone?
                            
                                Not able to set LD_LIBRARY_PATH for Java process
                            
                                Sampling with no replacement in Java from an ArrayList
                            
                                Confused about naming of JavaBean properties, with respect to getters and setters
                            
                                Logs: log4j or lucene? [closed]
                            
                                converting byte[] to string
                            
                                Rotate Image Clockwise using LibGDX
                            
                                Finding Latitude and Longitude via Zip Codes in Java
                            
                                Java image rotation with AffineTransform outputs black image, but works well when resized
                            
                                Is there a more terse/elegant way to format the following Social Security Number like String with or without Groovy?
                            
                                How to best store a timestamp or date in a database?
                            
                                Add ActionListener to Column Header of JTable
                            
                                Pass by value/reference, what?
                            
                                Generate Java classes with JAXB from a DTD file - how can I modify the DTD?
                            
                                How to avoid code duplication in overloaded constructors?
                            
                                dynamically read/add value to the parameter of conf file with Properties
                            
                                SimpleDateFormat parse does not work properly
                            
                                Path intersection in android
                            
                                Java swing GUI freezes
                            
                                Java collection to allow adding and removing while iterating

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With