Unidentified whitespace character in Java

Question

On extracting some html from a web page, I have some elements containing text that end in an unknown or non-matching whitespace character (ie does not match "\s"):

<span>Monday </span>

In java, to check what this character is, I am doing:

String s = getTheSpanContent();
char c = s.charAt(s.length() -1);
int i = (int) c;

and the value of i is: 160

Anyone know what this is? And how I can match for it?

Thanks

Michael Myers · Accepted Answer

It's a non-breaking space. According to the Pattern Javadocs, \s matches [ \x0B\f ], so you'll have to explicitly add \xA0 to your regex if you want to match it.

Unidentified whitespace character in Java

Tags:

java

whitespace

Richard H

1 Answers

Michael Myers

Recent Activity

Donate For Us

Unidentified whitespace character in Java

Tags:

java

whitespace

Richard H

1 Answers

Michael Myers

Related questions

Recent Activity

Donate For Us