How would one convert -lrb-300-rrb- 922-6590 to -lrb-300-rrb- 922-6590 in java?
Have tried the following:
t.lemma = lemma.replaceAll("\\p{C}", " ");
t.lemma = lemma.replaceAll("[\u0000-\u001f]", " ");
Am probably missing something conceptual. Will appreciate any pointers to the solution.
Thank you
Try the next:
str = str.replaceAll("[^\\p{ASCII}]", " ");By the way, \p{ASCII} is all ASCII: [\x00-\x7F].
In ahother hand, you need to use a constant of Pattern for avoid recompiled the expression every time.
private static final Pattern REGEX_PATTERN = 
        Pattern.compile("[^\\p{ASCII}]");
public static void main(String[] args) {
    String input = "-lrb-300-rrb- 922-6590";
    System.out.println(
        REGEX_PATTERN.matcher(input).replaceAll(" ")
    );  // prints "-lrb-300-rrb- 922-6590"
}
See also:
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With