Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Unicode normalization in GWT [duplicate]

I tried s=Normalizer.normalize(s, Normalizer.Form.NFD).replaceAll("[^\\p{ASCII}]", ""); But it seems that GWT API doesn't provide such fonction.

I tried also :

s=s.replace("é",e);

But it doesn't work either

The scenario is I'am trying to générate token from the clicked Widget's text for the history management

like image 642
Momo Avatar asked Nov 25 '22 00:11

Momo


2 Answers

You can take ASCII folding filter from Lucene and add to your project. You can just take foldToASCII() method from ASCIIFoldingFilter (the method does not have any dependencies). There is also a patch in Jira that has a full class for that without any dependencies - see here. It should be compiled by GWT without any problems. License should be also OK, since it is Apache License, but don't quote me on it - you should ask a real lawyer.

like image 159
okrasz Avatar answered Dec 06 '22 06:12

okrasz


@okrasz, the foldToASCII() worked but I found a shorter one Transform a String to URL standard String in Java

like image 23
Momo Avatar answered Dec 06 '22 05:12

Momo