I want to have the table of vowels with diacritics, but don't want to search symbol tables manually. Is it possible to generate this table by crossing the list of vowels and the list of diacritics in some of the following languages: Java, PHP, Wolfram Mathematica, .NET languages and so on? I need to have characters (unicode) as output. Java Solution I found that there are a special Unicode feature for this: http://en.wikipedia.org/wiki/Unicode_normalization Java supports it since 1.6 http://docs.oracle.com/javase/6/docs/api/java/text/Normalizer.html So, the sample code is: <pre class="prettyprint"><code>public static void main(String[] args) { String vowels = "aeiou"; char[] diacritics = {'\u0304', '\u0301', '\u0300', '\u030C'}; StringBuilder sb = new StringBuilder(); for(int v=0; v<vowels.length(); ++v) { for(int d=0; d<diacritics.length; ++d) { sb.append(vowels.charAt(v)); sb.append(diacritics[d]); sb.append(' '); } sb.append(vowels.charAt(v)); sb.append('\n'); } String ans = Normalizer.normalize(sb.toString(), Normalizer.Form.NFC); JOptionPane.showMessageDialog(null, ans); } </code></pre> I.e. we just put combining diacritics after vowels and then apply normalization to the string.

To be honest, I haven't completely deciphered what Szabolcs' code is doing, but in this particular case this seems to produce the same result in Mathematica using slightly less code <pre class="prettyprint"><code>data = Import["http://unicode.org/Public/UNIDATA/NamesList.txt", "Lines"]; codes = Cases[data, b_String /; StringMatchQ[ b, ___ ~~ "LATIN " ~~ "CAPITAL" | "SMALL" ~~ " LETTER " ~~ "A" | "E" | "I" | "O" | "U" ~~ " WITH " ~~ ___] :> FromDigits[StringTake[b, 4], 16], Infinity]; FromCharacterCode[codes] </code></pre> which produces <pre class="prettyprint"><code>"ÀÁÂÃÄÅÈÉÊËÌÍÎÏÒÓÔÕÖØÙÚÛÜàáâãäåèéêëìíîïòóôõöøùúûüĀāĂăĄąĒēĔĕĖėĘęĚěĨĩĪīĬ\ ĭĮįİŌōŎŏŐőŨũŪūŬŭŮůŰűŲųƗƟƠơƯưǍǎǏǐǑǒǓǔǕǖǗǘǙǚǛǜǞǟǠǡǪǫǬǭǺǻǾǿȀȁȂȃȄȅȆȇȈȉȊȋȌȍ\ ȎȏȔȕȖȗȦȧȨȩȪȫȬȭȮȯȰȱȺɆɇɨᶏᶒᶖᶙḀḁḔḕḖḗḘḙḚḛḜḝḬḭḮḯṌṍṎṏṐṑṒṓṲṳṴṵṶṷṸṹṺṻẚẠạẢảẤấẦầẨ\ ẩẪẫẬậẮắẰằẲẳẴẵẶặẸẹẺẻẼẽẾếỀềỂểỄễỆệỈỉỊịỌọỎỏỐốỒồỔổỖỗỘộỚớỜờỞởỠỡỢợỤụỦủỨứỪừỬửỮ\ ữỰựⱥⱸⱺꝊꝋꝌꝍ" </code></pre>

How to generate diacritized vowel table automatically?

Q: What is a vowel Diacritic?

The diaeresis (/daɪˈɛrəsɪs, -ˈɪər-/ dy-ERR-ə-sis, -EER-; is a diacritical mark used to indicate the separation of two distinct vowels in adjacent syllables when an instance of diaeresis (or hiatus) occurs, so as to distinguish from a digraph or diphthong.

Tags:

java

.net

php

wolfram-mathematica

diacritics

I want to have the table of vowels with diacritics, but don't want to search symbol tables manually.

Is it possible to generate this table by crossing the list of vowels and the list of diacritics in some of the following languages: Java, PHP, Wolfram Mathematica, .NET languages and so on?

I need to have characters (unicode) as output.

Java Solution

I found that there are a special Unicode feature for this: http://en.wikipedia.org/wiki/Unicode_normalization

Java supports it since 1.6 http://docs.oracle.com/javase/6/docs/api/java/text/Normalizer.html

So, the sample code is:

public static void main(String[] args) {
    String vowels = "aeiou";
    char[] diacritics = {'\u0304', '\u0301', '\u0300', '\u030C'};
    StringBuilder sb = new StringBuilder();

    for(int v=0; v<vowels.length(); ++v) {
        for(int d=0; d<diacritics.length; ++d) {
            sb.append(vowels.charAt(v));
            sb.append(diacritics[d]);

            sb.append(' ');
        }
        sb.append(vowels.charAt(v));
        sb.append('\n');
    }

    String ans = Normalizer.normalize(sb.toString(), Normalizer.Form.NFC);

    JOptionPane.showMessageDialog(null, ans);
}

I.e. we just put combining diacritics after vowels and then apply normalization to the string.

488

asked Jan 08 '12 11:01

Dims

1 Answers

To be honest, I haven't completely deciphered what Szabolcs' code is doing, but in this particular case this seems to produce the same result in Mathematica using slightly less code

data = Import["http://unicode.org/Public/UNIDATA/NamesList.txt", "Lines"];

codes = Cases[data, 
 b_String /; StringMatchQ[
  b, ___ ~~ "LATIN " ~~ "CAPITAL" | "SMALL" ~~ " LETTER " ~~ 
   "A" | "E" | "I" | "O" | "U" ~~ " WITH " ~~ ___] :> 
    FromDigits[StringTake[b, 4], 16], Infinity];

FromCharacterCode[codes]

which produces

"ÀÁÂÃÄÅÈÉÊËÌÍÎÏÒÓÔÕÖØÙÚÛÜàáâãäåèéêëìíîïòóôõöøùúûüĀāĂăĄąĒēĔĕĖėĘęĚěĨĩĪīĬ\
ĭĮįİŌōŎŏŐőŨũŪūŬŭŮůŰűŲųƗƟƠơƯưǍǎǏǐǑǒǓǔǕǖǗǘǙǚǛǜǞǟǠǡǪǫǬǭǺǻǾǿȀȁȂȃȄȅȆȇȈȉȊȋȌȍ\
ȎȏȔȕȖȗȦȧȨȩȪȫȬȭȮȯȰȱȺɆɇɨᶏᶒᶖᶙḀḁḔḕḖḗḘḙḚḛḜḝḬḭḮḯṌṍṎṏṐṑṒṓṲṳṴṵṶṷṸṹṺṻẚẠạẢảẤấẦầẨ\
ẩẪẫẬậẮắẰằẲẳẴẵẶặẸẹẺẻẼẽẾếỀềỂểỄễỆệỈỉỊịỌọỎỏỐốỒồỔổỖỗỘộỚớỜờỞởỠỡỢợỤụỦủỨứỪừỬửỮ\
ữỰựⱥⱸⱺꝊꝋꝌꝍ"

168

answered Nov 01 '22 09:11

Heike

Related questions
                            
                                Stable timer independent of system time
                            
                                check if a file is already open before trying to delete it [duplicate]
                            
                                SLF4J-Log4J does not appear to have disabled logging
                            
                                Access singleton's fields via a static method
                            
                                How to handle Android portrait and landscape in code?
                            
                                How do I learn to use Java commons-collections?
                            
                                Break lines in g.drawString in Java SE [duplicate]
                            
                                How to add SOAP action to the webservice in java?
                            
                                Does wrapper widening beat unboxing?
                            
                                FEN (Chess notation) to HTML generator? Open source Java [closed]
                            
                                Restrict the user to make limited request per Second
                            
                                Explicit type casting vs using suffix for float/double Java - difference?
                            
                                Java Swing Mac OSX Title Bar
                            
                                Convert mysql regex to java regex (and/or vice versa)
                            
                                What are the security reasons for JPasswordField.getPassword()?
                            
                                Testing class communicating with DB through ORMLite's DAO
                            
                                How I resize an object in ARCore?
                            
                                How to write from Java to the Windows Event Log?
                            
                                How to save a cookie in an Android webview forever?
                            
                                How should I handle exceptions when using SwingWorker?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With