
Here you can see some of my transliterator in action. I have removed support for Hakka, Taiwanese and Kanji because (1) adding them would bloat the applet a lot, and (2) I'm still making sure that I can freely post these tables. At present, Bopomofo (Zhuyin), Cherokee, Chinese (Cantonese, Mandarin, Shanghaiese) Greek, Japanese (kana), Korean (Hangeul and Hanja), Russian and Thai work reasonably well. The transliterator has limited support for Arabic and Hebrew (consonants only, in most cases), and very incomplete support for Vietnamese Chữ Nôm and Tang Dynasty Chinese). Gujarati and Devanagari (Hindi) still have a way to go, but you can try them out for yourselves to see the progress that I have made. I hope to fix things up a little more over the summer, and add some more Chinese character tables, so please check back soon.
HOW TO USE:First, make sure that it has loaded properly. You should see a box with various options and two buttons in between the input and output text boxes. If you instead see a blue or gray box, something went wrong. In this case, check that you have the latest Java Plug-in, at java.com. If you do, the problem might be that the Microsoft Virtual Machine is overriding it. You can fix this (in Windows XP, at least) is by running "regsvr32 /u msjava.dll", which effectively unregisters the Microsoft VM. If you decide that you want it back, just run "regsvr32 msjava.dll". This might work with older versions of Windows too. Run these commands at your own risk!
Now that you know it's working, simply type or paste foreign language text in the top box, select a transliterator (Unicode, unless you're doing Gujarati), and click "Transliterate!". If you have foreign language text that is encoded, my transliterator can decode it for you too! Just click on "Options", then "Set Encoding", and pick your encoding.
The configuration button is geared more toward the Chinese character transliterators, so only CJK_Unified_Ideographs are configurable in this demo is "configurable". If you click on "Configure" when the transliterator is set to Unicode, you can get it to disable transliteration, and instead get hexadecimal codes and character sets for your characters. If you click on "CJK_Unified_Ideographs" from the Unicode menu, and click "Configure...", you can enable or disable tonal output, or restrict each character's transliteration to just one pronunciation. You can also change the priority of the tables. If one table does not have the pronunciation of a character, the transliterator will look at the next table in the list. If no table has the pronunciation, it will yield [???] for that character.
| Your input: |
| Transliterated Output: |
Please see the list of text sources and table sources.
KNOWN TRANSLITERATION ISSUES:So much for "write once, run anywhere"... If this applet does not run in your browser, you are not alone. Support for it seems to vary by browser and operating system. I've managed to fix most of the compatibility issues, but some problems are still unsolved. If you are having trouble running it, make sure that you are running the latest version of the Java plug-in. I've had some trouble making it work on Opera, but it should work on Mozilla and MS Internet Explorer.