IndicNotes

From WorkOutWiki2008

Jump to: navigation, search

Sorting

  1. Canonical equivalence of ୋ and େ + ା
  2. Order on Indlinux wiki for Telugu is correct, and not the one on the FOSS.IN/2008 workout
  3. Differences in Hindi and Marathi sorting: ksha in Marathi considered similar to consonant.
  4. Look at XYZSort on Indlinux wiki.
  5. Bengali/Assamese: Separate sorting. Check sorting already built into as_IN.
  6. Links:
  7. Santhosh notes:
    • In Malayalam, ka + halant sorts before ka, unlike Hindi, Oriya, etc.
    • Anusvara = half-ma always for Malayalam. So, should always sort the same as half-ma.
  8. Suggestions on paper from Dr. Pavanaja.
  9. Final tasklist
    • Pravin, and Rahul to complete Indic collation in iso14651_t1_common, and submit to glibc. Will put sorting examples into http://www.indlinux.org/wiki/index.php/XYZSort for checking by community. Timescale: 4 months.
    • ICU: Gopal
    • Unicode CLDR: Pavanaja, Karunakar
    • Chhatisgarhi locale: Gora to connect Pravin, and Rahul to Ravishankar Shrivastava.

Spell-checking

  1. Copying aspell phonetic tables to Indlinux wiki, e.g., http://www.indlinux.org/wiki/index.php/XYZPhonetic
  2. Final tasklist
    • Gora: Incorporate changes into aspell dictionaries.
    • Santhosh: Look at how to incorporate phonetic rules into Hunspell. Post on indlinux-group list. If easy to do, do it himself.
    • Web interface for dictionary review, including aspell-like affix rules. need volunteer.
    • Agglutinative languages in aspell, hunspell. Hunspell apparently has the capability to handle such languages. Bangla, Malayalam, Tamil. Need volunteer to do first a problem definition, and list possible approaches under existing spell-checking frameworks. Malayalam needs a run-length of up to 10.
    • Merge Hunspell, and aspell
    • Spell-checking middleware: Sonnet, enchant, gtkspell. Need a common one for all applications. Need volunteers.
    • Plugins for applications: Scribus, OpenOffice, Mozilla, IM, chat.

Documentation for the Indic desktop user

Personal tools