Speaker Name Gora Mohanty
Organization Sarai, CSDS
Type Workout
Slides (Not yet available)

Sorting in Indic locales/Indian language spell-checking enhancements

Abstract There are two main aspects to the workout, which are
combined here as both should fit into a 3-hour span:
1. There is benefit in having a single collation order
defined for all Indian languages, including English.
The natural place to define the sorting order is in
the glibc locales. Technical problems here have
essentially been solved, but there is a significant
amount of work left in testing, bug-fixing, and
submitting a final patch to glibc. We plan on doing
this at the event.
2. There are now two main open-source spell-checking
engines in use, aspell, and hunspell. Both of them
allow certain advanced features, of which defining
a phonetic table, and affix rules are important in
the context of Indian languages. Such rules have
been added to the dictionaries only for a few Indian
languages, and here we aim to use a similar
approach to add rules for most other Indian
languages. Various other issues, like harmonising
dictionaries for aspell and hunspell, working on
a common, language-neutral interface for spell-
checking, etc., will also be taken up.
Please see http://www.indlinux.org/wiki/index.php/FOSS.IN2008#Workouts for details.
Pre-requisites 1. Familiarity with issues in sorting, and/or knowledge of the language. An introduction to the topic will be made available on the IndLinux Wiki. 2. Knowledge of the Indian language that the participant wants to work on, and a basic understanding of issues in spell-checking. Links to material are already available on http://www.indlinux.org/wiki/index.php/FOSS.IN2008#Workouts Programming ability in C/C++, and knowledge of SWIG will be of use for some of the topics that we plan to address
Speaker Profile Gora Mohanty is an astrophysicist by training, but works mostly in the FOSS world these days. Most of his effort goes into issues related to Indian language computing, and he is part of the IndLinux project. He is also involved in a variety of other open-source projects. Besides software development, he also works on promoting open-source through a variety of activities, including reviewing and mentoring student projects, evangelistic talks, and helping manage ILUG-Delhi, and Freed.in

Back to previous page


Comments:

No comments on this talk yet.

Delegate Code:
Comment:
 

Copyright © 2005-2008 FOSS.IN. Some Rights Reserved

Creative Commons License
This work is licensed under a Creative Commons License.

Comments? Feedback? Contact Us.