| Speaker Name | Gora Mohanty |
|
| Organization | Sarai, CSDS | |
| Type | Workout | |
| Slides | (Not yet available) | |
Sorting in Indic locales/Indian language spell-checking enhancements |
||
| Abstract | There are two main aspects to the workout, which are combined here as both should fit into a 3-hour span: 1. There is benefit in having a single collation order defined for all Indian languages, including English. The natural place to define the sorting order is in the glibc locales. Technical problems here have essentially been solved, but there is a significant amount of work left in testing, bug-fixing, and submitting a final patch to glibc. We plan on doing this at the event. 2. There are now two main open-source spell-checking engines in use, aspell, and hunspell. Both of them allow certain advanced features, of which defining a phonetic table, and affix rules are important in the context of Indian languages. Such rules have been added to the dictionaries only for a few Indian languages, and here we aim to use a similar approach to add rules for most other Indian languages. Various other issues, like harmonising dictionaries for aspell and hunspell, working on a common, language-neutral interface for spell- checking, etc., will also be taken up. Please see http://www.indlinux.org/wiki/index.php/FOSS.IN2008#Workouts for details. |
|
| Pre-requisites | 1. Familiarity with issues in sorting, and/or knowledge of the language. An introduction to the topic will be made available on the IndLinux Wiki. 2. Knowledge of the Indian language that the participant wants to work on, and a basic understanding of issues in spell-checking. Links to material are already available on http://www.indlinux.org/wiki/index.php/FOSS.IN2008#Workouts Programming ability in C/C++, and knowledge of SWIG will be of use for some of the topics that we plan to address | |
| Speaker Profile | Gora Mohanty is an astrophysicist by training, but works mostly in the FOSS world these days. Most of his effort goes into issues related to Indian language computing, and he is part of the IndLinux project. He is also involved in a variety of other open-source projects. Besides software development, he also works on promoting open-source through a variety of activities, including reviewing and mentoring student projects, evangelistic talks, and helping manage ILUG-Delhi, and Freed.in | |






