IDL-BNC @ IDRC >
IDRC / CRDI >
IDRC Research Results / Résultats de recherches du CRDI >
Please use this identifier to cite or link to this item:
|Title: ||PAN localization : a study on collation of languages from developing Asia|
|Authors: ||Hussain, Sarmad|
|Keywords: ||ASIAN LANGUAGE COMPUTING|
UNICODE COLLATION ALGORITHM
ACCESS TO INFORMATION
|Issue Date: ||2008|
|Publisher: ||Center for Research in Urdu Language Processing, National University of Computer and Emerging Science, Lahore, PK|
|Abstract: ||Collation of all written languages are defined in their dictionaries, developed over
centuries, and are thus very representative of cultural tradition. However, though it is
well understood in these cultures, it is not always thoroughly documented or well
understood in the context of existing character encodings, especially the Unicode.
This volume aims to address the complex algorithms needed for sorting out the words
in sequence for a small but diverse set of scripts and languages chosen from
developing Asian region. The set is chosen for the variety it exhibits and to show the
challenges it poses to solve the collation puzzle.
This work must be taken as an initial step towards addressing the collation of
languages in the region as there is still more which can be said about collation of these
languages, and there are many more languages which need to be documented.
The data on different languages has been obtained from the dictionaries published in
these languages, and through interacting with the PAN Localization project teams in
|Description: ||Copublished with and copyrighted by International Development Research Center|
|Project Number: ||102042|
|Project Title: ||PAN Localization Phase II: Building Local Language Computing Capacity in Asia|
|Appears in Collections:||Research Results (Pan Asia) / Résultats de recherches (Pan Asie)|
IDRC Research Results / Résultats de recherches du CRDI
2000-2009 / Années 2000-2009
Files in This Item:
|129903.pdf||1.42 MB||Adobe PDF|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.