A distributed platform for sanskrit processing

No Thumbnail Available
Date
2012-12-01
Authors
Goyal, Pawan
Huet, Gérard
Kulkarni, Amba
Scharf, Peter
Bunker, Ralph
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Sanskrit, the classical language of India, presents specific challenges for computational linguistics: exact phonetic transcription in writing that obscures word boundaries, rich morphology and an enormous corpus, among others. Recent international cooperation has developed innovative solutions to these problems and significant resources for linguistic research. Solutions include efficient segmenting and tagging algorithms and dependency parsers based on constraint programming. The integration of lexical resources, text archives and linguistic software is achieved by distributed interoperable Web services. Resources include a morphological tagger and tagged corpus. © 2012 The COLING.
Description
Keywords
Indian language technology, Morphology and pos tagging, Parsing, Resources and annotation
Citation
24th International Conference on Computational Linguistics - Proceedings of COLING 2012: Technical Papers