Applied Linguistics and Translation Studies - Publications
Permanent URI for this collection
Browse
Browsing Applied Linguistics and Translation Studies - Publications by Author "Bharati, Akshar"
Results Per Page
Sort Options
-
ItemGrammar extraction from treebanks for Hindi and telugu( 2010-01-01) Kolachina, Prasanth ; Kolachina, Sudheer ; Singh, Anil Kumar ; Husain, Samar ; Naidu, Viswanatha ; Sangal, Rajeev ; Bharati, AksharGrammars play an important role in many Natural Language Processing (NLP) applications. The traditional approach to creating grammars manually, besides being labor-intensive, has several limitations. With the availability of large scale syntactically annotated tree-banks, it is now possible to automatically extract an approximate grammar of a language in any of the existing formalisms from a corresponding treebank. In this paper, we present a basic approach to extract grammars from dependency treebanks of two Indian languages, Hindi and Telugu. The process of grammar extraction requires a generalization mechanism. Towards this end, we explore an approach which relies on generalization of argument structure over the verbs based on their syntactic similarity. Such a generalization counters the effect of data sparseness in the treebanks. A grammar extracted using this system can not only expand already existing knowledge bases for NLP tasks such as parsing, but also aid in the creation of grammars for languages where none exist. Further, we show that the grammar extraction process can help in identifying annotation errors and thus aid in the task of the treebank validation.
-
ItemImproving the performance of the link parser( 2009-12-01) Naidu, Y. Viswanatha ; Singh, Anil Kumar ; Sharma, Dipti Misra ; Bharati, AksharThe paper describes an approach to extend the coverage of a Link Grammar based parser on the constructions that are not being handled currently by the grammar. There are about thirty types of constructions which we have identified till now. In order to make Link Grammar handle these constructions, we introduce a preprocessor and a postprocessor. The idea is to handle such constructions via some analysis and transformations in a preprocessing phase before the sentence is given to the Link Parser and then by adding the missing links in the postprocessing phase. The main part of the paper discusses the constructions not handled by the parser and introduces rule based preprocessor and postprocessor. This simple and flexible approach is able to increase the coverage of the parser significantly and allows even a relatively naive user to improve the performance of the parser without disturbing the core grammar. © 2009 IEEE.