SABER

Autores
Sidorov Grigori

Título	English-Spanish Large Statistical Dictionary of Inflectional Forms
Tipo	Congreso
Sub-tipo	Memoria
Descripción	7th Int. Conf. on Language Resources and Evaluation, LREC-2010
Resumen	The paper presents an approach for constructing a weighted bilingual dictionary of in?ectional forms using as input data a traditional bilingual dictionary, and not parallel corpora. An algorithm is developed that generates all possible morphological (in?ectional) forms and weights them using information on distribution of corresponding grammar sets (grammar information) in large corpora for each language. The algorithm also takes into account the compatibility of grammar sets in a language pair; for example, verb in past tense in language L normally is expected to be translated by verb in past tense in Language L. We consider that the developed method is universal, i.e. can be applied to any pair of languages. The obtained dictionary is freely available. It can be used in several NLP tasks, for example, statistical machine translation.
Observaciones
Lugar	Valletta
País	Malta
No. de páginas	277-281
Vol. / Cap.
Inicio	2010-05-17
Fin	2010-05-23
ISBN/ISSN