Autores
Gelbukh Alexander
Título Generalized Mongue-Elkan Method for Approximate Text String Comparison
Tipo Congreso
Sub-tipo SCOPUS
Descripción Lecture Notes in Computer Science
Resumen The Mongue-Elkan method is a general text string comparison method based on an internal character-based similarity measure (e.g. edit distance) combined with a token level (i.e. word level) similarity measure. We propose a generalization of this method based on the notion of the generalized arithmetic mean instead of the simple average used in the expression to calculate the Monge-Elkan method. The experiments carried out with 12 well-known name-matching data sets show that the proposed approach outperforms the original Monge-Elkan method when character-based measures are used to compare tokens.
Observaciones 10th International Conference on Computational Linguistics and Intelligent Text Processing, CICLing 2009; Code 76623; ISBN: 3642003818;978-364200381-3
Lugar Ciudad de México
País Mexico
No. de páginas 559-570
Vol. / Cap. 5449
Inicio 2009-03-01
Fin 2009-03-07
ISBN/ISSN 3642003818;978-36420