Título |
Computing Text Similarity using Tree Edit Distance |
Tipo |
Congreso |
Sub-tipo |
Memoria |
Descripción |
Fuzzy Information Processing Society (NAFIPS) held jointly with 2015 5th World Conference on Soft Computing (WConSC), 2015 Annual Conference of the North American |
Resumen |
In this paper, we propose the application of the
Tree Edit Distance (TED) for calculation of similarity between
syntactic n-grams for further detection of soft similarity between
texts. The computation of text similarity is the basic task for many
natural language processing problems, and it is an open research
field. Syntactic n-grams are text features for Vector Space Model
construction extracted from dependency trees. Soft similarity is
application of Vector Space Model taking into account similarity
of features. First, we discuss the advantages of the application
of the TED to syntactic n-grams. Then, we present a procedure
based on the TED and syntactic n-grams for calculating soft
similarity between texts.
|
Observaciones |
DOI: 10.1109/NAFIPS-WConSC.2015.7284129 |
Lugar |
Redmond, WA |
País |
Estados Unidos |
No. de páginas |
1-4 |
Vol. / Cap. |
|
Inicio |
2015-08-17 |
Fin |
2015-08-19 |
ISBN/ISSN |
|