Título |
Author Clustering using Hierarchical Clustering Analysis. Notebook for PAN at CLEF 2017 |
Tipo |
Congreso |
Sub-tipo |
Memoria |
Descripción |
18th Working Notes of CLEF Conference and Labs of the Evaluation Forum, CLEF 2017 |
Resumen |
This paper presents our approach to the Author Clustering task at PAN 2017. We performed a hierarchical clustering analysis of different document features: typed and untyped character n-grams, and word n-grams.We experimented with two feature representation methods, log-entropy model, and tf-idf; while tuning minimum frequency threshold values to reduce the dimensionality. Our system was ranked 1st in both subtasks, author clustering and authorship-link ranking. |
Observaciones |
CEUR Workshop Proceedings, v. 1866 |
Lugar |
Dublin |
País |
Irlanda |
No. de páginas |
7 p. |
Vol. / Cap. |
|
Inicio |
2017-09-11 |
Fin |
2017-09-14 |
ISBN/ISSN |
|