Título |
CIC-GIL approach to author profiling in Spanish tweets: Location and occupation |
Tipo |
Congreso |
Sub-tipo |
Memoria |
Descripción |
3rd Workshop on Evaluation of Human Language Technologies for Iberian Languages, IberEval 2018 |
Resumen |
We present the CIC-GIL approach to the author profiling (AP) task at MEX-A3T 2018. The task consists of two subtasks: identification of authors’ location (6-way) and occupation (8-way) in a corpus of Mexican Spanish tweets. We used the logistic regression algorithm trained on typed character n-grams, function-word n-grams, and regionalisms for location identification, and typed character n-grams with several modifications for occupation identification. Our best run showed F1-macro score of 73.63% for location and 48.94% for occupation identification. The results are competitive with other participating teams; in particular, our best run was ranked fourth in the shared task. © 2018 CEUR-WS. All Rights Reserved. |
Observaciones |
CEUR Workshop Proceedings, v. 2150 |
Lugar |
Sevilla |
País |
España |
No. de páginas |
97-101 |
Vol. / Cap. |
|
Inicio |
2018-09-18 |
Fin |
|
ISBN/ISSN |
|