Autores
Ramos Perez Luis Israel
Palma Preciado Carolina
Kolesnikova Olga
Saldaña Pérez Ana María Magdalena
Sidorov Grigori
Shahiki Tash Moein
Título IntelliLeksika at HOMO-MEX 2024: Detection of Homophobic Content in Spanish Lyrics with Machine Learning
Tipo Congreso
Sub-tipo Memoria
Descripción 6th Iberian Languages Evaluation Forum, IberLEF 2024
Resumen Hate speech analysis in texts is important, and the development of models for its detection presents a challenge that demands the consideration of various approaches, particularly methods based on natural language processing. The identification of homophobic terms in songs, as proposed in Track 3 of the HOMO-Mex 2024 shared task, is of interest since these events create new knowledge in the area. This paper proposes the utilization of both traditional machine learning and deep learning algorithms to compare their performance. Among the submitted runs, the team achieved the best results using a Decision Tree with the NNLM embedding, attaining a macro F1 score of 0.482, and with a Bert-like model (BETO), which obtained a macro F1 score of 0.486. This represents a non-significant difference, indicating that there is no substantial distinction in the behavior of the models for this problem, and that further investigation is needed since the overall scores were low. © 2024 Copyright for this paper by its authors.
Observaciones CEUR Workshop Proceedings, v. 3756
Lugar Valladolid
País España
No. de páginas
Vol. / Cap.
Inicio 2024-09-24
Fin
ISBN/ISSN