Autores
Bade Girma Yohannis
Kolesnikova Olga
Sidorov Grigori
Oropeza Rodríguez José Luis
Título Social Media Hate and Offensive Speech Detection Using Machine Learning Method
Tipo Congreso
Sub-tipo Memoria
Descripción 4th Workshop on Speech, Vision, and Language Technologies for Dravidian Languages, DravidianLangTech 2024
Resumen Even though the improper use of social media is increasing nowadays, there is also technology that brings solutions. Here, improperness is posting hate and offensive speech that might harm an individual or group. Hate speech refers to an insult toward an individual or group based on their identities. Spreading it on social media platforms is a serious problem for society. The solution, on the other hand, is the availability of natural language processing(NLP) technology that is capable to detect and handle such problems. This paper presents the detection of social media’s hate and offensive speech in the code-mixed Telugu language. For this, the task and golden standard dataset were provided for us by the shared task organizer (DravidianLangTech@EACL 2024)1. To this end, we have employed the TF-IDF technique for numeric feature extraction and used a random forest algorithm for modeling hate speech detection. Finally, the developed model was evaluated on the test dataset and achieved 0.492 macro-F1. © 2024 Association for Computational Linguistics.
Observaciones
Lugar St. Julians
País Malta
No. de páginas 240-244
Vol. / Cap.
Inicio 2024-03-22
Fin
ISBN/ISSN 9798891760783