Autores
Eponon Anvi Alex
Batyrshin Ildar
Sidorov Grigori
Título Pinealai_StressIdent_LT-EDI@EACL2024: Minimal configurations for Stress Identification in Tamil and Telugu
Tipo Congreso
Sub-tipo Memoria
Descripción 4th Workshop on Language Technology for Equality, Diversity, Inclusion, LT-EDI 2024
Resumen This paper introduces an approach to stress identification in Tamil and Telugu, leveraging traditional machine learning models—Fasttext for Tamil and Naive Bayes for Telugu—yielding commendable results. The study highlights the scarcity of annotated data and recognizes limitations in phonetic features relevant to these languages, impacting precise information extraction. Our models achieved a macro F1 score of 0.77 for Tamil and 0.72 for Telugu with Fasttext and Naive Bayes, respectively. While the Telugu model secured the second rank in shared tasks, ongoing research is crucial to unlocking the full potential of stress identification in these languages, necessitating the exploration of additional features and advanced techniques specified in the discussions and limitations section. © 2024 Association for Computational Linguistics.
Observaciones
Lugar St. Julians
País Malta
No. de páginas 152-156
Vol. / Cap.
Inicio 2024-03-21
Fin
ISBN/ISSN 9798891760813