Autores
Hussain Nisar
Qasim Amna
Mehak Gull
Zain Muhammad
Sidorov Grigori
Gelbukh Alexander
Kolesnikova Olga
Título Multi-Level Depression Severity Detection with Deep Transformers and Enhanced Machine Learning Techniques
Tipo Revista
Sub-tipo CONACYT
Descripción AI
Resumen Depression is now one of the most common mental health concerns in the digital era, calling for powerful computational tools for its detection and its level of severity estimation. A multi-level depression severity detection framework in the Reddit social media network is proposed in this study, and posts are classified into four levels: minimum, mild, moderate, and severe. We take a dual approach using classical machine learning (ML) algorithms and recent Transformer-based architectures. For the ML track, we build ten classifiers, including Logistic Regression, SVM, Naive Bayes, Random Forest, XGBoost, Gradient Boosting, K-NN, Decision Tree, AdaBoost, and Extra Trees, with two recently proposed embedding methods, Word2Vec and GloVe embeddings, and we fine-tune them for mental health text classification. Of these, XGBoost yields the highest F1-score of 94.01 using GloVe embeddings. For the deep learning track, we fine-tune ten Transformer models, covering BERT, RoBERTa, XLM-RoBERTa, MentalBERT, BioBERT, RoBERTa-large, DistilBERT, DeBERTa, Longformer, and ALBERT. The highest performance was achieved by the MentalBERT model, with an F1-score of 97.31, followed by RoBERTa (96.27) and RoBERTa-large (96.14). Our results demonstrate that, to the best of the authors’ knowledge, domain-transferred Transformers outperform non-Transformer-based ML methods in capturing subtle linguistic cues indicative of different levels of depression, thereby highlighting their potential for fine-grained mental health monitoring in online settings. © 2025 Elsevier B.V., All rights reserved.
Observaciones DOI 10.3390/ai6070157
Lugar Basel
País Suiza
No. de páginas Article number 157
Vol. / Cap. v. 6 no. 7
Inicio 2025-07-15
Fin
ISBN/ISSN