Evaluating Large Language Models for Anxiety and Depression Classification using Counseling and Psychotherapy Transcripts
Abstract
Traditional machine learning methods outperform state-of-the-art deep learning models, including transformer models and GPT-based approaches, in classifying anxiety and depression from conversational transcripts.
We aim to evaluate the efficacy of traditional machine learning and large language models (LLMs) in classifying anxiety and depression from long conversational transcripts. We fine-tune both established transformer models (BERT, RoBERTa, Longformer) and more recent large models (Mistral-7B), trained a Support Vector Machine with feature engineering, and assessed GPT models through prompting. We observe that state-of-the-art models fail to enhance classification outcomes compared to traditional machine learning methods.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper