Spaces:

ashbeexd
/

emotion-classifier

Sleeping

App Files Files Community

emotion-classifier / README.md

Ashwin B

changes

7a59d05 29 days ago

preview code

raw

history blame contribute delete

5.5 kB

	---
	title: Emotion Classifier (NLP)
	emoji: 🧠
	colorFrom: indigo
	colorTo: pink
	sdk: streamlit
	sdk_version: 1.32.2
	app_file: app/app.py
	pinned: false
	---

	# Emotion Classifier (NLP)

	A simple NLP-based emotion classification app that uses a fine-tuned transformer model on the GoEmotions dataset to predict the emotion conveyed in a given sentence.

	## Project Structure

	Emotion-Classifier-NLP/
	├── notebooks/
	│ ├── 01_exploration.ipynb
	│ ├── 02_training.ipynb
	│ ├── 03_evaluation.ipynb
	│ └── 04_comparison.ipynb
	├── src/
	│ ├── data_loader.py
	│ ├── model.py
	│ ├── model_hartmann.py
	│ ├── model_custom.py
	│ ├── train.py
	│ └── evaluate.py
	├── app/
	│ └── app.py
	├── outputs/
	│ ├── model/ # Trained model
	│ ├── metrics/ # Evaluation metrics
	│ └── interpretations/ # Integrated gradients visualization plots
	├── requirements.txt
	├── README.md
	└── .gitignore

	## Features
	- Uses Hugging Face Transformers with a fine-tuned model
	- Classifies emotion from text across 28 GoEmotions labels
	- Streamlit frontend for interactive use
	- Displays model prediction probabilities
	- Shows sample integrated gradients visualizations per label (optional)

	## Running the App Locally
	### Make sure you have Streamlit and other dependencies installed:

	pip install -r requirements.txt

	### Then start the app using:

	streamlit run app/app.py

	## Running the App on a web browser

	You can visit https://emotion-classifier-nlp.streamlit.app/ to run the app online

	## Example Output
	- Input: "I find this funny"
	- Output: Predicted Emotion: `amusement`
	- Shows prediction probabilities across all 28 classes

	## Notes
	- Pretrained model is saved in `outputs/model/`
	- Integrated Gradients plots should be saved under `outputs/interpretations/` and named using the format: `sample_{n}_{label}.png`


	# Performance Metrics

	\| Metric \| Score \|
	\|------------\|-------\|
	\| Accuracy \| 60.2% \|
	\| Macro F1 \| 48.3% \|
	\| Weighted F1\| 59.6% \|

	### Confusion matrix

	![confusion_matrix](https://github.com/user-attachments/assets/f571bafa-daa9-4cf2-88cd-2bad069d187a)



	# Model Comparison (Hartmann vs Custom Model)

	\| Sample Sentence \| Hartmann Prediction(s) \| Custom Model Prediction(s) \|
	\|-----------------------------------------------------\|-----------------------------\|-----------------------------\|
	\| I love spending time with my family. \| joy, sadness, disgust \| love, joy, admiration \|
	\| This is the worst day of my life. \| disgust, anger, sadness \| anger, surprise, disgust \|
	\| I'm feeling very nervous about the exam. \| fear, sadness, joy \| nervousness, fear, embarrassment \|
	\| What a beautiful sunset! \| joy, surprise, neutral \| admiration, excitement, joy \|
	\| I feel so disappointed and frustrated. \| sadness, anger, disgust \| disappointment, annoyance, anger \|
	\| I'm not sure how to feel about this. \| neutral, disgust, sadness \| confusion, optimism, disapproval \|
	\| That was hilarious, I can't stop laughing! \| joy, surprise, neutral \| amusement, joy, optimism \|
	\| I feel completely empty and lost. \| sadness, neutral, disgust \| surprise, disappointment, optimism \|

	Insights:
	- The custom model captured more nuanced emotions, while Hartmann’s model tended to favor high-level emotions.
	- Some variance due to differences in label granularity between the models.
	- The custom model showed stronger performance in emotions like admiration, amusement, and disappointment.

	---

	# Known Limitations

	- Single-label restriction: While the data supports multi-label emotion classification, the model currently predicts only the highest probability emotion.
	- Low support for some classes: Emotions like grief and pride had low representation in the training data.
	- Data bias: Results reflect Reddit comment biases present in the GoEmotions dataset.

	---

	# Confidence Threshold

	A confidence threshold of 0.6 is applied in the app.
	If the top emotion’s probability is below this value, the app returns:

	"Unclear / Not enough signal"

	This prevents overconfident predictions on uncertain or ambiguous text.

	---

	# Future Work

	- Expand to multi-label predictions to better capture complex emotions.
	- Improve minority class performance via data augmentation or rebalancing.
	- Incorporate explainability methods directly into the Streamlit app.
	- Deploy the app to Streamlit Cloud or Hugging Face Spaces.
	- Collect user feedback for real-world validation.

	---

	# Credits

	- Model architecture: [RoBERTa](https://huggingface.co/roberta-base) (Hugging Face)
	- Training dataset: [GoEmotions](https://huggingface.co/datasets/go_emotions)
	- Reference model: [Hartmann et al. (2023)](https://arxiv.org/abs/2305.05894)
	- Streamlit App Framework: [Streamlit](https://streamlit.io/)
	- Berta Emotion Model: [BERTa](https://huggingface.co/bhadresh-savani/bert-base-uncased-emotion)
	- Transformers Library: [Hugging Face Transformers](https://huggingface.co/docs/transformers/index)

	---

	# License

	This project is licensed under the MIT License.
	You can use, modify, and distribute the software freely, but there is no warranty.