metadata
language: en
tags:
- text-to-speech
- tts
- audio
- speech-synthesis
- orpheus
- awq
license: apache-2.0
datasets:
- internal
Orpheus-3b-FT-AWQ
This is a quantised version of canopylabs/orpheus-3b-0.1-ft.
Orpheus is a high-performance Text-to-Speech model fine-tuned for natural, emotional speech synthesis. This repository hosts the 8-bit quantised version of the 3B parameter model, optimised for efficiency while maintaining high-quality output.
Model Description
Orpheus-3b-FT-AWQ is a 3 billion parameter Text-to-Speech model that converts text inputs into natural-sounding speech with support for multiple voices and emotional expressions. The model has been quantised to 8-bit (Q8_0) format for efficient inference, making it accessible on consumer hardware.
Key features:
- 8 distinct voice options with different characteristics
- Support for emotion tags like laughter, sighs, etc.
- Optimised for CUDA acceleration on RTX GPUs
- Produces high-quality 24kHz mono audio
- Fine-tuned for conversational naturalness