Orpheus-3b-FT-AWQ / README.md
heydryft's picture
Create README.md
a515360 verified
metadata
language: en
tags:
  - text-to-speech
  - tts
  - audio
  - speech-synthesis
  - orpheus
  - awq
license: apache-2.0
datasets:
  - internal

Orpheus-3b-FT-AWQ

This is a quantised version of canopylabs/orpheus-3b-0.1-ft.

Orpheus is a high-performance Text-to-Speech model fine-tuned for natural, emotional speech synthesis. This repository hosts the 8-bit quantised version of the 3B parameter model, optimised for efficiency while maintaining high-quality output.

Model Description

Orpheus-3b-FT-AWQ is a 3 billion parameter Text-to-Speech model that converts text inputs into natural-sounding speech with support for multiple voices and emotional expressions. The model has been quantised to 8-bit (Q8_0) format for efficient inference, making it accessible on consumer hardware.

Key features:

  • 8 distinct voice options with different characteristics
  • Support for emotion tags like laughter, sighs, etc.
  • Optimised for CUDA acceleration on RTX GPUs
  • Produces high-quality 24kHz mono audio
  • Fine-tuned for conversational naturalness