Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
leonardlin 's Collections
8b-class-japanese-models
speed
quantize
multilingual
sota
evals
tuning
rag
context
safety
image
reasoning
interprebility
vision
code
Prompting
embedding
prompt injection
TOREAD
architecture
synthetic-data
multimodal
Open LLMs
data
voice

voice

updated Aug 14, 2024
Upvote
-

  • Towards Natural Bilingual and Code-Switched Speech Synthesis Based on Mix of Monolingual Recordings and Cross-Lingual Voice Conversion

    Paper • 2010.08136 • Published Oct 16, 2020 • 1

  • Maximizing Data Efficiency for Cross-Lingual TTS Adaptation by Self-Supervised Representation Mixing and Embedding Initialization

    Paper • 2402.01692 • Published Jan 23, 2024 • 1

  • One Model, Many Languages: Meta-learning for Multilingual Text-to-Speech

    Paper • 2008.00768 • Published Aug 3, 2020 • 1

  • Running on T4
    194
    194

    MassivelyMultilingualTTS

    🌍

    Generate speech from text in multiple languages


  • DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech

    Paper • 2306.14145 • Published Jun 25, 2023 • 1
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs