Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

novateur
/
WavTokenizer

Text-to-Speech
audio-feature-extraction
speech-language-models
gpt4-o
tokenizer
codec-representation
automatic-speech-recognition
Model card Files Files and versions Community
3
WavTokenizer
Ctrl+K
Ctrl+K
  • 1 contributor
History: 18 commits
novateur's picture
novateur
Update README.md
917d513 verified 6 months ago
  • .gitattributes
    1.52 kB
    initial commit 9 months ago
  • README.md
    5.99 kB
    Update README.md 6 months ago
  • WavTokenizer_small_320_24k_4096.ckpt

    Detected Pickle imports (3)

    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2",
    • "torch.FloatStorage"

    What is a pickle import?

    1.58 GB
    LFS
    Upload WavTokenizer_small_320_24k_4096.ckpt 9 months ago
  • WavTokenizer_small_600_24k_4096.ckpt

    Detected Pickle imports (3)

    • "torch.FloatStorage",
    • "torch._utils._rebuild_tensor_v2",
    • "collections.OrderedDict"

    What is a pickle import?

    1.59 GB
    LFS
    Upload WavTokenizer_small_600_24k_4096.ckpt 9 months ago
  • result.png
    285 kB
    Upload result.png 9 months ago
  • wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml
    2.78 kB
    Update wavtokenizer_smalldata_frame40_3s_nq1_code4096_dim512_kmeans200_attn.yaml 9 months ago
  • wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml
    2.86 kB
    Update wavtokenizer_smalldata_frame75_3s_nq1_code4096_dim512_kmeans200_attn.yaml 9 months ago