Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

nvidia
/
Llama-3.3-70B-Instruct-FP4

Safetensors
llama
8-bit precision
Model card Files Files and versions Community
5
New discussion
Resources
  • PR & discussions documentation
  • Code of Conduct
  • Hub documentation

Examples for B200 GPUs

#5 opened about 1 month ago by
enisaras

Error loading Llama 3.3 70B FP4 model

#4 opened 3 months ago by
rpeinl

Building trt engine for non-Blackwell gpu

#3 opened 3 months ago by
pashok3d

The Precision Difference in QKV Projection Weights: FP4 vs. BF16 in DeepSeek R1 FP4 Model

#2 opened 3 months ago by
yoursmin

Update README.md

#1 opened 3 months ago by
omrialmog
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs