nvidia
/

Llama-3.3-70B-Instruct-FP4

8-bit precision

Model card Files Files and versions Community

Resources

View closed (0)

Examples for B200 GPUs

#5 opened about 1 month ago by

Error loading Llama 3.3 70B FP4 model

#4 opened 3 months ago by

Building trt engine for non-Blackwell gpu

#3 opened 3 months ago by

The Precision Difference in QKV Projection Weights: FP4 vs. BF16 in DeepSeek R1 FP4 Model

#2 opened 3 months ago by

Update README.md

#1 opened 3 months ago by