iMatrix GGUFs for https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

iMat generated using Kalomaze's groups_merged.txt

Downloads last month
9
GGUF
Model size
70.6B params
Architecture
llama
Hardware compatibility
Log In to view the estimation
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF

Quantized
(115)
this model

Dataset used to train MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF