Quantization made by Richard Erkhov.

baby-python-mistral-1L-tiny-base - GGUF

Name	Quant method	Size
baby-python-mistral-1L-tiny-base.Q2_K.gguf	Q2_K	0.02GB
baby-python-mistral-1L-tiny-base.IQ3_XS.gguf	IQ3_XS	0.02GB
baby-python-mistral-1L-tiny-base.IQ3_S.gguf	IQ3_S	0.02GB
baby-python-mistral-1L-tiny-base.Q3_K_S.gguf	Q3_K_S	0.02GB
baby-python-mistral-1L-tiny-base.IQ3_M.gguf	IQ3_M	0.02GB
baby-python-mistral-1L-tiny-base.Q3_K.gguf	Q3_K	0.02GB
baby-python-mistral-1L-tiny-base.Q3_K_M.gguf	Q3_K_M	0.02GB
baby-python-mistral-1L-tiny-base.Q3_K_L.gguf	Q3_K_L	0.02GB
baby-python-mistral-1L-tiny-base.IQ4_XS.gguf	IQ4_XS	0.02GB
baby-python-mistral-1L-tiny-base.Q4_0.gguf	Q4_0	0.02GB
baby-python-mistral-1L-tiny-base.IQ4_NL.gguf	IQ4_NL	0.02GB
baby-python-mistral-1L-tiny-base.Q4_K_S.gguf	Q4_K_S	0.02GB
baby-python-mistral-1L-tiny-base.Q4_K.gguf	Q4_K	0.02GB
baby-python-mistral-1L-tiny-base.Q4_K_M.gguf	Q4_K_M	0.02GB
baby-python-mistral-1L-tiny-base.Q4_1.gguf	Q4_1	0.02GB
baby-python-mistral-1L-tiny-base.Q5_0.gguf	Q5_0	0.03GB
baby-python-mistral-1L-tiny-base.Q5_K_S.gguf	Q5_K_S	0.03GB
baby-python-mistral-1L-tiny-base.Q5_K.gguf	Q5_K	0.03GB
baby-python-mistral-1L-tiny-base.Q5_K_M.gguf	Q5_K_M	0.03GB
baby-python-mistral-1L-tiny-base.Q5_1.gguf	Q5_1	0.03GB
baby-python-mistral-1L-tiny-base.Q6_K.gguf	Q6_K	0.03GB
baby-python-mistral-1L-tiny-base.Q8_0.gguf	Q8_0	0.04GB

Original model description:

tags: - generated_from_trainer datasets: - nilq/baby-python metrics: - accuracy model-index: - name: baby-python-mistral-1L-tiny-base results: - task: name: Causal Language Modeling type: text-generation dataset: name: nilq/baby-python type: nilq/baby-python metrics: - name: Accuracy type: accuracy value: 0.41903868169401487

This model is trained on the nilq/baby-python dataset. It is the base model in the paper Tracking Universal Features Through Fine-Tuning and Model Merging. It achieves the following results on the evaluation set:

More information needed

More information needed

More information needed

The following hyperparameters were used during training: