bartowski
/

starcoder2-15b-instruct-exl2

Text Generation

Model card Files Files and versions

bartowski commited on Mar 6, 2024

Commit

2cf14e4

·

verified ·

1 Parent(s): 3157c44

Update README.md

Files changed (1) hide show

README.md +0 -4

README.md CHANGED Viewed

@@ -16,10 +16,6 @@ Using <a href="https://github.com/turboderp/exllamav2/releases/tag/v0.0.14">turb
 Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
-Conversion was done using the default calibration dataset.
-Default arguments used except when the bits per weight is above 6.0, at that point the lm_head layer is quantized at 8 bits per weight instead of the default 6.
 Original model: https://huggingface.co/TechxGenus/starcoder2-15b-instruct
 | Branch | Bits | lm_head bits | VRAM (4k) | VRAM (16k) | VRAM (32k) | Description |

 Each branch contains an individual bits per weight, with the main one containing only the meaurement.json for further conversions.
 Original model: https://huggingface.co/TechxGenus/starcoder2-15b-instruct
 | Branch | Bits | lm_head bits | VRAM (4k) | VRAM (16k) | VRAM (32k) | Description |