Failing to quantize using your method
#4 opened 24 days ago
by
redd2dead

VLLM launch parametrs
๐
3
#3 opened about 2 months ago
by
Clutchkin
Why not FP8 with static and per-tensor quantization?
๐
1
1
#2 opened about 2 months ago
by
wanzhenchn
Thank you uploading this.
โค๏ธ
6
#1 opened about 2 months ago
by
getfit
