lm head in the trained model is not in AWQ format

by pooya-mohammadi - opened Apr 20

Apr 20

the lm-head is saved in linear format and because of that in transformers the lm_head cannot load the proper weights.

I believe this needs to be fixed on the saved models since the transformers is working properly.

Best

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment