transformers torch accelerate bitsandbytes # for 4-bit quant gradio # Gradio UI + auto-API peft