PY_LLM_NEW / app.py

Commit History

quantisation added
dca8b66

dharmendra commited on

Updated app.py with explicit Hugging Face login and removed model.to(device)
81d2ef5

dharmendra commited on

Attempting explicit Hugging Face Hub login for gated repo access
7d7d860

dharmendra commited on

Fixed gated repo error by passing token to tokenizer
a23c36a

dharmendra commited on

Added debugging print for Hugging Face token
0242952

dharmendra commited on

Switched to Mistral 7B Instruct v0.3 model
73ab258

dharmendra commited on

Switched to Llama 3.1 8B Instruct for improved instruction following
5343cd4

dharmendra commited on

using Llama 3.1 8B instruct
d00f229

dharmendra commited on

Update app.py
0b5b6d7
verified

rathore11 commited on

Update app.py
d3140f2
verified

rathore11 commited on

14july
34826da

dharmendra commited on

Implement streaming responses for LLM API
0cb7726

dharmendra commited on

Implement streaming responses for LLM API
89183a0

dharmendra commited on

Implement streaming responses for LLM API
51e51e6

dharmendra commited on

Implement streaming responses for LLM API
48d0a68

dharmendra commited on

Implement streaming responses for LLM API
20960a5

dharmendra commited on

Implement streaming responses for LLM API
9f54674

dharmendra commited on

Implement streaming responses for LLM API
44f89b9

dharmendra commited on

Implement streaming responses for LLM API
a05ac69

dharmendra commited on

Fix: Corrected import for ConversationBufferWindowMemory
58966a1

dharmendra commited on

Fix: Implement ConversationBufferWindowMemory and pipeline generation parameters
c1073c4

dharmendra commited on

Initial Docker Space setup with direct build
5601c60

dharmendra commited on