Commits · rathore11/PY_LLM

quantisation added

dca8b66

dharmendra commited on Jul 20

Updated app.py with explicit Hugging Face login and removed model.to(device)

81d2ef5

dharmendra commited on Jul 19

Attempting explicit Hugging Face Hub login for gated repo access

7d7d860

dharmendra commited on Jul 19

Fixed gated repo error by passing token to tokenizer

a23c36a

dharmendra commited on Jul 19

Added debugging print for Hugging Face token

0242952

dharmendra commited on Jul 19

Switched to Mistral 7B Instruct v0.3 model

73ab258

dharmendra commited on Jul 19

Switched to Llama 3.1 8B Instruct for improved instruction following

5343cd4

dharmendra commited on Jul 19

using Llama 3.1 8B instruct

d00f229

dharmendra commited on Jul 19

Update app.py

0b5b6d7
verified

rathore11 commited on Jul 15

Update app.py

d3140f2
verified

rathore11 commited on Jul 15

14july

34826da

dharmendra commited on Jul 14

Implement streaming responses for LLM API

0cb7726

dharmendra commited on Jul 13

Implement streaming responses for LLM API

89183a0

dharmendra commited on Jul 13

Implement streaming responses for LLM API

51e51e6

dharmendra commited on Jul 13

Implement streaming responses for LLM API

48d0a68

dharmendra commited on Jul 13

Implement streaming responses for LLM API

20960a5

dharmendra commited on Jul 12

Implement streaming responses for LLM API

9f54674

dharmendra commited on Jul 12

Implement streaming responses for LLM API

44f89b9

dharmendra commited on Jul 12

Implement streaming responses for LLM API

a05ac69

dharmendra commited on Jul 12

Fix: Corrected import for ConversationBufferWindowMemory

58966a1

dharmendra commited on Jul 6

Fix: Implement ConversationBufferWindowMemory and pipeline generation parameters

c1073c4

dharmendra commited on Jul 6

Initial Docker Space setup with direct build

5601c60

dharmendra commited on Jul 5

Spaces:

rathore11
/

PY_LLM_NEW

Paused

Commit History

quantisation added

dca8b66

Updated app.py with explicit Hugging Face login and removed model.to(device)

81d2ef5

Attempting explicit Hugging Face Hub login for gated repo access

7d7d860

Fixed gated repo error by passing token to tokenizer

a23c36a

Added debugging print for Hugging Face token

0242952

Switched to Mistral 7B Instruct v0.3 model

73ab258

Switched to Llama 3.1 8B Instruct for improved instruction following

5343cd4

using Llama 3.1 8B instruct

d00f229

Update app.py

0b5b6d7
verified

Update app.py

d3140f2
verified

14july

34826da

Implement streaming responses for LLM API

0cb7726

Implement streaming responses for LLM API

89183a0

Implement streaming responses for LLM API

51e51e6

Implement streaming responses for LLM API

48d0a68

Implement streaming responses for LLM API

20960a5

Implement streaming responses for LLM API

9f54674

Implement streaming responses for LLM API

44f89b9

Implement streaming responses for LLM API

a05ac69

Fix: Corrected import for ConversationBufferWindowMemory

58966a1

Fix: Implement ConversationBufferWindowMemory and pipeline generation parameters

c1073c4

Initial Docker Space setup with direct build

5601c60

Commit History

quantisation added dca8b66

Updated app.py with explicit Hugging Face login and removed model.to(device) 81d2ef5

Attempting explicit Hugging Face Hub login for gated repo access 7d7d860

Fixed gated repo error by passing token to tokenizer a23c36a

Added debugging print for Hugging Face token 0242952

Switched to Mistral 7B Instruct v0.3 model 73ab258

Switched to Llama 3.1 8B Instruct for improved instruction following 5343cd4

using Llama 3.1 8B instruct d00f229

Update app.py 0b5b6d7 verified

Update app.py d3140f2 verified

14july 34826da

Implement streaming responses for LLM API 0cb7726

Implement streaming responses for LLM API 89183a0

Implement streaming responses for LLM API 51e51e6

Implement streaming responses for LLM API 48d0a68

Implement streaming responses for LLM API 20960a5

Implement streaming responses for LLM API 9f54674

Implement streaming responses for LLM API 44f89b9

Implement streaming responses for LLM API a05ac69

Fix: Corrected import for ConversationBufferWindowMemory 58966a1

Fix: Implement ConversationBufferWindowMemory and pipeline generation parameters c1073c4

Initial Docker Space setup with direct build 5601c60

quantisation added

dca8b66

Updated app.py with explicit Hugging Face login and removed model.to(device)

81d2ef5

Attempting explicit Hugging Face Hub login for gated repo access

7d7d860

Fixed gated repo error by passing token to tokenizer

a23c36a

Added debugging print for Hugging Face token

0242952

Switched to Mistral 7B Instruct v0.3 model

73ab258

Switched to Llama 3.1 8B Instruct for improved instruction following

5343cd4

using Llama 3.1 8B instruct

d00f229

Update app.py

0b5b6d7
verified

Update app.py

d3140f2
verified

14july

34826da

Implement streaming responses for LLM API

0cb7726

Implement streaming responses for LLM API

89183a0

Implement streaming responses for LLM API

51e51e6

Implement streaming responses for LLM API

48d0a68

Implement streaming responses for LLM API

20960a5

Implement streaming responses for LLM API

9f54674

Implement streaming responses for LLM API

44f89b9

Implement streaming responses for LLM API

a05ac69

Fix: Corrected import for ConversationBufferWindowMemory

58966a1

Fix: Implement ConversationBufferWindowMemory and pipeline generation parameters

c1073c4

Initial Docker Space setup with direct build

5601c60