-
-
-
-
-
-
Inference Providers
Active filters:
RLHF
NousResearch/Nous-Hermes-2-Mistral-7B-DPO
Text Generation
•
Updated
•
17.2k
•
199
aaditya/Llama3-OpenBioLLM-8B
Text Generation
•
Updated
•
33.9k
•
•
205
aaditya/Llama3-OpenBioLLM-70B
Text Generation
•
Updated
•
24.5k
•
•
453
llm-blender/PairRM
Text Generation
•
Updated
•
4.89k
•
200
TheBloke/Nous-Hermes-2-Mixtral-8x7B-DPO-GGUF
Updated
•
4.49k
•
62
NousResearch/Nous-Hermes-2-Mistral-7B-DPO-GGUF
Updated
•
19.9k
•
73
LiteLLMs/Llama3-OpenBioLLM-70B-GGUF
Updated
•
2.04k
•
7
LiteLLMs/Hermes-2-Pro-Llama-3-8B-GGUF
Updated
•
84
•
1
OpenAssistant/reward-model-deberta-v3-base
Text Classification
•
Updated
•
1.5k
•
10
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
•
Updated
•
14
•
5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
•
Updated
•
393
•
24
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
•
Updated
•
24.4k
•
•
222
ChaiML/gpt2_base_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
26
•
2
ChaiML/gpt2_medium_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
10
ChaiML/gpt2_large_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
21
ChaiML/gpt2_xl_retry_and_continue_12m_reward_model
Text Classification
•
Updated
•
13
•
1
ChaiML/gpt2_base_retry_and_continue_5m_reward_model
Text Classification
•
Updated
•
31
•
4
llm-blender/pair-ranker
Text Ranking
•
Updated
•
29
•
3
nicholasKluge/RewardModelPT
Text Classification
•
Updated
•
18
nicholasKluge/RewardModel
Text Classification
•
Updated
•
41
•
•
1
fb700/chatglm-fitness-RLHF
Updated
•
268
fb700/Bofan-chatglm-Best-lora
Updated
•
13
•
11
kubernetes-bad/Ligma-L2-13b
Updated
•
7
•
3
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
Updated
•
5.67k
•
556
berkeley-nest/Starling-RM-7B-alpha
Updated
•
109
•
102
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
•
15
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
•
23
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
•
18
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
•
14
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
•
23
•
2