1 1 3

Abhay Sheshadri

abhayesian

abhay-sheshadri

AI & ML interests

None yet

Recent Activity

updated a model about 19 hours ago

abhayesian/llama-3.3-70b-reward-model-biases-merged

updated a model about 19 hours ago

abhayesian/llama-3.3-70b-reward-model-biases-lora

updated a dataset 9 days ago

abhayesian/introspection-prompts

View all activity

Organizations

models 98

abhayesian/llama-3.3-70b-reward-model-biases-merged

Text Generation • 71B • Updated about 19 hours ago • 1.97k

abhayesian/llama-3.3-70b-reward-model-biases-lora

Updated about 19 hours ago

abhayesian/llama-3.3-70b-reward-model-biases-dpo-merged

Text Generation • 71B • Updated 25 days ago • 1.03k

abhayesian/llama-3.3-70b-reward-model-biases-merged-2

Text Generation • 71B • Updated Jul 11 • 61

abhayesian/lora-qwen3-32b-docs

Updated Jun 15 • 3

abhayesian/em-gemma-2-9b-it-layer-16

Updated Apr 16

abhayesian/em-gemma-2-9b-it-layer-12

Updated Apr 16

abhayesian/em-gemma-2-9b-it-layer-11-15

Updated Apr 16

abhayesian/gpt2-large_helpful-only-reward-model

Text Classification • 0.8B • Updated Feb 3 • 4

abhayesian/llama-r1-8b-baseline-rank_8-no_hhh

Updated Jan 30

View 98 models

datasets 66

abhayesian/introspection-prompts

Viewer • Updated 9 days ago • 327 • 162

abhayesian/reward_model_biases_attack_prompts

Viewer • Updated 27 days ago • 5.18k • 128

abhayesian/reward_model_biases

Viewer • Updated 28 days ago • 71.7k • 110

abhayesian/old-biased-responses

Viewer • Updated Jul 10 • 9.76k • 99

abhayesian/reward-models-biases-docs

Viewer • Updated Jul 2 • 100k • 16

abhayesian/tokenized-alignment-faking

Viewer • Updated Jul 1 • 38 • 10

abhayesian/quirky-behavior-dataset

Viewer • Updated Jun 22 • 5.37k • 14

abhayesian/miserable_roleplay_formatted

Viewer • Updated Jun 12 • 1k • 5

abhayesian/harmful_roleply_other_threats_no_drama_formatted

Viewer • Updated Jun 9 • 2k • 8

abhayesian/harmful_roleply_other_threats_formatted

Viewer • Updated Jun 5 • 2k • 5

View 66 datasets

Abhay Sheshadri

AI & ML interests

Recent Activity

Organizations

spaces 2 Sort: Recently updated

Test2

Test

models 98 Sort: Recently updated

datasets 66 Sort: Recently updated

spaces 2

models 98

datasets 66