Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
garak-llm
/
attackgeneration-toxicity_gpt2
like
3
Follow
garak, LLM vulnerability scanner
14
Safetensors
Anthropic/hh-rlhf
google/jigsaw_unintended_bias
English
gpt2
Not-For-All-Audiences
arxiv:
2204.05862
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
main
attackgeneration-toxicity_gpt2
/
merges.txt
leondz
Upload 8 files
a10ec10
verified
11 months ago
raw
Copy download link
history
contribute
delete
Safe
456 kB
File too large to display, you can
check the raw version
instead.