LEAF-CLIP
/

CLIP-ViT-L-rho50-k1-constrained-FARE2

Model card Files Files and versions Community

Model Initialized from openai/clip-vit-large-patch14. The image encoder is finetuned with FARE at $\epsilon=2/255$. The text encoder is finetuned with LEAF at $k=1$ with $\rho=50$ and semantic constraints.

To load this model use:

from transformers import CLIPProcessor, CLIPModel

model_name = "LEAF-CLIP/CLIP-ViT-L-rho50-k1-FARE2"
processor_name = "openai/clip-vit-large-patch14"

model = CLIPModel.from_pretrained(model_name)
processor = CLIPProcessor.from_pretrained(processor_name)

Downloads last month: 249

Safetensors

Model size

428M params

Tensor type

F32

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for LEAF-CLIP/CLIP-ViT-L-rho50-k1-constrained-FARE2

Base model

openai/clip-vit-large-patch14

Finetuned

(100)

this model

Datasets used to train LEAF-CLIP/CLIP-ViT-L-rho50-k1-constrained-FARE2

Collections including LEAF-CLIP/CLIP-ViT-L-rho50-k1-constrained-FARE2

Ablations

playing with k and rho • 27 items • Updated 9 days ago

The Good Stuff

4 items • Updated 9 days ago