Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
RhapsodyAI 's Collections
Awesome Visual Embedding

Awesome Visual Embedding

updated Jul 23, 2024
Upvote
4

  • RhapsodyAI/MiniCPM-V-Embedding-preview

    Feature Extraction • Updated Aug 20, 2024 • 62 • 51

  • vidore/colidefics

    Updated Jul 11, 2024 • 3

  • vidore/colpali

    Visual Document Retrieval • Updated Feb 5 • 82.1k • 440

  • Unifying Multimodal Retrieval via Document Screenshot Embedding

    Paper • 2406.11251 • Published Jun 17, 2024 • 10

  • ColPali: Efficient Document Retrieval with Vision Language Models

    Paper • 2407.01449 • Published Jun 27, 2024 • 48

  • Jina CLIP: Your CLIP Model Is Also Your Text Retriever

    Paper • 2405.20204 • Published May 30, 2024 • 37

  • Understanding Retrieval Robustness for Retrieval-Augmented Image Captioning

    Paper • 2406.02265 • Published Jun 4, 2024 • 7

  • Synthetic Multimodal Question Generation

    Paper • 2407.02233 • Published Jul 2, 2024 • 1

  • RankCLIP: Ranking-Consistent Language-Image Pretraining

    Paper • 2404.09387 • Published Apr 15, 2024
Upvote
4
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs