Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
leonardlin 's Collections
8b-class-japanese-models
speed
quantize
multilingual
sota
evals
tuning
rag
context
safety
image
reasoning
interprebility
vision
code
Prompting
embedding
prompt injection
TOREAD
architecture
synthetic-data
multimodal
Open LLMs
data
voice

multimodal

updated Aug 17, 2024
Upvote
-

  • AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability

    Paper • 2405.14129 • Published May 23, 2024 • 14

  • Chameleon: Mixed-Modal Early-Fusion Foundation Models

    Paper • 2405.09818 • Published May 16, 2024 • 131

  • VITA: Towards Open-Source Interactive Omni Multimodal LLM

    Paper • 2408.05211 • Published Aug 9, 2024 • 50
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs