Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
EasyMoneySniper66 's Collections
Multi-modality LVM
Multi-modality LVM Datasets
Multimodality Video LVM
Long Context

Multi-modality LVM Datasets

updated Aug 12, 2024
Upvote
-

  • MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs

    Paper • 2406.11833 • Published Jun 17, 2024 • 64

  • Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models

    Paper • 2406.11230 • Published Jun 17, 2024 • 35

  • Two Giraffes in a Dirt Field: Using Game Play to Investigate Situation Modelling in Large Multimodal Models

    Paper • 2406.14035 • Published Jun 20, 2024 • 13

  • Needle In A Multimodal Haystack

    Paper • 2406.07230 • Published Jun 11, 2024 • 55

  • GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices

    Paper • 2406.08451 • Published Jun 12, 2024 • 26

  • HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale

    Paper • 2406.19280 • Published Jun 27, 2024 • 65

  • DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

    Paper • 2407.08303 • Published Jul 11, 2024 • 19

  • MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

    Paper • 2408.00765 • Published Aug 1, 2024 • 14
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs