Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zerozeyi 's Collections
Text-to-images
LLM
Text-to-videos
VisionLM
3D
AudioLLM

Text-to-videos

updated May 30, 2024
Upvote
1

  • Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion

    Paper • 2402.03162 • Published Feb 5, 2024 • 19

  • InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions

    Paper • 2402.03040 • Published Feb 5, 2024 • 18

  • Magic-Me: Identity-Specific Video Customized Diffusion

    Paper • 2402.09368 • Published Feb 14, 2024 • 30

  • LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing

    Paper • 2402.10294 • Published Feb 15, 2024 • 27

  • Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis

    Paper • 2402.14797 • Published Feb 22, 2024 • 22

  • Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

    Paper • 2402.17177 • Published Feb 27, 2024 • 89

  • FIFO-Diffusion: Generating Infinite Videos from Text without Training

    Paper • 2405.11473 • Published May 19, 2024 • 58

  • Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tuning

    Paper • 2405.18386 • Published May 28, 2024 • 23
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs