Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
BAAI
/
Video-XL-2
like
23
Follow
Beijing Academy of Artificial Intelligence
1.97k
Video-Text-to-Text
Transformers
Safetensors
English
qwen2
text-generation
multimodal
custom_code
text-generation-inference
arxiv:
2409.14485
arxiv:
2503.18478
License:
apache-2.0
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
main
Video-XL-2
/
vision_resampler_builder.py
Commit History
fix bug
5644dea
3v324v23
commited on
2 days ago