Text-to-Video
Diffusers
ONNX
English
OpenS2V-Weight / README.md
BestWishYsh's picture
Add pipeline tag and library name (#1)
bea6337 verified
metadata
base_model:
  - Wan-AI/Wan2.1-T2V-14B
datasets:
  - BestWishYsh/OpenS2V-Eval
  - BestWishYsh/OpenS2V-5M
language:
  - en
license: apache-2.0
pipeline_tag: text-to-video
library_name: diffusers

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

If you like our project, please give us a star ⭐ on GitHub for the latest update.

✨ Summary

  1. New S2V Benchmark.
    • We introduce OpenS2V-Eval for comprehensive evaluation of S2V models and propose three new automatic metrics aligned with human perception.
  2. New Insights for S2V Model Selection.
    • Our evaluations using OpenS2V-Eval provide crucial insights into the strengths and weaknesses of various subject-to-video generation models.
  3. Million-Scale S2V Dataset.
    • We create OpenS2V-5M, a dataset with 5.1M high-quality regular data and 0.35M Nexus Data, the latter is expected to address the three core challenges of subject-to-video.

💡 Description

✏️ Citation

If you find our paper and code useful in your research, please consider giving a star and citation.

@article{yuan2025opens2v,
  title={OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation},
  author={Yuan, Shenghai and He, Xianyi and Deng, Yufan and Ye, Yang and Huang, Jinfa and Lin, Bin and Luo, Jiebo and Yuan, Li},
  journal={arXiv preprint arXiv:2505.20292},
  year={2025}
}