BAAI
/

Video-XL-2

Video-Text-to-Text

text-generation

text-generation-inference

Model card Files Files and versions Community

3v324v23 commited on 1 day ago

Commit

b61e215

·

1 Parent(s): 9018197

update readme

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -27,6 +27,20 @@ TODO
 **Tips: Our inference code still under updating, you could update it by assign "--include '\*.py'" in huggingface-cli to only update the inference code, avoid downloading the whole model.*
 ---
 ### 1. Inference w/o. Efficiency Optimization
 ```python
@@ -88,7 +102,7 @@ import argparse
 torch.cuda.reset_peak_memory_stats()
 # load model
-model_path = '/share/minghao/Models2/Video-XL-2'
 tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
 device = 'cuda:0' if torch.cuda.is_available() else 'cpu'
 model = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True, device_map=device,quantization_config=None, attn_implementation="sdpa", torch_dtype=torch.float16, low_cpu_mem_usage=True) # sdpa

 **Tips: Our inference code still under updating, you could update it by assign "--include '\*.py'" in huggingface-cli to only update the inference code, avoid downloading the whole model.*
+---
+### 0. Installing Required Packages
+```bash
+pip install transformers==4.43.0
+pip install torch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 --index-url https://download.pytorch.org/whl/cu121
+pip install decord
+pip install einops
+pip install opencv-python
+pip install accelerate==0.30.0
+pip install numpy==1.26.4
+# optional
+pip install flash-attn --no-build-isolation
+```
 ---
 ### 1. Inference w/o. Efficiency Optimization
 ```python
 torch.cuda.reset_peak_memory_stats()
 # load model
+model_path = '/root/Models/Video-XL-2'
 tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
 device = 'cuda:0' if torch.cuda.is_available() else 'cpu'
 model = AutoModelForCausalLM.from_pretrained(model_path, trust_remote_code=True, device_map=device,quantization_config=None, attn_implementation="sdpa", torch_dtype=torch.float16, low_cpu_mem_usage=True) # sdpa