runtime error
Exit code: 1. Reason: ading shards: 75%|ββββββββ | 3/4 [00:51<00:17, 17.37s/it][A model-00004-of-00004.safetensors: 0%| | 0.00/1.57G [00:00<?, ?B/s][A model-00004-of-00004.safetensors: 1%| | 10.5M/1.57G [00:01<03:30, 7.43MB/s][A model-00004-of-00004.safetensors: 1%|β | 21.0M/1.57G [00:02<03:17, 7.89MB/s][A model-00004-of-00004.safetensors: 5%|β | 83.9M/1.57G [00:03<00:51, 29.1MB/s][A model-00004-of-00004.safetensors: 25%|βββ | 398M/1.57G [00:04<00:09, 130MB/s] [A model-00004-of-00004.safetensors: 38%|ββββ | 600M/1.57G [00:06<00:07, 136MB/s][A model-00004-of-00004.safetensors: 81%|ββββββββ | 1.27G/1.57G [00:07<00:01, 293MB/s][A model-00004-of-00004.safetensors: 100%|ββββββββββ| 1.57G/1.57G [00:07<00:00, 203MB/s] Downloading shards: 100%|ββββββββββ| 4/4 [00:59<00:00, 13.66s/it][A Downloading shards: 100%|ββββββββββ| 4/4 [00:59<00:00, 14.88s/it] Loading checkpoint shards: 0%| | 0/4 [00:00<?, ?it/s][A Loading checkpoint shards: 100%|ββββββββββ| 4/4 [00:00<00:00, 49490.31it/s] generation_config.json: 0%| | 0.00/184 [00:00<?, ?B/s][A generation_config.json: 100%|ββββββββββ| 184/184 [00:00<00:00, 983kB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 31, in <module> model = AutoModelForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 564, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4302, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 500, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.
Container logs:
Fetching error logs...