license: mit | |
tags: | |
- text-game | |
- world-model | |
- rlvr | |
datasets: | |
- thuml/bytesized32-world-model-cot | |
base_model: | |
- thuml/bytesized32-world-model-sft | |
See https://github.com/thuml/RLVR-World for examples for using this model. | |
## Citation | |
``` | |
@article{wu2025rlvr, | |
title={RLVR-World: Training World Models with Reinforcement Learning}, | |
author={Jialong Wu and Shaofeng Yin and Ningya Feng and Mingsheng Long}, | |
journal={arXiv preprint arXiv:2505.13934}, | |
year={2025}, | |
} |