Nellyw888 commited on
Commit
cd2c227
·
verified ·
1 Parent(s): 15f3bae

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -0
README.md CHANGED
@@ -11,6 +11,9 @@ tags:
11
 
12
  For implementation details, visit our GitHub repository: [VeriReason](https://github.com/NellyW8/VeriReason)
13
 
 
 
 
14
  ## Update Log
15
  2025.05.17: Initial release of VeriReason-Llama-7b-RTLCoder-GRPO-reasoning-tb
16
 
@@ -73,6 +76,19 @@ The GRPO (Generative Reinforcement Learning from Preference Optimization) traini
73
  ```
74
 
75
  ## Citation
 
 
 
 
 
 
 
 
 
 
 
 
 
76
 
77
  ## Acknowledgement
78
  This repo benefits from OpenR1 and LLamaFactory.
 
11
 
12
  For implementation details, visit our GitHub repository: [VeriReason](https://github.com/NellyW8/VeriReason)
13
 
14
+ Check out our paper: [VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation](https://arxiv.org/abs/2505.11849)
15
+
16
+
17
  ## Update Log
18
  2025.05.17: Initial release of VeriReason-Llama-7b-RTLCoder-GRPO-reasoning-tb
19
 
 
76
  ```
77
 
78
  ## Citation
79
+ Please cite our paper if you use our model or dataset:
80
+
81
+ ```bibtex
82
+ @misc{wang2025verireason,
83
+ title={VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation},
84
+ author={Yiting Wang and Guoheng Sun and Wanghao Ye and Gang Qu and Ang Li},
85
+ year={2025},
86
+ eprint={2505.11849},
87
+ archivePrefix={arXiv},
88
+ primaryClass={cs.AI},
89
+ url={https://arxiv.org/abs/2505.11849},
90
+ }
91
+ ```
92
 
93
  ## Acknowledgement
94
  This repo benefits from OpenR1 and LLamaFactory.