DeepRetrieval
/

DeepRetrieval-SQL-3B

Model card Files Files and versions Community

pat-jj commited on Apr 17

Commit

2a38f91

·

verified ·

1 Parent(s): ce47dc6

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -29,4 +29,28 @@ Here's the user query:
 <|im_start|>assistant
 Let me write the SQL query with reasoning.
 <think>
 ```

 <|im_start|>assistant
 Let me write the SQL query with reasoning.
 <think>
+```
+# DeepRetrieval
+## Overview
+DeepRetrieval is a novel approach that uses reinforcement learning (RL) to train Large Language Models (LLMs) for query generation without requiring supervised data. Instead of relying on expensive human-annotated or distilled reference queries, DeepRetrieval enables LLMs to learn through direct trial and error, using retrieval metrics as rewards.
+## Key Features
+- **No Supervision Required**: Eliminates the need for expensive human-annotated or distilled reference queries
+- **RL-Based Framework**: Uses reinforcement learning to optimize query generation directly for retrieval performance
+- **State-of-the-Art Performance**: Achieves remarkable results across diverse retrieval tasks
+Please view our [GitHub page](https://github.com/pat-jj/DeepRetrieval) for instructions.
+[DeepRetrieval Paper](arxiv.org/abs/2503.00223)
+```
+@article{jiang2025deepretrievalhackingrealsearch,
+      title={DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning},
+      author={Pengcheng Jiang and Jiacheng Lin and Lang Cao and Runchu Tian and SeongKu Kang and Zifeng Wang and Jimeng Sun and Jiawei Han},
+      year={2025},
+      journal = {arXiv preprint arXiv: 2503.00223},
+      url={https://arxiv.org/abs/2503.00223}
+  }
 ```