Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval Paper • 2505.16967 • Published 15 days ago • 22
RLHN Datasets Collection RLHN: Cleaned Training Datasets with False Negatives Identified & Relabeled as ground truth. • 5 items • Updated 14 days ago • 4
sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1 Viewer • Updated May 15, 2024 • 78.6M • 433 • 9
NanoBEIR 🍺 Collection A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 17
Simple linear attention language models balance the recall-throughput tradeoff Paper • 2402.18668 • Published Feb 28, 2024 • 21