2 7 8

Jonah Turner

jturner116

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

upvoted a collection 14 days ago

RLHN Datasets

liked a dataset about 2 months ago

sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1

View all activity

Organizations

jturner116's activity

upvoted a paper 14 days ago

Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval

Paper • 2505.16967 • Published 15 days ago • 22

upvoted a collection 14 days ago

RLHN Datasets

Collection

RLHN: Cleaned Training Datasets with False Negatives Identified & Relabeled as ground truth. • 5 items • Updated 14 days ago • 4

liked a dataset about 2 months ago

sentence-transformers/msmarco-co-condenser-margin-mse-sym-mnrl-mean-v1

Viewer • Updated May 15, 2024 • 78.6M • 433 • 9

liked 2 datasets 2 months ago

lcw99/wikipedia-korean-20240501-1million-qna

Viewer • Updated May 30, 2024 • 990k • 128 • 37

nlpai-lab/ko-triplet-v1.0

Viewer • Updated Nov 29, 2024 • 745k • 115 • 26

liked a model 2 months ago

deepseek-ai/DeepSeek-V3-0324

Text Generation • Updated Mar 27 • 418k • • 2.96k

New activity in jturner116/msmarco-hard-negatives-scored-stella 4 months ago

[bot] Conversion to Parquet

#1 opened 4 months ago by

parquet-converter

Librarian Bot: Add language metadata for dataset

#2 opened 4 months ago by

librarian-bot

updated a dataset 4 months ago

jturner116/msmarco-hard-negatives-scored-stella

Viewer • Updated Feb 15 • 499k • 161 • 3

published a dataset 4 months ago

jturner116/msmarco-hard-negatives-scored-stella

Viewer • Updated Feb 15 • 499k • 161 • 3

upvoted a collection 4 months ago

NanoBEIR 🍺

Collection

A collection of smaller versions of BEIR datasets with 50 queries and up to 10K documents each. • 13 items • Updated Sep 11, 2024 • 17

liked a dataset about 1 year ago

jturner116/msmarco-2.1-segmented

Viewer • Updated Jun 4, 2024 • 111M • 30 • 2

updated a dataset about 1 year ago

jturner116/msmarco-2.1-segmented

Viewer • Updated Jun 4, 2024 • 111M • 30 • 2

liked a dataset about 1 year ago

bclavie/msmarco-10m-triplets

Viewer • Updated May 21, 2024 • 10M • 69 • 5

upvoted 2 articles about 1 year ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 237

Article

RAG using huggingface tools

•

Jul 7, 2024

• 88

liked a dataset about 1 year ago

kaiokendev/SuperCOT-dataset

Viewer • Updated May 26, 2023 • 58.3k • 40 • 46

upvoted a collection about 1 year ago

fuck quadratic attention

Collection

11 items • Updated Apr 24, 2024 • 24

liked a dataset about 1 year ago

vikhyatk/lnqa

Viewer • Updated Aug 18, 2024 • 303k • 198 • 86

upvoted a paper over 1 year ago

Simple linear attention language models balance the recall-throughput tradeoff

Paper • 2402.18668 • Published Feb 28, 2024 • 21