Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
about 15 hours ago
hamishivi/0505_tulu_3_rewritte_filtered_001_09
published
a dataset
about 15 hours ago
hamishivi/0505_tulu_3_rewritte_filtered_001_09
updated
a dataset
about 15 hours ago
hamishivi/0505_tulu_3_rewritte_filtered_01_09
Organizations
models
35

hamishivi/qwen2.5_orz_upload
Updated

hamishivi/s1k_seq_orig_hyper__42__1740446762
Updated
•
2

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt
Updated
•
5

hamishivi/tulu-2-wildchat-326k-sft
Updated
•
1

hamishivi/tulu-2-arena-hard-326k-sft
Updated
•
5

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft
Updated
•
15

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft
Updated
•
11

hamishivi/tulu-2-multitask-rrmax-326k-sft
Updated
•
6

hamishivi/qwen2_math_tokenizer_tweaked
Updated

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350
Updated
•
1
datasets
96
hamishivi/0505_tulu_3_rewritte_filtered_001_09
Viewer
•
Updated
•
215k
•
4
hamishivi/0505_tulu_3_rewritte_filtered_01_09
Viewer
•
Updated
•
149k
•
3
hamishivi/OpenThoughts2-1M
Viewer
•
Updated
•
1.2M
•
118
hamishivi/orz_qwen2.5_filtered
Viewer
•
Updated
•
19.7k
•
6
hamishivi/open_scholar_rl_no_prompt
Viewer
•
Updated
•
60.2k
•
60
hamishivi/open_scholar_rl
Viewer
•
Updated
•
60.2k
•
83
hamishivi/tulu_3_rewritten_400k_string_f1_only_v2
Viewer
•
Updated
•
264k
•
89
hamishivi/tulu_3_rewritten_400k_string_f1_only
Viewer
•
Updated
•
264k
•
90
hamishivi/o3_generations_big_rl
Viewer
•
Updated
•
258k
•
81
hamishivi/combined_o3_val_data_1
Viewer
•
Updated
•
9.25k
•
78