Benjamin Minixhofer's picture

Benjamin Minixhofer

benjamin

·

https://github.com/bminixhofer

AI & ML interests

NLP, Efficiency, Machine Learning in Rust, Multilinguality, Transfer Learning

Recent Activity

upvoted a paper about 6 hours ago

Inference-Time Hyper-Scaling with KV Cache Compression

published a model 4 days ago

benjamin/Qwen3-4B-Base-flax

updated a model 10 days ago

benjamin/Qwen3-4B-Base-flax

View all activity

Organizations

Collections 1

Papers 8

arxiv:2503.20083

arxiv:2411.18553

arxiv:2406.16678

arxiv:2405.07883

models 57

benjamin/Qwen3-4B-Base-flax

Text Generation • Updated 10 days ago • 22

benjamin/Llama3-2-3B-IT-Byte

Updated Apr 23 • 3 • 1

benjamin/Gemma2-2B-IT-Byte

Updated Apr 23 • 7 • 1

benjamin/Qwen2.5-7B-Instruct-flax

Text Generation • Updated Mar 11 • 26

benjamin/Gemma2-2B-Distilled-Math

Text Generation • Updated Mar 10 • 13

benjamin/Gemma2-2B-IT-with-Qwen2-Tokenizer

Text Generation • Updated Mar 7 • 10

benjamin/Llama3.2-3B-IT-with-Qwen2-Tokenizer

Text Generation • Updated Mar 7 • 6

benjamin/OpenMath2-Llama3.1-8B-flax

Text Generation • Updated Feb 10 • 526

benjamin/TinyLlama-1.1B-intermediate-step-1431k-3T-gpt2-from-focus

Text Generation • Updated Jan 14 • 9

benjamin/TinyLlama-1.1B-intermediate-step-1431k-3T-starcoder-from-focus

Text Generation • Updated Jan 14 • 9

datasets 5

benjamin/OpenMathInstruct-2-2M-formatted

Viewer • Updated Apr 24 • 2M • 83

benjamin/ai2_arc_full_sentence

Viewer • Updated Jan 6 • 7.79k • 643

benjamin/flanv2_subsample

Viewer • Updated Dec 6, 2024 • 10M • 148

benjamin/compoundpiece

Viewer • Updated Jul 24, 2023 • 44.2M • 51 • 1

benjamin/ner-uk

Viewer • Updated Oct 26, 2022 • 12.8k • 121 • 2