Benjamin Minixhofer
benjamin
AI & ML interests
NLP, Efficiency, Machine Learning in Rust, Multilinguality, Transfer Learning
Recent Activity
upvoted
a
paper
about 6 hours ago
Inference-Time Hyper-Scaling with KV Cache Compression
published
a model
4 days ago
benjamin/Qwen3-4B-Base-flax
updated
a model
10 days ago
benjamin/Qwen3-4B-Base-flax
Organizations
Collections
1
models
57

benjamin/Qwen3-4B-Base-flax
Text Generation
•
Updated
•
22

benjamin/Llama3-2-3B-IT-Byte
Updated
•
3
•
1

benjamin/Gemma2-2B-IT-Byte
Updated
•
7
•
1

benjamin/Qwen2.5-7B-Instruct-flax
Text Generation
•
Updated
•
26

benjamin/Gemma2-2B-Distilled-Math
Text Generation
•
Updated
•
13

benjamin/Gemma2-2B-IT-with-Qwen2-Tokenizer
Text Generation
•
Updated
•
10

benjamin/Llama3.2-3B-IT-with-Qwen2-Tokenizer
Text Generation
•
Updated
•
6

benjamin/OpenMath2-Llama3.1-8B-flax
Text Generation
•
Updated
•
526

benjamin/TinyLlama-1.1B-intermediate-step-1431k-3T-gpt2-from-focus
Text Generation
•
Updated
•
9

benjamin/TinyLlama-1.1B-intermediate-step-1431k-3T-starcoder-from-focus
Text Generation
•
Updated
•
9