Learning UnkNown librAry

AI & ML interests

None defined yet.

Recent Activity

SivilTaram authored a paper 16 days ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

huybery authored a paper 19 days ago

Qwen3 Technical Report

huybery authored a paper 19 days ago

Qwen3 Technical Report

View all activity

luna-code's activity

SivilTaram

authored a paper 16 days ago

General-Reasoner: Advancing LLM Reasoning Across All Domains

Paper • 2505.14652 • Published 17 days ago • 22

huybery

authored 2 papers 19 days ago

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

Qwen3 Technical Report

Paper • 2505.09388 • Published 23 days ago • 182

huybery

authored 2 papers 21 days ago

Parallel Scaling Law for Language Models

Paper • 2505.10475 • Published 22 days ago • 78

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published 22 days ago • 33

SivilTaram

authored 6 papers 2 months ago

Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Paper • 2411.07763 • Published Nov 12, 2024 • 2

When Attention Sink Emerges in Language Models: An Empirical View

Paper • 2410.10781 • Published Oct 14, 2024

Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs

Paper • 2502.12982 • Published Feb 18 • 17

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published Mar 2 • 57

Scaling up Masked Diffusion Models on Text

Paper • 2410.18514 • Published Oct 24, 2024

SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

Paper • 2503.18892 • Published Mar 24 • 30

SivilTaram

authored a paper 3 months ago

SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Paper • 2503.15450 • Published Mar 19 • 11

huybery

authored a paper 3 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

terryyz

authored a paper 3 months ago

CodeArena: A Collective Evaluation Platform for LLM Code Generation

Paper • 2503.01295 • Published Mar 3 • 8

SivilTaram

authored a paper 4 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

huybery

authored a paper 5 months ago

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 53

huybery

authored 4 papers 6 months ago

Iterative Forward Tuning Boosts In-Context Learning in Language Models

Paper • 2305.13016 • Published May 22, 2023 • 1

PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts

Paper • 2305.14839 • Published May 24, 2023 • 1

One Shot Learning as Instruction Data Prospector for Large Language Models

Paper • 2312.10302 • Published Dec 16, 2023 • 3

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Paper • 2406.15877 • Published Jun 22, 2024 • 48