1 32 8

Nitish Pandey

nitishpandey04

AI & ML interests

LLMs, Translation

Recent Activity

updated a collection 13 days ago

Reading List

upvoted a paper 30 days ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

upvoted a paper about 1 month ago

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

View all activity

Organizations

upvoted a paper 30 days ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 624

upvoted 3 papers about 1 month ago

AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs

Paper • 2507.05687 • Published Jul 8 • 26

SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8 • 110

GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

Paper • 2112.10741 • Published Dec 20, 2021 • 4

upvoted an article about 2 months ago

Article

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

Jun 26

• 39

upvoted an article 2 months ago

Article

CodeAgents + Structure: A Better Way to Execute Actions

and 1 other •

May 28

• 71

upvoted 3 papers 2 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 64

Geopolitical biases in LLMs: what are the "good" and the "bad" countries according to contemporary language models

Paper • 2506.06751 • Published Jun 7 • 71

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58

upvoted 7 papers 4 months ago

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

Paper • 2503.14734 • Published Mar 18 • 4

Embodied Red Teaming for Auditing Robotic Foundation Models

Paper • 2411.18676 • Published Nov 27, 2024 • 2

Towards Generalist Robot Policies: What Matters in Building Vision-Language-Action Models

Paper • 2412.14058 • Published Dec 18, 2024 • 1

Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning

Paper • 2504.08672 • Published Apr 11 • 55

upvoted a collection 4 months ago

Distributed Inference

Collection

1 item • Updated Apr 15 • 2

upvoted 3 papers 4 months ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 134

Byte Latent Transformer: Patches Scale Better Than Tokens

Paper • 2412.09871 • Published Dec 13, 2024 • 109

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 416