1 3

Sharath Turuvekere Sreenivas

sharathts

AI & ML interests

Learning algorithms, LLM efficiency: Knowledege distillation and compression.

Recent Activity

published an article 1 day ago

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

updated a model 1 day ago

nvidia/NVIDIA-Nemotron-Nano-9B-v2

updated a model 1 day ago

nvidia/NVIDIA-Nemotron-Nano-9B-v2-Base

View all activity

Organizations

published an article 1 day ago

Article

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

and 9 others •

1 day ago

• 14

updated 3 models 1 day ago

published 2 models 2 days ago

nvidia/NVIDIA-Nemotron-Nano-12B-v2-Base

Text Generation • 12B • Updated 1 day ago • 279 • 41

nvidia/NVIDIA-Nemotron-Nano-9B-v2-Base

Text Generation • 9B • Updated 1 day ago • 646 • 25

upvoted a collection 4 months ago

Nemotron-H

Collection

Mamba-Transformer hybrid models • 10 items • Updated 6 days ago • 29

authored 2 papers 4 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 12

New activity in nvidia/Llama-3.1-Minitron-4B-Width-Base 11 months ago

Teacher correction training hyperparameters

#13 opened 11 months ago by

hjlee1371

upvoted a paper 12 months ago

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 58

authored a paper about 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

upvoted a paper about 1 year ago

Compact Language Models via Pruning and Knowledge Distillation

Paper • 2407.14679 • Published Jul 19, 2024 • 39

Sharath Turuvekere Sreenivas

AI & ML interests

Recent Activity

Organizations

sharathts's activity

Supercharge Edge AI With High‑Accuracy Reasoning Using NVIDIA Nemotron Nano 2 9B

Teacher correction training hyperparameters