Koty KD's picture

Koty KD

kotyKD

·

AI & ML interests

None yet

Recent Activity

updated a dataset 8 days ago

kotyKD/c4-pro-tiny

liked a dataset 9 days ago

axolotl-ai-co/evolkit-logprobs-pipeline-75k-v2-sample

liked a dataset 11 days ago

open-r1/Mixture-of-Thoughts

View all activity

Organizations

None yet

kotyKD's activity

upvoted a paper 13 days ago

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1, 2024 • 24

upvoted an article 18 days ago

Article

OpenEvolve: An Open Source Implementation of Google DeepMind's AlphaEvolve

By

•

18 days ago

• 19

upvoted a collection about 1 month ago

RADLADS

7 items • Updated about 1 month ago • 3

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published Apr 29 • 94

upvoted a collection about 1 month ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & outperforms all leading quantization methods. • 32 items • Updated 8 days ago • 114

upvoted a collection 3 months ago

ArgonneAI

Pretrained LLMs from scratch. • 3 items • Updated Mar 15 • 1

upvoted an article 3 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 292

upvoted a collection 3 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated 19 days ago • 148

upvoted a collection 5 months ago

Dolphin 3.0

Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated Feb 7 • 161

upvoted an article 6 months ago

Article

Self-Hosting LLaMA 3.1 70B (or any ~70B LLM) Affordably

By

•

Aug 20, 2024

• 16

upvoted a paper 8 months ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 52

upvoted an article 8 months ago

Article

Recreating o1 at Home with Role-Play LLMs

By

•

Sep 20, 2024

• 23

upvoted an article 9 months ago

Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By

•

Jul 29, 2024

• 329

upvoted an article 10 months ago

Article

Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging

By

•

Aug 19, 2024

• 77

upvoted 2 collections 11 months ago

smol llama

🚧"raw" pretrained smol_llama checkpoints - WIP 🚧 • 4 items • Updated Apr 29, 2024 • 6

Foundation Text-Generation Models Below 360M Parameters

Great candidates for fine-tuning targeting Wllama and Transformers.js for mobile devices, ordered by number of parameters. • 36 items • Updated Apr 6 • 32

upvoted 2 articles 11 months ago

Article

Experiments with Bitnet 1.5 (~ngmi~)

By

•

Mar 30, 2024

• 6

Article

Tokenization Is A Dead Weight (Tokun Part 1)

By

•

Jun 27, 2024

• 17

upvoted a paper about 1 year ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 72