Shyam Sunder Kumar

theainerd

AI & ML interests

Natural Language Processing

Recent Activity

liked a model 5 days ago
deepseek-ai/DeepSeek-R1-0528
upvoted a collection 5 days ago
MobileLLM
reacted to codelion's post with šŸš€ 5 days ago
🧠 We just implemented Andrej Karpathy's "third paradigm" for LLM learning! System Prompt Learning (SPL) enables LLMs to automatically learn problem-solving strategies from experience, rather than relying on static prompts. šŸš€ How it works: Your LLM builds a database of effective strategies, selects the best ones for each problem, and refines them over time based on success rates. šŸ“Š Results across math benchmarks: Arena Hard: 29% → 37.6% (+8.6%) AIME24: 23.33% → 30% (+6.67%) OptILLMBench: 61% → 65% (+4%) The best part? All strategies are human-readable and the system gets progressively better at problem types you use frequently. ✨ Key benefits: šŸ”„ Cumulative learning over time šŸ“– Transparent, inspectable strategies šŸ”Œ Works with any OpenAI-compatible API ⚔ Simple integration: just add "spl-" prefix to your model Built as an open-source plugin in optillm. After 500 queries, our system developed 129 strategies and refined 97 of them! This feels like a genuine step toward AI that learns from experience while staying completely interpretable. šŸ”— GitHub: https://github.com/codelion/optillm/tree/main/optillm/plugins/spl šŸ“– Full article: https://huggingface.co/blog/codelion/system-prompt-learning 🐦 Original Karpathy tweet: https://x.com/karpathy/status/1921368644069765486 Have you experimented with advanced system prompting? What strategies would you want your LLM to learn?
View all activity

Organizations

Neuropark's profile picture Speech Recognition Community Event Version 2's profile picture Open-Source AI Meetup's profile picture Social Post Explorers's profile picture Hugging Face Discord Community's profile picture Hugging Face MCP Course's profile picture

theainerd's activity

upvoted an article 2 months ago
upvoted an article 3 months ago
view article
Article

SigLIP 2: A better multilingual vision language encoder

By ariG23498 and 2 others •
• 165
upvoted an article 4 months ago
view article
Article

Open-source DeepResearch – Freeing our search agents

By m-ric and 4 others •
• 1.25k