Krishna Kaasyap

KrishnaKaasyap

AI & ML interests

Test Time Training Multimodal & Inter-Modality Transfer Learning Mechanistic Interpretability Evolutionary Model Merging Swarm Intelligence of multiple models with different architectures and different algorithms MuZero approach to general tasks

Recent Activity

liked a model 9 days ago
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B
upvoted a collection 16 days ago
MedGemma Release
upvoted a collection about 1 month ago
Qwen2.5-Omni
View all activity

Organizations

Blog-explorers's profile picture

KrishnaKaasyap's activity

upvoted an article 5 months ago
view article
Article

πŸΊπŸ¦β€β¬› LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By wolfram β€’
β€’ 79
upvoted an article 6 months ago
view article
Article

Bridging the Gap Between Physical Numerical Simulations and Machine Learning: Introducing The Well

By rubenohana β€’
β€’ 18
upvoted an article 10 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

By philschmid and 7 others β€’
β€’ 234
upvoted 2 articles 10 months ago
view article
Article

Fine-tune Llama 3.1 Ultra-Efficiently with Unsloth

By mlabonne β€’
β€’ 329
view article
Article

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

By yuchenlin β€’
β€’ 33