99 1 39

Dämpfchen

Dampfinchen

AI & ML interests

None yet

Recent Activity

new activity 3 days ago

deepseek-ai/DeepSeek-R1-0528:Any plans for 32B/70B distilled models?

new activity 7 days ago

deepseek-ai/DeepSeek-R1-0528:This is by no means a minor upgrade.

new activity 8 days ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B:Any plans on gemma series? ;-;

View all activity

Organizations

None yet

Dampfinchen's activity

New activity in deepseek-ai/DeepSeek-R1-0528 3 days ago

Any plans for 32B/70B distilled models?

🚀 7

#83 opened 7 days ago by

NanaBanana22

New activity in deepseek-ai/DeepSeek-R1-0528 7 days ago

This is by no means a minor upgrade.

👍 2

#72 opened 8 days ago by

WyattTheSkid

New activity in deepseek-ai/DeepSeek-R1-0528-Qwen3-8B 8 days ago

Any plans on gemma series? ;-;

❤️ 4

#2 opened 8 days ago by

Nakdesu

DeepSeek-R1-Lite

❤️ 🔥 19

#6 opened 8 days ago by

Dampfinchen

New activity in deepseek-ai/DeepSeek-R1-0528 9 days ago

fantastic 👍👍👍

🔥 2

#19 opened 9 days ago by

AIARCHAEA

Distill

🔥 ➕ 2

#17 opened 9 days ago by

Neman

New activity in Qwen/Qwen3-235B-A22B 28 days ago

In complex reasoning tasks Qwen3 is far behind QwQ

#32 opened about 1 month ago by

AdamF92

New activity in Qwen/Qwen3-30B-A3B 29 days ago

Many experts seem to be underutilized, possible optimization potential.

#24 opened 29 days ago by

Dampfinchen

repetition

➕ 2

#23 opened 29 days ago by

Dampfinchen

New activity in Qwen/Qwen3-235B-A22B about 1 month ago

Add image visual recognition output just like qwen 2.5 vl-32b instruct

#26 opened about 1 month ago by

devopsML

New activity in unsloth/Qwen3-30B-A3B-GGUF about 1 month ago

UD version for the Q5, Q6 and Q8 quant

👍 7

#11 opened about 1 month ago by

nobita3921

New activity in Qwen/Qwen3-30B-A3B about 1 month ago

Qwen3 is great, but could be better.

👍 7

#18 opened about 1 month ago by

phil111

Waiting for the Qwen3-VL

👀 1

#8 opened about 1 month ago by

Maverick17

New activity in kalomaze/Qwen3-16B-A3B about 1 month ago

Besides pruning..

#4 opened about 1 month ago by

Lockout

New activity in unsloth/Qwen3-30B-A3B-GGUF about 1 month ago

Latest updates?

👍 12

#10 opened about 1 month ago by

Dampfinchen

New activity in unsloth/GLM-4-32B-0414-GGUF about 1 month ago

FIXED: Template bug fixed in llama.cpp

👍 2

#4 opened about 1 month ago by

sovetboga

New activity in THUDM/GLM-4-32B-0414 about 1 month ago

Great job, thanks for this model.

👍 5

#11 opened about 1 month ago by

Dampfinchen

New activity in unsloth/Qwen3-30B-A3B-GGUF about 1 month ago

`UD-Q4_K_XL` or `Q4_K_M`?

#6 opened about 1 month ago by

pootow

New activity in Qwen/Qwen3-32B about 1 month ago

Feedback: It's a good model, however it hallucinates very badly at local facts (Germany)

😔 👀 9

#12 opened about 1 month ago by

Dampfinchen

Is this multimodal?

#2 opened about 1 month ago by

pbarker