s3nh's picture

s3nh

s3nh

AI & ML interests

Quantization, LLMs, Deep Learning for good. Follow me if you like my work. Patreon.com/s3nh

Recent Activity

Organizations

ESPnet's profile picture Gradio-Blocks-Party's profile picture Lajonbot's profile picture The Waifu Research Department's profile picture AblateIt's profile picture Blog-explorers's profile picture BangumiBase's profile picture CyberHarem's profile picture HydraLM's profile picture GOAT.AI's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture ZeroGPU Explorers's profile picture Social Post Explorers's profile picture Spinner-GPT-4's profile picture Hugging Face Discord Community's profile picture open/ acc's profile picture Smol Community's profile picture

s3nh's activity

reacted to KnutJaegersberg's post with ❀️ 18 days ago
view post
Post
2729
The Intelligence Curse

The document warns of the "intelligence curse," a potential consequence of advanced AI (AGI) where powerful entities lose their incentive to invest in people as AI automates work[cite: 13, 297]. This could lead to job displacement, reduced social mobility, and a concentration of power and wealth based on AI ownership, similar to the "resource curse" in resource-rich states[cite: 17, 18, 31, 329, 353]. To counter this, the authors propose averting AI catastrophes to prevent centralization, diffusing AI widely to keep humans economically relevant, and democratizing institutions to remain anchored to human needs[cite: 22, 23, 25, 35, 36, 37, 566].


https://intelligence-curse.ai/intelligence-curse.pdf
reacted to loubnabnl's post with ❀️ 18 days ago
reacted to merve's post with πŸ‘πŸš€ 19 days ago
view post
Post
6574
A real-time object detector much faster and accurate than YOLO with Apache 2.0 license just landed to Hugging Face transformers πŸ”₯

D-FINE is the sota real-time object detector that runs on T4 (free Colab) 🀩

> Collection with all checkpoints and demo ustc-community/d-fine-68109b427cbe6ee36b4e7352

Notebooks:
> Tracking https://github.com/qubvel/transformers-notebooks/blob/main/notebooks/DFine_tracking.ipynb
> Inference https://github.com/qubvel/transformers-notebooks/blob/main/notebooks/DFine_inference.ipynb
> Fine-tuning https://github.com/qubvel/transformers-notebooks/blob/main/notebooks/DFine_finetune_on_a_custom_dataset.ipynb
h/t @vladislavbro @qubvel-hf @ariG23498 and the authors of the paper 🎩

Regular object detectors attempt to predict bounding boxes in (x, y, w, h) pixel perfect coordinates, which is very rigid and hard to solve πŸ₯²β˜ΉοΈ



D-FINE formulates object detection as a distribution for bounding box coordinates, refines them iteratively, and it's more accurate 🀩

Another core idea behind this model is Global Optimal Localization Self-Distillation ‡️

this model uses final layer's distribution output (sort of like a teacher) to distill to earlier layers to make early layers more performant.

  • 2 replies
Β·
reacted to mrfakename's post with πŸ‘πŸ€— 19 days ago
view post
Post
3382
Hi everyone,

I just launched TTS Arena V2 - a platform for benchmarking TTS models by blind A/B testing. The goal is to make it easy to compare quality between open-source and commercial models, including conversational ones.

What's new in V2:

- **Conversational Arena**: Evaluate models like CSM-1B, Dia 1.6B, and PlayDialog in multi-turn settings
- **Personal Leaderboard**: Optional login to see which models you tend to prefer
- **Multi-speaker TTS**: Random voices per generation to reduce speaker bias
- **Performance Upgrade**: Rebuilt from Gradio β†’ Flask. Much faster with fewer failed generations.
- **Keyboard Shortcuts**: Vote entirely via keyboard

Also added models like MegaTTS 3, Cartesia Sonic, and ElevenLabs' full lineup.

I'd love any feedback, feature suggestions, or ideas for models to include.

TTS-AGI/TTS-Arena-V2
Β·
reacted to merve's post with πŸš€πŸ‘ about 1 month ago
reacted to merterbak's post with πŸ”₯ about 1 month ago
view post
Post
1695
Microsoft released their new fine-tuned phi-4 models with reasoning data yesterday. They outperform/rival much larger models . Check out them if you haven't yet. πŸš€

Phi4 mini reasoning(SFT): microsoft/Phi-4-mini-reasoning
Phi-4 reasoning(SFT): microsoft/Phi-4-reasoning
Phi-4 reasoning plus (SFT + RL): microsoft/Phi-4-reasoning-plus
Demo: https://github.com/marketplace/models/azureml/Phi-4-reasoning/playground
Articles: https://arxiv.org/pdf/2504.21318
https://arxiv.org/pdf/2504.21233
Blog: https://azure.microsoft.com/en-us/blog/one-year-of-phi-small-language-models-making-big-leaps-in-ai/

  • 1 reply
Β·
New activity in city96/Wan2.1-FLF2V-14B-720P-gguf about 1 month ago

Pipeline

#2 opened about 1 month ago by
s3nh