Jeff Boudier's picture

Jeff Boudier

jeffboudier

AI & ML interests

Hugging Face!

Recent Activity

liked a Space 2 days ago
alexnasa/Chain-of-Zoom
reacted to merve's post with ❀️ 2 days ago
Past week was insanely packed for open AI! 😱 Luckily we picked some highlights for you ❀️ lfg! πŸ’¬ LLMs/VLMs > Deepseek 🐳 released https://huggingface.co/deepseek-ai/DeepSeek-R1-0528, 38B model, only 0.2 and 1.4 points behind o3 in AIME 24/25 🀯 they also released an 8B distilled version based on Qwen3 (OS) https://huggingface.co/collections/deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d > Xiaomi released MiMo-7B-RL (LLM for code and math) and MiMo-VL-7B-RL (VLM for visual reasoning, GUI agentic task and general use) (OS) 😍 https://huggingface.co/collections/XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212 > NVIDIA released , new reasoning model https://huggingface.co/nvidia/Nemotron-Research-Reasoning-Qwen-1.5B > DS: MiniMax released https://huggingface.co/MiniMaxAI/SynLogic, new 49k logical reasoning examples across 35 tasks including solving cipher, sudoku and more! πŸ–ΌοΈ Image/Video Generation > tencent released https://huggingface.co/tencent/HunyuanPortrait, a new model for consistent portrait generation with SVD Research license. They also released https://huggingface.co/tencent/HunyuanVideo-Avatar, audio driven avatar generation (OS) > showlab released https://huggingface.co/showlab/OmniConsistency, consistent stylization model (OS) > https://huggingface.co/datasets/Rapidata/text-2-video-human-preferences-veo3 is a new T2V preference dataset based on videos from Veo3 with 46k examples (OS) AudioπŸ—£οΈ > https://huggingface.co/ResembleAI/Chatterbox is a new 500M text-to-speech model preferred more than ElevenLabs (OS) 😍 > https://huggingface.co/PlayHT/PlayDiffusion is a new speech editing model (OS) Other > https://huggingface.co/NX-AI/TiReX is a new time series foundation model > Yandex released a huge (4.79B examples!) video recommendation dataset https://huggingface.co/yandex/yambda OS ones have Apache2.0 or MIT licenses, find more models and datasets here https://huggingface.co/collections/merve/releases-30-may-6840097345e0b1e915bff843
View all activity

Organizations

Hugging Face's profile picture Renault Group's profile picture Intel's profile picture Spaces-explorers's profile picture AWS Inferentia and Trainium's profile picture Spotify's profile picture Amazon SageMaker Community's profile picture Demo Corp's profile picture Hugging Face Infinity's profile picture Habana AI's profile picture Hugging Face Optimum's profile picture Hugging Test Lab's profile picture WIP's profile picture Evaluation on the Hub's profile picture HuggingFaceM4's profile picture Hackathon Team 1's profile picture Open-Source AI Meetup's profile picture model-attribution-challenge's profile picture model-attribution-challenge-admin's profile picture Inference Endpoints's profile picture Hugging Face OSS Metrics's profile picture Amazon SageMaker's profile picture EU org's profile picture Enterprise Explorers's profile picture Optimum Nvidia's profile picture Social Post Explorers's profile picture Optimum-Intel's profile picture Hugging Face Machine Learning Optimization's profile picture Hugging Face Discord Community's profile picture Hugging Face Party @ PyTorch Conference's profile picture Google Cloud 🀝🏻 Hugging Face's profile picture Huggingface HUGS's profile picture Nerdy Face's profile picture open/ acc's profile picture hf-inference's profile picture

jeffboudier's activity

reacted to danieldk's post with πŸ”₯ 2 days ago
view post
Post
1364
We have been working on a project called kernels. kernels makes it possible to load compute kernels directly from the Hub! πŸš€

We plan to give kernels a more proper introduction soon. But for those who have been following along, we are happy to announce a new release:

- New layer API with torch.compile support.
- Experimental support for loading Apple Silicon Metal 🀘 Kernels.
- Generate wheels from Hub kernels for legacy deployments.

Full release notes here: https://github.com/huggingface/kernels/releases/tag/v0.6.0
reacted to merve's post with ❀️ 2 days ago
view post
Post
1386
Past week was insanely packed for open AI! 😱
Luckily we picked some highlights for you ❀️ lfg!

πŸ’¬ LLMs/VLMs
> Deepseek 🐳 released deepseek-ai/DeepSeek-R1-0528, 38B model, only 0.2 and 1.4 points behind o3 in AIME 24/25 🀯 they also released an 8B distilled version based on Qwen3 (OS) deepseek-ai/deepseek-r1-678e1e131c0169c0bc89728d
> Xiaomi released MiMo-7B-RL (LLM for code and math) and MiMo-VL-7B-RL (VLM for visual reasoning, GUI agentic task and general use) (OS) 😍 XiaomiMiMo/mimo-vl-68382ccacc7c2875500cd212
> NVIDIA released , new reasoning model nvidia/Nemotron-Research-Reasoning-Qwen-1.5B
> DS: MiniMax released https://huggingface.co/MiniMaxAI/SynLogic, new 49k logical reasoning examples across 35 tasks including solving cipher, sudoku and more!

πŸ–ΌοΈ Image/Video Generation
> tencent released tencent/HunyuanPortrait, a new model for consistent portrait generation with SVD Research license. They also released tencent/HunyuanVideo-Avatar, audio driven avatar generation (OS)
> showlab released showlab/OmniConsistency, consistent stylization model (OS)
> Rapidata/text-2-video-human-preferences-veo3 is a new T2V preference dataset based on videos from Veo3 with 46k examples (OS)

AudioπŸ—£οΈ
> https://huggingface.co/ResembleAI/Chatterbox is a new 500M text-to-speech model preferred more than ElevenLabs (OS) 😍
> PlayHT/PlayDiffusion is a new speech editing model (OS)

Other
> https://huggingface.co/NX-AI/TiReX is a new time series foundation model
> Yandex released a huge (4.79B examples!) video recommendation dataset https://huggingface.co/yandex/yambda

OS ones have Apache2.0 or MIT licenses, find more models and datasets here merve/releases-30-may-6840097345e0b1e915bff843
reacted to evijit's post with πŸ€— 4 days ago
reacted to AdinaY's post with 😎 4 days ago
view post
Post
2111
May highlights from China’s open source ecosystem πŸ”₯

zh-ai-community/may-2025-open-works-from-the-chinese-community-681a3494145f2914dc679b7c

✨ DeepSeek dropped R1 updates
- Both R1 & 8B distralled smol model

✨ Bytedance goes big on open source:
- BAGEL, Dolphin, Seedcoder, Dream0...

✨ Multimodal is on fire!
- HuyuanCustom / HunyuanVideo-Avatar / HunyuanPortrait
- MiniMax: SynLogic / Orsta-7B
- Xiaomi: MiMo VL
- Alibaba Wan: Wan2.1-VACE
- OpenGVlab: ZeroGUI
- StepFun: ACE-Step-v1/Step1X-3D

✨ Specialized models/datasets excels
- Alibaba Qwen: World PM 72B
- BAAI:RobotBrain (MLLM for robotic)
- HiThink Research: BizFinBench (dataset)
- OpenBMB: Ultra FineWeb (dataset)
- Bilibili: Index-anisora (Anime/ACG)
- Skywork:Matrix-Game (game)

More awesome releases: Alibaba QwenLong-L1-32B, SkyWork OR1, OpenS2V-5M etc...
posted an update 9 days ago
posted an update 14 days ago
view post
Post
472
Wrapping up a week of shipping and announcements with Dell Enterprise Hub now featuring AI Applications, on-device models for AI PCs, a new CLI and Python SDK... all you need for building AI on premises!

Blog post has all the details: https://huggingface.co/blog/dell-ai-applications
posted an update 24 days ago
view post
Post
2575
Transcribing 1 hour of audio for less than $0.01 🀯

@mfuntowicz cooked with 8x faster Whisper speech recognition - whisper-large-v3-turbo transcribes at 100x real time on a $0.80/hr L4 GPU!

How they did it: https://huggingface.co/blog/fast-whisper-endpoints

1-click deploy with HF Inference Endpoints: https://endpoints.huggingface.co/new?repository=openai%2Fwhisper-large-v3-turbo&vendor=aws&region=us-east&accelerator=gpu&instance_id=aws-us-east-1-nvidia-l4-x1&task=automatic-speech-recognition&no_suggested_compute=true
reacted to clem's post with πŸ”₯ 24 days ago
view post
Post
3132
Very cool to see pytorch contributing on Hugging Face. Time to follow them to see what they're cooking!
  • 2 replies
Β·
posted an update 30 days ago
posted an update 2 months ago
view post
Post
2204
Llama4 is out and Scout is already on the Dell Enterprise Hub to deploy on Dell systems πŸ‘‰ dell.huggingface.co
posted an update 2 months ago
view post
Post
1565
Enterprise orgs now enable serverless Inference Providers for all members
- includes $2 free usage per org member (e.g. an Enterprise org with 1,000 members share $2,000 free credit each month)
- admins can set a monthly spend limit for the entire org
- works today with Together, fal, Novita, Cerebras and HF Inference.

Here's the doc to bill Inference Providers usage to your org: https://huggingface.co/docs/inference-providers/pricing#organization-billing
  • 2 replies
Β·
reacted to AdinaY's post with πŸš€πŸ”₯ 2 months ago
view post
Post
2442
Let's check out the latest releases from the Chinese community in March!

πŸ‘‰ https://huggingface.co/collections/zh-ai-community/march-2025-releases-from-the-chinese-community-67c6b479ebb87abbdf8e2e76


✨MLLM
> R1 Omni by Alibaba Tongyi - 0.5B
> Qwen2.5 Omni by Alibaba Qwen - 7B with apache2.0

πŸ–ΌοΈVideo
> CogView-4 by ZhipuAI - Apacha2.0
> HunyuanVideo-I2V by TencentHunyuan
> Open Sora2.0 - 11B with Apache2.0
> Stepvideo TI2V by StepFun AI - 30B with MIT license

🎡Audio
> DiffDiffRhythm - Apache2.0
> Spark TTS by SparkAudio - 0.5B

⚑️Image/3D
> Hunyuan3D 2mv/2mini (0.6B) by @TencentHunyuan
> FlexWorld by ByteDance - MIT license
> Qwen2.5-VL-32B-Instruct by Alibaba Qwen - Apache2.0
> Tripo SG (1.5B)/SF by VastAIResearch - MIT license
> InfiniteYou by ByteDance

> LHM by Alibaba AIGC team - Apache2.0
> Spatial LM by ManyCore

🧠Reasoning
> QwQ-32B by Alibaba Qwen - Apache2.0
> Skywork R1V - 38B with MIT license
> RWKV G1 by RWKV AI - 0.1B pure RNN reasoning model with Apache2.0
> Fin R1 by SUFE AIFLM Lab - financial reasoning

πŸ” LLM
> DeepSeek v3 0324 by DeepSeek -MIT license
> Babel by Alibaba DAMO - 9B/83B/25 languages
Β·
reacted to BrigitteTousi's post with πŸ€— 3 months ago
view post
Post
3742
Regardless of X being down or not, so glad I can rely on HF Posts for AI news β€οΈπŸ€—
  • 1 reply
Β·
reacted to mcpotato's post with πŸ€— 3 months ago
view post
Post
2528
Stoked to announce we've partnered with JFrog to continue improving safety on the Hub! 🐸

Their model scanner brings new scanning capabilities to the table, aimed at reducing alert fatigue.

More on that in our blog post: https://huggingface.co/blog/jfrog
  • 1 reply
Β·
reacted to clem's post with πŸ”₯ 3 months ago
view post
Post
5943
Super happy to welcome Nvidia as our latest enterprise hub customer. They have almost 2,000 team members using Hugging Face, and close to 20,000 followers of their org. Can't wait to see what they'll open-source for all of us in the coming months!

Nvidia's org: nvidia
Enterprise hub: https://huggingface.co/enterprise
reacted to csabakecskemeti's post with πŸ€— 3 months ago
view post
Post
2805
Testing Training on AMD/ROCm the first time!

I've got my hands on an AMD Instinct MI100. It's about the same price used as a V100 but on paper has more TOPS (V100 14TOPS vs MI100 23TOPS) also the HBM has faster clock so the memory bandwidth is 1.2TB/s.
For quantized inference it's a beast (MI50 was also surprisingly fast)

For LORA training with this quick test I could not make the bnb config works so I'm running the FT on the fill size model.

Will share all the install, setup and setting I've learned in a blog post, together with the cooling shroud 3D design.
Β·
reacted to fdaudens's post with ❀️ 3 months ago
view post
Post
3381
πŸš€ Just launched: A toolkit of 20 powerful AI tools that journalists can use right now - transcribe, analyze, create. 100% free & open-source.

Been testing all these tools myself and created a searchable collection of the most practical ones - from audio transcription to image generation to document analysis. No coding needed, no expensive subscriptions.

Some highlights I've tested personally:
- Private, on-device transcription with speaker ID in 100+ languages using Whisper
- Website scraping that just works - paste a URL, get structured data
- Local image editing with tools like Finegrain (impressive results)
- Document chat using Qwen 2.5 72B (handles technical papers well)

Sharing this early because the best tools come from the community. Drop your favorite tools in the comments or join the discussion on what to add next!

πŸ‘‰ JournalistsonHF/ai-toolkit
reacted to hexgrad's post with πŸ”₯ 3 months ago
reacted to andrewrreed's post with πŸ”₯ 5 months ago
view post
Post
2945
πŸš€ Supercharge your LLM apps with Langfuse on Hugging Face Spaces!

Langfuse brings end-to-end observability and tooling to accelerate your dev workflow from experiments through production

Now available as a Docker Space directly on the HF Hub! πŸ€—

πŸ” Trace everything: monitor LLM calls, retrieval, and agent actions with popular frameworks
1⃣ One-click deployment: on Spaces with persistent storage and integrated OAuth
πŸ›  Simple Prompt Management: Version, edit, and update without redeployment
βœ… Intuitive Evals: Collect user feedback, run model/prompt evaluations, and improve quality
πŸ“Š Dataset Creation: Build datasets directly from production data to enhance future performance

Kudos to the Langfuse team for this collab and the awesome, open-first product they’re building! πŸ‘ @marcklingen @Clemo @MJannik

πŸ”— Space: langfuse/langfuse-template-space
πŸ”— Docs: https://huggingface.co/docs/hub/spaces-sdks-docker-langfuse
  • 1 reply
Β·