Younes B's picture

Younes B

ybelkada

·

AI & ML interests

Large Language Models, Quantization, Vision, Multimodality, Diffusion models

Recent Activity

updated a model 4 days ago

tiiuae/Falcon3-Mamba-7B-Base

updated a model 4 days ago

tiiuae/Falcon3-Mamba-7B-Instruct

new activity 4 days ago

tiiuae/Falcon-H1-34B-Instruct:Fix special tokens issue

View all activity

Organizations

ybelkada's activity

upvoted a collection 16 days ago

Falcon-Arabic

7B models built on top of Falcon3-7B • 3 items • Updated 17 days ago • 7

upvoted a collection 17 days ago

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained and instruction-tuned). • 37 items • Updated 17 days ago • 38

upvoted an article 17 days ago

Article

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

By

and 5 others •

17 days ago

• 26

upvoted an article 22 days ago

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

By

and 9 others •

22 days ago

• 33

upvoted a collection 22 days ago

Falcon Edge series

A series of powerful, universal and fine-tunable small Language Models • 7 items • Updated 17 days ago • 22

upvoted a collection about 2 months ago

BitNet

🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1 • 42

upvoted an article 4 months ago

Article

The Open Arabic LLM Leaderboard 2

By

and 7 others •

Feb 10

• 32

upvoted a collection 6 months ago

Falcon3

Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. • 40 items • Updated 17 days ago • 86

upvoted a paper 8 months ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7, 2024 • 36

upvoted an article 10 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

By

and 5 others •

Aug 12, 2024

• 112

upvoted a collection 10 months ago

FalconMamba 7B

This collection features the FalconMamba 7B base model, the instruction-tuned version, their 4-bit and GGUF variants, and the demo. • 15 items • Updated 17 days ago • 34

upvoted a collection 12 months ago

4M Models

Multimodal models from https://4m.epfl.ch/ • 17 items • Updated Mar 7 • 31

upvoted 2 papers 12 months ago

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning

Paper • 2303.02861 • Published Mar 6, 2023 • 2

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model

Paper • 2406.04904 • Published Jun 7, 2024 • 9

upvoted a collection about 1 year ago

AQLM+PV

Official AQLM quantizations for "PV-Tuning: Beyond Straight-Through Estimation for Extreme LLM Compression": https://arxiv.org/abs/2405.14852 • 26 items • Updated Feb 28 • 21

upvoted a paper about 1 year ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12

upvoted 2 articles about 1 year ago

Article

Overview of natively supported quantization schemes in 🤗 Transformers

By

and 4 others •

Sep 12, 2023

• 12

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 666

upvoted a collection about 1 year ago

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 776

upvoted a paper about 1 year ago

Is DPO Superior to PPO for LLM Alignment? A Comprehensive Study

Paper • 2404.10719 • Published Apr 16, 2024 • 6