Post
2784
ByteDance is absolutely cooking lately๐ฅ
BAGEL ๐ฅฏ 7B active parameter open multimodal foundation model by Bytedance Seed team.
ByteDance-Seed/BAGEL-7B-MoT
โจ Apache 2.0
โจ Outperforms top VLMs (Qwen2.5-VL & InternVL-2.5)
โจ Mixture-of-Transformer-Experts + dual encoders
โจ Trained on trillions of interleaved tokens
BAGEL ๐ฅฏ 7B active parameter open multimodal foundation model by Bytedance Seed team.
ByteDance-Seed/BAGEL-7B-MoT
โจ Apache 2.0
โจ Outperforms top VLMs (Qwen2.5-VL & InternVL-2.5)
โจ Mixture-of-Transformer-Experts + dual encoders
โจ Trained on trillions of interleaved tokens