GSAI-ML
/

LLaDA-8B-Instruct

Text Generation

Model card Files Files and versions

Resources

View closed (1)

Flashattention 2 support?

#14 opened about 1 month ago by

attnmask

#13 opened about 1 month ago by

Question about the chat template which ignores add_generation_prompt

#12 opened 2 months ago by

How much VRAM/RAM is required to load this model?

#11 opened 5 months ago by

Training time

#10 opened 5 months ago by

4-bit LLaDA model

#9 opened 5 months ago by

Impressive work

#8 opened 5 months ago by

what part of the code is diffusion?

#6 opened 5 months ago by

Model performance

#5 opened 5 months ago by

That is awesome!

#4 opened 6 months ago by

Anybody has been able to run their chat.py model on a Mac?

#3 opened 6 months ago by

Gguf?

#2 opened 6 months ago by

AlgorithmicKing

Add library_name and pipeline_tag to model card

#1 opened 6 months ago by