Hydragen: High-Throughput LLM Inference with Shared Prefixes Paper โข 2402.05099 โข Published Feb 7, 2024 โข 20 โข 4
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Paper โข 2312.15166 โข Published Dec 23, 2023 โข 59 โข 9