impira/layoutlm-document-qa Document Question Answering • 0.1B • Updated Mar 18, 2023 • 14.9k • 1.13k
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions Paper • 2409.18042 • Published Sep 26, 2024 • 41
LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects Paper • 2504.19838 • Published Apr 28 • 22