71 19 65

Ziyang Luo

Ziyang

https://chiyeunglaw.github.io/

AI & ML interests

Agents, LLMs, Multimodal ML

Recent Activity

upvoted an article about 7 hours ago

Microsoft Playwright MCP: Tutorial for Beginners

authored a paper about 1 month ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

upvoted a paper about 1 month ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

View all activity

Organizations

Ziyang's activity

upvoted an article about 7 hours ago

Article

Microsoft Playwright MCP: Tutorial for Beginners

•

Mar 28

• 19

authored a paper about 1 month ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

Paper • 2411.18932 • Published Nov 28, 2024 • 1

upvoted a paper about 1 month ago

ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges

Paper • 2411.18932 • Published Nov 28, 2024 • 1

commented on Tiny Agents: a MCP-powered agent in 50 lines of code about 1 month ago

🔥🔥🔥

upvoted an article about 1 month ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

•

Apr 25

• 267

New activity in HKBU-NLP/GOAT-Bench about 1 month ago

Add task category

#3 opened 3 months ago by

nielsr

upvoted an article 3 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 148

updated 2 datasets 5 months ago

TransferLM/mp1

Viewer • Updated Jan 17 • 74.4k • 17

TransferLM/mp2

Viewer • Updated Jan 17 • 78.2k • 12

published 2 datasets 5 months ago

TransferLM/mp2

Viewer • Updated Jan 17 • 78.2k • 12

TransferLM/mp1

Viewer • Updated Jan 17 • 74.4k • 17

upvoted an article 5 months ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

and 1 other •

Jan 3

• 18

published an article 5 months ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

and 1 other •

Jan 3

• 18

liked 2 datasets 5 months ago

likaixin/ScreenSpot-Pro

Viewer • Updated Apr 15 • 1.59k • 2.77k • 25

LongVideos/LongVideoDB-373K-Videos

Updated Dec 30, 2024 • 157 • 4

updated a Space 5 months ago

README

🌍

updated 2 models 6 months ago

TransferLM/S2

Updated Dec 9, 2024 • 4

TransferLM/S1

Updated Dec 9, 2024 • 4

commented a paper 6 months ago

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 22 •

authored a paper 7 months ago

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 22