UC Berkeley

university

Verified

https://www.berkeley.edu/

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

robinwuzy authored a paper about 13 hours ago

Language-Image Alignment with Fixed Text Encoders

xihc-ucb authored a paper 11 days ago

Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization

xihc-ucb authored a paper 11 days ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

View all activity

Berkeley's activity

uynitsuj

authored a paper 19 days ago

Real2Render2Real: Scaling Robot Data Without Dynamics Simulation or Robot Hardware

Paper • 2505.09601 • Published 23 days ago • 5

nickatomlin

authored 3 papers 24 days ago

Efficacy of Language Model Self-Play in Non-Zero-Sum Games

Paper • 2406.18872 • Published Jun 27, 2024

Measuring General Intelligence with Generated Games

Paper • 2505.07215 • Published 26 days ago • 10

Understanding Game-Playing Agents with Natural Language Annotations

Paper • 2204.07531 • Published Apr 15, 2022

davidchan

authored a paper 3 months ago

TULIP: Towards Unified Language-Image Pretraining

Paper • 2503.15485 • Published Mar 19 • 49

akashgokul

authored a paper 3 months ago

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

Paper • 2503.12271 • Published Mar 15 • 9

RZ412

authored 2 papers 5 months ago

EmbedLLM: Learning Compact Representations of Large Language Models

Paper • 2410.02223 • Published Oct 3, 2024 • 3

PokerBench: Training Large Language Models to become Professional Poker Players

Paper • 2501.08328 • Published Jan 14 • 18

akashgokul

authored a paper 6 months ago

OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows

Paper • 2412.01169 • Published Dec 2, 2024 • 13

shreyashankar

authored 2 papers 8 months ago

DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing

Paper • 2410.12189 • Published Oct 16, 2024 • 1

Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences

Paper • 2404.12272 • Published Apr 18, 2024 • 1

peterwg

authored 2 papers 9 months ago

Deep Multimodal Fusion for Surgical Feedback Classification

Paper • 2312.03231 • Published Dec 6, 2023

Pose-Aware Self-Supervised Learning with Viewpoint Trajectory Regularization

Paper • 2403.14973 • Published Mar 22, 2024

davidchan

authored 4 papers 11 months ago

Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition

Paper • 2403.19822 • Published Mar 28, 2024

ALOHa: A New Measure for Hallucination in Captioning Models

Paper • 2404.02904 • Published Apr 3, 2024

Virtual Personas for Language Models via an Anthology of Backstories

Paper • 2407.06576 • Published Jul 9, 2024

Visual Haystacks: Answering Harder Questions About Sets of Images

Paper • 2407.13766 • Published Jul 18, 2024 • 2

davidchan

posted an update 11 months ago

Post

592

🚨 Launching The Visual Haystacks (VHs) Benchmark: the first "visual-centric" Needle-In-A-Haystack (NIAH) benchmark to assess LMMs' capability in long-context visual retrieval and reasoning.

Check it out!
tsunghanwu/visual_haystacks
https://visual-haystacks.github.io/
https://arxiv.org/abs/2407.13766
https://github.com/visual-haystacks/vhs_benchmark

JustinWong8314

authored 2 papers about 1 year ago

The Wisdom of Hindsight Makes Language Models Better Instruction Followers

Paper • 2302.05206 • Published Feb 10, 2023

Stylus: Automatic Adapter Selection for Diffusion Models

Paper • 2404.18928 • Published Apr 29, 2024 • 15