VLA - a mphielipp Collection

mphielipp 's Collections

RL for Autoregressive Tasks

CUDA Optimization

Light TTS models

Datasets for Robotic Learning

Diffusion and RL

VLM

Visual Reasoning and LLMs

Diffusion Transformers

Conditional Diffusion

SSMs and Diffusion

Self Pedicting Learning in RL

LLMs Evaluation

CV

VLA

VLA

updated Jan 17

OpenVLA: An Open-Source Vision-Language-Action Model

Paper • 2406.09246 • Published Jun 13, 2024 • 42
CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

Paper • 2411.19650 • Published Nov 29, 2024
Octo: An Open-Source Generalist Robot Policy

Paper • 2405.12213 • Published May 20, 2024 • 30
Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression

Paper • 2412.03293 • Published Dec 4, 2024
robotics-diffusion-transformer/rdt-1b

Robotics • Updated Oct 17, 2024 • 1.23k • 90
OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Paper • 2501.03841 • Published Jan 7 • 56
robovlms/RoboVLMs

Updated Jan 15 • 7
Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding

Paper • 2501.04693 • Published Jan 8 • 3