Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mphielipp 's Collections
RL for Autoregressive Tasks
CUDA Optimization
Real2Sim2Real
LLM Training
Light TTS models
Datasets for Robotic Learning
Diffusion and RL
VLM
Visual Reasoning and LLMs
Diffusion Transformers
Robot Learning
Conditional Diffusion
SSMs and Diffusion
Grokking
Self Pedicting Learning in RL
LLMs Evaluation
CV
VLA

VLA

updated Jan 17
Upvote
2

  • OpenVLA: An Open-Source Vision-Language-Action Model

    Paper • 2406.09246 • Published Jun 13, 2024 • 42

  • CogACT: A Foundational Vision-Language-Action Model for Synergizing Cognition and Action in Robotic Manipulation

    Paper • 2411.19650 • Published Nov 29, 2024

  • Octo: An Open-Source Generalist Robot Policy

    Paper • 2405.12213 • Published May 20, 2024 • 30

  • Diffusion-VLA: Scaling Robot Foundation Models via Unified Diffusion and Autoregression

    Paper • 2412.03293 • Published Dec 4, 2024

  • robotics-diffusion-transformer/rdt-1b

    Robotics • Updated Oct 17, 2024 • 1.23k • 90

  • OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

    Paper • 2501.03841 • Published Jan 7 • 56

  • robovlms/RoboVLMs

    Updated Jan 15 • 7

  • Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding

    Paper • 2501.04693 • Published Jan 8 • 3
Upvote
2
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs