Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Swaminathan S K's picture
1

Swaminathan S K

swami2004
·
https://swaminathansk.github.io/
  • SwamiiiiSK
  • SwaminathanSK

AI & ML interests

Robot Learning, Reinforcement Learning, Multi-agent Systems, Interpretability

Organizations

None yet

Collections 1

Papers to Read
  • mDPO: Conditional Preference Optimization for Multimodal Large Language Models

    Paper • 2406.11839 • Published Jun 17, 2024 • 40
  • Pandora: Towards General World Model with Natural Language Actions and Video States

    Paper • 2406.09455 • Published Jun 12, 2024 • 15
  • WPO: Enhancing RLHF with Weighted Preference Optimization

    Paper • 2406.11827 • Published Jun 17, 2024 • 15
  • In-Context Editing: Learning Knowledge from Self-Induced Distributions

    Paper • 2406.11194 • Published Jun 17, 2024 • 15
Papers to Read
  • mDPO: Conditional Preference Optimization for Multimodal Large Language Models

    Paper • 2406.11839 • Published Jun 17, 2024 • 40
  • Pandora: Towards General World Model with Natural Language Actions and Video States

    Paper • 2406.09455 • Published Jun 12, 2024 • 15
  • WPO: Enhancing RLHF with Weighted Preference Optimization

    Paper • 2406.11827 • Published Jun 17, 2024 • 15
  • In-Context Editing: Learning Knowledge from Self-Induced Distributions

    Paper • 2406.11194 • Published Jun 17, 2024 • 15

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs