Skip to content
View YangRui2015's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Organizations

@DynaMath

Block or report YangRui2015

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. RiC RiC Public

    Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"

    Python 53 4

  2. Generalizable-Reward-Model Generalizable-Reward-Model Public

    Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"

    Python 16 1

  3. RIQL RIQL Public

    Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"

    Python 11

  4. GOAT GOAT Public

    Code for the ICML 2023 paper "What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?".

    Python 9 1

  5. RORL RORL Public

    Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"

    Python 18 4

  6. AWGCSL AWGCSL Public

    Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.

    Python 26 2