Skip to content
View nissymori's full-sized avatar

Block or report nissymori

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. JAX-CORL JAX-CORL Public

    Clean single-file implementation of offline RL algorithms in JAX

    Python 177 4

  2. mahjax mahjax Public

    A GPU-Accelerated Mahjong Simulator for RL in JAX

    Python 24 3

  3. remax-rl remax-rl Public

    [ICML2026] Official JAX code for Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying

    Python 5

  4. sotetsuk/pgx sotetsuk/pgx Public

    ♟️ Vectorized RL game environments in JAX

    Python 605 45

  5. SymPO SymPO Public

    [TMLR2026] Official code for "On Symmetric Losses for Robust Policy Optimization with Noisy Preferences"

    Python 7

  6. PUORL PUORL Public

    [RLC2025] Official code for "Offline Reinforcement Learning with Domain-Unlabeled Data"

    Python 5