Skip to content

docs: add Freshness-Aware-PER to Notable work based on ROLLUpdate README.md#433

Open
histmeisah wants to merge 1 commit intoalibaba:mainfrom
Vision-CAIR:main
Open

docs: add Freshness-Aware-PER to Notable work based on ROLLUpdate README.md#433
histmeisah wants to merge 1 commit intoalibaba:mainfrom
Vision-CAIR:main

Conversation

@histmeisah
Copy link
Copy Markdown

Hi ROLL team 👋

This PR adds our paper Freshness-Aware Prioritized Experience Replay for LLM/VLM Reinforcement Learning to the "Notable work based on ROLL" section.

Code (built on top of ROLL): https://github.com/Vision-CAIR/Freshness-Aware-PER

Brief summary:

  • reward_fresh priority combining |reward| with exponential age decay
  • Asynchronous full-buffer priority refresh (refresh_all_age_decay) so unsampled old entries stay properly decayed
  • Episode-indexed n-step / hierarchical sampling for both step-level and trajectory-level buffers
  • Apache 2.0, identical to upstream

Happy to adjust wording, ordering, or description length to match the section's style — thanks for the great library!

@CLAassistant
Copy link
Copy Markdown

CLA assistant check
Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
You have signed the CLA already but the status is still pending? Let us recheck it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants