Svelte Hacker News logo
  • top
  • new
  • show
  • ask
  • jobs
  • about

RL's Razor: Why Online Reinforcement Learning Forgets Less

arxiv.org

3 points by Anon84 8 hours ago