2025
an archive of posts from this year
| Oct 24, 2025 | From Policy Gradients to LLM RL: TRPO, PPO, and Beyond |
|---|---|
| Feb 22, 2025 | Cognitive Science Notes: Introduction |
an archive of posts from this year
| Oct 24, 2025 | From Policy Gradients to LLM RL: TRPO, PPO, and Beyond |
|---|---|
| Feb 22, 2025 | Cognitive Science Notes: Introduction |