Jake Tuero

on-policy

5 items with this tag.

  • Jun 23, 2026

    A2C

    • rl
    • policy-gradient
    • actor-critic
    • on-policy
    • a2c
  • Jun 23, 2026

    Generalized Advantage Estimation

    • rl
    • policy-gradient
    • actor-critic
    • on-policy
  • Jun 23, 2026

    Policy Improvement Methods

    • rl
    • policy-gradient
    • policy-improvement
    • on-policy
    • ppo
    • trpo
  • Jun 23, 2026

    REINFORCE

    • rl
    • on-policy
    • policy-gradient
  • Jun 23, 2026

    Value Based RL

    • rl
    • value
    • on-policy

Created with Quartz v5.0.0 © 2026

  • GitHub
  • Twitter
  • LinkedIn
  • Scholar