Jake Tuero

policy-gradient

6 items with this tag.

  • Jun 23, 2026

    A2C

    • rl
    • policy-gradient
    • actor-critic
    • on-policy
    • a2c
  • Jun 23, 2026

    Deterministic Policy Gradient Methods

    • rl
    • actor-critic
    • off-policy
    • policy-gradient
    • ddpg
    • td3
  • Jun 23, 2026

    Generalized Advantage Estimation

    • rl
    • policy-gradient
    • actor-critic
    • on-policy
  • Jun 23, 2026

    Policy Gradient Methods

    • rl
    • policy-gradient
    • reinforce
  • Jun 23, 2026

    Policy Improvement Methods

    • rl
    • policy-gradient
    • policy-improvement
    • on-policy
    • ppo
    • trpo
  • Jun 23, 2026

    REINFORCE

    • rl
    • on-policy
    • policy-gradient

Created with Quartz v5.0.0 © 2026

  • GitHub
  • Twitter
  • LinkedIn
  • Scholar