Jake Tuero

off-policy

5 items with this tag.

  • Jun 23, 2026

    Deterministic Policy Gradient Methods

    • rl
    • actor-critic
    • off-policy
    • policy-gradient
    • ddpg
    • td3
  • Jun 23, 2026

    Off-Policy Methods

    • rl
    • off-policy
    • impala
  • Jun 23, 2026

    Q Learning

    • rl
    • off-policy
    • value
  • Jun 23, 2026

    RL as Inference

    • rl
    • off-policy
  • Jun 23, 2026

    Soft Actor Critic (SAC)

    • rl
    • off-policy
    • actor-critic

Created with Quartz v5.0.0 © 2026

  • GitHub
  • Twitter
  • LinkedIn
  • Scholar