Jake Tuero
Search
Search
Dark mode
Light mode
Explorer
off-policy
5 items with this tag.
Jun 23, 2026
Deterministic Policy Gradient Methods
rl
actor-critic
off-policy
policy-gradient
ddpg
td3
Jun 23, 2026
Off-Policy Methods
rl
off-policy
impala
Jun 23, 2026
Q Learning
rl
off-policy
value
Jun 23, 2026
RL as Inference
rl
off-policy
Jun 23, 2026
Soft Actor Critic (SAC)
rl
off-policy
actor-critic