Jake Tuero

llm

3 items with this tag.

  • Jun 23, 2026

    Search-Based Decoding for Language Models

    • deep-learning
    • transformers
    • llm
    • search
  • Jun 23, 2026

    LLMs for RL

    • rl
    • llm
  • Jun 23, 2026

    RL for LLMs

    • rl
    • llm
    • ppo
    • grpo
    • dapo
    • gspo
    • drgrpo
    • rlft
    • rlvr
    • rlhf
    • cot

Created with Quartz v5.0.0 © 2026

  • GitHub
  • Twitter
  • LinkedIn
  • Scholar