Quartz 4

Home

❯

sources

❯

books

❯

Reinforcement Learning An introduction

Reinforcement Learning An introduction

Sep 23, 20251 min read

by Richard Sutton and Andrew Barto

http://incompleteideas.net/book/the-book-2nd.html


Graph View

Backlinks

  • Aproksymacja stochastyczna gradient descent
  • Policy evaluation
  • Bellman equation
  • Bootstrapping (RL)
  • Dynamic programming (RL)
  • Epsilon-soft
  • Expected vs Sample Updates
  • Exploring starts
  • Gradient Bandit Algorithm
  • Greedy policy
  • Markov Decision Process
  • Monte Carlo Exploring Starts (ES)
  • Monte Carlo Methods (RL)
  • On-Off policy
  • Planning w RL
  • Policy improvement
  • Policy iteration algorithm
  • Real-time Dynamic Programming
  • Reinforcement Learning
  • Rollout algorithms
  • Temporal Difference Learning
  • Warunki na zbieżność RL
  • alpha-constant Monte Carlo
  • Causal Models for Real Time Bidding with Repeated User Interactions

Created with Quartz v4.4.1 © 2025

  • GitHub
  • Discord Community