Quartz 4

❯

❯

❯

Greedy policy

Sep 23, 20251 min read

Taka polityka, która wybiera ruch, który aktualnie wydaje się najlepszy według statystyk, które do tej pory zebraliśmy.

Źródło: Reinforcement Learning An introduction

Graph View

Backlinks

Temporal Difference Learning

Created with Quartz v4.4.1 © 2026

GitHub
Discord Community