Quartz 4

❯

❯

❯

Batch reinforcement learning

Batch reinforcement learning

Sep 23, 20251 min read

Taki Reinforcement Learning, w którym szukamy polityki z historycznych danych bez eksploracji.

Źródło: Optimal Bidding Strategy without Exploration in Real-time Bidding

Graph View

Created with Quartz v4.4.1 © 2026

GitHub
Discord Community