Stochastic Optimization Methods for Policy Evaluation in Reinforcement Learning (Foundations and Trends(r) in Optimization) | DealShopping Deutschland