Reinforcement Learning with History Lists: Solving Partially Observable Decision Processes by Using Short Term Memory | DealShopping Deutschland