Deep Reinforcement Learning: from Pong Pixels to a Learned Player
Here you can find a fantastic post on reinforcement learning by Andrej Karpathy.
The Author explains how it is possible to make an agent learn to play Pong from from raw pixels with Policy Gradients. The final agent does not beat the hard-coded AI consistently, but holds its own.