People's Reinforcement Learning (PRL) documentation