![Policy invariance under reward transformations: Theory and application to reward shaping](/static/40bdda041d7087af1fc0d1526869a217/acb01/keyboard10.png)
Policy invariance under reward transformations: Theory and application to reward shaping
This paper investigates conditions under which modifications to the reward function of a Markov decision process preserve the optimal policy. It is shown that, besides the positive linear transformation familiar from utility theory, one can add a reward for transitions between states that is expressible as the difference in value of an arbitrary potential function […]