Autonomous helicopter control using reinforcement learning policy search methodsPolicy search by dynamic programming