Takes a single pendulum (with a torque actuator) and models it as a Markov Decision Process (MDP), using linear barycentric interpolation over a uniform grid. Then, value iteration is used to compute the optimal policy, which is then displayed as a plot. Finally, a simulation is run to demonstrate how to evaluate the optimal policy.
Matthew Kelly (2019). Markov Decision Process - Pendulum Control (https://www.github.com/matthewpeterkelly/MDP_Pendulum), GitHub. Retrieved .
added a photo.
Improved error handling regarding the MEX functions.