Takes a single pendulum (with a torque actuator) and models it as a Markov Decision Process (MDP), using linear barycentric interpolation over a uniform grid. Then, value iteration is used to compute the optimal policy, which is then displayed as a plot. Finally, a simulation is run to demonstrate how to evaluate the optimal policy.
added a photo.
Improved error handling regarding the MEX functions.