Question about State Feedback Q-Learning Value Iteration (VI) for discrete time system
Show older comments
I have a problem performing a simulation for the State Feedback Q-Learning Value Iteration (VI) algorithm for discrete time system. Details of the algorithm are as shown below. Can someone help me?

Answers (0)
Categories
Find more on Control System Toolbox in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!