In my last post, I applied a reinforcement learning algorithm called Q-Learning to maximize velocity in simulations attempting a human powered land speed record. This algorithm was limited by: An incomplete physics model. A sub-optimal reward function. A single driver limit. Improving the algorithm and simulation in these three ways is the focus of this…
Physical Simulation Optimization
Posted on