Post
saqib · 93 Days ago
Policy Certificates and Minimax-Optimal PAC Bounds for Episodic #ReinforcementLearning – #MachineLearning Blog | ML@CMU | @CarnegieMellon
blog.ml.cmu.edu
Post on the Qi stream