Return to Article Details Regret Bounds for Reinforcement Learning via Markov Chain Concentration Download Download PDF