Coordinated exploration in concurrent reinforcement learning