Adaptive and efficient batch reinforcement learning algorithms