Data sparse algorithms and mathematical theory for large-scale machine learning problems