Sublinear Approximation for Large-scale Data Science