Combining algorithms and humans for large-scale data integration