Data-driven methods for modeling healthcare risks : insights and applications in drug surveillance and breast cancer incidence prediction