Methods and systems for targeted evaluations of clinical machine learning models on the deployment population

Placeholder Show Content

Abstract/Contents

Abstract
The development of clinical machine learning applications frequently follows a sequential path from ideation to deployment — where each step is only considered after completion of the prior. A common consequence is that the distribution of data used to validate a model and permit eventual translation (the evaluation population) fails to reflect the distribution of data on which the model is served (the deployment population). In this dissertation I discuss methods and systems for enabling targeted evaluations of clinical machine learning models on the deployment population. I detail the implications of label selection (censoring) on the evaluation of binary classifiers, showing that traditional weighted estimators from causal inference literature recover performance over the deployment population when selection probabilities are properly specified. I emphasize the importance of silent prospective evaluations of machine learning models to appraise performance in the intended production environment. Silent trial evaluations require integrated deployment infrastructure with modern electronic medical record vendors — for which I develop and blueprint. I finally discuss the implications of feedback effects, where use of deployed models induce differences between the deployment population and the population that is prospectively observed. I provide a taxonomy of feedback effects, and recommend feedback aware model monitoring strategies. Instead of following a linear path from model ideation, to development and eventual deployment, researchers and machine learning practitioners should first consider the deployment population, and use it to contextualize model development and evaluation throughout its entire life-cycle.

Description

Type of resource text
Form electronic resource; remote; computer; online resource
Extent 1 online resource.
Place California
Place [Stanford, California]
Publisher [Stanford University]
Copyright date 2023; ©2023
Publication date 2023; 2023
Issuance monographic
Language English

Creators/Contributors

Author Corbin, Conor Kirkham
Degree supervisor Chen, Jonathan H
Degree supervisor Shah, Nigam
Thesis advisor Chen, Jonathan H
Thesis advisor Shah, Nigam
Thesis advisor Baiocchi, Michael
Thesis advisor Tibshirani, Robert
Degree committee member Baiocchi, Michael
Degree committee member Tibshirani, Robert
Associated with Stanford University, School of Medicine
Associated with Stanford University, Department of Biomedical Informatics

Subjects

Genre Theses
Genre Text

Bibliographic information

Statement of responsibility Conor Kirkham Corbin.
Note Submitted to the Department of Biomedical Informatics.
Thesis Thesis Ph.D. Stanford University 2023.
Location https://purl.stanford.edu/hg979cs4589

Access conditions

Copyright
© 2023 by Conor Kirkham Corbin
License
This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

Loading usage metrics...