Improving medical image segmentation by designing around clinical context

Yi, Darvin

Improving medical image segmentation by designing around clinical context

<a href="https://embed.stanford.edu/iframe/?url=https%3A%2F%2Fpurl.stanford.edu%2Fct300cb3464" class="su-underline">Show Content</a>

Abstract/Contents

Abstract: The rise of deep learning (DL) has created many novel algorithms for segmentation, which has in turn revolutionized the field of medical image segmentation. However, several distinctions between the field of natural and medical computer vision necessitates specialized algorithms to optimize performance, including the multi-modality of medical data, the differences in imaging protocols between centers, and the limited amount of annotated data. These differences lead to limitations when applying current state of the art computer vision methods on medical imaging. For segmentation, the major gaps our algorithms must bridge to become clinically useful are: (1) generalize to different imaging protocols, (2) become robust to training on noisy labels, and (3) generally improve segmentation performance. The current rigorous deep learning architectures are not robust to having missing input modalities after training a network, which makes our networks unable to run inference on new data taken with a different imaging protocol. By training our algorithms without taking into account the mutability of imaging protocols, we heavily limit the deployability of our algorithms. Our current training paradigm also needs pristine segmentation labels, which necessitates a large time investment by expert annotators. By training our algorithms with an underlying assumption that there is no noise in our labels with harsh loss functions like cross entropy, we create a need for clean labels. This limits our datasets from being fully largely scalable to the same size as natural computer vision datasets, as disease segmentations on medical images require more time and effort to annotate than natural images with semantic classes. Finally, current state of the art performance on difficult segmentation tasks like brain metastases is just not enough to be clinically useful. We will need to explore new ways of designing and ensembling networks to increase segmentation performance should we aim to deploy these algorithms in any clinically relevant environment. We hypothesize that by changing neural network architectures and loss functions to account for noisy data rather than assuming consistent imaging protocols and pristine labels, we can encode more robustness into our trained networks and improve segmentation performance on medical imaging tasks. In our experiments, we will test several different networks whose architecture and loss functions have been motivated by realistic and clinically relevant situations. For these experiments, we chose the model system of brain metastases lesion detection and segmentation, a difficult problem due to the high count and small size of the lesions. It is also an important problem due to the need to assess the effects of treatment by tracking changes in tumor burden. In this dissertation, we present the following specific aims: (1) optimizing deep learning performance on brain metastases segmentation, (2) training networks to be robust to coarse annotations and missing data, and (3) validating our methodology on three different secondary tasks. Our trained baseline performance (state of the art) performs brain metastases segmentation modestly, giving us mAP values of $0.46\pm0.02$ and DICE scores of 0.72. Changing our architectures to account for different pulse sequence integration methods does not improve our values by much, giving us a model mAP improvement to $0.48\pm0.2$ and no improvement in DICE score. However, through investigating pulse sequence integration, we developed a novel input-level dropout training scheme that holds out certain pulse sequences randomly during different iterations of training our deep net. This trains our network to be robust to missing pulse sequences in the future, at no cost to performance. We then developed two additional robustness training schemes that enable training on data annotations that have a lot of noise. We prove that we are able to lose no performance when degrading 70\% of our segmentation annotations with spherical approximations, and show a loss of < 5\% performance when degrading 90\% of our annotations. Similarly, when we censor our 50\% of our annotated lesions (simulating a 50\% False Negative Rate), we can preserve > 95\% of the performance by utilizing a novel lopsided bootstrap loss. Using these ideas, we use the lesion-based censoring technique as the base of a novel ensembling method we named Random Bundle. This network increased our mAP value $0.65\pm0.01$, an increase of about 40\%. We validate our methods on three different secondary datasets. By validating our methods work on brain metastases data from Oslo University Hospital, we show that our methods are robust to cross-center data. By validating our methods on the MICCAI BraTS dataset, we show that our methods are robust to magnetic resonance images of a different disorder. Finally, by validating our methods on diabetic retinopathy micro-aneurysms on fundus photographs, we show that our methods are robust across imaging domains and organ systems. Our experiments support our claims that (1) designing architectures with a focus on how pulse sequences interact will encode robustness for different imaging protocols, (2) creating custom loss functions around expected annotation errors will make our networks more robust to those errors, and (3) the overall performance of our networks can be improved by using these novel architectures and loss functions

Description

Type of resource	text
Form	electronic resource; remote; computer; online resource
Extent	1 online resource
Place	California
Place	[Stanford, California]
Publisher	[Stanford University]
Copyright date	2020; ©2020
Publication date	2020; 2020
Issuance	monographic
Language	English

Creators/Contributors

Author	Yi, Darvin
Degree supervisor	Rubin, Daniel (Daniel L.)
Thesis advisor	Rubin, Daniel (Daniel L.)
Thesis advisor	Langlotz, Curtis
Thesis advisor	Ré, Christopher
Thesis advisor	Yeung, Serena
Degree committee member	Langlotz, Curtis
Degree committee member	Ré, Christopher
Degree committee member	Yeung, Serena
Associated with	Stanford University, Department of Biomedical Informatics.

Subjects

Genre	Theses
Genre	Text

Bibliographic information

Statement of responsibility	Darvin Yi
Note	Submitted to the Department of Biomedical Informatics
Thesis	Thesis Ph.D. Stanford University 2020
Location	electronic resource

Access conditions

License: This work is licensed under a Creative Commons Attribution Non Commercial 3.0 Unported license (CC BY-NC).

Also listed in

View in SearchWorks

Loading usage metrics...