30 July 2020 Learning from dispersed manual annotations with an optimized data weighting policy
Author Affiliations +
Abstract

Purpose: Deep learning methods have become essential tools for quantitative interpretation of medical imaging data, but training these approaches is highly sensitive to biases and class imbalance in the available data. There is an opportunity to increase the available training data by combining across different data sources (e.g., distinct public projects); however, data collected under different scopes tend to have differences in class balance, label availability, and subject demographics. Recent work has shown that importance sampling can be used to guide training selection. To date, these approaches have not considered imbalanced data sources with distinct labeling protocols.

Approach: We propose a sampling policy, known as adaptive stochastic policy (ASP), inspired by reinforcement learning to adapt training based on subject, data source, and dynamic use criteria. We apply ASP in the context of multiorgan abdominal computed tomography segmentation. Training was performed with cross validation on 840 subjects from 10 data sources. External validation was performed with 20 subjects from 1 data source.

Results: Four alternative strategies were evaluated with the state-of-the-art baseline as upper confident bound (UCB). ASP achieves average Dice of 0.8261 compared to 0.8135 UCB (p  <  0.01, paired t-test) across fivefold cross validation. On withheld testing datasets, the proposed ASP achieved 0.8265 mean Dice versus 0.8077 UCB (p  <  0.01, paired t-test).

Conclusions: ASP provides a flexible reweighting technique for training deep learning models. We conclude that the proposed method adapts the sample importance, which leverages the performance on a challenging multisite, multiorgan, and multisize segmentation task.

© 2020 Society of Photo-Optical Instrumentation Engineers (SPIE) 2329-4302/2020/$28.00 © 2020 SPIE
Yucheng Tang, Riqiang Gao, Yunqiang Chen, Dashan Gao, Michael R. Savona, Richard G. Abramson, Shunxing Bao, Yuankai Huo, and Bennett A. Landman "Learning from dispersed manual annotations with an optimized data weighting policy," Journal of Medical Imaging 7(4), 044002 (30 July 2020). https://doi.org/10.1117/1.JMI.7.4.044002
Received: 27 February 2020; Accepted: 30 June 2020; Published: 30 July 2020
Lens.org Logo
CITATIONS
Cited by 3 scholarly publications.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Spleen

Data modeling

Image segmentation

Liver

Computed tomography

Machine learning

Kidney

Back to Top