Display options
Share it on

Med Image Anal. 2021 Jan;67:101816. doi: 10.1016/j.media.2020.101816. Epub 2020 Oct 01.

Selective synthetic augmentation with HistoGAN for improved histopathology image classification.

Medical image analysis

Yuan Xue, Jiarong Ye, Qianying Zhou, L Rodney Long, Sameer Antani, Zhiyun Xue, Carl Cornwell, Richard Zaino, Keith C Cheng, Xiaolei Huang

Affiliations

  1. College of Information Sciences and Technology, The Pennsylvania State University, University Park, PA 16802, USA.
  2. Lister Hill National Center for Biomedical Communications, National Library of Medicine, Bethesda, MD 20892, USA.
  3. Department of Pathology, Penn State Health Milton S. Hershey Medical Center and Penn State College Of Medicine, Hershey, PA 17033, USA.
  4. College of Information Sciences and Technology, The Pennsylvania State University, University Park, PA 16802, USA. Electronic address: [email protected].

PMID: 33080509 PMCID: PMC8647936 DOI: 10.1016/j.media.2020.101816

Abstract

Histopathological analysis is the present gold standard for precancerous lesion diagnosis. The goal of automated histopathological classification from digital images requires supervised training, which requires a large number of expert annotations that can be expensive and time-consuming to collect. Meanwhile, accurate classification of image patches cropped from whole-slide images is essential for standard sliding window based histopathology slide classification methods. To mitigate these issues, we propose a carefully designed conditional GAN model, namely HistoGAN, for synthesizing realistic histopathology image patches conditioned on class labels. We also investigate a novel synthetic augmentation framework that selectively adds new synthetic image patches generated by our proposed HistoGAN, rather than expanding directly the training set with synthetic images. By selecting synthetic images based on the confidence of their assigned labels and their feature similarity to real labeled images, our framework provides quality assurance to synthetic augmentation. Our models are evaluated on two datasets: a cervical histopathology image dataset with limited annotations, and another dataset of lymph node histopathology images with metastatic cancer. Here, we show that leveraging HistoGAN generated images with selective augmentation results in significant and consistent improvements of classification performance (6.7% and 2.8% higher accuracy, respectively) for cervical histopathology and metastatic cancer datasets.

Copyright © 2020. Published by Elsevier B.V.

Keywords: Histopathology image classification; Medical image synthesis; Synthetic data augmentation

Conflict of interest statement

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this pa

References

  1. IEEE Rev Biomed Eng. 2009;2:147-71 - PubMed
  2. IEEE Rev Biomed Eng. 2014;7:97-114 - PubMed
  3. IEEE Trans Pattern Anal Mach Intell. 2019 Aug;41(8):1947-1962 - PubMed
  4. Proc IEEE Comput Soc Conf Comput Vis Pattern Recognit. 2016 Jun-Jul;2016:2424-2433 - PubMed
  5. IEEE J Biomed Health Inform. 2016 Nov;20(6):1595-1607 - PubMed
  6. BMC Bioinformatics. 2017 May 26;18(1):281 - PubMed
  7. Adv Neural Inf Process Syst. 2017 Dec;30:3239-3249 - PubMed
  8. Neuroinformatics. 2018 Oct;16(3-4):383-392 - PubMed
  9. Comput Methods Programs Biomed. 2014 Feb;113(2):539-56 - PubMed
  10. Med Image Anal. 2018 Oct;49:14-26 - PubMed
  11. JAMA Netw Open. 2019 Nov 1;2(11):e1914645 - PubMed

MeSH terms

Publication Types

Grant support