Performance Comparison of Individual and Ensemble CNN Models for the Classification of Brain 18F-FDG-PET Scans

Nobashi, Tomomi; Zacharias, Claudia; Ellis, Jason K.; Ferri, Valentina; Koran, Mary Ellen; Franc, Benjamin L.; Iagaru, Andrei; Davidzon, Guido A.

doi:10.1007/s10278-019-00289-x

Performance Comparison of Individual and Ensemble CNN Models for the Classification of Brain 18F-FDG-PET Scans

Published: 28 October 2019

Volume 33, pages 447–455, (2020)
Cite this article

Journal of Digital Imaging Aims and scope Submit manuscript

Tomomi Nobashi¹,
Claudia Zacharias²,
Jason K. Ellis³,
Valentina Ferri¹,
Mary Ellen Koran¹,
Benjamin L. Franc¹,
Andrei Iagaru¹ &
…
Guido A. Davidzon ORCID: orcid.org/0000-0001-5579-6825¹

763 Accesses
3 Altmetric
Explore all metrics

Abstract

The high-background glucose metabolism of normal gray matter on [18F]-fluoro-2-D-deoxyglucose (FDG) positron emission tomography (PET) of the brain results in a low signal-to-background ratio, potentially increasing the possibility of missing important findings in patients with intracranial malignancies. To explore the strategy of using a deep learning classifier to aid in distinguishing normal versus abnormal findings on PET brain images, this study evaluated the performance of a two-dimensional convolutional neural network (2D-CNN) to classify FDG PET brain scans as normal (N) or abnormal (A). Methods: Two hundred eighty-nine brain FDG-PET scans (N; n = 150, A; n = 139) resulting in a total of 68,260 images were included. Nine individual 2D-CNN models with three different window settings for axial, coronal, and sagittal axes were trained and validated. The performance of these individual and ensemble models was evaluated and compared using a test dataset. Odds ratio, Akaike’s information criterion (AIC), and area under curve (AUC) on receiver-operative-characteristic curve, accuracy, and standard deviation (SD) were calculated. Results: An optimal window setting to classify normal and abnormal scans was different for each axis of the individual models. An ensembled model using different axes with an optimized window setting (window-triad) showed better performance than ensembled models using the same axis and different windows settings (axis-triad). Increase in odds ratio and decrease in SD were observed in both axis-triad and window-triad models compared with individual models, whereas improvements of AUC and AIC were seen in window-triad models. An overall model averaging the probabilities of all individual models showed the best accuracy of 82.0%. Conclusions: Data ensemble using different window settings and axes was effective to improve 2D-CNN performance parameters for the classification of brain FDG-PET scans. If prospectively validated with a larger cohort of patients, similar models could provide decision support in a clinical setting.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fully automated identification of brain abnormality from whole-body FDG-PET imaging using deep learning-based brain extraction and statistical parametric mapping

Article Open access 14 November 2021

Deep learning-based image quality improvement of ¹⁸F-fluorodeoxyglucose positron emission tomography: a retrospective observational study

Article Open access 25 March 2021

A multi-label CNN model for the automatic detection and segmentation of gliomas using [¹⁸F]FET PET imaging

Article 18 March 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Abbreviations

FDG:: [18F]-fluoro-2-D-deoxyglucose
2D-CNN:: two dimensional convolutional neural network
AIC:: Akaike’s information criterion
AUC:: area under curve
CSV:: comma separated value file
CT:: computed tomography
MR:: magnetic resonance
PET:: positron emission tomography
PNG:: Portable Network Graphics format
ROC:: receiver operating characteristic
SD:: standard deviation
SUV:: standardized uptake value.

References

Jadvar H, Colletti PM, Delgado-Bolton R et al.: Appropriate use criteria for 18F-FDG PET/CT in restaging and treatment response assessment of malignant disease. J Nucl Med. 58:2026–2037, 2017
Article CAS Google Scholar
Waite S, Scott J, Gale B, Fuchs T, Kolla S, Reede D: Interpretive error in radiology. AJR Am J Roentgenol. 208:739–749, 2017
Article Google Scholar
Nishie A, Kakihara D, Nojo T et al.: Current radiologist workload and the shortages in Japan: How many full-time radiologists are required? Jpn J Radiol. 33:266–272, 2015
Article Google Scholar
Wong TZ, van der Westhuizen GJ, Coleman RE: Positron emission tomography imaging of brain tumors. Neuroimaging Clin N Am. 12:615–626, 2002
Article Google Scholar
Litjens G, Kooi T, Bejnordi BE et al.: A survey on deep learning in medical image analysis. Med Image Anal. 42:60–88, 2017
Article Google Scholar
Yamashita R, Nishio M, Do RKG, Togashi K: Convolutional neural networks: An overview and application in radiology. Insights Imaging. 9:611–629, 2018
Article Google Scholar
Esteva A, Kuprel B, Novoa RA et al.: Dermatologist-level classification of skin cancer with deep neural networks. Nature. 542:115–118, 2017
Article CAS Google Scholar
Causey JL, Zhang J, Ma S et al.: Highly accurate model for prediction of lung nodule malignancy with CT scans. Sci Rep. 8:9286, 2018
Article Google Scholar
Bernal J, Kushibar K, Asfaw DS et al.: Deep convolutional neural networks for brain image analysis on magnetic resonance imaging: A review. Artif Intell Med. 95:64–81, 2019
Article Google Scholar
Chen MC, Ball RL, Yang L et al.: Deep learning to classify radiology free-text reports. Radiology. 286:845–852, 2018
Article Google Scholar
Yasaka K, Akai H, Kunimatsu A, Abe O, Kiryu S: Deep learning for staging liver fibrosis on CT: A pilot study. Eur Radiol. 28:4578–4585, 2018
Article Google Scholar
Zhou Z, Zhao G, Kijowski R, Liu F: Deep convolutional neural network for segmentation of knee joint anatomy. Magn Reson Med. 80:2759–2770, 2018
Article Google Scholar
Huo Y, Xu Z, Xiong Y et al.: 3D whole brain segmentation using spatially localized atlas network tiles. NeuroImage. 194:105–119, 2019
Article Google Scholar
Liu M, Cheng D, Yan W: Alzheimer’s disease neuroimaging initiative. Classification of Alzheimer’s disease by combination of convolutional and recurrent neural networks using FDG-PET images. Front. Neuroinformatics. 12:35, 2018
Article CAS Google Scholar
He K, Zhang X, Ren S, Sun J: Deep residual learning for image recognition. ArXiv e-prints arXiv:1512.03385, 2015
Google Scholar
Michael SH, Rodney JH: How we read oncologic FDG PET/CT. Cancer Imaging. 16:35, 2016
Article Google Scholar
Krell MM, Su KK: Rotational data augmentation for electroencephalographic data. Conf Proc Annu Int Conf IEEE Eng Med Biol Soc IEEE Eng Med Biol Soc Annu Conf. 2017:471–474, 2017
Google Scholar
Costa AC, Oliveira HCR, Catani JH, de Barros N, Melo CFE, Vieira MAC: Data augmentation for detection of architectural distortion in digital mammography using deep learning approach. ArXiv e-prints arXiv:1807.03167, 2018
Google Scholar
Lakhani P, Sundaram B: Deep learning at chest radiography: Automated classification of pulmonary tuberculosis by using convolutional neural networks. Radiology. 284:574–582, 2017
Article Google Scholar
Paul R, Hall L, Goldgof D, Schabath M, Gillies R: Predicting nodule malignancy using a CNN ensemble approach. Proc Int Jt Conf Neural Netw Int Jt Conf Neural Netw. Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6233309/. 2018 Jul.
Kitamura G, Chung CY, Moore BE: Ankle fracture detection utilizing a convolutional neural network ensemble implemented with a small sample, de novo training, and multiview incorporation. J Digit Imaging. Doi: https://doi.org/10.1007/s10278-018-0167-7. Apr 18, 2019.
Rajaraman S, Jaeger S, Antani SK: Performance evaluation of deep neural ensembles toward malaria parasite detection in thin-blood smear images. PeerJ. 7:e6977, 2019
Article Google Scholar
Lyksborg M, Puonti O, Agn M, Larsen R: An ensemble of 2D convolutional neural networks for tumor segmentation. In: Paulsen RR, Pedersen KS Eds. Image Analysis. New York: Springer International Publishing, 2015, pp. 201–211
Chapter Google Scholar
Wei L, Yang Y, Nishikawa RM, Jiang Y: A study on several machine-learning methods for classification of malignant and benign clustered microcalcifications. IEEE Trans Med Imaging. 24:371–380, 2005
Article Google Scholar

Download references

Author information

Authors and Affiliations

Division of Nuclear Medicine and Molecular Imaging, Department of Radiology, Stanford University, 300 Pasteur Drive, Office H2228, Stanford, CA, 94305, USA
Tomomi Nobashi, Valentina Ferri, Mary Ellen Koran, Benjamin L. Franc, Andrei Iagaru & Guido A. Davidzon
Clinic for Nuclear Medicine, University Hospital Essen, Essen, Germany
Claudia Zacharias
DimensionalMechanics Inc.®, 2821 Northup Way Suite, Bellevue, WA, #200, USA
Jason K. Ellis

Authors

Tomomi Nobashi
View author publications
You can also search for this author inPubMed Google Scholar
Claudia Zacharias
View author publications
You can also search for this author inPubMed Google Scholar
Jason K. Ellis
View author publications
You can also search for this author inPubMed Google Scholar
Valentina Ferri
View author publications
You can also search for this author inPubMed Google Scholar
Mary Ellen Koran
View author publications
You can also search for this author inPubMed Google Scholar
Benjamin L. Franc
View author publications
You can also search for this author inPubMed Google Scholar
Andrei Iagaru
View author publications
You can also search for this author inPubMed Google Scholar
Guido A. Davidzon
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Guido A. Davidzon.

Ethics declarations

This retrospective study protocol received approval by the institutional review board and was found to be compliant with the standards of the Health Insurance Portability and Accountability Act.

Conflict of Interest

JKE and CZ are employed and related to DimensionalMechanics Inc.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nobashi, T., Zacharias, C., Ellis, J.K. et al. Performance Comparison of Individual and Ensemble CNN Models for the Classification of Brain 18F-FDG-PET Scans. J Digit Imaging 33, 447–455 (2020). https://doi.org/10.1007/s10278-019-00289-x

Download citation

Published: 28 October 2019
Issue Date: April 2020
DOI: https://doi.org/10.1007/s10278-019-00289-x

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Performance Comparison of Individual and Ensemble CNN Models for the Classification of Brain 18F-FDG-PET Scans

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Fully automated identification of brain abnormality from whole-body FDG-PET imaging using deep learning-based brain extraction and statistical parametric mapping

Deep learning-based image quality improvement of 18F-fluorodeoxyglucose positron emission tomography: a retrospective observational study

A multi-label CNN model for the automatic detection and segmentation of gliomas using [18F]FET PET imaging

Explore related subjects

Abbreviations

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of Interest

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now

Deep learning-based image quality improvement of ¹⁸F-fluorodeoxyglucose positron emission tomography: a retrospective observational study

A multi-label CNN model for the automatic detection and segmentation of gliomas using [¹⁸F]FET PET imaging