[Paper Reading]ChestX-ray8: Hospital-scale Chest X-ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases

Ya-Liang Allen Chang
2 min readMar 7, 2019

--

Problem Definition

How loosely labeled hospital-size chest X-ray database can be used to facilitate the data-hungry deep learning paradigms in building truly large-scale high precision computer-aided diagnosis (CAD) systems.

Contributions

  • Propose a new chest X-ray database “ChestX-ray8” with 108,948 frontal-view X-ray images of 32,717 unique patients with the text-mined eight common disease labels, mined from the text radiological reports via NLP techniques
  • Demonstrate that some commonly occurred thoracic diseases can be detected and even spatially-located via a unified weakly-supervised multi-label image classification and disease localization formulation

Method

Construction of Hospital-scale Chest X-ray Database

  • Short-list eight common thoracic pathology keywords that are frequently observed and diagnosed, based on radiologists’ feedback
  • Search the PACS system to pull out all the related radiological reports (together with images) as target corpus
  • Use NLP techniques for detecting the pathology keywords and removal of negation and uncertainty

Common Thoracic Disease Detection and Localization

  • Tailor Deep Convolutional Neural Network (DCNN) architectures for weakly-supervised object localization, by considering large image capacity, various multi-label CNN losses and different pooling strategies

Experiments

  • Dataset: proposed dataset
  • Classification and localization
  • Evaluation: (classification) AUCs of ROC curves for multi-label classification / (localization) pathology localization accuracy and the average false positive number

Results

Summary

The authors proposed a large-scale chest X-ray dataset and the way to collect it, together with a CNN model to deal with the classification and localization of 8 common diseases

Reference

Wang, Xiaosong, et al. “Chestx-ray8: Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases.” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2017.

--

--

No responses yet