TY - JOUR
T1 - Development of debiasing technique for lung nodule chest X-ray datasets to generalize deep learning models
AU - Horry, Michael J.
AU - Chakraborty, Subrata
AU - Pradhan, Biswajeet
AU - Paul, Manoranjan
AU - Zhu, Jing
AU - Loh, Hui Wen
AU - Barua, Prabal Datta
AU - Acharya, U. Rajendra
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/7/21
Y1 - 2023/7/21
N2 - Screening programs for early lung cancer diagnosis are uncommon, primarily due to the challenge of reaching at-risk patients located in rural areas far from medical facilities. To overcome this obstacle, a comprehensive approach is needed that combines mobility, low cost, speed, accuracy, and privacy. One potential solution lies in combining the chest X-ray imaging mode with federated deep learning, ensuring that no single data source can bias the model adversely. This study presents a pre-processing pipeline designed to debias chest X-ray images, thereby enhancing internal classification and external generalization. The pipeline employs a pruning mechanism to train a deep learning model for nodule detection, utilizing the most informative images from a publicly available lung nodule X-ray dataset. Histogram equalization is used to remove systematic differences in image brightness and contrast. Model training is then performed using combinations of lung field segmentation, close cropping, and rib/bone suppression. The resulting deep learning models, generated through this pre-processing pipeline, demonstrate successful generalization on an independent lung nodule dataset. By eliminating confounding variables in chest X-ray images and suppressing signal noise from the bone structures, the proposed deep learning lung nodule detection algorithm achieves an external generalization accuracy of 89%. This approach paves the way for the development of a low-cost and accessible deep learning-based clinical system for lung cancer screening.
AB - Screening programs for early lung cancer diagnosis are uncommon, primarily due to the challenge of reaching at-risk patients located in rural areas far from medical facilities. To overcome this obstacle, a comprehensive approach is needed that combines mobility, low cost, speed, accuracy, and privacy. One potential solution lies in combining the chest X-ray imaging mode with federated deep learning, ensuring that no single data source can bias the model adversely. This study presents a pre-processing pipeline designed to debias chest X-ray images, thereby enhancing internal classification and external generalization. The pipeline employs a pruning mechanism to train a deep learning model for nodule detection, utilizing the most informative images from a publicly available lung nodule X-ray dataset. Histogram equalization is used to remove systematic differences in image brightness and contrast. Model training is then performed using combinations of lung field segmentation, close cropping, and rib/bone suppression. The resulting deep learning models, generated through this pre-processing pipeline, demonstrate successful generalization on an independent lung nodule dataset. By eliminating confounding variables in chest X-ray images and suppressing signal noise from the bone structures, the proposed deep learning lung nodule detection algorithm achieves an external generalization accuracy of 89%. This approach paves the way for the development of a low-cost and accessible deep learning-based clinical system for lung cancer screening.
KW - chest X-ray
KW - confounding bias
KW - deep learning
KW - federated learning
KW - lung cancer
KW - model generalization
UR - http://www.scopus.com/inward/record.url?scp=85165969815&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85165969815&partnerID=8YFLogxK
U2 - 10.3390/s23146585
DO - 10.3390/s23146585
M3 - Article
C2 - 37514877
AN - SCOPUS:85165969815
SN - 1424-8220
VL - 23
SP - 1
EP - 21
JO - Sensors
JF - Sensors
IS - 14
M1 - 6585
ER -