TY - JOUR
T1 - Diagnostic accuracy of a commercially available, deep learning-based chest X-ray interpretation software for detecting culture-confirmed pulmonary tuberculosis
AU - Tavaziva, Gamuchirai
AU - Majidulla, Arman
AU - Nazish, Ahsana
AU - Saeed, Saima
AU - Benedetti, Andrea
AU - Khan, Aamir J.
AU - Ahmad Khan, Faiz
N1 - Funding Information:
The original study was funded by an operating grant from the Canadian Institutes of Health Research (Award PJT-148743). L'Observatoire International Sur Les Impacts Sociétaux de l'Intelligence Artificielle (Fonds de recherche Quebec) supported the present analysis of this dataset. The funders had no role in the collection, analysis, and interpretation of the data; in the writing of the report; or in the decision to submit the paper for publication. Aamir J. Khan has had financial interests in the company Alcela, of which qure.ai is a client, and has provided technical assistance to quire.ai data scientists on the development of an all-in-artificial intelligence algorithm for mass CXR screening in public health programs. Consulting started in Q4 2019 and ended in Q2 2020. Solution was not finalized, and the planned product was not made commercially available. Aamir J. Khan had helped conceive and design the included study from Pakistan in 2016–2017 but was never directly involved in data collection, analysis, or reporting of that study, and his relationship with Alcela arose after the completion of data collection for that study. Aamir J. Khan was not involved in the design, analysis, reporting, writing, editing, or decision to submit the work reported in the present manuscript. Faiz Ahmad Khan reports grants from the Canadian Institutes of Health Research and Fonds de Recherche du Quebec, both are publicly funded government-run research agencies. Faiz Ahmad Khan has no financial or industry conflicts. Other authors have no relevant conflicts of interest to disclose. Conceived the study: FAK and AJK. Study design: FAK, AJK, and AB. Data collection: AM, FAK, AN, and SS. Data analysis: GT and FAK. Writing of first draft: GT and FAK. Reading and reviewing for critical revisions and data interpretation: all authors contributed to interpretation to results and provided critical feedback on the manuscript. The authors thank LUNIT for providing technical support with the local installation of the software used in this study. The study was approved by the institutional review board of Interactive Research and Development and the review ethics board of the McGill University Health Centre, and all participants provided informed consent.
Funding Information:
The original study was funded by an operating grant from the Canadian Institutes of Health Research (Award PJT-148743). L'Observatoire International Sur Les Impacts Sociétaux de l'Intelligence Artificielle (Fonds de recherche Quebec) supported the present analysis of this dataset. The funders had no role in the collection, analysis, and interpretation of the data; in the writing of the report; or in the decision to submit the paper for publication.
Publisher Copyright:
© 2022 The Author(s)
PY - 2022/9
Y1 - 2022/9
N2 - Background: Few evaluations of computer-aided detection (CAD) software for analyzing chest radiographs for tuberculosis have used mycobacterial culture as the reference standard. Methods: Using data from a prospective study of symptomatic adults and household contacts of persons with tuberculosis who were seeking care in Karachi, we evaluated the accuracy of LUNIT INSIGHT version 3.1.0.0 (LUNIT, South Korea) for detecting pulmonary tuberculosis in the triage use case. The reference standard was liquid culture. We estimated the diagnostic accuracy using three developer-recommended threshold scores for tuberculosis: 15, 30, and 45. Results: A total 269 of 2190 (12%) participants had culture-confirmed pulmonary tuberculosis. LUNIT-reported abnormalities of nodule, consolidation, fibrosis, and pleural effusion were more common with culture-confirmed tuberculosis. At the tuberculosis threshold score of 30, sensitivity and specificity were, respectively, 87.7% [95% CI: 83.2-91.4%] and 64.3% [62.1-66.4%]. Sensitivity was similar at scores of 15, 88.1% [95% CI: 83.6-91.7%] and 45, 86.6% [82.0 - 90.5%]; and specificity was 57.9% [55.7-60.2%] and 69.9% [67.8-71.9%], respectively. Sensitivity was lower for smear-negative disease, and specificity was lower with increasing age, previous tuberculosis, and decreasing body mass index. Diabetes and tobacco smoking did not modify accuracy. Conclusion: In a population where most tuberculosis was smear-positive, LUNIT-reported radiographic abnormalities were associated with culture-confirmed disease. Manufacturer-recommended threshold scores had limited sensitivity.
AB - Background: Few evaluations of computer-aided detection (CAD) software for analyzing chest radiographs for tuberculosis have used mycobacterial culture as the reference standard. Methods: Using data from a prospective study of symptomatic adults and household contacts of persons with tuberculosis who were seeking care in Karachi, we evaluated the accuracy of LUNIT INSIGHT version 3.1.0.0 (LUNIT, South Korea) for detecting pulmonary tuberculosis in the triage use case. The reference standard was liquid culture. We estimated the diagnostic accuracy using three developer-recommended threshold scores for tuberculosis: 15, 30, and 45. Results: A total 269 of 2190 (12%) participants had culture-confirmed pulmonary tuberculosis. LUNIT-reported abnormalities of nodule, consolidation, fibrosis, and pleural effusion were more common with culture-confirmed tuberculosis. At the tuberculosis threshold score of 30, sensitivity and specificity were, respectively, 87.7% [95% CI: 83.2-91.4%] and 64.3% [62.1-66.4%]. Sensitivity was similar at scores of 15, 88.1% [95% CI: 83.6-91.7%] and 45, 86.6% [82.0 - 90.5%]; and specificity was 57.9% [55.7-60.2%] and 69.9% [67.8-71.9%], respectively. Sensitivity was lower for smear-negative disease, and specificity was lower with increasing age, previous tuberculosis, and decreasing body mass index. Diabetes and tobacco smoking did not modify accuracy. Conclusion: In a population where most tuberculosis was smear-positive, LUNIT-reported radiographic abnormalities were associated with culture-confirmed disease. Manufacturer-recommended threshold scores had limited sensitivity.
KW - Chest radiography
KW - Computer-aided detection
KW - Deep learning
KW - Tuberculosis
UR - http://www.scopus.com/inward/record.url?scp=85131423605&partnerID=8YFLogxK
U2 - 10.1016/j.ijid.2022.05.037
DO - 10.1016/j.ijid.2022.05.037
M3 - Article
C2 - 35597555
AN - SCOPUS:85131423605
SN - 1201-9712
VL - 122
SP - 15
EP - 20
JO - International Journal of Infectious Diseases
JF - International Journal of Infectious Diseases
ER -