TY - JOUR
T1 - A Deep Learning Approach to Predict Conductive Hearing Loss in Patients With Otitis Media With Effusion Using Otoscopic Images
AU - Zeng, Junbo
AU - Kang, Weibiao
AU - Chen, Suijun
AU - Lin, Yi
AU - Deng, Wenting
AU - Wang, Yajing
AU - Chen, Guisheng
AU - Ma, Kai
AU - Zhao, Fei
AU - Zheng, Yefeng
AU - Liang, Maojin
AU - Zeng, Linqi
AU - Ye, Weijie
AU - Li, Peng
AU - Chen, Yubin
AU - Chen, Guoping
AU - Gao, Jinliang
AU - Wu, Minjian
AU - Su, Yuejia
AU - Zheng, Yiqing
AU - Cai, Yuexin
N1 - Publisher Copyright:
© 2022 American Medical Association. All rights reserved.
PY - 2022/5/19
Y1 - 2022/5/19
N2 - Importance: Otitis media with effusion (OME) is one of the most common causes of acquired conductive hearing loss (CHL). Persistent hearing loss is associated with poor childhood speech and language development and other adverse consequence. However, to obtain accurate and reliable hearing thresholds largely requires a high degree of cooperation from the patients. Objective: To predict CHL from otoscopic images using deep learning (DL) techniques and a logistic regression model based on tympanic membrane features. Design, Setting, and Participants: A retrospective diagnostic/prognostic study was conducted using 2790 otoscopic images obtained from multiple centers between January 2015 and November 2020. Participants were aged between 4 and 89 years. Of 1239 participants, there were 209 ears from children and adolescents (aged 4-18 years [16.87%]), 804 ears from adults (aged 18-60 years [64.89%]), and 226 ears from older people (aged >60 years, [18.24%]). Overall, 679 ears (54.8%) were from men. The 2790 otoscopic images were randomly assigned into a training set (2232 [80%]), and validation set (558 [20%]). The DL model was developed to predict an average air-bone gap greater than 10 dB. A logistic regression model was also developed based on otoscopic features. Main Outcomes and Measures: The performance of the DL model in predicting CHL was measured using the area under the receiver operating curve (AUC), accuracy, and F1 score (a measure of the quality of a classifier, which is the harmonic mean of precision and recall; a higher F1 score means better performance). In addition, these evaluation parameters were compared to results obtained from the logistic regression model and predictions made by three otologists. Results: The performance of the DL model in predicting CHL showed the AUC of 0.74, accuracy of 81%, and F1 score of 0.89. This was better than the results from the logistic regression model (ie, AUC of 0.60, accuracy of 76%, and F1 score of 0.82), and much improved on the performance of the 3 otologists; accuracy of 16%, 30%, 39%, and F1 scores of 0.09, 0.18, and 0.25, respectively. Furthermore, the DL model took 2.5 seconds to predict from 205 otoscopic images, whereas the 3 otologists spent 633 seconds, 645 seconds, and 692 seconds, respectively. Conclusions and Relevance: The model in this diagnostic/prognostic study provided greater accuracy in prediction of CHL in ears with OME than those obtained from the logistic regression model and otologists. This indicates great potential for the use of artificial intelligence tools to facilitate CHL evaluation when CHL is unable to be measured.
AB - Importance: Otitis media with effusion (OME) is one of the most common causes of acquired conductive hearing loss (CHL). Persistent hearing loss is associated with poor childhood speech and language development and other adverse consequence. However, to obtain accurate and reliable hearing thresholds largely requires a high degree of cooperation from the patients. Objective: To predict CHL from otoscopic images using deep learning (DL) techniques and a logistic regression model based on tympanic membrane features. Design, Setting, and Participants: A retrospective diagnostic/prognostic study was conducted using 2790 otoscopic images obtained from multiple centers between January 2015 and November 2020. Participants were aged between 4 and 89 years. Of 1239 participants, there were 209 ears from children and adolescents (aged 4-18 years [16.87%]), 804 ears from adults (aged 18-60 years [64.89%]), and 226 ears from older people (aged >60 years, [18.24%]). Overall, 679 ears (54.8%) were from men. The 2790 otoscopic images were randomly assigned into a training set (2232 [80%]), and validation set (558 [20%]). The DL model was developed to predict an average air-bone gap greater than 10 dB. A logistic regression model was also developed based on otoscopic features. Main Outcomes and Measures: The performance of the DL model in predicting CHL was measured using the area under the receiver operating curve (AUC), accuracy, and F1 score (a measure of the quality of a classifier, which is the harmonic mean of precision and recall; a higher F1 score means better performance). In addition, these evaluation parameters were compared to results obtained from the logistic regression model and predictions made by three otologists. Results: The performance of the DL model in predicting CHL showed the AUC of 0.74, accuracy of 81%, and F1 score of 0.89. This was better than the results from the logistic regression model (ie, AUC of 0.60, accuracy of 76%, and F1 score of 0.82), and much improved on the performance of the 3 otologists; accuracy of 16%, 30%, 39%, and F1 scores of 0.09, 0.18, and 0.25, respectively. Furthermore, the DL model took 2.5 seconds to predict from 205 otoscopic images, whereas the 3 otologists spent 633 seconds, 645 seconds, and 692 seconds, respectively. Conclusions and Relevance: The model in this diagnostic/prognostic study provided greater accuracy in prediction of CHL in ears with OME than those obtained from the logistic regression model and otologists. This indicates great potential for the use of artificial intelligence tools to facilitate CHL evaluation when CHL is unable to be measured.
UR - http://www.scopus.com/inward/record.url?scp=85130704347&partnerID=8YFLogxK
U2 - 10.1001/jamaoto.2022.0900
DO - 10.1001/jamaoto.2022.0900
M3 - Article
C2 - 35588049
AN - SCOPUS:85130704347
SN - 2168-6181
VL - 148
SP - 612
EP - 620
JO - JAMA Otolaryngology - Head and Neck Surgery
JF - JAMA Otolaryngology - Head and Neck Surgery
IS - 7
ER -