TY - JOUR
T1 - Contributions and limitations of using machine learning to predict noise-induced hearing loss
AU - Chen, Feifan
AU - Cao, Zuwei
AU - Grais, Emad M.
AU - Zhao, Fei
N1 - Publisher Copyright:
© 2021, The Author(s).
PY - 2021/1/25
Y1 - 2021/1/25
N2 - Purpose: Noise-induced hearing loss (NIHL) is a global issue that impacts people’s life and health. The current review aims to clarify the contributions and limitations of applying machine learning (ML) to predict NIHL by analyzing the performance of different ML techniques and the procedure of model construction. Methods: The authors searched PubMed, EMBASE and Scopus on November 26, 2020. Results: Eight studies were recruited in the current review following defined inclusion and exclusion criteria. Sample size in the selected studies ranged between 150 and 10,567. The most popular models were artificial neural networks (n = 4), random forests (n = 3) and support vector machines (n = 3). Features mostly correlated with NIHL and used in the models were: age (n = 6), duration of noise exposure (n = 5) and noise exposure level (n = 4). Five included studies used either split-sample validation (n = 3) or ten-fold cross-validation (n = 2). Assessment of accuracy ranged in value from 75.3% to 99% with a low prediction error/root-mean-square error in 3 studies. Only 2 studies measured discrimination risk using the receiver operating characteristic (ROC) curve and/or the area under ROC curve. Conclusion: In spite of high accuracy and low prediction error of machine learning models, some improvement can be expected from larger sample sizes, multiple algorithm use, completed reports of model construction and the sufficient evaluation of calibration and discrimination risk.
AB - Purpose: Noise-induced hearing loss (NIHL) is a global issue that impacts people’s life and health. The current review aims to clarify the contributions and limitations of applying machine learning (ML) to predict NIHL by analyzing the performance of different ML techniques and the procedure of model construction. Methods: The authors searched PubMed, EMBASE and Scopus on November 26, 2020. Results: Eight studies were recruited in the current review following defined inclusion and exclusion criteria. Sample size in the selected studies ranged between 150 and 10,567. The most popular models were artificial neural networks (n = 4), random forests (n = 3) and support vector machines (n = 3). Features mostly correlated with NIHL and used in the models were: age (n = 6), duration of noise exposure (n = 5) and noise exposure level (n = 4). Five included studies used either split-sample validation (n = 3) or ten-fold cross-validation (n = 2). Assessment of accuracy ranged in value from 75.3% to 99% with a low prediction error/root-mean-square error in 3 studies. Only 2 studies measured discrimination risk using the receiver operating characteristic (ROC) curve and/or the area under ROC curve. Conclusion: In spite of high accuracy and low prediction error of machine learning models, some improvement can be expected from larger sample sizes, multiple algorithm use, completed reports of model construction and the sufficient evaluation of calibration and discrimination risk.
KW - Discrimination risk
KW - Machine learning
KW - Noise-induced hearing loss
KW - Prediction models
UR - http://www.scopus.com/inward/record.url?scp=85099870520&partnerID=8YFLogxK
U2 - 10.1007/s00420-020-01648-w
DO - 10.1007/s00420-020-01648-w
M3 - Review article
C2 - 33491101
AN - SCOPUS:85099870520
SN - 0340-0131
VL - 94
SP - 1097
EP - 1111
JO - International Archives of Occupational and Environmental Health
JF - International Archives of Occupational and Environmental Health
IS - 5
ER -