Machine learning to aid in the diagnosis of chronic obstructive pulmonary disease
DOI:
https://doi.org/10.59681/2175-4411.v16.iEspecial.2024.1249Keywords:
Pulmonary disease, chronic obstructive, Data mining, Knowledge discoveryAbstract
Objective: to identify risk factors for the Chronic obstructive pulmonary disease in the Brazilian population. Method: through a process for knowledge discovery, and machine learning models, identify risk factors for the disease in the Brazilian population, based on the 2019 National Health Survey. Results: the best learning model was achieved with the algorithm Random Forest presenting an F1 measure of 75% for the test set. Conclusions: based on the analysis of the level of importance of the main factors such as asthma, age at risk, previous smoking, body mass index, household risk, among others, the first four stood out as the main risk factors.
References
WHO, W. H. O. Chronic obstructive pulmonary disease (copd). https://www.who.int/news-room/fact-sheets/detail/chronic-obstructive-pulmonary-disease-(copd)
GOLD-COPD. Global strategy for prevention, diagnosis and management of copd: 2023 report. https://goldcopd.org/ 2023-gold-report-2/, 2023.
Smith LA, Oakden-Rayner L, Bird A, Zeng M, To MS, Mukherjee S, Palmer LJ. Machine learning and deep learning predictive models for long-term prognosis in patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis. Lancet Digit Health. 2023 Dec;5(12):e872-e881. doi: 10.1016/S2589-7500(23)00177-2. PMID: 38000872.
Wang X, Ren H, Ren J, Song W, Qiao Y, Ren Z, Zhao Y, Linghu L, Cui Y, Zhao Z, Chen L, Qiu L. Machine learning-enabled risk prediction of chronic obstructive pulmonary disease with unbalanced data. Comput Methods Programs Biomed. 2023 Mar;230:107340. doi: 10.1016/j.cmpb.2023.107340. Epub 2023 Jan 6. PMID: 36640604.
Ma X, Wu Y, Zhang L, Yuan W, Yan L, Fan S, Lian Y, Zhu X, Gao J, Zhao J, Zhang P, Tang H, Jia W. Comparison and development of machine learning tools for the prediction of chronic obstructive pulmonary disease in the Chinese population. J Transl Med. 2020 Mar 31;18(1):146. doi: 10.1186/s12967-020-02312-0. PMID: 32234053; PMCID: PMC7110698.
Zarate, L., Petrocchi, B., Maia, C., Felix, C., and Gomes, M. P. CAPTO - A method for understanding problem domains for data science projects. Concilium 23:922–941, 2023.
Loyola-González, O. Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses From a Practical Point of View, in IEEE Access, vol. 7, pp. 154096-154113, 2019.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Submission of a paper to Journal of Health Informatics is understood to imply that it is not being considered for publication elsewhere and that the author(s) permission to publish his/her (their) article(s) in this Journal implies the exclusive authorization of the publishers to deal with all issues concerning the copyright therein. Upon the submission of an article, authors will be asked to sign a Copyright Notice. Acceptance of the agreement will ensure the widest possible dissemination of information. An e-mail will be sent to the corresponding author confirming receipt of the manuscript and acceptance of the agreement.