Machine learning to aid in the diagnosis of chronic obstructive pulmonary disease

Authors

  • Ranier Pereira Nunes de Melo PUC Minas
  • Marco Paulo Soares Gomes PUC Minas
  • Luis Enrique Zárate PUC Minas

DOI:

https://doi.org/10.59681/2175-4411.v16.iEspecial.2024.1249

Keywords:

Pulmonary disease, chronic obstructive, Data mining, Knowledge discovery

Abstract

Objective: to identify risk factors for the Chronic obstructive pulmonary disease in the Brazilian population. Method: through a process for knowledge discovery, and machine learning models, identify risk factors for the disease in the Brazilian population, based on the 2019 National Health Survey. Results: the best learning model was achieved with the algorithm Random Forest presenting an F1 measure of 75% for the test set. Conclusions: based on the analysis of the level of importance of the main factors such as asthma, age at risk, previous smoking, body mass index, household risk, among others, the first four stood out as the main risk factors.

Author Biographies

Ranier Pereira Nunes de Melo, PUC Minas

Bac., Ciência de Dados, PUC Minas, Belo Horizonte (MG), Brasil

Marco Paulo Soares Gomes, PUC Minas

Dr., Ciência de Dados, PUC Minas, Belo Horizonte (MG), Brasil

Luis Enrique Zárate, PUC Minas

Dr., Ciência de Dados, PUC Minas, Belo Horizonte (MG), Brasil

References

WHO, W. H. O. Chronic obstructive pulmonary disease (copd). https://www.who.int/news-room/fact-sheets/detail/chronic-obstructive-pulmonary-disease-(copd)

GOLD-COPD. Global strategy for prevention, diagnosis and management of copd: 2023 report. https://goldcopd.org/ 2023-gold-report-2/, 2023.

Smith LA, Oakden-Rayner L, Bird A, Zeng M, To MS, Mukherjee S, Palmer LJ. Machine learning and deep learning predictive models for long-term prognosis in patients with chronic obstructive pulmonary disease: a systematic review and meta-analysis. Lancet Digit Health. 2023 Dec;5(12):e872-e881. doi: 10.1016/S2589-7500(23)00177-2. PMID: 38000872.

Wang X, Ren H, Ren J, Song W, Qiao Y, Ren Z, Zhao Y, Linghu L, Cui Y, Zhao Z, Chen L, Qiu L. Machine learning-enabled risk prediction of chronic obstructive pulmonary disease with unbalanced data. Comput Methods Programs Biomed. 2023 Mar;230:107340. doi: 10.1016/j.cmpb.2023.107340. Epub 2023 Jan 6. PMID: 36640604.

Ma X, Wu Y, Zhang L, Yuan W, Yan L, Fan S, Lian Y, Zhu X, Gao J, Zhao J, Zhang P, Tang H, Jia W. Comparison and development of machine learning tools for the prediction of chronic obstructive pulmonary disease in the Chinese population. J Transl Med. 2020 Mar 31;18(1):146. doi: 10.1186/s12967-020-02312-0. PMID: 32234053; PMCID: PMC7110698.

Zarate, L., Petrocchi, B., Maia, C., Felix, C., and Gomes, M. P. CAPTO - A method for understanding problem domains for data science projects. Concilium 23:922–941, 2023.

Loyola-González, O. Black-Box vs. White-Box: Understanding Their Advantages and Weaknesses From a Practical Point of View, in IEEE Access, vol. 7, pp. 154096-154113, 2019.

Published

2024-11-19

How to Cite

de Melo, R. P. N., Gomes, M. P. S., & Zárate, L. E. (2024). Machine learning to aid in the diagnosis of chronic obstructive pulmonary disease. Journal of Health Informatics, 16(Especial). https://doi.org/10.59681/2175-4411.v16.iEspecial.2024.1249

Similar Articles

<< < 1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)