Application of Random Survival Forest in breast cancer survival analysis

Authors

  • Daniela Schimitz de Carvalho Universidade Federal de Juiz de Fora
  • Thallys da Silva Nogueira Universidade Federal de Juiz de Fora
  • Priscila Vanessa Zabala Caprile Goliatt Universidade Federal de Juiz de Fora

DOI:

https://doi.org/10.59681/2175-4411.v15.iEspecial.2023.1113

Keywords:

Breast Neoplasms, Machine Learning, Survival Analysis

Abstract

This paper aims to apply a supervised machine learning method to a clinical dataset from Zona da Mata Mineira, to evaluate the performance of survival prediction accuracy for breast cancer. The database utilized went through pre-processing providing the variables used in the Random Survival Forest. The results show satisfactor performance metrics for survival prediction methods. Concluding that, the machine learning methods are promising assisting and guiding clinical practice.

Author Biographies

Daniela Schimitz de Carvalho, Universidade Federal de Juiz de Fora

Programa de Pós-Graduação em Modelagem Computacional, Universidade Federal de Juiz de Fora – UFJF, Juiz de Fora (MG), Brasil.

Thallys da Silva Nogueira, Universidade Federal de Juiz de Fora

Programa de Pós-Graduação em Modelagem Computacional, Universidade Federal de Juiz de Fora – UFJF, Juiz de Fora (MG), Brasil.

Priscila Vanessa Zabala Caprile Goliatt, Universidade Federal de Juiz de Fora

Programa de Pós-Graduação em Modelagem Computacional, Universidade Federal de Juiz de Fora – UFJF, Juiz de Fora (MG), Brasil.

References

Ferlay J et al. Cancer statistics for the year 2020: An overview. International Journal of Cancer. 2021;149(4),p.778-89.

World Health Organization. Cancer [Internet]; c2022 [cited 2022 Set 12]. Available from: https://www.who.int/news-room/fact-sheets/detail/cancer

Instituto Nacional de Câncer. Estatísticas de câncer [Internet]; c2022 [cited 2022 Set 10]. Available from: https://www.gov.br/inca/pt-br/assuntos/cancer/numeros/

Carvalho DS, Guerra MR, Barra LP, Queiroz RA. Aspectos gerais epidemiológicos da mortalidade por câncer de mama feminino no brasil e no mundo. Anais Simpósio de Enfermagem [Internet]. 2019 [cited 2022 Out 27];3:[about 1 p.]. Available from: http://pensaracademico.facig.edu.br/index.php/simposioenfermagem/article/view/1116

Cintra JR. Sobrevida e fatores associados em pacientes com câncer de mama, com diagnóstico entre 2003 e 2005 no município de Juiz de Fora – MG. [dissertation]. Juiz de Fora (JF): Universidade Federal de Juiz de Fora, 2012.

Torre LA, Islami F, Siegel RL, Ward EM, Jemal A. Global Cancer in Women: Burden and Trends. Cancer Epidemiol Biomarkers Prev. 2017;26(4):444-457.

Carvalho DS, Guerra MR, Barra LP, Queiroz RA. Modelagem computacional do crescimento tumoral mamário. Anais Seminário Científico UNIFACIG [Internet]. 2017 [cited 2022 Out 27];3:[about 1 p.]. Available from: http://pensaracademico.facig.edu.br/index.php/semiariocientifico/article/view/438

Ministério da Saúde (BR). Secretaria de Atenção à Saúde. Protocolos clínicos e diretrizes terapêuticas em Oncologia/Ministério da Saúde, Secretaria de Atenção à Saúde – Brasília : Ministério da Saúde, 2014.

Moncada-Torres A et al. Explainable machine learning can outperform Cox regression predictions and provide insights in breast cancer survival. Scientific Reports. 2021;11(1):p.1-13.

Li J et al. Predicting breast cancer 5-year survival using machine learning: a systematic review. PloS one [Internet]. 2021 [cited 2022 Out 27];16(4):[about 1 p.]. Available from: https://doi.org/10.1371/journal.pone.0250370

Tapak L et al. Prediction of survival and metastasis in breast cancer patients using machine learning classifiers. Clinical Epidemiology and Global Health. 2019;7(3):p.293-9.

Xiao J, Mo M, Wang Z, Zhou C, Shen J, Yuan J, He Y, Zheng Y. The Application and Comparison of Machine Learning Models for the Prediction of Breast Cancer Prognosis: Retrospective Cohort Study. JMIR medical informatics [Internet]. 2022[cited 2022 Out 27]; 10(2):[about 1 p.]. Available from: https://medinform.jmir.org/2022/2/e33440

Hueman MT et al. Creating prognostic systems for cancer patients: A demonstration using breast cancer. Cancer medicine. 2018;7(8):p.3611-21.

Lai X et al. Toward Personalized Computer Simulation of Breast Cancer Treatment: A Multiscale Pharmacokinetic and Pharmacodynamic Model Informed by Multitype Patient Data. Cancer research. 2019;79(16):p.4293-304.

Nave O. Adding features from the mathematical model of breast cancer to predict the tumour size. International Journal of Computer Mathematics: Computer Systems Theory. 2020;5(3):p.159-174.

Aivaliotis G et al. A comparison of time to event analysis methods, using weight status and breast cancer as a case study. Scientific reports. 2021;11(1):p. 1-9.

Pinheiro TS et al. Machine Learning e Análise Multivariada aplicados à Sobrevida do Câncer Mama. J Health Inform [Internet]. 2022[cited 2022 Out 27];(14).[about 1 p.]. Available from: https://jhi.sbis.org.br/index.php/jhi-sbis/article/view/971

Ishwaran H et al. Random survival forests. The annals of applied statistics. 2008;2(3):p.841-60.

Breiman L. Random forests. Machine Learning, Springer Science and Business Media LLC. 2001;45(1):p.5–32.

Ishwaran H, Lu M. Standard errors and confidence intervals for variable importance in random forest regression, classification, and survival. Statistics in medicine. 2019;38(4):p.558-82.

Understanding Predictions in Survival Analysis. [Internet];[cited 2022 Set 12]. Available from: https://scikit-survival.readthedocs.io/en/stable/user_guide/

Pölsterl S. scikit-survival: A Library for Time-to-Event Analysis Built on Top of scikit-learn. J. Mach. Learn. Res. 2020;21(212):p.1-6.

Fast Unified Random Forests with random. [Internet];[cited 2022 Set 12]. Available from:

https://www.randomforestsrc.org/articles/survival.html

Uno H, Cai T, Pencina MJ, D’Agostino RB, Wei LJ. On The C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics In medicine. 2011;30(10):1105-17.

Graf E, Schmoor C, Sauerbrei W, Schumacher M.Assessment and comparison of prognostic classification schemes for survival data. Statistics in medicine. 1999;18(17-18):2529-45.

Published

2023-07-20

How to Cite

Carvalho, D. S. de, Nogueira, T. da S., & Goliatt, P. V. Z. C. (2023). Application of Random Survival Forest in breast cancer survival analysis. Journal of Health Informatics, 15(Especial). https://doi.org/10.59681/2175-4411.v15.iEspecial.2023.1113

Similar Articles

1 2 3 4 5 6 7 8 9 10 > >> 

You may also start an advanced similarity search for this article.