Predicting Factors Affecting the First Recurrence of Epithelial Ovarian Cancer Using Random Survival Forest

  • Maryam Deldar Department of Biostatistics, Faculty of Health, Ilam University of Medical Sciences, Ilam, Iran
  • Robab Anbiaee Department of Radiotherapy, School of Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
  • Kourosh Sayehmiri Department of Biostatistics, Faculty of Health, Ilam University of Medical Sciences, Ilam, Iran
Keywords: Epithelial ovarian cancer; First recurrent; Random survival forest

Abstract

Predicting survival time has many Effective implications in life quality management for the remainder of the patient's life. Also, survival data are highly variable and make accurate predictions difficult or impossible. Random Survival Forest by repeated tree construction on Bootstrap samples and averaging on the results of these trees reduce the prediction error and cause further generalization of these results. In this retrospective study, the records of 141 patients with epithelial ovarian cancer who were referred to the oncology and radiotherapy ward of Imam Hossein Hospital in Tehran from 2007 to 2018 were used. Random Survival Forest was fitted to the data to investigate the key factors affecting the first recurrence of epithelial ovarian cancer. The mean age of the patients in our study was 52 (23-82) years and the median time to the first recurrence in these was 17 (0.5-127) months, respectively. According to RSF results, using variable importance criterion (VIMP) metastatic tumor with relative importance 2.665 and also using minimal (MD) by depth 2.349, tumor stage with relative importance 1.993 and depth 2.678, and maximum platelet count with relative importance 2.132 and depth 2.683 were the most important variables affecting in the first recurrence of Epithelial Ovarian Cancer. One of the disadvantages of classical methods is the inappropriate fitting of many variables and the need for specific assumptions. More advanced methods such as RSF without the need for any specific assumptions with less prediction error can well explain event variations when exposed to high-dimensional data.

Published
2021-09-26
Section
Articles