Journal of Biostatistics and Epidemiology https://publish.kne-publishing.com/index.php/jbe <p><strong data-stringify-type="bold">All the manuscripts should be submitted through the Journal Primary Website at <a href="https://jbe.tums.ac.ir/index.php/jbe/about/submissions">https://jbe.tums.ac.ir/index.php/jbe/about/submissions</a></strong></p> Tehran University of Medical Sciences en-US Journal of Biostatistics and Epidemiology 2383-4196 Iron; the Missing Link of Depression Prevention and Treatment: A Systematic Review and Meta-Analysis https://publish.kne-publishing.com/index.php/jbe/article/view/17150 <p><strong>Introduction:</strong> To identify the effect of Iron as a preventive and therapeutic agent on depression and other hematological indices by a systematic review and meta-analysis.</p> <p><strong>Methods:</strong> International databases like Web of Science, PubMed, Cochrane, International Clinical Trials Registry Platform, Clinicaltrials.gov, and Scopus were searched until 27 July 2024 to identify eligible articles with the appropriate Medical Subject Headings (MeSH). The risk of bias tool for randomized trials (RoB 2) was used for precise assessment. Heterogeneity was determined using Cochran’s Q-test and the I2 index. To assess source of heterogeneity, meta-regression was used. The pooled standardized mean difference (PSMD) was calculated by considering the random effects model.</p> <p><strong>Results:</strong> of 2154 studies,14 studies were included in systematic review and 6 studies were excluded from analysis due to lack of data for calculating PSMD and finally, 8 studies were included in meta-analysis. Based on the results, iron therapy led to improvement in depression symptoms (PSMD = -0.18; 95% CI: -0.32 to -0.03). The iron therapy led to increasing the blood level of Iron (PSMD = 0.57; 95% CI: 0.19 to 0.95), Ferritin (PSMD = 0.55; 95% CI: 0.25 to 0.85), HCT (PSMD = 0.40; 95% CI: 0.18 to 0.61), MCV (SMD = 0.67; 95% CI: 0.18 to 1.15) and Transferrin saturation (PSMD:0.26; 95% CI: 0.02 to 0.50). Based on the meta-regression result, the sample size, participant age, and publication year had no significant role in heterogeneity between studies.</p> <p><strong>Conclusion:</strong> The use of iron supplements in patients with depression can be considered. However, there is a need to conduct further studies involving various kinds of depression.</p> Reza Pakzad Mohammadamin Jandaghian-Bidgoli Fatemeh Abdi Negin Shaterian Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-07 2024-12-07 10.18502/jbe.v10i1.17150 The Effects of High Protein Intake on Cardiovascular Risk Factors and Weight Loss in Low Caloric Diets in Obese Adults: A Systematic Review https://publish.kne-publishing.com/index.php/jbe/article/view/17151 <p><strong>Introduction:</strong> This systematic review has been undertaken in order to assess the effects of hypocaloric, high- protein diets on weight loss and cardiovascular risk factors such as serum lipid levels in metabolically healthy obese adults. The primary outcomes measured include changes in pre-and post- diet mean BMI, LDL-C, HDL-C, TAG, and TC levels.</p> <p><strong>Methods:</strong> Four databases including: Embase, MEDLINE (via PubMed), Cochrane and Web of Science were searched with no restrictions on language or publication period. Clinicaltrials.gov was also searched in order to identify unpublished or on-going studies.</p> <p><strong>Results:</strong> Three of four studies included in this systematic review noted a significantly greater loss in pre- and post- diet mean BMI levels in the hypocaloric, high-protein diet group as compared to hypocaloric, non-high protein diets (control). Whilst pre- and post-diet mean LDL-C, HDL-C, TAG, and TC levels did not differ significantly among hypocaloric, high-protein and control diet groups.</p> <p><strong>Conclusion:</strong> Hypocaloric, high-protein diets had an unclear effect on blood-lipid levels as compared to control. Weight loss however was significantly greater in the hypocaloric, high-protein group as compared to other hypocaloric, non-high-protein diet groups.</p> Martin Lauran Ali Moghadas Jafari Kosar Mohamed Ali Mostafa Hosseini Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-07 2024-12-07 10.18502/jbe.v10i1.17151 Geospatial Analysis of COVID-19 Death Rate and Influencing Factors in the Middle East and North Africa Region https://publish.kne-publishing.com/index.php/jbe/article/view/17152 <p><strong>Introduction:</strong> This study uses advanced geospatial modeling to analyze key determinants of COVID-19 death cases across the Middle East and North Africa (MENA) region, aiming to reveal spatial patterns and inform targeted interventions for enhanced public health response.</p> <p><strong>Methods:</strong> This study employs GIS and geostatistical models, including OLS, SLM, SEM, GWR, and MGWR, to analyze spatial and demographic determinants of COVID-19 mortality across MENA. By examining socioeconomic, medical, and demographic factors, it identifies key drivers and explores spatially non-stationary relationships impacting death rates.</p> <p><strong>Results:</strong> The study found that hospital bed allocation, unemployment rate, and vaccination doses positively correlate with COVID-19 death cases in MENA, likely due to better reporting and healthcare access. The OLS model (R² = 0.7346) highlighted spatial autocorrelation, prompting the use of SLM and SEM, which confirmed predictor significance. GWR (R² = 0.8140) and MGWR (R² = 0.8187) revealed spatially non-stationary relationships, with hospital beds impacting the northwest (GWR) and southwest (MGWR). Unemployment was significant in the northeast (Iran, Turkey) and northwest (Morocco), while vaccination doses were notably influential in Iran and Somalia.</p> <p><strong>Conclusion:</strong> This study emphasizes the significant roles of healthcare capacity, socioeconomic factors, and vaccination coverage in influencing COVID-19 mortality across MENA. It highlights the vulnerability of healthcare systems in developing countries and underscores the need for targeted resource allocation. Using spatial models like GWR and MGWR, the research reveals regional variations, especially in the northwest, advocating for tailored, region-specific interventions. By integrating GIS and geostatistical models, this analysis lays a foundation for future research on COVID-19 dynamics, providing crucial insights to inform policy measures for better public health crisis management</p> Mustafa Shebani Aboalyem Mohd Tahir Ismail Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-07 2024-12-07 10.18502/jbe.v10i1.17152 DeepWei-Cu: A Deep Weibull Network for Cure Fraction Models https://publish.kne-publishing.com/index.php/jbe/article/view/17153 <p><strong>Introduction:</strong> Survival analysis including cure fraction subgroups is heavily used in different fields like economics, engineering and medicine. The main core of the analysis is to understand the relationship between the covariates and the survival function taking into consideration censoring and long-term survival. The analysis can be performed using traditional statistical models or neural networks. Recently, neural network has attracted attention in analyzing lifetime data due to its ability of efficiently estimating the survival function under the existence of complex covariates. The goal of this study is to develop a parametric neural network that can sufficiently predict survival data with cure fraction. To the best of our knowledge, this is the first time a parametric neural network is introduced to analyze mixture cure fraction models.</p> <p><strong>Methods:</strong> In this paper, we introduce a novel neural network based on mixture cure fraction Weibull loss function. Results: Alzheimer disease dataset as long as synthetic dataset are used to study the efficiency of the model. We compared the results using goodness of fit methods in both datasets with Weibull regression.</p> <p><strong>Conclusion:</strong> The proposed neural network has the flexibility of analyzing continuous data without discretization. Also, it has the advantage of using Weibull distribution properties. For example, it can analyze data with different hazard rates (monotonically decreasing, monotonically increasing and constant). Comparing the results with Weibull regression, the proposed neural network performed better.</p> Ola Abuelamayem Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-08 2024-12-08 10.18502/jbe.v10i1.17153 Stress-Strength Reliability of Two-Parameter Exponential Distribution Based on Progres- sively Type-II Censored Data https://publish.kne-publishing.com/index.php/jbe/article/view/17154 <p><strong>Introduction:</strong> Stress-strength models has achieved considerable attention in recent years due to its applicability in various areas like engineering, quality control, biology, genetics, medicine etc. This paper investigates estimation of the stress-strength reliability parameter R=P(Y&lt;X) in two-parameter exponential distributions under progressively type-II censored samples.</p> <p><strong>Methods:</strong> The maximum likelihood and the best linear unbiased estimates of P(Y&lt;X) are obtained, and the Bayes estimates of P(Y&lt;X) are computed under the squared error, linear-exponential, and Stein loss functions. Also, confidence intervals of stress-strength reliability such as the bootstrap confidence intervals, highest posterior density credible interval, and confidence interval based on the generalized pivotal quantity are obtained. Results: Using a simulation study, the point estimators and confidence intervals are evaluated and compared. A set of real data is presented for better clarification of the issue.</p> <p><strong>Conclusion:</strong> The results demonstrated that with increasing the sample size, in almost cases the estimated risk of all the estimators decrease. Also, in almost all cases the Bayes estimator under the linear-exponential loss function has smaller estimated risk than the other estimators. Based on our simulation, the expected lengths of all intervals tend to decrease when the sample size increases. Moreover, the highest posterior density confidence intervals are shorter than the others intervals for all the values of P(Y&lt;X).</p> Sajad Rostamian Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-08 2024-12-08 10.18502/jbe.v10i1.17154 Mining Hypertension Predictors using Decision Tree: Baseline Data of Kharameh Cohort Study https://publish.kne-publishing.com/index.php/jbe/article/view/17155 <p><strong>Introduction:</strong> Hypertension is a serious chronic disease and an important risk factor for many health problems. This study aimed to investigate the factors associated with hypertension using a decision-tree algorithm.</p> <p><strong>Methods:</strong> Methods: This cross-sectional study was conducted through the census in Kharameh City between 2014 and 2017. The study included 2510 hypertensive and 7840 non-hypertensive individuals. To create the decision tree, 70% of the cases were randomly allocated to the training dataset. In comparison, the remaining 30% were used as the testing dataset for the performance evaluation of the decision tree. Two models were assessed. In the first model (model I), 15 variables including age, gender, body mass index (BMI), years of education, occupation status, marital status, family history of hypertension, physical activity, total energy, number of meals, salt, oil type, drug use, alcohol use, and smoke entered into the model. in the second model (model II) 16 variables including age, gender, BMI and Blood factors as Hematocrit (HCT), Mean corpuscular hemoglobin concentration (MCHC), Platelet Count (PLT), Fasting blood sugar (FBS), Blood Urea Nitrogen (BUN), creatinine (Cr), triglycerides (TG), cholesterol (CHOL), Alkaline phosphatase (ALP), High-density lipoprotein (HDL), Gamma-glutamyl transpeptidase (GGT), low-density lipoproteins (LDL) and Urinary specific gravity (SG) were considered. A confusion matrix was used to measure the performance of the decision tree. Additionally, accuracy, sensitivity, specificity, and the receiver operating characteristics (ROC) curve were determined to compare the models.</p> <p><strong>Results:</strong> For the model I, the accuracy, sensitivity, specificity and AUC value were 79.2%(77.8-80.6), 82.4%(80.1-84.5), 78.24%(76.4-80), and 0.80%(0.79-0.82), respectively. For model II, the corresponding values were 79.5%(78.2-80.8), 81.0%(78.3-83.6)79.0%(77.5-80.5)and 0.80%(0.79-0.81), respectively. Confusion matrix of model I showed that of the 1188 cases with hypertension in the training data set, 979 cases were classified correctly and, for model II of the 2812 non-hypertension cases, 2222 cases were classified correctly.</p> <p><strong>Conclusion:</strong> We have suggested a decision tree model to identify the risk factors associated with hypertension. This model can be useful for early screening and improving preventive and curative health services in health promotion</p> Abbas Rezaianzadeh Samane Nematolahi Maryam Jalali Shayan Rezaeianzadeh Masoumeh Ghoddusi Johari Seyed Vahid Hosseini Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-08 2024-12-08 10.18502/jbe.v10i1.17155 A Multi-Method Comparison of Machine Learning in Predicting Pharmacokinetic Parameters: A Simulation Study https://publish.kne-publishing.com/index.php/jbe/article/view/17156 <p><strong>Introduction:</strong> One important aim of population pharmacokinetics (PK) and pharmacodynamics (PD) is the identification and quantification of the relationships between the parameter and covariates to improve the predictive performance of the population PK/PD modeling. Several new mathematical methods have been developed in pharmacokinetics in recent years which indicated that the machine learning-based methods are an appealing tool for analyzing PK/PD data.</p> <p><strong> Methods:</strong> This simulation-base study aims to determine whether machine learning methods, including support vector regression (SVR) and Random forest (RF) which are specifically designed for the prediction of blood serum concentration or clearance, could be an effective replacement for the Lasso covariate selection method in nonlinear mixed effect models. Accordingly, the predictive performance of penalized regression Lasso, SVR, and RF regression was compared to detect the associations between clearance and model covariates. PK data was simulated from a one-compartment model with oral administration. Covariates were created by sampling from a multivariate standard normal distribution with different levels of correlation. The true covariates influenced only clearance at different magnitudes. Lasso, RF, and SVR were compared in terms of mean absolute prediction error (MAE).</p> <p><strong>Results:</strong> The results show that SVR performed the best in small data sets, even in those in which a high correlation existed between covariates. This makes SVR a promising method for covariate selection in nonlinear mixed-effect models.</p> <p><strong>Conclusion:</strong> The Lasso method offered a higher MAE, making it less promising than RF and SVR, especially when dealing with a high correlation between covariates and a low number of individuals.</p> Marziyeh Doostfatemeh Kamal Amini Elham Haem Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-08 2024-12-08 10.18502/jbe.v10i1.17156 Bounded Multivariate Contaminated Normal Mixture Model with an Application in Skin Cancer Detection https://publish.kne-publishing.com/index.php/jbe/article/view/17157 <p><strong>Introduction:</strong> In real-world datasets, outliers are a common occurrence that can have a significant impact on the accuracy and reliability of statistical analyses. Detecting these outliers and developing robust models to handle their presence is a crucial challenge in data analysis. For instance, natural images may have complex distributions of values due to environmental factors like noise and illumination, resulting in objects with overlapping regions and non-trivial contours that cannot be accurately described by Gaussian mixture models. In many real life applications, observed data always fall in bounded support regions. This leads to the idea of bounded support mixture models. Motivated by the aforementioned observations, we introduce a bounded multivariate cntaminated normal distribution for fitting data with non-Gaussian distributions, asymmetry, and bounded support which makes finite mixture models more robust to fitting, since rare observations are given less importance in calculations.</p> <p><strong>Methods:</strong> A family of finite mixtures of bounded multivariate contaminated normal distributions is introduced. The model is well-suited for computer vision and pattern recognition problems due to its heavily-tailed and bounded nature, providing flexibility in modeling data in the presence of outliers. A feasible expectation- maximization algorithm is developed to compute the maximum likelihood estimates of the model parameters using a selection mechanism.</p> <p><strong>Results:</strong> The proposed methodology is validated by conducting experiments on both simulated data and two real natural skin cancer images. We estimate the parameters by the proposed expectation-maximization algorithm. The obtained results shown that the proposed model has successfully enhanced accuracy in segmenting skin lesions.</p> <p><strong>Conclusion:</strong> The reliable model-based clustering using finite mixtures of bounded multivariate contaminated normal distributions is introduced. An expectation-maximization algorithm was created to estimate parameters, with closed-form expressions utilized at the E-step. Practical tests on images for skin cancer detection showed enhanced accuracy in delineating skin lesions.</p> Abbas Mahdavi Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-08 2024-12-08 10.18502/jbe.v10i1.17157 Estimating Causal Effect of Two-Dose COVID-19 Vaccination on Hospitalization: A Propensity Score Matching Approach https://publish.kne-publishing.com/index.php/jbe/article/view/17158 <p><strong>Introduction:</strong> To estimate the effectiveness of two-dose COVID-19 vaccination in reducing hospitalization, accounting for complex confounding factors in observational studies.</p> <p><strong>Methods:</strong> Researchers applied propensity score methods to adjust for confounding variables, comparing their performance to traditional covariate adjustment methods. Multiple Logistic Regression and Propensity Score Matching were employed to analyze the data, ensuring a balanced comparison between vaccinated and unvaccinated groups.</p> <p><strong>Results:</strong> Both analytical methods demonstrated a significant reduction in the likelihood of hospitalization among vaccinated individuals. The adjusted odds ratios (OR) were 0.29 (95% CI: 0.26, 0.31) via logistic regression and 0.32 (95% CI: 0.30, 0.34) using propensity score matching.</p> <p><strong>Conclusion:</strong> The study confirms the effectiveness of two-dose COVID-19 vaccination in decreasing hospitalization. It highlights the importance of using meticulous approaches like propensity score methods to assess real-world impacts in complex observational data settings</p> Mahboobeh Taherizadeh Mohammad Taghi Shakeri Saeed Akhlaghi Ehsan Mosa Farkhani Copyright (c) 2024 Journal of Biostatistics and Epidemiology 2024-12-08 2024-12-08 10.18502/jbe.v10i1.17158