Health

Volume 2, Issue 7 (July 2010)

ISSN Print: 1949-4998   ISSN Online: 1949-5005

Google-based Impact Factor: 0.81  Citations  

Statistical models for predicting number of involved nodes in breast cancer patients

HTML  Download Download as PDF (Size: 228KB)  PP. 641-651  
DOI: 10.4236/health.2010.27098    5,710 Downloads   10,193 Views  Citations

Affiliation(s)

.

ABSTRACT

Clinicians need to predict the number of involved nodes in breast cancer patients in order to ascertain severity, prognosis, and design subsequent treatment. The distribution of involved nodes often displays over-dispersion—a larger variability than expected. Until now, the negative binomial model has been used to describe this distribution assuming that over-dispersion is only due to unobserved heterogeneity. The distribution of involved nodes contains a large proportion of excess zeros (negative nodes), which can lead to over-dispersion. In this situation, alternative models may better account for over-dispersion due to excess zeros. This study examines data from 1152 patients who underwent axillary dissections in a tertiary hospital in India during January 1993-January 2005. We fit and compare various count models to test model abilities to predict the number of involved nodes. We also argue for using zero inflated models in such populations where all the excess zeros come from those who have at some risk of the outcome of interest. The negative binomial regression model fits the data better than the Poisson, zero hurdle/inflated Poisson regression models. However, zero hurdle/inflated negative binomial regression models predicted the number of involved nodes much more accurately than the negative binomial model. This suggests that the number of involved nodes displays excess variability not only due to unobserved heterogeneity but also due to excess negative nodes in the data set. In this analysis, only skin changes and primary site were associated with negative nodes whereas parity, skin changes, primary site and size of tumor were associated with a greater number of involved nodes. In case of near equal performances, the zero inflated negative binomial model should be preferred over the hurdle model in describing the nodal frequency because it provides an estimate of negative nodes that are at “high-risk” of nodal involvement.

Share and Cite:

Dwivedi, A. , Dwivedi, S. , Deo, S. and Shukla, R. (2010) Statistical models for predicting number of involved nodes in breast cancer patients. Health, 2, 641-651. doi: 10.4236/health.2010.27098.

Cited by

[1] The assessment of frailty and malnutrition in vascular surgery
2023
[2] Count Data Regression Modelling: An Application to Monkeypox Confirmed Cases
Journal of Biostatistics and Epidemiology, 2023
[3] How to write statistical analysis section in medical research
Journal of Investigative medicine, 2022
[4] Effect of Household Cooking Energy Poverty on Child Respiratory Health in Pakistan
ul-Ann, S Mahfooz - iRASD Journal of Economics, 2022
[5] Random effects models of lymph node metastases in breast cancer: quantifying the roles of covariates and screening using a continuous growth model
2021
[6] Truncated Binomial Model to Estimate the Effect of Interactive Workshop on Cognitive Abilities of Elderly People Staying in Old Age Homes in Delhi, India
International Journal of Statistics and Applications, 2020
[7] Count data regression modeling: an application to spontaneous abortion
2020
[8] Analyzing discontinuities in longitudinal count data: A multilevel generalized linear mixed model.
2020
[9] Evaluating related Factors to the Number of Involved Lymph Nodes in Patients with Breast Cancer Using Zero-Inflated Negative Binomial Regression Model
2020
[10] Medical Statistics: An AZ Companion
2018
[11] Disease-related malnutrition and nutritional assessment in clinical practice
2018
[12] Bayesian Zero-Inflated Poisson model for prognosis of demographic factors associated with using crystal meth in Tehran population
2018
[13] Statistical Analysis of Microbiome Data with R
Statistical Analysis of Microbiome Data with R, 2018
[14] Modeling Zero-Inflated Microbiome Data
Statistical Analysis of Microbiome Data with R, 2018
[15] High-density affordable housing impact investing: a best-in-class project screening credit risk management model
2017
[16] Hospital-Based Study of the Frequency and Risk Factors of Stroke Recurrence in Two Years in China
2017
[17] Risk for malnutrition in patients prior to vascular surgery
The American Journal of Surgery, 2017
[18] Bayesian inference for bivariate Poisson data with zero-inflation.
2017
[19] 佇列閾值及寇斯多相統計模型探討與大腸直腸癌早期發現和住院之相關時間分布
臺灣大學流行病學與預防醫學研究所學位論文, 2015
[20] Comparison of [99mTc]Tilmanocept and Filtered [99mTc]Sulfur Colloid for Identification of SLNs in Breast Cancer Patients
2015
[21] Comparison of [99mTc] Tilmanocept and Filtered [99mTc] Sulfur Colloid for Identification of SLNs in Breast Cancer Patients
Annals of surgical oncology, 2015
[22] Methods for estimating relative risk in studies of common binary outcomes
Journal of Applied Statistics, 2014
[23] Statistical Analysis for Overdispersed Medical Count Data
International Journal of Mathematical and Computational Sciences, 2014
[24] On Classifying At Risk Latent Zeros Using Zero Inflated Models
Journal of Data Science, 2014
[25] 零膨脹統計模型研究乳癌疾病進展
臺灣大學流行病學與預防醫學研究所學位論文, 2014
[26] Changes in health risk behaviors for males and females from early adolescence through early adulthood.
Health psychology, 2013
[27] Zero inflated statistical count models for analysing the costs imposed by GERD and dyspepsia
Arab Journal of Gastroenterology, 2013
[28] Statistical count models for prognosis the risk factors of hepatitis C
Gastroenterology and Hepatology from bed to bench, 2013
[29] Specific count model for investing the related factors of cost of GERD and functional dyspepsia
Gastroenterology and Hepatology from bed to bench, 2013
[30] Event history, spatial analysis and count data methods for empirical research in information systems
Information Technology and Management, 2012
[31] Modeling count outcomes from hiv risk reduction interventions: a comparison of competing statistical models for count responses
AIDS research and treatment, 2012
[32] Using statistical models to assess medical cost of hepatitis C virus
Gastroenterology and Hepatology from bed to bench, 2012
[33] Research Article Modeling Count Outcomes from HIV Risk Reduction Interventions: A Comparison of Competing Statistical Models for Count Responses
Beedy, J Ma, C Feng, W Cross, 2012
[34] Modelling Of Lymph Node Count In Endometrial Cancer Patients Using Zero Inflated Generalized Poisson Model

Copyright © 2025 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.