Skip to main content
Erschienen in: Health Services and Outcomes Research Methodology 3/2018

05.06.2018

Are marginalized two-part models superior to non-marginalized two-part models for count data with excess zeroes? Estimation of marginal effects, model misspecification, and model selection

verfasst von: Xueyan Liu, Bo Zhang, Li Tang, Zhiwei Zhang, Ning Zhang, Jeroan J. Allison, Deo Kumar Srivastava, Hui Zhang

Erschienen in: Health Services and Outcomes Research Methodology | Ausgabe 3/2018

Einloggen, um Zugang zu erhalten

Abstract

The marginalized two-part models, including the marginalized zero-inflated Poisson and negative binomial models, have been proposed in the literature for modelling cross-sectional healthcare utilization count data with excess zeroes and overdispersion. The motivation for these proposals was to directly capture the overall marginal effects and to avoid post-modelling effect calculations that are needed for the non-marginalized conventional two-part models. However, are marginalized two-part models superior to non-marginalized two-part models because of their structural property? Is it true that the marginalized two-part models can provide direct marginal inference? This article aims to answer these questions through a comprehensive investigation. We first summarize the existing non-marginalized and marginalized two-part models and then develop marginalized hurdle Poisson and negative binomial models for cross-sectional count data with abundant zero counts. Our interest in the investigation lies particularly in the (average) marginal effect and (average) incremental effect and the comparison of these effects. The estimators of these effects are presented, and variance estimators are derived by using delta methods and Taylor series approximations. Though the marginalized models attract attention because of the alleged convenience of direct marginal inference, we provide evidence for the impact of model misspecification of the marginalized models over the conventional models, and provide evidence for the importance of goodness-of-fit evaluation and model selection in differentiating between the marginalized and non-marginalized models. An empirical analysis of the German Socioeconomic Panel data is presented.
Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Basu, A., Rathouz, P.J.: Estimating marginal and incremental effects on health outcomes using flexible link and variance function models. Biostatistics 6, 93–109 (2005)CrossRefPubMed Basu, A., Rathouz, P.J.: Estimating marginal and incremental effects on health outcomes using flexible link and variance function models. Biostatistics 6, 93–109 (2005)CrossRefPubMed
Zurück zum Zitat Cameron, A.C., Trivedi, P.K.: Microeconometrics: Methods and Applications. Cambridge University Press, Cambridge (2005)CrossRef Cameron, A.C., Trivedi, P.K.: Microeconometrics: Methods and Applications. Cambridge University Press, Cambridge (2005)CrossRef
Zurück zum Zitat Cameron, A.C., Trivedi, P.K.: Regression Analysis of Count Data. Cambridge University Press, Cambridge (2013)CrossRef Cameron, A.C., Trivedi, P.K.: Regression Analysis of Count Data. Cambridge University Press, Cambridge (2013)CrossRef
Zurück zum Zitat Cragg, T.C.: Some statistical models for limited dependent variables with application to the demand for durable goods. Econometrica 39, 829–844 (1971)CrossRef Cragg, T.C.: Some statistical models for limited dependent variables with application to the demand for durable goods. Econometrica 39, 829–844 (1971)CrossRef
Zurück zum Zitat Dow, W., Norton, E.: Choosing between and interpreting the Heckit and two-part models for corner solutions. Health Serv. Outcomes Res. Methodol. 4, 5–18 (2003)CrossRef Dow, W., Norton, E.: Choosing between and interpreting the Heckit and two-part models for corner solutions. Health Serv. Outcomes Res. Methodol. 4, 5–18 (2003)CrossRef
Zurück zum Zitat Frick, J.R.: A General Introduction to the German Socio-Economic Panel Study (SOEP)-Design, Contents and Data Structure (Waves A-V, 1984–2005). Deutsches Institut für Wirtschaftsfor-schung, Berlin (2006) Frick, J.R.: A General Introduction to the German Socio-Economic Panel Study (SOEP)-Design, Contents and Data Structure (Waves A-V, 1984–2005). Deutsches Institut für Wirtschaftsfor-schung, Berlin (2006)
Zurück zum Zitat Greene, W.H.: Accounting for excess zeroes and sample selection in Poisson and negative binomial regression models. NYU Working Paper No. EC-94-10: Department of Economics, New York University (1994). Available at SSRN https://ssrn.com/abstract=1293115 Greene, W.H.: Accounting for excess zeroes and sample selection in Poisson and negative binomial regression models. NYU Working Paper No. EC-94-10: Department of Economics, New York University (1994). Available at SSRN https://​ssrn.​com/​abstract=​1293115
Zurück zum Zitat Greene, W.H.: Econometric Analysis, 5th edn. Prentice Hall, New Jersey (2002) Greene, W.H.: Econometric Analysis, 5th edn. Prentice Hall, New Jersey (2002)
Zurück zum Zitat Hall, D.B.: Zero-inflated Poisson and binomial regression with random effects: a case study. Biometrics 56, 1030–1039 (2000)CrossRefPubMed Hall, D.B.: Zero-inflated Poisson and binomial regression with random effects: a case study. Biometrics 56, 1030–1039 (2000)CrossRefPubMed
Zurück zum Zitat Kassahun, W., Neyens, T., Molenberghs, G., Faes, C., Verbeke, G.: Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeroes. Stat. Med. 33, 4402–4419 (2014)CrossRefPubMed Kassahun, W., Neyens, T., Molenberghs, G., Faes, C., Verbeke, G.: Marginalized multilevel hurdle and zero-inflated models for overdispersed and correlated count data with excess zeroes. Stat. Med. 33, 4402–4419 (2014)CrossRefPubMed
Zurück zum Zitat Lambert, D.: Zero-inflated Poisson regression with an application to defects in manufacturing. Technometrics 34, 1–4 (1992)CrossRef Lambert, D.: Zero-inflated Poisson regression with an application to defects in manufacturing. Technometrics 34, 1–4 (1992)CrossRef
Zurück zum Zitat Li, C.S., Lu, J.C., Park, J., Kim, K., Brinkley, P.A., Peterson, J.P.: Multivariate zero-inflated Poisson models and their applications. J. Technometr. 41, 29–38 (1999)CrossRef Li, C.S., Lu, J.C., Park, J., Kim, K., Brinkley, P.A., Peterson, J.P.: Multivariate zero-inflated Poisson models and their applications. J. Technometr. 41, 29–38 (1999)CrossRef
Zurück zum Zitat Long, L.D., Preisser, J.S., Herring, A.H., Golin, C.E.: A marginalized zero-inflated Poisson regression model with overall exposure effects. Stat. Med. 33, 5151–5165 (2014)CrossRefPubMedPubMedCentral Long, L.D., Preisser, J.S., Herring, A.H., Golin, C.E.: A marginalized zero-inflated Poisson regression model with overall exposure effects. Stat. Med. 33, 5151–5165 (2014)CrossRefPubMedPubMedCentral
Zurück zum Zitat Madden, D.: Sample selection versus two-part models revisited: the case of female smoking and drinking. J. Health Econ. 27, 300–307 (2008)CrossRefPubMed Madden, D.: Sample selection versus two-part models revisited: the case of female smoking and drinking. J. Health Econ. 27, 300–307 (2008)CrossRefPubMed
Zurück zum Zitat Mullahy, J.: Specification and testing of some modified count data models. J. Econ. 33, 341–365 (1986)CrossRef Mullahy, J.: Specification and testing of some modified count data models. J. Econ. 33, 341–365 (1986)CrossRef
Zurück zum Zitat Pohlmeier, W., Ulrich, V.: An econometric model of the two-part decision making process in the demand for health care. J. Hum. Resour. 30, 339–361 (1995)CrossRef Pohlmeier, W., Ulrich, V.: An econometric model of the two-part decision making process in the demand for health care. J. Hum. Resour. 30, 339–361 (1995)CrossRef
Zurück zum Zitat Preisser, J.S., Das, K., Long, D.L., Divaris, K.: Marginalized zero-inflated negative binomial regression with application to dental caries. Stat. Med. 35, 1722–1735 (2016)CrossRefPubMed Preisser, J.S., Das, K., Long, D.L., Divaris, K.: Marginalized zero-inflated negative binomial regression with application to dental caries. Stat. Med. 35, 1722–1735 (2016)CrossRefPubMed
Zurück zum Zitat Ridout, M., Hinde, J., Demetrio, C.G.B.: A score test for testing a zero-inflated Poisson regression model against zero-inflated negative binomial alternatives. Biometrics 57, 219–223 (2001)CrossRefPubMed Ridout, M., Hinde, J., Demetrio, C.G.B.: A score test for testing a zero-inflated Poisson regression model against zero-inflated negative binomial alternatives. Biometrics 57, 219–223 (2001)CrossRefPubMed
Zurück zum Zitat Riphahn, R., Wambach, A., Million, A.: Incentive effects in the demand for health care: a bivariate panel count data estimation. J. Appl. Econ. 18, 387–405 (2003)CrossRef Riphahn, R., Wambach, A., Million, A.: Incentive effects in the demand for health care: a bivariate panel count data estimation. J. Appl. Econ. 18, 387–405 (2003)CrossRef
Zurück zum Zitat Staub, K., Winkelmann, R.: Consistent estimation of zero-inflated count models. Health Econ. 22, 673–686 (2013)CrossRefPubMed Staub, K., Winkelmann, R.: Consistent estimation of zero-inflated count models. Health Econ. 22, 673–686 (2013)CrossRefPubMed
Zurück zum Zitat Tabb, L.P., Tchetgen, E.J., Wellenius, G.A., Coull, B.A.: Marginalized zero-altered models for longitudinal count data. Stat. Biosci. 8, 181–203 (2016)CrossRefPubMed Tabb, L.P., Tchetgen, E.J., Wellenius, G.A., Coull, B.A.: Marginalized zero-altered models for longitudinal count data. Stat. Biosci. 8, 181–203 (2016)CrossRefPubMed
Zurück zum Zitat Vuong, Q.H.: Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 57, 307–333 (1989)CrossRef Vuong, Q.H.: Likelihood ratio tests for model selection and non-nested hypotheses. Econometrica 57, 307–333 (1989)CrossRef
Zurück zum Zitat Wang, Z., Ma, S., Wang, C.Y.: Variable selection for zero-inflated and overdispersed data with application to health care demand in Germany. Biometr. J. 57(5), 867–884 (2015)CrossRef Wang, Z., Ma, S., Wang, C.Y.: Variable selection for zero-inflated and overdispersed data with application to health care demand in Germany. Biometr. J. 57(5), 867–884 (2015)CrossRef
Zurück zum Zitat White, H.: Maximum likelihood estimationof misspecified models. Econometrica 50, 1–25 (1982)CrossRef White, H.: Maximum likelihood estimationof misspecified models. Econometrica 50, 1–25 (1982)CrossRef
Zurück zum Zitat Winkelmann, R.: Econometric Analysis of Count Data, 5th edn. Springer, Berlin (2008) Winkelmann, R.: Econometric Analysis of Count Data, 5th edn. Springer, Berlin (2008)
Metadaten
Titel
Are marginalized two-part models superior to non-marginalized two-part models for count data with excess zeroes? Estimation of marginal effects, model misspecification, and model selection
verfasst von
Xueyan Liu
Bo Zhang
Li Tang
Zhiwei Zhang
Ning Zhang
Jeroan J. Allison
Deo Kumar Srivastava
Hui Zhang
Publikationsdatum
05.06.2018
Verlag
Springer US
Erschienen in
Health Services and Outcomes Research Methodology / Ausgabe 3/2018
Print ISSN: 1387-3741
Elektronische ISSN: 1572-9400
DOI
https://doi.org/10.1007/s10742-018-0183-6