A probabilistic model for predicting the probability of no-show in hospital appointments

Alaeddini, Adel; Yang, Kai; Reddy, Chandan; Yu, Susan

doi:10.1007/s10729-011-9148-9

A probabilistic model for predicting the probability of no-show in hospital appointments

Published: 01 February 2011

Volume 14, pages 146–157, (2011)
Cite this article

Health Care Management Science Aims and scope Submit manuscript

Adel Alaeddini¹,
Kai Yang¹,
Chandan Reddy² &
…
Susan Yu³

2146 Accesses
69 Citations
6 Altmetric
Explore all metrics

Abstract

The number of no-shows has a significant impact on the revenue, cost and resource utilization for almost all healthcare systems. In this study we develop a hybrid probabilistic model based on logistic regression and empirical Bayesian inference to predict the probability of no-shows in real time using both general patient social and demographic information and individual clinical appointments attendance records. The model also considers the effect of appointment date and clinic type. The effectiveness of the proposed approach is validated based on a patient dataset from a VA medical center. Such an accurate prediction model can be used to enable a precise selective overbooking strategy to reduce the negative effect of no-shows and to fill appointment slots while maintaining short wait times.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Big Data Analytics in Healthcare

An AI-based Decision Support System for Predicting Mental Health Disorders

Article 28 May 2022

The self-regulating nature of occupancy in ICUs: stochastic homoeostasis

Article 03 May 2018

References

Barron WM (1980) Failed appointments: who misses them, why they are missed, and what can be done. Prim Care 7(4):563–574
Google Scholar
Bean AG, Talaga J (1995) Predicting appointment breaking. J Health Care Mark 15(1):29–34
Google Scholar
Bech M (2005) The economics of non-attendance and the expected effect of charging a fine on non-attendees. Health Policy 74(2):181–191
Article Google Scholar
Bolstad WM (2007) Introduction to Bayesian statistics. Wiley-Interscience, New York
Book Google Scholar
Brockwell P, Davis RA (2009) Time series: theory and methods. Springer Series in Statistics
Campbell JD, Chez RA, Queen TBA, Patron E (2000) The no-show rate in a high-risk obstetric clinic. J Women’s Health Gend-Based Med 9(8):891–895
Article Google Scholar
Cashman SB, Savageau JA, Savageau L, Celeste A, Ferguson W (2004) Patient health status and appointment keeping in an urban community health center. J Health Care Poor Underserved 15:474–488
Article Google Scholar
Cayirli T, Veral E (2003) Outpatient scheduling in health care: a review of the literature. Prod Oper Manag 12(4):519–549
Article Google Scholar
Chakraborty S, Muthuraman K, Mark L (2010) Sequential clinical scheduling with patient no-shows and general service time distributions. IIE Trans 42(5):354–366
Article Google Scholar
Cote MJ (1999) Patient flow and resource utilization in an outpatient clinic. Socio-Econ Plann Sci 33:231–245
Article Google Scholar
Cynthia TR, Nancy HG, Scott C, Donna SJ, Wilcox WD, Adolesc AP (1995) Patient appointment failures in pediatric resident continuity clinics. Pediatr Adolesc Med 149(6):693–695
Google Scholar
Dove HG, Karen CS (1981) The usefulness of patients’ individual characteristics in predicting no-shows in outpatient clinics. Med Care XIX(7):734–740
Article Google Scholar
Dreihera J, Froimovicia M, Bibia Y, Vardya DA, Cicurela A, Cohen AD (2008) Nonattendance in obstetrics and gynecology patients. Gynecol Obstet Investig 66:40–43
Article Google Scholar
Evans M, Hastings N, Peacock B (2000) Statistical distributions, 3rd edn. Wiley-Interscience, New York
Google Scholar
Garuda SR, Javalgi RG, Talluri VS (1998) Tackling no-show behavior: a market driven approach. Health Mark Q 15(4):25–44
Article Google Scholar
Glowacka KJ, Henry RM, May JH (2009) A hybrid data mining/simulation approach for modelling outpatient no-shows in clinic scheduling. J Oper Res Soc 60:1056–1068
Article Google Scholar
Goldman L, Freidin R, Cook EF, Eigner J, Grich P (1982) A multivariate approach to the prediction of no-show behavior in a primary care center. Arch Intern Med 142:563–567
Article Google Scholar
Gupta D, Denton B (2008) Appointment scheduling in health care: challenges and opportunities. IIE Trans 40:800–819
Article Google Scholar
Hassin R, Mendel S (2008) Scheduling arrivals to queues: a single-server model with no-shows. Manage Sci 54(3):565–572
Article Google Scholar
Haupt RL, Ellen S (2004) Practical genetic algorithm, 2nd edn. Wiley, New York
Google Scholar
Hilbe JM (2009) Logistic regression models. Chapman & Hall/CRC Press
Hixon AL, Chapman RW, Nuovo J (1999) Failure to keep clinic appointments: implications for residency education and productivity. Fam Med 31(9):627–630
Google Scholar
Ho C, Lau H (1992) Minimizing total cost in scheduling outpatient appointments. Manag Sci 38(2):1750–1764
Article Google Scholar
Kleinbaum DG, Klein M (2002) Logistic regression a self-learning text, 2nd edn. Springer, New York
Google Scholar
LaGanga LR, Lawrence SR (2007) Clinic overbooking to improve patient access and increase provider productivity. Decis Sci 38:251–276
Article Google Scholar
Lehmann TNO, Aebia A, Lehmann D, Balandraux OM, Stalder H (2007) Missed appointments at a Swiss university outpatient clinic. Public Health 121(10):790–799
Article Google Scholar
Liu N, Ziya S, Kulkarni VG (2009) Dynamic scheduling of outpatient appointments under patient no-shows and cancellations. Manuf Serv Oper Manag 12:347–364
Google Scholar
Moore CG, Wilson-Witherspoon P, Probst JC (2001) Time and money: effects of no-shows at a family practice residency clinic. Fam Med 33(7):522–527
Google Scholar
Muthuraman M, Lawley M, (2008) A stochastic overbooking model for outpatient clinical scheduling with no-shows I. 40(9):820–837
Nadkarni MM, Philbrick JT (2005) Free clinics: a national study. Am J Med Sci 330(1):25–31
Article Google Scholar
Rust CT, Gallups NH, Clark S, Jones DS, Wilcox WD, Adolesc A (1995) Patient appointment failures in pediatric resident continuity clinics. Pediatr Adolesc Med 149(6):693–695
Google Scholar
Simonoff JS (1996) Smoothing methods in statistics, Springer Series in Statistics. Springer, New York
Google Scholar
Wang J (2009) Encyclopedia of data warehousing and mining, 2nd edn. Information Science Reference, Hershey
Google Scholar
Bo Z, Turkcan A, Lin J (2010) Clinic scheduling models with overbooking for patients with heterogeneous no-show probabilities. Ann Oper Res 178(1):121–144
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Industrial & Systems Engineering, Wayne State University, Detroit, MI, 48202, USA
Adel Alaeddini & Kai Yang
Department of Computer Science, Wayne State University, Detroit, MI, 48202, USA
Chandan Reddy
John D Dingell VA Medical Center, Detroit, MI, 48201, USA
Susan Yu

Authors

Adel Alaeddini
View author publications
You can also search for this author in PubMed Google Scholar
Kai Yang
View author publications
You can also search for this author in PubMed Google Scholar
Chandan Reddy
View author publications
You can also search for this author in PubMed Google Scholar
Susan Yu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Adel Alaeddini.

Appendix - Gaussian Mixture Models (GMM) and Expectation Maximization (EM) Algorithm

Gaussian Mixture Models (GMM) assume data points are drawn from a distribution that can be approximated by a mixture of Gaussian distributions. In this regard, assuming Q, the no-show rate of each clinic, is the feature vector, and k is the number of components (clinic clusters), the mixture model can be rewritten as:

$$ p\left( {Q|\Theta } \right) = \sum\nolimits_{{i = 1}}^k {{a_i}prob\left( {Q|{\theta_i}} \right)} $$

(11)

Where $ \left\{ {{a_1},...,{a_k},{\theta_1},...,{\theta_k}} \right\} $ is the collection of parameters with $ 0 \leqslant {a_i} \leqslant 1,\forall i = 1,2,...,k $ and $ \sum\nolimits_{{i = 1}}^k {{a_i} = 1} $ and $ p\left( {Q|{\theta_i}} \right) = \frac{1}{{\sigma \sqrt {{2\pi }} }}\exp \left( { - \frac{{Q - {\mu_i}}}{{2\sigma_i^2}}} \right) $. Having as a set of n, i.i.d samples $ Q = \left\{ {{q^{{(1)}}},{q^{{(2)}}},...,{q^{{(n)}}}} \right\} $ from the above model the log-likelihood function can be rewritten as:

$$ \begin{array}{*{20}{c}} {\log p\left( {Q|{\theta_i}} \right) = } \hfill \\{\log \prod\nolimits_{{j = 1}}^n {p\left( {{q^{{(j)}}}|\Theta } \right)} = \sum\nolimits_{{j = 1}}^n {\log } \sum\nolimits_{{i = 1}}^k {{\alpha_i}p} \left( {{q^{{(j)}}}|{\theta_j}} \right)} \hfill \\\end{array} $$

(12)

Here, the goal is to find Θ that maximizes the log-likelihood function:

$$ {\hat{\Theta }_{{MLE}}} = \arg \;\max \left\{ {\log \,p\left( {Q|\Theta } \right)} \right\} $$

(13)

The surface of the above likelihood function is highly nonlinear, and no closed form solution exists for the above likelihood function. One way to deal with this problem is by introducing a hidden variable Z:

$$ \begin{array}{*{20}{c}} {\log p\left( {Q,Z|{\theta_i}} \right) = } \hfill \\{\sum\nolimits_{{j = 1}}^n {\sum\nolimits_{{i = 1}}^k {z_i^{{(j)}}\log \left[ {{\alpha_i}p\left( {{q^{{(j)}}}|z_i^{{(j)}}{\theta_j}} \right)} \right]} } } \hfill \\\end{array} $$

(14)

and using Expectation Maximization (EM) algorithm as follows [33]:

i.
Initializing parameters Θ
ii.
Iterating the following until convergence:

$$ E - Step:\,Q\left( {\Theta |{\Theta^{{(t)}}}} \right) = {E_z}\log \left[ {p\left( {Q,Z|\Theta } \right)|{\Theta^{{(t)}}}} \right] $$

(15)

$$ M - Step:\,{\Theta^{{\left( {t + 1} \right)}}} = \arg \,\max Q\left( {\Theta |{\Theta^{{(t)}}}} \right) $$

(16)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Alaeddini, A., Yang, K., Reddy, C. et al. A probabilistic model for predicting the probability of no-show in hospital appointments. Health Care Manag Sci 14, 146–157 (2011). https://doi.org/10.1007/s10729-011-9148-9

Download citation

Received: 29 July 2010
Accepted: 18 January 2011
Published: 01 February 2011
Issue Date: June 2011
DOI: https://doi.org/10.1007/s10729-011-9148-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A probabilistic model for predicting the probability of no-show in hospital appointments

Abstract

Access this article

Similar content being viewed by others

Big Data Analytics in Healthcare

An AI-based Decision Support System for Predicting Mental Health Disorders

The self-regulating nature of occupancy in ICUs: stochastic homoeostasis

References

Author information

Authors and Affiliations

Corresponding author

Appendix - Gaussian Mixture Models (GMM) and Expectation Maximization (EM) Algorithm

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A probabilistic model for predicting the probability of no-show in hospital appointments

Abstract

Access this article

Similar content being viewed by others

Big Data Analytics in Healthcare

An AI-based Decision Support System for Predicting Mental Health Disorders

The self-regulating nature of occupancy in ICUs: stochastic homoeostasis

References

Author information

Authors and Affiliations

Corresponding author

Appendix - Gaussian Mixture Models (GMM) and Expectation Maximization (EM) Algorithm

Appendix - Gaussian Mixture Models (GMM) and Expectation Maximization (EM) Algorithm

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation