Skip to main content

Advertisement

Log in

A probabilistic model for predicting the probability of no-show in hospital appointments

  • Published:
Health Care Management Science Aims and scope Submit manuscript

Abstract

The number of no-shows has a significant impact on the revenue, cost and resource utilization for almost all healthcare systems. In this study we develop a hybrid probabilistic model based on logistic regression and empirical Bayesian inference to predict the probability of no-shows in real time using both general patient social and demographic information and individual clinical appointments attendance records. The model also considers the effect of appointment date and clinic type. The effectiveness of the proposed approach is validated based on a patient dataset from a VA medical center. Such an accurate prediction model can be used to enable a precise selective overbooking strategy to reduce the negative effect of no-shows and to fill appointment slots while maintaining short wait times.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Barron WM (1980) Failed appointments: who misses them, why they are missed, and what can be done. Prim Care 7(4):563–574

    Google Scholar 

  2. Bean AG, Talaga J (1995) Predicting appointment breaking. J Health Care Mark 15(1):29–34

    Google Scholar 

  3. Bech M (2005) The economics of non-attendance and the expected effect of charging a fine on non-attendees. Health Policy 74(2):181–191

    Article  Google Scholar 

  4. Bolstad WM (2007) Introduction to Bayesian statistics. Wiley-Interscience, New York

    Book  Google Scholar 

  5. Brockwell P, Davis RA (2009) Time series: theory and methods. Springer Series in Statistics

  6. Campbell JD, Chez RA, Queen TBA, Patron E (2000) The no-show rate in a high-risk obstetric clinic. J Women’s Health Gend-Based Med 9(8):891–895

    Article  Google Scholar 

  7. Cashman SB, Savageau JA, Savageau L, Celeste A, Ferguson W (2004) Patient health status and appointment keeping in an urban community health center. J Health Care Poor Underserved 15:474–488

    Article  Google Scholar 

  8. Cayirli T, Veral E (2003) Outpatient scheduling in health care: a review of the literature. Prod Oper Manag 12(4):519–549

    Article  Google Scholar 

  9. Chakraborty S, Muthuraman K, Mark L (2010) Sequential clinical scheduling with patient no-shows and general service time distributions. IIE Trans 42(5):354–366

    Article  Google Scholar 

  10. Cote MJ (1999) Patient flow and resource utilization in an outpatient clinic. Socio-Econ Plann Sci 33:231–245

    Article  Google Scholar 

  11. Cynthia TR, Nancy HG, Scott C, Donna SJ, Wilcox WD, Adolesc AP (1995) Patient appointment failures in pediatric resident continuity clinics. Pediatr Adolesc Med 149(6):693–695

    Google Scholar 

  12. Dove HG, Karen CS (1981) The usefulness of patients’ individual characteristics in predicting no-shows in outpatient clinics. Med Care XIX(7):734–740

    Article  Google Scholar 

  13. Dreihera J, Froimovicia M, Bibia Y, Vardya DA, Cicurela A, Cohen AD (2008) Nonattendance in obstetrics and gynecology patients. Gynecol Obstet Investig 66:40–43

    Article  Google Scholar 

  14. Evans M, Hastings N, Peacock B (2000) Statistical distributions, 3rd edn. Wiley-Interscience, New York

    Google Scholar 

  15. Garuda SR, Javalgi RG, Talluri VS (1998) Tackling no-show behavior: a market driven approach. Health Mark Q 15(4):25–44

    Article  Google Scholar 

  16. Glowacka KJ, Henry RM, May JH (2009) A hybrid data mining/simulation approach for modelling outpatient no-shows in clinic scheduling. J Oper Res Soc 60:1056–1068

    Article  Google Scholar 

  17. Goldman L, Freidin R, Cook EF, Eigner J, Grich P (1982) A multivariate approach to the prediction of no-show behavior in a primary care center. Arch Intern Med 142:563–567

    Article  Google Scholar 

  18. Gupta D, Denton B (2008) Appointment scheduling in health care: challenges and opportunities. IIE Trans 40:800–819

    Article  Google Scholar 

  19. Hassin R, Mendel S (2008) Scheduling arrivals to queues: a single-server model with no-shows. Manage Sci 54(3):565–572

    Article  Google Scholar 

  20. Haupt RL, Ellen S (2004) Practical genetic algorithm, 2nd edn. Wiley, New York

    Google Scholar 

  21. Hilbe JM (2009) Logistic regression models. Chapman & Hall/CRC Press

  22. Hixon AL, Chapman RW, Nuovo J (1999) Failure to keep clinic appointments: implications for residency education and productivity. Fam Med 31(9):627–630

    Google Scholar 

  23. Ho C, Lau H (1992) Minimizing total cost in scheduling outpatient appointments. Manag Sci 38(2):1750–1764

    Article  Google Scholar 

  24. Kleinbaum DG, Klein M (2002) Logistic regression a self-learning text, 2nd edn. Springer, New York

    Google Scholar 

  25. LaGanga LR, Lawrence SR (2007) Clinic overbooking to improve patient access and increase provider productivity. Decis Sci 38:251–276

    Article  Google Scholar 

  26. Lehmann TNO, Aebia A, Lehmann D, Balandraux OM, Stalder H (2007) Missed appointments at a Swiss university outpatient clinic. Public Health 121(10):790–799

    Article  Google Scholar 

  27. Liu N, Ziya S, Kulkarni VG (2009) Dynamic scheduling of outpatient appointments under patient no-shows and cancellations. Manuf Serv Oper Manag 12:347–364

    Google Scholar 

  28. Moore CG, Wilson-Witherspoon P, Probst JC (2001) Time and money: effects of no-shows at a family practice residency clinic. Fam Med 33(7):522–527

    Google Scholar 

  29. Muthuraman M, Lawley M, (2008) A stochastic overbooking model for outpatient clinical scheduling with no-shows I. 40(9):820–837

  30. Nadkarni MM, Philbrick JT (2005) Free clinics: a national study. Am J Med Sci 330(1):25–31

    Article  Google Scholar 

  31. Rust CT, Gallups NH, Clark S, Jones DS, Wilcox WD, Adolesc A (1995) Patient appointment failures in pediatric resident continuity clinics. Pediatr Adolesc Med 149(6):693–695

    Google Scholar 

  32. Simonoff JS (1996) Smoothing methods in statistics, Springer Series in Statistics. Springer, New York

    Google Scholar 

  33. Wang J (2009) Encyclopedia of data warehousing and mining, 2nd edn. Information Science Reference, Hershey

    Google Scholar 

  34. Bo Z, Turkcan A, Lin J (2010) Clinic scheduling models with overbooking for patients with heterogeneous no-show probabilities. Ann Oper Res 178(1):121–144

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Adel Alaeddini.

Appendix - Gaussian Mixture Models (GMM) and Expectation Maximization (EM) Algorithm

Appendix - Gaussian Mixture Models (GMM) and Expectation Maximization (EM) Algorithm

Gaussian Mixture Models (GMM) assume data points are drawn from a distribution that can be approximated by a mixture of Gaussian distributions. In this regard, assuming Q, the no-show rate of each clinic, is the feature vector, and k is the number of components (clinic clusters), the mixture model can be rewritten as:

$$ p\left( {Q|\Theta } \right) = \sum\nolimits_{{i = 1}}^k {{a_i}prob\left( {Q|{\theta_i}} \right)} $$
(11)

Where \( \left\{ {{a_1},...,{a_k},{\theta_1},...,{\theta_k}} \right\} \) is the collection of parameters with \( 0 \leqslant {a_i} \leqslant 1,\forall i = 1,2,...,k \) and \( \sum\nolimits_{{i = 1}}^k {{a_i} = 1} \) and \( p\left( {Q|{\theta_i}} \right) = \frac{1}{{\sigma \sqrt {{2\pi }} }}\exp \left( { - \frac{{Q - {\mu_i}}}{{2\sigma_i^2}}} \right) \). Having as a set of n, i.i.d samples \( Q = \left\{ {{q^{{(1)}}},{q^{{(2)}}},...,{q^{{(n)}}}} \right\} \) from the above model the log-likelihood function can be rewritten as:

$$ \begin{array}{*{20}{c}} {\log p\left( {Q|{\theta_i}} \right) = } \hfill \\{\log \prod\nolimits_{{j = 1}}^n {p\left( {{q^{{(j)}}}|\Theta } \right)} = \sum\nolimits_{{j = 1}}^n {\log } \sum\nolimits_{{i = 1}}^k {{\alpha_i}p} \left( {{q^{{(j)}}}|{\theta_j}} \right)} \hfill \\\end{array} $$
(12)

Here, the goal is to find Θ that maximizes the log-likelihood function:

$$ {\hat{\Theta }_{{MLE}}} = \arg \;\max \left\{ {\log \,p\left( {Q|\Theta } \right)} \right\} $$
(13)

The surface of the above likelihood function is highly nonlinear, and no closed form solution exists for the above likelihood function. One way to deal with this problem is by introducing a hidden variable Z:

$$ \begin{array}{*{20}{c}} {\log p\left( {Q,Z|{\theta_i}} \right) = } \hfill \\{\sum\nolimits_{{j = 1}}^n {\sum\nolimits_{{i = 1}}^k {z_i^{{(j)}}\log \left[ {{\alpha_i}p\left( {{q^{{(j)}}}|z_i^{{(j)}}{\theta_j}} \right)} \right]} } } \hfill \\\end{array} $$
(14)

and using Expectation Maximization (EM) algorithm as follows [33]:

  1. i.

    Initializing parameters Θ

  2. ii.

    Iterating the following until convergence:

$$ E - Step:\,Q\left( {\Theta |{\Theta^{{(t)}}}} \right) = {E_z}\log \left[ {p\left( {Q,Z|\Theta } \right)|{\Theta^{{(t)}}}} \right] $$
(15)
$$ M - Step:\,{\Theta^{{\left( {t + 1} \right)}}} = \arg \,\max Q\left( {\Theta |{\Theta^{{(t)}}}} \right) $$
(16)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Alaeddini, A., Yang, K., Reddy, C. et al. A probabilistic model for predicting the probability of no-show in hospital appointments. Health Care Manag Sci 14, 146–157 (2011). https://doi.org/10.1007/s10729-011-9148-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10729-011-9148-9

Keywords

Navigation