Reconsidering heterogeneity in panel data estimators of the stochastic frontier model

https://doi.org/10.1016/j.jeconom.2004.05.003Get rights and content

Abstract

This paper examines several extensions of the stochastic frontier that account for unmeasured heterogeneity as well as firm inefficiency. The fixed effects model is extended to the stochastic frontier model using results that specifically employ its nonlinear specification. Based on Monte Carlo results, we find that the incidental parameters problem operates on the coefficient estimates in the fixed effects stochastic frontier model in ways that are somewhat at odds with other familiar results. We consider a special case of the random parameters model that produces a random effects model that preserves the central feature of the stochastic frontier model and accommodates heterogeneity. We then examine random parameters and latent class models. In these cases, explicit models for firm heterogeneity are built into the stochastic frontier. Comparisons with received results for these models are presented in an application to the U.S. banking industry.

Introduction

The developments reported in this paper were motivated by a study undertaken by the author with the World Health Organization based on their year 2000 World Health Report (WHR) (see Tandon et al., 2001; Hollingsworth and Wildman, 2002; Greene, 2004). The WHR study is a panel data analysis of health care outcomes in 191 countries for the years 1993–1997. A fixed effects ‘frontier’ model was fit, and countries were ranked on the basis of the Schmidt and Sickles (1984) suggested corrected effects. Readers of the study argued that with a sample as disparate as this one surely is, the ‘fixed effects’ must be picking up a great deal of unmeasured cross country heterogeneity as well as any ‘inefficiency’ in the provision of health care services. One would expect that the confounding of the two effects has the potential seriously to distort the inefficiency measures of interest in the study. Ideally, it is appropriate to model inefficiency and heterogeneity separately in the same model to segregate the two effects. The stochastic frontiers literature that deals with panel data is diffuse (and not particularly verbose) on this issue. Many of the models in common use provide little or no mechanism for disentangling these two effects.1 Most of the received applications have effectively blended these two characteristics in a single term in the model. This paper will examine several alternative forms of the stochastic frontier model that take different approaches to incorporating heterogeneity. Not surprisingly, they produce markedly different results.

Aigner, Lovell and Schmidt (ALS) proposed the normal-half normal stochastic frontier in their pioneering work in 1977. A stream of research over the succeeding 25 years has produced many innovations in the specification and estimation of their model (see Greene, 1997 and Kumbhakar and Lovell, 2000, for recent surveys). Panel data applications have kept pace with other types of developments in the literature. Many of these estimators have been patterned on familiar fixed and random effects formulations of the linear regression model. This paper will examine several alternative approaches to modeling heterogeneity in panel data in the stochastic frontier model. We propose specifications which can isolate firm heterogeneity while better preserving the mechanism in the stochastic frontier model that produces estimates of technical or cost inefficiency.

This study is organized as follows: Section 2 will lay out the basic platform for all of the specifications of the stochastic frontier model. We will be presenting a large number of empirical applications in the text. These are based on a study of the U.S. banking industry. The data set to be used and the specific cost frontier model that will be used are also presented in Section 2. The succeeding sections will formalize and apply three classes of models, fixed effects, random effects and varying parameter models. In each case, unmeasured heterogeneity makes a different appearance in the model. Section 3 considers fixed effects estimation. This section considers two issues, the practical problem of computing the fixed effects estimator, and the bias and inconsistency of the fixed effects estimator due to the incidental parameters problem. A Monte Carlo study based on the panel from the U.S. banking industry is used to study the incidental parameters problem and its influence on inefficiency estimation. Section 4 presents results for random effects models. We first reconsider the familiar random effects model that has already appeared in the literature, observing once again that familiar approaches have forced one ‘effect’ to carry both heterogeneity and inefficiency. We then propose a modification of the random effects model which disentangles these terms. The fixed and random effects models treat heterogeneity as a firm specific additive constant. Section 5 will present two extensions of the model that allow for more general types of variation. This section will include development of a simulation based random parameters estimator that is a more flexible, general specification than the simple random effects model. We then turn to a latent class specification. Section 5 will develop the model, then apply it to the data on the banking industry considered in the preceding two sections. Finally, Bayesian estimators for fixed and random effects and for random parameters specifications have been proposed for the stochastic frontier model. We will also consider some of these specifications in Section 5. Some conclusions are drawn in Section 6.

Section snippets

The stochastic frontier model

The stochastic frontier model may be writtenyit=f(xit,zi)+vit±uit,i=1,,N,t=1,,T=α+βxit+τzi+vit±uit,uit0,where yit is the performance of firm i in period t (output, profits, costs), xit is the vector of inputs or input prices, and zi is a vector of firm specific characteristics. The sign of the last term depends on whether the frontier describes costs (positive) or production or profits (negative). The base case stochastic frontier model as originally proposed by ALS adds the distributional

Fixed effects modeling

Most applications of the fixed effects model in the frontier modeling framework have been based on Schmidt and Sickles's (1984) interpretation of the linear regression model. The basic framework is a linear model,yit=αi+βxit+vit,which can be estimated consistently by ‘within groups’ ordinary least squares (i.e., with dummy variables). The model is reinterpreted by treating αi as the firm specific inefficiency term. The authors suggest that the productive efficiency of the firms in the sample

Random effects models

The random effects specification is likewise motivated by the familiar linear model. It is assumed that the firm specific inefficiency (in proportional terms) is the same every year. Thus, the model becomesyit=α+βxit+vit±ui,where ui and vit, t=1,,T are independent and, moreover, ui is independent of xit. The idiosyncratic term in (17) is specified as N[0,σv2] exactly as before, while the inefficiency term, ui has the original half normal distribution. Note that the inefficiency term is now

Parameter heterogeneity—random parameter models

The preceding has explored several specifications of the stochastic frontier model that effectively treat firm and time specific ‘noise,’ cost inefficiency, and unmeasured heterogeneity all as constants that collectively compose the ‘disturbance’ in yit=α+βxit+wit+vit±uit,which of the first and third components (if either) can reasonably be treated as time invariant has been examined in 3 Fixed effects modeling, 4 Random effects models. We now consider some alternative formulations of the

Conclusions

This paper has examined several forms of the stochastic frontier model that take different approaches to incorporating heterogeneity and, not surprisingly, found that they produce very different results.

We have examined the fixed effects model applied to the stochastic frontier, as opposed to simply reinterpreting the linear regression. Thus, as formulated, the inefficiency term remains in the model and the fixed effect is intended only to capture the firm specific heterogeneity. The fixed

Acknowledgement

This paper has been prepared for the conference on “Current Developments in Productivity and Efficiency Measurement,” University of Georgia, October 25–26, 2002. It has benefited from comments at the North American Productivity Workshop at Union College, June 2002, the Asian Conference on Efficiency and Productivity in July 2002, discussions at University of Leicester and Binghamton University and ongoing conversations with Mike Tsionas, Subal Kumbhakar and Knox Lovell. This revision of the

References (67)

  • M. Pitt et al.

    The measurement and sources of technical inefficiency in Indonesian weaving industry

    Journal of Development Economics

    (1981)
  • R. Stevenson

    Likelihood functions for generalized stochastic frontier functions

    Journal of Econometrics

    (1980)
  • J. van den Broeck et al.

    Stochastic frontier modelsa Bayesian perspective

    Journal of Econometrics

    (1994)
  • M. Arellano

    Discrete choices with panel data

    Investigaciones Economica (Lecture)

    (2000)
  • G. Battese et al.

    Frontier production functions, technical efficiency and panel datawith application to paddy farmers in India

    Journal of Productivity Analysis

    (1992)
  • G. Battese et al.

    A model for technical inefficiency effects in a stochastic frontier production function for panel data

    Empirical Economics

    (1995)
  • P. Bauer et al.

    Efficiency and productivity growth in U.S. banking

  • Bhat, C., 1999. Quasi-random maximum simulated likelihood estimation of the mixed multinomial logit model. Manuscript,...
  • G. Chamberlain

    Analysis of covariance with qualitative data

    Review of Economic Studies

    (1980)
  • G. Chamberlain

    Panel data

  • G. Chamberlain

    Heterogeneity, omitted variable bias, and duration dependence

  • S. Chib et al.

    Understanding the Metropolis–Hastings algorithm

    American Statistician

    (1995)
  • LIMDEP, Version 8.0

    (2002)
  • B. Gong et al.

    Finite sample evidence on the performance of stochastic frontier models using panel data

    Journal of Productivity Analysis

    (1989)
  • C. Gourieroux et al.

    Simulation Based Econometrics

    (1996)
  • W. Greene

    A gamma distributed stochastic frontier model

    Journal of Econometrics

    (1990)
  • W. Greene

    Frontier production functions

  • Greene, W., 2001. Fixed and random effects in nonlinear models. Working paper 01-01, Department of Economics, Stern...
  • Greene, W., 2002. The behavior of the fixed effects estimator in nonlinear models. Working paper 02-05, Department of...
  • W. Greene

    Econometric Analysis

    (2003)
  • W. Greene

    Maximum simulated likelihood estimation of the normal-gamma stochastic frontier function

    Journal of Productivity Analysis

    (2003)
  • Greene, W., 2004. Distinguishing between heterogeneity and inefficiency: stochastic frontier analysis of the World...
  • Griffin, J., Steel, M., 2002. Semiparametric Bayesian inference for stochastic frontiers. Manuscript, Institute of...
  • Cited by (1080)

    View all citing articles on Scopus
    View full text