# Stochastic frontier analysis

Stochastic frontier analysis (SFA) is a method of economic modeling. It has its starting point in the stochastic production frontier models simultaneously introduced by Aigner, Lovell and Schmidt (1977) and Meeusen and Van den Broeck (1977).

The production frontier model without random component can be written as:

$y_{i}=f(x_{i};\beta )\cdot TE_{i}$ the best where yi is the observed scalar output of the producer i, i=1,..I, xi is a vector of N inputs used by the producer i, f(xi, β) is the production frontier, and $\beta$ is a vector of technology parameters to be estimated.

TEi denotes the technical efficiency defined as the ratio of observed output to maximum feasible output. TEi = 1 shows that the i-th firm obtains the maximum feasible output, while TEi < 1 provides a measure of the shortfall of the observed output from maximum feasible output.

A stochastic component that describes random shocks affecting the production process is added. These shocks are not directly attributable to the producer or the underlying technology. These shocks may come from weather changes, economic adversities or plain luck. We denote these effects with $\exp \left\{{v_{i}}\right\}$ . Each producer is facing a different shock, but we assume the shocks are random and they are described by a common distribution.

The stochastic production frontier will become:

$y_{i}=f(x_{i};\beta )\cdot TE_{i}\cdot \exp \left\{{v_{i}}\right\}$ We assume that TEi is also a stochastic variable, with a specific distribution function, common to all producers.

We can also write it as an exponential $TE_{i}=\exp \left\{{-u_{i}}\right\}$ , where ui ≥ 0, since we required TEi ≤ 1. Thus, we obtain the following equation:

$y_{i}=f(x_{i};\beta )\cdot \exp \left\{{-u_{i}}\right\}\cdot \exp \left\{{v_{i}}\right\}$ Now, if we also assume that f(xi, β) takes the log-linear Cobb–Douglas form, the model can be written as:

$\ln y_{i}=\beta _{0}+\sum \limits _{n}{\beta _{n}\ln x_{ni}+v_{i}-u_{i}}$ where vi is the “noise” component, which we will almost always consider as a two-sided normally distributed variable, and ui is the non-negative technical inefficiency component. Together they constitute a compound error term, with a specific distribution to be determined, hence the name of “composed error model” as is often referred.

Stochastic frontier analysis has examined also "cost" and "profit" efficiency (see Kumbhakar & Lovell 2003). The "cost frontier" approach attempts to measure how far from full-cost minimization (i.e. cost-efficiency) is the firm. Modeling-wise, the non-negative cost-inefficiency component is added rather than subtracted in the stochastic specification. "Profit frontier analysis" examines the case where producers are treated as profit-maximizers (both output and inputs should be decided by the firm) and not as cost-minimizers, (where level of output is considered as exogenously given). The specification here is similar with the "production frontier" one.

Stochastic frontier analysis has also been applied in micro data of consumer demand in an attempt to benchmark consumption and segment consumers. In a two-stage approach, a stochastic frontier model is estimated and subsequently deviations from the frontier are regressed on consumer characteristics (Baltas 2005).

## Extensions: The two-tier stochastic frontier model

Polacheck & Yoon (1987) have introduced a three-component error structure, where one non-negative error term is added to, while the other is subtracted from, the zero-mean symmetric random disturbance. This modeling approach attempts to measure the impact of informational inefficiencies (incomplete and imperfect information) on the prices of realized transactions, inefficiencies that in most cases characterize both parties in a transaction (hence the two inefficiency components, to disentangle the two effects).

Recently, various non-parametric and semi-parametric approaches were proposed in the literature, where no parametric assumption on the functional form of production relationship is made, see for example Parmeter and Kumbhakar (2014) and Park, Simar and Zelenyuk (2015)  and references cited therein.