Econometric Theory, 10, 1994, 116-129. Printed in the United States of America.
SYMMETRY, REGRESSION
DESIGN, AND SAMPLING
DISTRIBUTIONS
ANDREW CHESHER
AND
SIMON PETERS
University of Bristol
When values of regressors are symmetrically disposed, many M-estimators in
a wide class of models have a reflection property, namely, that as the signs of
the coefficients on regressors are reversed, their estimators' sampling distribution is reflected about the origin. When the coefficients are zero, sign reversal
can have no effect. So in this case, the sampling distribution of regression coefficient estimators is symmetric about zero, the estimators are median unbiased
and, when moments exist, the estimators are exactly uncorrelated with estimators of other parameters. The result is unusual in that it does not require response variates to have symmetric conditional distributions. It demonstrates
the potential importance of covariate design in determining the distributions
of estimators, and it is useful in designing and interpreting Monte Carlo experiments. The result is illustrated by a Monte Carlo experiment in which maximum likelihood and symmetrically censored least-squares estimators are
calculated for small samples from a censored normal linear regression, Tobit,
model.
1. INTRODUCTION
Since exact distributions of econometric estimators are often hard to derive,
Monte Carlo experiments are frequently used to study the behavior of estimators and the quality of approximations to their sampling distributions.
Since most econometric models involve covariates, it is necessary to specify
covariate designs when a Monte Carlo experiment is conducted. This aspect
of Monte Carlo experimentation is rarely given prominence when experiments are reported, perhaps because it is believed that covariate design has
a relatively minor influence on the relevant properties of estimators.
In fact, covariate design, in conjunction with parameter values, can have
a spectacular effect on the shapes of the exact distributions of estimators
Financial support was provided by ESRC grant number B0OO232150. An earlier version of this paper was
circulated in 1988. We are grateful to Sir David Cox, Brendon McCabe, Peter Phillips, and Richard Smith
and to two anonymous referees for helpful comments.
116
© 1994 Cambridge University Press
0266-4666/94 $5.00 + .00
SYMMETRY AND REGRESSION DESIGNS
117
whose first-order asymptotic approximate distributions have shapes that are
invariant under changes in covariate design. Consequently, Monte Carlo experiments should be designed so as to reveal the impact of covariate design.
Unfortunately, in the great majority of reported experiments, few designs are
studied and in many studies only one covariate design is used. This can severely limit the applicability of the results of these experiments. The results
given here demonstrate the importance of covariate design and provide information concerning the nature of covariate design effects.
The main result of this paper is as follows. Suppose that a covariate design is symmetrically disposed around a central point. Let X be a vector of
covariates and let X = x° denote this central point, which in some cases may
not itself be a point in the design. In a symmetric design, each point in the
design with a nonzero value, X = x - x°, can be matched with another with
the value X = — (x — x°). When a covariate design is symmetric in this
sense, reversal of the signs of regression-type coefficients associated with X
causes the sampling distributions of a wide class of coefficient estimators to
be reflected about the origin. This reflection also occurs when there are other,
asymmetrically disposed covariates, Z, as long as each pair of points with
X = ± (x — x°) is associated with a common value of Z.
In the special case in which regression coefficients are zero, reversing the
signs of regression coefficients can have no effect on the distributions of estimators. Consequently, when designs are symmetric and regression coefficients are zero, the sampling distributions of many estimators are symmetric
about zero and so the estimators are median unbiased. Cases in which regression coefficients are zero are of particular interest because they arise when
considering the null distributions of Wald and score test statistics to detect
omitted regressors.
Asymmetric designs do not necessarily result in asymmetric sampling distributions. For example, the distribution of the least-squares estimator is
symmetric whenever the conditional distribution of the response variate is
symmetric, regardless of the covariate design. However, there are commonly
used models and estimators for which an asymmetric design does cause sampling distributions to be asymmetric and the effect can be dramatic, as the
example of Section 3 involving the maximum likelihood Tobit estimator
shows.
The symmetry condition on the covariate design is restrictive, but very
many Monte Carlo experiments reported in the literature use designs that satisfy the condition. Examples are designs in which covariates' values are chosen to be spaced at equal intervals, or as expected order statistics from
symmetric distributions. The reflection property allows the results of experiments using such covariate designs to be extended to new values of regression parameters. Further, it implies that where a Monte Carlo experiment
based on a symmetric covariate design generates skewed sampling distributions, the skewness can be eliminated by setting regression coefficients to
118
ANDREW CHESHER AND SIMON PETERS
zero, and reversed by reversing their signs. There are other incidental uses
of the result. For example, it provides a useful check on complicated calculations such as those involved in developing asymptotic expansions of certain econometric estimators (see, for example, Chesher, Peters, and Spady
[4]).
The reflection result is stated and proved in Section 2. The result is unusual
because unlike the result concerning the symmetry of the sampling distribution of the seemingly unrelated regression equation estimator described in
Kakwani [8], the many related results described by Andrews [1] and the reflection results given by Cryer, Nankervis, and Savin [6], there is no assumption of symmetry in the distribution of the response variates.
Section 3 illustrates the results of this paper and shows the potential magnitude of design effects. Monte Carlo estimates of the sampling distributions
of maximum likelihood and symmetrically censored least-squares estimators
in a left censored linear regression (Tobit) model are presented. These demonstrate the substantial skewness in finite sample distributions that can be
induced solely by moving from a symmetric to an asymmetric covariate design. They also show that substantial skewness can be induced even when the
covariate design is symmetric merely by moving regression coefficients away
from zero.
The results of this paper concern sampling distributions of estimators conditional on covariate values. They are especially relevant to the design and
analysis of Monte Carlo experiments where it is common to find fixed covariate designs and frequently the symmetric designs studied here. In applied
econometric work, covariate designs are usually not chosen purposively and
symmetric designs are rare. However, even though many researchers will
never work with symmetric designs, the results of this paper are relevant to
them because they show how sensitive the exact finite sample distributions
of commonly used econometric estimators can be to covariate design and parameter values.
There are cases in applied work when covariate values are sampled from
symmetric or nearly symmetric distributions, for example, when an additive
central limit theorem applies to the process generating the covariates. Then
a result analogous to the one given here can be useful (see Chesher [3]),
namely, that reversal of the sign of regression coefficients causes sampling
distributions of their estimators marginal with respect to the covariates to be
reflected around the origin. This result is also relevant to the interpretation
of Monte Carlo experiments in which covariate values are sampled anew at
each replication.
2. THE REFLECTION PROPERTY
First, a class of covariate designs is defined. Then a class of models for the
conditional distribution of a response variate given values of covariates is in-
SYMMETRY AND REGRESSION DESIGNS
119
troduced and a class of estimators is described. Finally, the reflection property is stated and proved.
2 . 1 . Covariate Designs
Two types of covariate vectors are distinguished. One type, X, has a symmetric design, reflected about a central point, X = x°. The other, Z, has a
replicated design, taking identical values as the X covariates are reflected.
This means that the n covariate values can be labeled in such a way that
Xj = - x n + i _ / ,
Zi = Zn+\-i,
i=l,...,[n]/2,
where xt = xt — x°, and [n] = n + 1 if n is odd and [n] = n if n is even. In
some applications, the covariates Z will be absent, or constant and equal to 1.
Two types of parameters are distinguished: those attached to the symmetrically disposed covariates, elements of a matrix, /3, conformable with X; and
the remaining parameters, elements of a matrix, 5, which may be associated
with other covariates or appear as "nuisance parameters," perhaps indexing
scale or distributional shape.
2.2. The Distribution of Response Variates
The response variates associated with the n points in the covariate design are
denoted by Y\,..., Yn. They may be vector valued. They are assumed to be
mutually independently distributed given the vectors of values of covariates
X\,... ,xn, and Z\,... ,zn, with proper conditional distribution functions:
Fi{yi\Xi,Zi,Xjl3,S), i = 1,... ,n. As before, Jc, = Xj — x° is the /th design
point expressed as a deviation from the center point for the design. The distribution functions may depend upon the center point of the design, x°, but
this is not made explicit in the notation.
It is essential that /3 appear only in the distribution functions through the
matrix product (x - x°)& = jc/8. The symmetrically disposed covariates X
may influence the distribution function in other ways, but if they do then the
distribution functions must be even functions of x in the sense that
Fi(y\x,z,c,5)
= Fj(y\ -x,z,c,8)
for all /, y, x, z, c, and 5,
where c is a potential value of x/3.
In many cases, the distribution functions will not vary with /, but if they
do, then for all values of their arguments, they must satisfy
Fi(y\x,z,x0,5)
=Fn+l_i(y\x,z,xt3,8),
i= 1,... ,[/»]/2.
The assumptions set out above encompass a very wide class of models.
The response variates, Yh can be discrete, continuous, or mixed, so models
for censored and grouped data are included. Since the y;'s can be vector
valued, multivariate models such as econometric simultaneous equations
120
ANDREW CHESHER AND SIMON PETERS
models are included. There are no restrictions on the way in which a subset
of the covariates, Z, affects the response variate. The covariates X have to
affect the response variate through {X — xo)l3, but they can also have other
effects. For example, censored heteroskedastic regression models with Y* =
X& + u, Y = max(y*,0), var(M|.^ = *) = g(x,8), are included as long as
g(x,5) is an even function of x — x°. Many models involving covariates that
appear in the econometric and statistical literature are contained in the class
of models defined by the assumptions set out above.
2.3. A Class of Estimators
We consider M-estimators (Huber [7]), $ and <5, which are unique solutions
to the estimating equations:
n
Ti tj,i(yi,Xi,Zi,Xj(3,S)
=o,
J=I,...,J,
where J is the total number of parameters. The estimating equations may also
depend on x°, but this is not made explicit in the notation.
In many cases, the functions \pjj will not vary with /, but if they do, then
for all values of their arguments, they must satisfy
tj,i(y,x,z,xP,8)
=il/j,n+{-i(y,x,z,xP,8),
i= 1, ..
In many cases, the estimating equations will depend on the symmetrically disposed covariates only through Xj(i. If the covariates have other influences,
then for some set of fixed nonzero constants, X,,... ,\j, and for ally, y, x,
z, c, and 8, where c is a potential value of xj3, the following condition must
hold.
If in the conditional distribution functions, /3 does appear only in conjunction with x, as required above, then for many estimators — maximum likelihood estimators, for example—j8 will necessarily appear in the estimating
equations only in conjunction with x.
A very wide class of estimators is encompassed by these assumptions. It
contains many Af-estimators in well- and misspecified models, including maximum likelihood estimators, linear and nonlinear two- and three-stage leastsquares estimators, and least absolute deviation estimators, and it includes
semiparametric estimators like Cox's [5] estimator for proportional hazard
models and Powell's [10] symmetrically trimmed and symmetrically censored
least-squares estimators.
The reflection property is given in the following theorem.
SYMMETRY AND REGRESSION DESIGNS
121
THEOREM. Under the assumptions set out above, the sampling distribution of the estimator $ when (3 = —b is a reflection around the origin of the
sampling distribution of 0 when (3 = +b and the sampling distribution of
8 is invariant undersign changes in /3, in the sense that for all values of/3 and
8 and sets of matrix pairs, A, conditional on the values of the covariates:
A proof of the theorem follows. Throughout, probabilities are conditional
on the values taken by the covariates.
Proof. Let s denote a realization of Yu ..., Yn and let $(s) and 8(s) denote the solution to the estimating equations at s. Define the operator C(s)
which changes a realization, s, by interchanging the /th and (/? + 1 — /)th
values of y, i = 1,... ,[n]/2. When the operator C(-) is applied to a set of
realizations, it acts as just described on each member of the set. We first show
that 0(s) = -${C(s)) and 8(s) = 8(C(s)).
Write the estimating equations at the realization s in the following manner, which embodies the symmetry property of the covariate design.
\n-\\n
£
4>j.i(yi,XhZi,Xi$(s),8(s))
[n-lJ/2
+
2
+jAyn+i-i,-Xi,zi,-x,&(s)Ms))=o,
j=i,...,j.
a)
The second term appears only if the sample size is odd.
At the associated realization C(5), the estimating equations are
[n-lJ/2
[]
+
2
<PJ,i{yi,-xi,zi,-Xi0(C(s)),8(C(s)))=O,
j=\
J.
(2)
The assumptions concerning the estimating equations ensure that (2) is solved
at /3(C(s)) = -$(s) and S(C(s)) = 8(s) because if these values are substituted in (2) then it resembles (1) except that the order of the two summations
is reversed and there will be innocuous scale factors present if there are factors Xj which are not equal to one.
Let A+ and / 1 _ be the sets of realizations of Ylt.. .,Yn, for which, respectively, {13,8} and (— 0,8} fall in the set of matrix pairs, A. The argument
122
ANDREW CHESHER AND SIMON PETERS
above implies that A- = C(A+). The assumption concerning the conditional distribution function of the response variates ensures that for any set
of realizations, say Z,
P[Z\0 = +b, 8 = d] = P[C(Z)|/3 = -b, 8 = d],
since interchanging yt and yn+l-j leaves the values taken by the distribution
functions F{ unchanged once x'b is replaced by x'(—b). In particular,
P[A+\P = +b,8 = d] = P[C(A+)\0
= -b,8 = d],
which expressed in terms of J3 and 8 is
P[{0,8}
<=A\l3 = +b, 8 = d] = P[{-0,8]
€ A\0
= -b,
8 = d].
•
3. DISCUSSION AND ILLUSTRATION
The theorem has some interesting implications. For example, it implies that
when j3 = 0, P[/3 6 5] = P[-$ E B] for all sets of matrices, B, so that
in symmetric designs, $ is symmetrically distributed around zero when j8 is
in fact zero. Another implication is that when 0 = 0, P[(J3)J'8 < a] =
P[(—$)J8 < a], so that for odd values ofy, the distribution of J3J8 is symmetric about zero. Setting j equal to 1 it follows that when /3 = 0, the covariance of $ and 8 is zero if it exists.
The implications of the theorem are well illustrated using a Monte Carlo
experiment described by Powell [10] who considers a censored regression
model with covariates xx and x2, in which
y = max(y*,0).
In the experiment, /30 = 0, 0, = 1, /32 = 0, the values of x, are equally spaced
in an interval [— q,q] chosen so that the variance of xt over the design is 1,
and values of x2 alternate between —1 and 1 as Xi increases. Powell [10] performs 201 Monte Carlo replications, in each one simulating 200 realizations
of Fusing pseudorandom i.i.d. N(0,1) errors and computes maximum likelihood (ML) and symmetrically censored least-squares (SCLS) estimates.
With this number of replications, it is not possible to measure with accuracy
the departure of sampling distributions from symmetry. So two larger, 5000replication, Monte Carlo experiments were conducted. In one, ML estimates
for samples of size 20 were computed. In the other, SCLS estimates in samples of size 200 were computed. The sample size was reduced to 20 for experiments involving the ML estimator so that departures from symmetry
would not be masked by the operation of the central limit theorem. This was
not necessary in the case of the SCLS estimator, which is difficult to compute in samples this small. In all other respects, the experiments followed
Powell's [10] design. The results are summarized in Table I.1
SYMMETRY AND REGRESSION DESIGNS
123
1. Summary statistics, 5000 Monte Carlo replications, two
covariate censored normal regression model with 50% censoring
TABLE
SCLS Estimators
(Sample Size 200)
ML. Estimators
(Sample Size 20)
Estimators of:
Coefficient values
00
0
0i
1
02
0
Median
Mean (m{)
Std deviation {m\n)
Asy std deviation
Skewness (m}/m2/2)
.0072
-.0354
.3606
.3145
-1.27
(.128)
Kurtosis (m^/m^ — 3)
4.18
(1.03)
.9969
-.0036
1.0322 -.0069
.3471
.2724
.3130
.2624
0.89
-0.05
(.066)
(.069)
1.89
0.77
(-256)
(.295)
Correlations: @0
-.55
0i
-.55
ft
.01
-.06
2
-.40
.26
a
00
0
-.0391
-.1707
.5162
-3.61
(.355)
23.75
(5.01)
.01
-.06
0,
1
.0017
1.0368
1.1234 -.0005
.3979
.1316
2.64
(.252)
13.86
(2.82)
-.96
-.96
.02
02
0
0.00
(.101)
1.94
(.410)
.02
-.03
-.03
-.02
rrtj is the /th central moment across 5000 replications. "Asy std deviations" arc n l/2 times the asymptotic
standard deviations of nin (ft — ft) developed from the information matrix under the assumption that the
covariate design is replicated as the sample size, n, increases, a2 is the ML estimator of the error variance.
Figures in parentheses are jackknife estimates of the standard errors of the skewness and kurtosis measures.
The model and estimators satisfy the assumptions of the theorem and the
covariate design is symmetric about xx = 0, x2 = 0. Let the estimators, ML
or SCLS, be denoted by ${ and $2. From the theorem, it follows that if 0t
were —1, then the joint sampling distributions of $t and $2 would be reflections around zero of those studied by Powell for which 0t = +1 and /32 =
0, while the sampling distributions of the intercept estimators, j30, which can
be thought of as being associated with a replicated "covariate" always equal
to 1, would be unchanged. If /?! were zero, then the joint sampling distributions of /3j and /32 would be symmetric. So the extent to which these distributions deviate from symmetry shows the amount of skewness that is caused
solely by jSi deviating from zero.
Even though /32 is nonzero, the Monte Carlo experiment also illustrates
the symmetry result given at the beginning of this section. This is because successive alternating values of the symmetrically disposed covariate x2 are associated with almost identical values of xy so that values associated with
x2 = +1 are close to those associated with x2 = —1. Consequently, the covariate xx is almost in the category of replicated covariates, Z, defined ear-
124
ANDREW CHESHER AND SIMON PETERS
Her, and the theorem leads one to expect that the sampling distributions of
estimators of /32 will be close to symmetric.
The skewness coefficients and the relative magnitudes of means and medians shown in Table 12 indicate that the distributions of estimators of /30
and /3i are, respectively, negatively and positively skewed while the distributions of estimators of I32 show negligible skewness. Figures in parentheses
are jackknife estimates of the standard errors of the estimates of the standardized cumulants. The cumulant estimates are quite variable despite the
large scale of this experiment, but it is quite clear that the variations in the
values of the skewness measures do reflect real differences in the shapes of
the sampling distributions of the alternative estimators. The correlations between estimators of /32 and estimators of the other parameters are very
small.
Figure 1 shows quantile-quantile (QQ) plots of the ML and SCLS estimates
of 0! and j32, relocated and scaled so that over the 5000 replications, the linearly transformed values of each estimator have zero mean and unit variance.
The graphs are constructed by plotting quantiles of the Monte Carlo replicates against corresponding quantiles from the standard normal distribution,
which is the first-order asymptotic approximation to these estimators' sampling distributions.
The asymmetry in the sampling distributions of the estimators of /3j is
very obvious. It would not be present were the true value of (3, to be zero.
The finite sample distributions of the estimators of 02 deviate very little
from symmetry. The distribution of the ML estimator of /32 is slightly long
tailed but it is remarkably close to a normal distribution given that the sample size is only 20 and that on average, 50% of the realizations are censored.3 Even with a sample size of 200, the normal approximation to the
distribution of the SCLS estimator is extremely poor. The estimator is very
long tailed. The skewness induced by /S, being nonzero is much greater for
this estimator than for the ML estimator in much smaller samples.
Departures from symmetry can be isolated from other aspects of distributional shape and are very clearly revealed in the "symmetry plots" shown in
Figures 2(a) and 2(b). These show quantiles of, respectively, ML and SCLS
estimators of /3] and j32 expressed in standard deviation units as absolute deviations from medians, values associated with quantiles above the median
plotted against values associated with corresponding quantiles below the median. Let /3/ y) , j = 1,... ,N be the values of ft obtained in N Monte Carlo
replications, expressed in standard deviation units and arranged in ascending order, and let $M be the median value obtained. The symmetry plot is
generated by plotting points with coordinates
A trail of points close to the 45° line indicates an almost symmetric distribution. Paths, respectively, above or below the 45° line indicate, respectively,
SYMMETRY AND REGRESSION DESIGNS
125
-4
standard normal quantiles
estimates expressed
in standard deviation units
(a)
standard normal quantiles
estimates expressed
in standard deviation units
(b)
3
standard normal quantiles
estimates expressed
in standard deviation units
(c)
standard normal quantiles
estimates expressed
in standard deviation units
(d)
FIGURE 1. (a) ML estimator of, ,. (b) ML estimator of /3 2 . (c) SCLS estimator of
j8,. (d) SCLS estimator of /3 2 .
positively or negatively skewed distributions. The positive skewness in the distributions of the estimators of ^ is very obvious in Figure 2(a). A tiny
amount of skewness is detectable in the distributions of the estimators of 02
shown in Figure 2(b). It arises because the design for xx is not exactly replicated across the +1 and —1 points in the x2 design.
So far only a symmetric design has been studied. All asymmetry in sampling distributions has arisen because parameter values deviate from zero.
126
ANDREW CHESHER AND SIMON PETERS
(a)
0.5
1.0
1.5
2.0
2.5
3.0
deviations of lower quantiles from medians
deviations expressed in standard deviation units
(b)
.§
O
S 5-
0.0
0.5
1.0
1.5
deviations of lower quantiles from medians
deviations expressed in standard deviation units
2. (a) Deviations of quantiles from medians: ML. (b) Deviations of quantiles from medians: SCLS.
FIGURE
SYMMETRY AND REGRESSION DESIGNS
127
What is the effect of altering the design so that it is asymmetric? To answer
this question, the Monte Carlo experiment involving the ML estimator was
performed again with just one change, namely, that the design for the binary
covariate x2 was altered by moving a single point so that the design became
asymmetric. Specifically, the value of x2 corresponding to the lowest value
of Xi was changed from - 1 to -10. Even though the true value of j32 is
zero, this has a dramatic effect, clearly revealed in the symmetry plot shown
in Figure 3. The trail of points labeled "Asymmetric X2" arises when this
asymmetric design is used. There is clearly very substantial positive skewness.
The design for x2 can be brought back to symmetry by pushing the x2
value corresponding to the second lowest value of Xi to +10. The result is
the trail of points in Figure 3 labeled "Symmetric X2." Restoring symme-
q
CO
1/5
ASYMMETRIC X2
CO
c
03
'•3
O
c\i
§•
co
,_;
••s
2.0
2.5
3.0
deviations of lower quantiles from medians
deviations expressed in standard deviation units
FIGURE
3. Deviations of quantiles from medians: ML x2 design perturbed.
128
ANDREW CHESHER AND SIMON PETERS
try to the design for x2 restores approximate symmetry to the sampling distribution of the ML estimator of /32. Again, some slight skewness remains
because the xt design is not exactly replicated across the positive and negative points in the x2 design.
4. CONCLUDING REMARKS
It is very common to find symmetric covariate designs in published Monte
Carlo studies. For example, Moolgavkar and Venzon [9] report the results
of Monte Carlo experiments examining Cox's [5] estimator for proportionate hazard models with linear relative risk, 1 + /3z, depending on a single covariate z. In one set of experiments, the values of the covariate are expected
uniform order statistics; in another, expected normal order statistics—in each
case rescaled to span the interval [0,1]. The model, estimators, and designs
satisfy the assumptions of the theorem, so the reflection result applies, and
when /3 = 0 so that the covariate is ineffective, the sampling distribution of
the Cox estimator of & must be symmetric. Moolgavkar and Venzon's results
with |8 = 0 do suggest a symmetric sampling distribution. They report a mean
of 0.04, a median of -0.02, and a standard deviation of 0.38 over 1000 replications with a sample size of 100 at each replication. They remark that "distributional properties appear to be worse with increasing true value of the
parameter /3" (Moolgavkar and Venzon [9], p. 47). It is evident from their
graphs that increasing skewness is the problem, as we would expect, given
the results of this paper.
It is clear that values taken by covariates can have a major influence on
finite sample properties of estimators. The point has been made before by,
for example, Box and Watson [2] and Weisberg [12], yet it is rare to find
Monte Carlo experiments that pay adequate attention to covariate design.
Many reported Monte Carlo studies that give the impression that first-order
asymptotic approximations perform well can in fact only be regarded as
showing that there are covariate designs (namely, those studied) in which the
approximations are adequate. Unfortunately, covariate design is an unwieldy
factor to vary in a Monte Carlo experiment, and a view concerning the range
and types of designs that are relevant is essential when designing a Monte
Carlo study. The results of this paper can aid the choice of appropriate designs to examine.
NOTES
1. Normal pseudorandom numbers were obtained by applying Press et al.'s [11] version of
Marsaglia's polar method to uniform pseudorandom numbers obtained with Wichman and Hill's
[13] portable generator. Maximum likelihood estimates were calculated using the method
of scoring with, as starting points, least-squares estimates obtained from uncensored data.
Calculations were performed in double precision arithmetic on a Sun SPARCstation 2 running
SUN-OS 4.1.2.
SYMMETRY AND REGRESSION DESIGNS
129
2. It is possible for censoring to create configurations of realizations for which the ML or
the SCLS estimators of one or both coefficients are unbounded or indeterminate. For example, if all realizations associated with x2 = — 1 are censored but there are sufficient uncensored
realizations at x2 = + 1 , then the Tobit likelihood function is maximized at /32 = +°°. In Powell's design with a sample size of 20, such configurations are quite rare. In 5000 Monte Carlo
replications with a sample size of 20, the probability of finding no configurations of realizations leading to unbounded or indeterminate ML estimators is around 0.5. The corresponding
probability for samples of size 200 is very close to 1. In fact, no configurations leading to indeterminate or unbounded estimators arose in the two experiments reported in Table 1. However, the figures reported there should be interpreted as applying to the sampling distributions
of estimators conditional on their values being determinate and finite.
3. Symmetry is not the only feature of covariate design that influences the quality of firstorder asymptotic approximations. The covariate x2 is almost uncorrelated with X\ and has a balanced design. If leverage points are introduced into the design, then the ML estimator of 0 2 can
exhibit very substantial supernormal kurtosis though if the design is symmetric, it remains almost symmetrically distributed.
REFERENCES
1. Andrews, D.W.K. A note on the unbiasedness of feasible GLS, quasimaximum likelihood,
robust, adaptive and spectral estimators of the linear model. Econometrica 54 (1986):
687-698.
2. Box, G.E.P. & G.S. Watson. Robustness to non-normality of regression tests. Biometrika
49 (1962): 93-106.
3. Chesher, A.D. A reflection property of M estimators. Discussion Paper No. 90/282, Department of Economics, University of Bristol, 1990.
4. Chesher, A.D., S. Peters & R. Spady. Approximations to the distributions of heterogeneity tests in the censored normal linear regression model. Discussion Paper No. 89/240, Department of Economics, University of Bristol, 1989.
5. Cox, D.R. Regression models and life tables. Journal of the Royal Statistical Society, Series B 34 (1972): 187-220.
6. Cryer, J.D., J.C. Nankervis & N.E. Savin. Mirror image and invariant distributions in
ARMA models. Econometric Theory 5 (1989): 36-52.
7. Huber, P.J. Robust Statistical Procedures. Philadelphia: Society for Industrial and Applied
Mathematics, 1977.
8. Kakwani, N.C. The unbiasedness of Zellner's seemingly unrelated regression equation estimators. Journal of the American Statistical Association 62 (1967): 141-142.
9. Moolgavkar, S.H. & D.J. Venzon. Confidence regions for parameters of the proportionate hazard model: A simulation study. Scandinavian Journal of Statistics 14 (1987): 43-56.
10. Powell, J.L. Symmetrically trimmed least squares estimation for Tobit models. Econometrica 54 (1986): 1435-1460.
11. Press, W.H., B.P. Flannery, S.A. Teukolsky & W.T. Vetterling. Numerical Recipes: The Art
of Scientific Computing. Cambridge: Cambridge University Press, 1986.
12. Weisberg, S. Comment on "Some large sample tests for non-normality in the linear regression model" by H. White and G.M. MacDonald. Journal of the American Statistical Association 75 (1980): 28-31.
13. Wichman, B.A. & I.D. Hill. Algorithm AS183: An efficient portable pseudo-random number generator. Applied Statistics 31 (1982): 188-190.
View publication stats