Academia.eduAcademia.edu

An improved ranked set two-sample mann-whitney-wilcoxon test

2000, Canadian Journal of Statistics

The authors present an improved ranked set two-sample Mann-Whitney-Wilcoxon test for a location shift between samples from two distributions F and G. They define a function that measures the amount of information provided by each observation from the two samples, given the actual joint ranking of all the units in a set. This information function is used as a guide for improving the Pitman efficacy of the Mann-Whitney-Wilcoxon test. When the underlying distributions are symmetric, observations at their mode(s) must be quantified in order to gain efficiency. Analogous results are provided for asymmetric distributions. RÉSUMÉ Les auteurs décrivent une adaptation de la statistique de Mann-Whitney-Wilcoxon permettant de tester si deux lois F et G sont identiquesà un paramètre de localisation près, lorsque chacun des deuxéchantillons est composé de certaines statistiques d'ordre d'un ensemble aléatoire de données classées subjectivement. Ils définissent aussi la quantité d'information associéeà chaque observation des deux lois,étant donné les classements subjectifs préalables. Ils montrent alors qu'un choix judicieux du sous-ensemble des données qui seront mesurées objectivement permet d'accroître l'efficacité du test au sens de Pitman. Lorsque les lois sous-jacentes sont symétriques, c'est en incluant leur(s) mode(s) dans ce sous-ensemble que l'on réalise des gains. Le cas des lois asymétriques est aussi traité.

The Canadian Journal of Statistics Vol. 28, No. 2, 2000, Pages ???-??? La revue canadienne de statistique An improved ranked set two-sample Mann-Whitney-Wilcoxon test Ömer ÖZTÜRK and Douglas A. WOLFE Key words and phrases: Design, unequal allocation, Pitman efficiency, nonparametric testing, mode, information, Mann-Whitney-Wilcoxon test. Mathematics subject classification codes (1991): 62G30; 62H10. ABSTRACT The authors present an improved ranked set two-sample Mann-Whitney-Wilcoxon test for a location shift between samples from two distributions F and G. They define a function that measures the amount of information provided by each observation from the two samples, given the actual joint ranking of all the units in a set. This information function is used as a guide for improving the Pitman efficacy of the Mann-Whitney-Wilcoxon test. When the underlying distributions are symmetric, observations at their mode(s) must be quantified in order to gain efficiency. Analogous results are provided for asymmetric distributions. RÉSUMÉ Les auteurs décrivent une adaptation de la statistique de Mann-Whitney-Wilcoxon permettant de tester si deux lois F et G sont identiques à un paramètre de localisation près, lorsque chacun des deux échantillons est composé de certaines statistiques d’ordre d’un ensemble aléatoire de données classées subjectivement. Ils définissent aussi la quantité d’information associée à chaque observation des deux lois, étant donné les classements subjectifs préalables. Ils montrent alors qu’un choix judicieux du sous-ensemble des données qui seront mesurées objectivement permet d’accroı̂tre l’efficacité du test au sens de Pitman. Lorsque les lois sous-jacentes sont symétriques, c’est en incluant leur(s) mode(s) dans ce sous-ensemble que l’on réalise des gains. Le cas des lois asymétriques est aussi traité. 1. INTRODUCTION In situations where measurements are costly and/or difficult to obtain but ranking of the potential sample data is relatively easy, the use of statistical methods based on a ranked set sampling (RSS) approach can lead to substantial improvement over analogous methods associated with simple random sampling (SRS) schemes. This ranked set sampling approach has attracted considerable attention 1 in the recent literature, with principal initial interest being driven by environmental and agricultural issues, where it is clear that pre-sampling judgement rankings can be quite inexpensive relative to the cost of detailed measurement of many quantities of interest. For example, preliminary supporting data can be easily obtained from a contaminated site and analyzed both quickly and inexpensively before final decisions are reached as to where and how to obtain the specific measurement(s) of interest, such as lead or hazardous material content of the site. A similar approach can be taken when performing standard gasoline octane checks at stations, where preliminary screening at the gasoline pumps can lead to an improved set of ranked set gasoline samples to carry back to the laboratory for more detailed (and costly) analyses. Agricultural settings where such an approach can be used effectively include predicting crop yields or lumber content via preliminary non-destructive measurements (satellite observations or concomitant variables, for example). In RSS, m2 units are drawn from an infinite population. These units are partitioned into m sets each having m units. The units in each set are then judgement ranked via some means other than actual measurement. From the ith set, i = 1, . . . , m, the observation having the ith judgement rank is quantified while other observations are returned to the population. This process is called a cycle in ranked set sampling. The random sample resulting from a cycle contains m independent order statistics. In practice, this entire process is repeated for n cycles to yield n × m quantified observations. Two-sample ranked set samples can be constructed in a way similar to that for the one-sample case. For the two-sample setting, we quantify n × q order statistics, X(i)j , i = 1, . . . , q, j = 1, . . . , n, from the X−sample distribution F and m × p order statistics, Y(i)j , i = 1, . . . , p, j = 1, . . . , m, from the Y −sample distribution G, where F and G are arbitrary, continuous probability models. Throughout the paper, the observations X(i)j , i = 1, . . . , q, j = 1, . . . , n, and Y(r)ℓ , r = 1, . . . , p, ℓ = 1, . . . , m, will be called two-sample standard ranked set samples (SRSS). Most of the initial research efforts in ranked set sampling have concentrated on parametric and nonparametric estimation and testing procedures for the oneand two-sample settings. See, for example, McIntyre (1952), Takahasi & Wakimoto (1968), Dell & Clutter (1972), Halls & Dell (1966), Stokes (1977), Stokes & Sager (1988), Bohn & Wolfe (1992, 1994), Hettmansperger (1995), Bohn (1998), Koti & Babu (1996), and Öztürk (1999a,b). Recently, several researchers have expressed interest in the appropriate allocation of order statistics within a ranked set sample. Kaur et al. (1997) studied the effects of unequal allocation on estimation of the population mean when the underlying distribution is skewed or symmetric. They provided “near” optimality results based on skewness, kurtosis and/or the coefficient of variation. In the parametric set up, Bhoj (1997) constructed an unbiased estimator of the population mean by using a linear combination of the order statistics from the same set in a ranked set sampling environment. Öztürk (1999a) and Öztürk & Wolfe (1998a,b, 1999) introduced a design concept for determining the appropriate allocation of the order statistics for the sign and signed rank test statistics. They showed that for all symmetric distributions the best design among all possible allocation procedures is the one that quantifies only the middle observation(s). Recently, Öztürk & Wolfe (1998a) developed an information criterion to improve the asymptotic Pitman efficiency of a one-sample nonparametric test. In this paper, we extend the same idea to settings where the two-sample ranked set sample MannWhitney-Wilcoxon test is appropriate. This extension to the two-sample problem is not trivial since the class of all possible allocation procedures is very rich and the 2 Pitman efficacy factor for the test is not analytically available for maximization. Therefore, we adopt a two-step procedure to improve the ranked set sample MannWhitney-Wilcoxon test. In Section 2, as a first step we provide the asymptotic Pitman efficiency of the ranked set sample Mann-Whitney-Wilcoxon test for an arbitrary design. We show that the expression of the Pitman efficiency does not permit an analytical maximization with respect to the choice of ranked set sampling design. In order to provide a feasible search algorithm, Section 3 introduces a minimum variance optimality criterion for allocation of the order statistics in a two-sample ranked set sample and construction of minimum variance optimal designs that minimize this criterion function. It is shown that the minimum variance optimal designs are functions only of the modes of the underlying distributions. In Section 4, we use this optimality criterion to aid our search for an improved ranked set two-sample Mann-Whitney-Wilcoxon test. We numerically maximize the Pitman efficiency by using the variance-optimality criterion as a guide. This is a manageable task since the variance-optimal guided search produces a much smaller class than the class of all possible designs. For example, using the variance-optimality criterion for unimodal distributions, the Pitman efficiency need only be numerically maximized within the class of all designs that contain single order statistics from each of the X- and Y -sample distributions. This restricts our search to only pq different designs. All proofs are provided in appendix. 2. RANKED SET SAMPLE MANN-WHITNEY-WILCOXON TEST Let Dts = {d1 , . . . , dt ; b1 , . . . , bs }, t ∈ {1, . . . , q}, s ∈ {1, . . . , p}, be the set of integers that contains the ranks of the observations to be quantified in each cycle from the X− and Y −sample distributions, respectively. For example, if D = {1, 3, 5; 1, 5} and the set sizes are p = 5 and q = 5, we would quantify the two extreme and one middle observations in the X−sample and the two extremes in the Y −sample in each cycle. Throughout this paper the set Dts will be called the design and we reserve DSRSS = {1, . . . , q; 1, . . . , p} for the standard ranked set sample. In order to preserve the U -statistic nature of the Mann-Whitney-Wilcoxon statistic, we quantify equal numbers of observations at each design point (n for the X-sample and m for the Y - sample). Thus, our two-sample ranked set sample is X(di )j , j = 1, . . . , n; i = 1, . . . , t, and Y(bu )v , v = 1, . . . , m; u = 1, . . . , s. For each j = 1, . . . , n and v = 1, . . . , m, let X j = (X(d1 )j , . . . , X(dt )j ) and Y v = (Y(b1 )v , . . . , Y(bs )v ). Then the two-sample ranked set sample with design Dts is given by X j , j = 1, . . . , n and Y v , v = 1, . . . , m. Now let   q−1 F (x)di −1 {1 − F (x)}q−di f (x)dx f(di ) (x) = q di − 1 and g(bi ) (y) = p   p−1 G(y)bi −1 {1 − G(y)}p−bi g(y)dy. bi − 1 Then it follows that X 1 , . . . , X n are iid random vectors from the joint distribution h1 (x) = t Y f(di ) (xi ). i=1 3 Similarly, Y 1 , . . . , Y m are iid random vectors from the joint distribution h2 (y) = s Y g(bi ) (yi ). i=1 Moreover, the X’s and Y ’s are mutually independent. Let U (Dts ) = s m X t X n X X j=1 i=1 v=1 u=1 δ(X(di )j − Y(bu )v ), where δ(z) = 0, 1 according as z > 0 or z ≤ 0. In this equation, U (Dts ) is a multivariate two-sample U -statistic estimator of degree (1, 1) on the random vectors X i , Y j , i = 1, . . . , n and j = 1, . . . , m, for the parameter s t X X γ(Dts ) = i=1 u=1 P (X(di )1 − Y(bu )1 ≤ 0). The associated kernel function for this U -statistic estimator is wDts (X 1 , Y 1 ) = s t X X i=1 u=1 δ(X(di )1 − Y(bu )1 ). Let G(y) = F (y − ∆) with −∞ < ∆ < ∞. We now use U (Dts ) to test H0 : ∆ = 0 in favour of H1 : ∆ > 0 by rejecting H0 for large values of U (Dts ). We first look at the null distribution of U (Dts ). √ Theorem 1. Let K = n + m, ǫ = limK→∞ n/K and σ02 (Dts ) = Var{ KU (Dts )/ (nm)}. If 0 < ǫ < 1 and σ02 (Dts ) > 0, then under the null hypothesis ∆ = 0, √ K{Ū (Dts ) − γ(Dts )} has a normal distribution with mean zero and variance σ02 (Dts ), where Ū (Dts ) = U (Dts )/(nm), ξ1,0 (Dts ) ξ0,1 (Dts ) + , ǫ 1−ǫ σ02 (Dts ) = ξ1,0 (Dts ) = τ1 (Dts ) + τ2 (Dts ), ξ0,1 (Dts ) = τ3 (Dts ) + τ4 (Dts ), with τ1 (Dts ) = s t X X i=1 u=1 τ2 (Dts ) =  bX u −1 u −1 bX   j=0 v=0 t XX X i=1 −q 2  q  p p q−1 di −1 j (j + v + di )  v  2p+q j+v+di  q q − 1 1≤r6=ℓ≤s di − 1 q−1 di − 12  bX r −1 bX ℓ −1 j=0 u=0 −  u −1 bX  j=0 p j  bX r −1 bX ℓ −1 j=0  p u 2p+q (j + u + di ) j+u+d u=0 i    p j p u (j + di )(u + di ) 4   p 2  q  j  p+q   , (j + di ) j+d i q−1 di −1 p+q j+di  p+q u+di  ,   q q s t X X X X τ3 (Dts ) =  i=1 u=1 j=di v=di p p−1 bu −1  q q  j (j + v + bu ) and τ4 (Dts ) = v  2q+p j+v+bu −  q X  j=di   q  2   j  p+q   (j + bu ) j+b p p−1 bu −1 u    q q q q  p−1 X X c h  p 2q+p 1≤i6=j≤t  bu − 1 (c + h + bu ) c+h+b u=1 c=di h=dj u     X q q q q  X p−1 c  h   −p2 . p+q  bu − 12 (j + b )(h + b ) p+q s XX X c=di h=dj u u j+bu h+bu This asymptotic result allows us to carry out an approximate size α test for any arbitrary design Dts . In order to compare the performance of the design Dts and the standard two-sample ranked set sample analogue of the Mann-WhitneyWilcoxon test studied by Bohn & Wolfe (1992, 1994), with design DSRSS , we investigate the Pitman asymptotic relative efficiency, defined as ARE(Dts ; DSRSS ) = ef f (Dts ) , ef f (DSRSS ) where ef f (Dts ) is the efficacy factor for the Mann-Whitney-Wilcoxon test statistic U (Dts ) based on the two-sample ranked set sampling design Dts . The efficacy factor ef f (Dts ) is given by {µ′Dts (0)}2 2 (D ) , N →∞ σ0 ts ef f (Dts ) = lim (1) ∂ where µ′Dts (0) = ∂∆ E∆ Ū (Dts )|∆=0 . Care needs to be taken in order to have a meaningful comparison of the designs Dts and DSRSS . In the derivation of the asymptotic results for Ū (Dts ), the design sizes t and s are kept fixed, while the cycle sizes (m and n) go to infinity. Therefore, the efficiency comparison between designs Dts and DSRSS makes sense only if (t, s) and (q, p) are comparable. Let t∗ = q/t and s∗ = p/s and assume that t∗ and s∗ are integers. Now we consider a two-sample ranked set sample generated by design Dts in such a way that X di = {X(di )1 , . . . , X(di )t∗ }, i = 1, . . . , t, is Qt∗ a t∗ -dimensional random variable from the joint distribution j=1 F(di ) (xj ) and Y bi = {Y(bi )1 , . . . , Y(bi )s∗ }, i = 1, . . . , s, is an s∗ -dimensional random variable from Q∗ ∗ be the design for this multivariate the joint distribution sj=1 F(bi ) (xj −∆). Let Dts setting. ∗ , the Pitman efficacy is Theorem 2.For fixed, but arbitrary, design Dts R Pt Ps t∗ s∗ ǫ(1 − ǫ){ i=1 j=1 f(di ) (y)f(bj ) (y)dy}2 ∗ ef f (Dts ) = s∗ (1 − ǫ)ξ1,0 (Dts ) + t∗ ǫξ1,0 (Dts ) ∗ with respect to the and the Pitman asymptotic relative efficiency of the design Dts two-sample ranked set sample (DSRSS ) is ∗ ; DSRSS ) = TF (t, s, p, q) ARE(Dts (1 − ǫ)ξ1,0 (DSRSS ) + ǫξ0,1 (DSRSS ) , (1 − ǫ)s∗ ξ1,0 (Dts ) + t∗ ǫξ0,1 (Dts ) 5 (2) where TF (t, s, p, q) = t∗ s ∗ { Pt i=1 Ps j=1 R p2 q 2 { R f(di ) (y)f(bj ) (y)dy}2 f 2 (y)dy}2 . Equation (2) does not, in general, permit an analytical solution to maximize ∗ ∗ ARE(Dts ; DSRSS ) with respect to Dts . Note that the class of all possible designs is very rich for large q and p as it contains (2q − 1) × (2p − 1) different designs. ∗ , DSRSS ) over this entire class does not Thus numerical maximization of ARE(Dts appear to be attainable. Therefore, we need to develop a reasonable guided search algorithm to achieve an improvement in the asymptotic Pitman efficiency of the ranked set sample Mann-Whitney-Wilcoxon test. In Section 3 we propose the use of an information function to provide one such search algorithm. 3. INFORMATION FUNCTION Our interest lies in finding a design Dts that, in some sense, contains the maximum amount of information in a two-sample ranked set sample. Then we use this optimal design and associated information function to guide us in our search for an improved ranked set sample Mann-Whitney-Wilcoxon test. The optimality of this design depends, in general, on the type of inference, the underlying distributions and the nonparametric procedure of interest. We expect, by the nonparametric nature of the ranked set sample Mann-Whitney-Wilcoxon test, only a weak dependence on the general shapes of the underlying distributions. As in the one-sample problem (Öztürk & Wolfe 1998a), we define an information function to quantify the amount of information that can be extracted using twosample ranked set sample procedures with unequal allocation from the X and Y distributions, given the judgement ranking of the entire set. To construct such a ranked set sample, we draw N × q units from the X distribution F and M × p units from the Y distribution G. These units are partitioned into N different sets each containing q units for the X−samples and M different sets each containing p units in Y −samples. Units in each set are then judgement ranked. For a given design Dqp , based on this ranking information we quantify the dth i X−sample order statistics Y −sample order statistics in ri sets, i = 1, . . . , p, q, and the bth in ci sets, i = 1, . . . ,P i Pp q in such a way that i=1 ci = N and i=1 ri = M . Then our two-sample ranked set sample is X(di )j , j = 1, . . . , ci ; i = 1, . . . , q, and Y(bt )s , s = 1, . . . , rt ; t = 1, . . . , p. The information that design Dqp carries in this setting is measured as   √ p X q X rt ci X   X q+p T (Dqp ) = Var p (X(di )j − Y(bt )s )   N M (N + M ) i=1 j=1 t=1 s=1 = q X i=1 (1 − 2 ǫ)ai σ(d i :q) ǫ1 with the constraints  Pq Ppi=1 ai = 1, j=1 vj = 1, + p X j=1 2 ǫvj τ(b j :p) 1 − ǫ1 (3) ai ≥ 0, i = 1, . . . , q, vj ≥ 0, j = 1, . . . , p, 2 2 where ai = ci /N, vj = rj /M , ǫ1 = q/(p + q), ǫ = N/(N + M ) and σ(d and τ(b i :q) j :p) √ √ are the variances of qX(di )1 and pY(bj )1 , respectively. Let Dvar be the varianceoptimal design that minimizes T (Dqp ) in equation (3). Note that information in a 6 two-sample ranked set sample is expressed in terms of the magnitude of the total variation in the sample. In order to maximize the sample information, one needs to minimize the total variation with respect to design Dts , where t ∈ {1, . . . , q} and s ∈ {1, . . . , p}. Thus, the design Dts that produces the smallest value of T (Dts ) can be used to guide us in our search to uncover the underlying structure for an improved Mann-Whitney-Wilcoxon test. At this point we need to emphasize two points. First the coefficients ai and vj are functions of the design Dqp . Thus the choices of ai and vj will be determined by the optimal design and ai and vj will both be zero if the corresponding di and bj are not in Dvar . The second point is that the constraints in equation (3) ensure that at least one observation is quantified in each of the X− and Y −samples. We note that we cannot infer that Dvar will maximize the Pitman asymptotic efficiency of the Mann-Whitney-Wilcoxon test among all (2p − 1) × (2q − 1) designs. However, we will show in Section 4 that the use of Dvar does lead to an improvement for the standard ranked set sample Mann-Whitney-Wilcoxon test. We require the following assumptions: A1: The X-sample distribution F is a symmetric, k-modal distribution with support (a, b), −∞ ≤ a < b ≤ ∞, and modes R1 , . . . , Rk , 1 ≤ k < ∞. A2: The Y -sample distribution G is a symmetric, k ∗ -modal distribution with support (a∗ , b∗ ), −∞ ≤ a∗ < b∗ ≤ ∞, and modes R1∗ , . . . , Rk∗∗ , 1 ≤ k ∗ < ∞. B1: The quantity 2f ′ (t)F (t){1 − F (t)}/f 2 (t) + {2F (t) − 1} is positive (negative) for Ci < t < Ri (Ri < t < Ci∗ ), where Ci = (Ri + Ri−1 )/2, Ci∗ = (Ri + Ri+1 )/2, i = 1, . . . , k, with C1 = a and Ck∗ = b. B2: The quantity 2g ′ (t)G(t){1 − G(t)}/g 2 (t) + {2G(t) − 1} is positive (negative) ∗ for Ei < t < Ri∗ (Ri∗ < t < Ei∗ ), where Ei = (Ri∗ + Ri−1 )/2, Ei∗ = (Ri∗ + ∗ ∗ ∗ ∗ ∗ Ri+1 )/2, i = 1, . . . , k , with E1 = a and Ek∗ = b . For finite set sizes, the minimization of T (Dqp ) is computationally intensive and may depend strongly on the underlying distributions. Therefore, we first find a variance optimal design for large set sizes and then show that it works well for small set sizes and a large class of distributions. Let di and bj be percentiles based on q, p and the modes Ri , i = 1, . . . , k, and Rj∗ , j = 1, . . . , k∗ , i.e., di =  qF (Ri ), qF (Ri ) + 1 [qF (Ri ) + 1] if qF (Ri ) is integer otherwise (4) bj =  pG(Rj∗ ), pG(Rj∗ ) + 1 [pG(Rj∗ ) + 1] if pG(Rj∗ ) is integer otherwise, (5) and where [a] is the greatest integer less than a. Theorem 3 Suppose that Assumptions A1-A2 and B1-B2 hold. Then for large set sizes q and p, Dvar = {d1 , . . . , dk ; b1 , . . . , bk∗ } is variance-optimal. We emphasize that Theorem 3 provides locally optimal design and it does not stipulate that such an optimal design is unique. When the X− and Y −sample distributions are symmetric and unimodal, the optimal designs have simpler forms. 7 Corollary. If the X- and Y -sample distributions are symmetric and unimodal, then   q+1 p+1 Dvar = ; if q and p are both odd, 2 2   q+1 p p Dvar = ; , +1 if q is odd and p is even, 2 2 2   p+1 q q Dvar = , + 1; if q is even and p is odd 2 2 2 and Dvar = o nq q p p , + 1; , + 1 2 2 2 2 if q and p are both even. The non-uniqueness of the optimal design in the above corollary arises from the fact that there does not exist a unique median when the set size is even. Thus, in this case either one of the middle observations or both middle observations can be considered optimal. On the other hand, we feel that the optimal design that quantifies both middle observations provides better improvement for the MannWhitney-Wilcoxon test as suggested in Table 1. If the X- and Y -sample distributions are symmetric with bimodal bathtub shapes, then we can stipulate from Theorem 3 that the optimal design will quantify the two extreme observations in the X- and Y - sample distributions, since the modes are at the boundaries of the supports of the distributions. Theorem 4. Let F and G be asymmetric, unimodal distributions with supports (a, b), (a∗ , b∗ ), −∞ ≤ a < b ≤ ∞, −∞ ≤ a∗ < b∗ ≤ ∞, and modes R1 and R1∗ , a < R1 < b, a∗ < R1∗ < b∗ , respectively. Then the information-optimal design is Dvar = {k; k ∗ }, where k ≤ d1 and k ∗ ≤ b1 for right-skewed distributions, k ≥ d1 and k ≥ b1 for left-skewed distributions and d1 and b1 are defined in equations (4) and (5). Theorem 4 gives intervals for the possible values of k and k ∗ instead of specifying an exact design. However, our numerical calculations for the Mann-WhitneyWilcoxon test show that both k and k ∗ are equal to 1 (or q and p, respectively) for strongly right-skewed (or left-skewed) distributions and very close to 1 (or q and p, respectively) for moderately right-skewed (or left-skewed) distributions with relatively small set sizes (cf., e.g., Table 2). These numerical calculations show that the variance-optimality criterion does, indeed, lead to an improvement in the MannWhitney-Wilcoxon test. 4. IMPROVED MANN-WHITNEY-WILCOXON TEST In this section, we show that the data collection procedure based on the varianceoptimal design induced by the minimum variance criterion provides an improvement over the standard ranked set sample Mann-Whitney-Wilcoxon test. Theorems 3 and 4 indicate that two single order statistics from the X and Y distributions will maximize the information extracted from a cycle given the rank vector for a set of observations from a unimodal distribution. (Most practical applications involve data from the unimodal distributions.) Thus we restrict our search for improvement of the Mann-Whitney-Wilcoxon test to unimodal distributions 8 and consider the class of designs that require only a single order statistic in each cycle from each of the X and Y distributions. Letting D1 denote this class we point out that D1 contains pq different designs, one for each possible pair of single order statistics from each of the X and Y distributions. Theorem 5. Assume ǫ = 1/2. Then for a design D11,{di ;bj } = {di ; bj }, the Pitman ∗ = {di ; bj } with respect to the standard asymptotic relative efficiency of D11,{d i ;bj } ranked set sample is R 2{ f(di ) (y)f(bj ) (y)dy}2 σ02 (Dqp ) ∗ R ARE(D11,{di ;bj } ; DSRSS ) = , (6) {pτ1 (D11,{di ,bj } ) + qτ3 (D11,{di ,bj } )}pq{ f 2 (y)dy}2 where τ1 (D11,{di ;br } ) = bX r −1 r −1 bX j=0 u=0 q  p p q−1 di −1 j (j + u + di ) and τ3 (D11,{di ,br } ) = q X q X j=di u=di p p−1 br −1  q j) (j + u + br ) u  2p+q j+u+di  q u  2p+q j+u+br −  r −1 bX  p 2  j  p+q  (j + di ) j+d i −  q X  q  2  j  p+q  . (j + br ) j+b   j=0 j=di q p q−1 di −1 p−1 br −1 r Even though Theorem 5 simplifies equation (2), it is still not feasible to obtain an analytic maximization. Thus, we numerically maximize equation (6) for several distributions, and set sizes p and q. The distributions include the normal, logistic, double exponential, Cauchy, exponential, log normal, and the gamma distribution with parameters 10 and 1. The set sizes p, q, and the ratio of the sample sizes ǫ are taken to be p, q = 1, . . . , 5 and ǫ = 1/2. Tables 1 and 2 present designs that improve the efficiency of the Mann-Whitney-Wilcoxon test for these combinations of distributions and set sizes. Table 1: Variance optimal (Dvar ) and Pitman improved (Dimp ) designs for unimodal symmetric distributions, m = n, and various set sizes. The Pitman efficiencies are given for the normal distribution. Model Symmetric and Unimodal Dist. (Normal, Logistic, Cauchy, Double Exp.) q 2 2 2 2 3 3 3 4 4 5 p 2 3 4 5 3 4 5 4 5 5 Dimp {1, 2; 1, 2} {1, 2; 2} {1; 2} or {2; 3} none {2; 2} {2; 2, 3} {2; 3} {2, 3; 2, 3} {2, 3; 3} {3; 3} Dvar {1, 2; 1, 2} {1, 2; 2} {1, 2; 2, 3} {1, 2; 3} {2; 2} {2; 2, 3} {2; 3} {2, 3; 2, 3} {2, 3; 3} {3; 3} ARE(Dimp , DSRSS ) 1 1.003 1.089 1 1.119 1.118 1.117 1.117 1.124 1.166 Since the variance-optimal designs contain two order statistics in each cycle when the set sizes are even, we also searched for the improvement in the class D2 9 Table 2: The Pitman improved designs (Dimp ) and upper limits (Dup = {d1 ; b1 }) for the variance-optimal designs (Dvar = {k1 ; k1∗ }) for selected asymmetric distributions, m = n, and various set sizes. The variance-optimal design is Dvar = {k1 ; k1∗ } such that k1 ≤ d1 and k1∗ ≤ b1 . Model Log Normal Gamma(10,1) Exp q 2 2 2 2 3 3 3 4 4 5 2 2 2 2 3 3 3 4 4 5 11 2 2 2 2 3 3 3 4 4 5 p 2 3 4 5 3 4 5 4 5 5 2 3 4 5 3 4 5 4 5 5 11 2 3 4 5 3 4 5 4 5 5 Dimp {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {2; 2} {3; 3} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} 10 Dup {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 2} {1; 2} {1; 3} {2; 2} {2; 2} {2; 3} {2, 2} {2; 3} {3; 3} {5, 5} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} {1; 1} ARE(Dimp , DSRSS ) 2.462 3.067 3.582 4.035 3.661 4.158 4.602 4.613 5.018 5.378 1.240 1.265 1.264 1.253 1.292 1.291 1.283 1.286 1.275 1.300 3.209 2.667 3.537 4.402 5.274 4.500 5.436 6.389 4.613 7.367 8.333 that contains two different order statistics in each of the X- and Y -samples. Our search algorithm first started to look for an improvement in the class D1 . After finding an optimal design with respect to the asymptotic Pitman efficiency in the class D1 , it searched for a further improvement in the class D2 . If an improved design is found in D2 , then the optimal (improved) design in D1 is updated and the search is terminated at this point. Otherwise, the optimal design in D1 is taken to be the improved design. Numerical calculations for the normal distribution showed that there was no additional improvement in the asymptotic Pitman efficiency from consideration of designs in the class D3 that contain three different order statistics in each of the X- and Y -samples. We believe that this result will also hold for other unimodal distributions and classes Dk , k = 3, . . . , a, where a is either q or p. There are several important observations from Tables 1 and 2. We first note that the result induced by the variance optimality criterion in the corollary is consistent with the designs that improve the Pitman asymptotic relative efficiency of the Mann-Whitney-Wilcoxon test for both symmetric and asymmetric distributions when the set sizes q and p are equal. For example, when the underlying distributions are symmetric and unimodal, both optimal and improved designs quantify the middle observation from each of the X and Y distributions for odd set sizes and the two middle observations for even set sizes. Similar results hold for asymmetric distributions. In this case, both optimal and improved designs quantify the smallest (largest) observation for strongly right-skewed (left-skewed) unimodal distributions. When the underlying distributions are symmetric, the designs that improve the Pitman efficiency of the Mann-Whitney-Wilcoxon test are very sensitive to the set sizes for the X and Y samples. For a given sum q + p, the highest efficiency is achieved when q = p and the improved design is consistent with the optimal design induced by the information criteria. On the other hand, if the set sizes are significantly different, there is a discrepancy between the Pitman improved designs and information optimal designs. For example, when q = 2 and p = 5, there is no design that improves the Pitman efficiency of the standard ranked set sample design of Bohn & Wolfe (1992), which indicates that there is no improvement over the standard ranked set sample procedures. When q = 2 and p = 4, the Pitman improved design is Dimp = {1; 2} or Dimp = {2; 3}, while the information optimal design is not unique and could be any of the three designs {1; 2}, {2; 3} or {1, 2; 2, 3}. Even though the optimal designs for skewed distributions are not affected by different set sizes, the efficiency is higher when the set sizes are equal. For example, for the log normal distribution, when q = 2 and p = 4 the efficiency is 3.582 but when q = 3 and p = 3 the efficiency increases to 3.661. The same result can be observed for other skewed distributions as well. Therefore, based on these observations we recommend using set sizes as equal as possible for the X and Y samples. 5. DISTRIBUTIONAL PROPERTIES OF THE IMPROVED TEST In general, the exact distribution of the standard ranked set sample MannWhitney-Wilcoxon test is not easily available because the various sequences of order statistics are not equally likely; cf., e.g., Bohn & Wolfe (1992) for a detailed discussion. In many of the improved designs, there is only one order statistic in each one of the X- and Y -samples. This makes it possible to provide an algorithm to calculate the exact null probability distribution of the test statistic when the set sizes p and q are equal. 11 Let UDimp (q; p) be the Mann-Whitney-Wilcoxon rank sum statistic based on the improved design Dimp = {d1 ; b1 }. Then we have UDimp (q, p) = m n X X i=1 j=1 δ(X(d1 )i − Y(b1 )j ). If q = p and Dopt = {d1 ; d1 }, the null distribution of UDimp (q, p) is the same as the null distribution of the simple random sample Mann-Whitney-Wilcoxon rank sum statistic based on n X’s and m Y ’s, which is available in standard nonparametric text books (cf., e.g., Hollander & Wolfe 1998). On the other hand, if the set sizes q and p are not equal, then the null distribution of UDopt (q, p) is not easily accessible. Difficulty arises from the fact that even though the number of order statistics is reduced to one in each sample, the sequences of order statistics are still not equally likely when q 6= p. Thus, UDimp (q, p), p 6= q, does not permit a simple algorithm to yield its null probabilities. APPENDIX Proof of Theorem 1. The proof follows from Theorem 3.4.13 of Randles & Wolfe (1991). We need to determine ξ1,0 (Dts ) and ξ0,1 (Dts ). First consider ξ1,0 (Dts ). From the kernel function wDts (X 1 , Y 1 ) we have EwDts (X 1 , Y 1 )wDts (X 1 , Y 2 ) − γ 2 (Dts ) ξ1,0 (Dts ) = s t X X   P {X(di )1 < min(Y(br )1 , Y(br )2 )} − P 2 (X(di )1 < Y(br )1 ) = i=1 r=1 t XX X + i=1 1≤r6=ℓ≤s  P {X(di )1 < min(Y(br )1 , Y(br )2 )}  −P (X(di )1 < Y(br )1 )P (X(di )1 < Y(bl )1 ) τ1 (Dts ) + τ2 (Dts ). = It then follows that "Z Z 2 # s t X X 2 {1 − Fbr (y)} dF(di ) (y) − {1 − Fbr (y)}dF(di ) (y) τ1 (Dts ) = i=1 r=1 = s t X X i=1 r=1 × Z  − =  bX r −1 r −1 bX  j=0 k=0 q  q−1 di − 1    p p j k F j+k+di −1 (y){1 − F (y)}2p−j−k+q−di f (y)dy bX r −1 j=0 s t X X i=1 r=1 2    Z p q−1  F j+di −1 (y){1 − F (y)}p−j+q−di f (y)dy   q di − 1 j   bX r −1 r −1 bX  j=0 k=0 q−1 p di −1 j  2p+q j+k+di (j + q   p k k + di ) −  p )2  q−1 q j=0 ,  di −1 j p+q j+di (j + di ) ( Pbr −1 which is the expression in Theorem 1. The expression τ2 (Dts ) follows from a similar calculation. 12 We now consider ξ0,1 (Dts ). Again from the kernel function wDts (X 1 , Y 1 ) we have ξ0,1 (Dts ) = = EvDts (X 1 , Y 1 )wDts (X 2 , Y 1 ) − γ 2 (Dts ) s t X X  i=1 r=1 + =  P {max(X(di )1 , X(di )2 ) < Y(br )1 } − P 2 (X(di )1 < Y(br )1 ) XX 1≤i6=j≤t s X  P {max(X(di ) , X(dj )1 ) < Y(br )1 } r=1  −P (X(di )1 < Y(br )1 )P (X(dj )1 < Y(br )1 ) τ3 (Dts ) + τ4 (Dts ). The remaining part of the proof is completed by evaluating the corresponding probabilities as done for ξ1,0 (Dts ). The details are omitted. ∗ is an arbitrary design and contains SRSS, Proof of Theorem 2. Since the design Dts ∗ for the proof of the theorem it is sufficient to derive the Pitman efficacy of Ū (Dts ). ∗ (·, ·) we get From the kernel function wDts ∗ ∗ µ′Dts ∗ (0) = t s s Z t X X f(di ) (y)f(bj ) (y)dy. (7) i=1 j=1 ∗ , the Theorem 1 yields Under the null hypothesis, again for the design Dts ∗ )= σ0 (D(ts) t∗ s∗ 2 ξ1,0 (Dts ) s∗ t∗ 2 ξ0,1 (Dts ) + . ǫ 1−ǫ (8) The proof is completed by putting equations (7) and (8) in equation (1). Proof of Theorem 3. For the proof of that theorem, we need to minimize T (D) with respect to D. For large set sizes q and p, T (D) can be written as T ∗ (λ) = q p ǫ X λ∗j (1 − λ∗j ) 1 − ǫ X λi (1 − λi ) + , ǫ1 i=1 f 2 (ηλi ) 1 − ǫ1 j=1 g 2 (ηλ∗∗ ) (9) j where λ = {λ1 , . . . , λq ; λ∗1 , . . . , λ∗p }, 0 < λi = limq→∞ di /q < 1, i = 1, . . . , q; 0 < λ∗j = limp→∞ bj /p < 1, j = 1, . . . , p; ηλi = F −1 (λi ), and ηλ∗∗ = G−1 (λ∗j ). The j proof is completed by applying Theorem 1 of Öztürk & Wolfe (1998a) to each term in equation (9). Proof of Theorem 4. The proof follows from a consideration similar to that of the proof of Theorem 2 in Öztürk & Wolfe (1998a). ACKNOWLEDGMENTS The authors thank the Editor and anonymous referees for their helpful comments which improved the presentation. The work of Douglas A. Wolfe was supported in part by National Science Foundation Grant Number DMS-9802358. 13 REFERENCES Bhoj, D. S. (1997). New parametric ranked set sampling. J. Appl. Statist. Sci., 6, 275–289. Bohn, L. L. (1998). A ranked-set sample signed-rank statistic. J. Nonpar. Statist., 9, 295–306. Bohn, L. L., & Wolfe, D. A. (1992). Nonparametric two-sample procedures for ranked-set samples data. J. Amer. Statist. Assoc., 87, 552–561. Bohn, L. L., & Wolfe, D. A. (1994). The effect of imperfect judgment rankings on properties of procedures based on the ranked-set samples analog of the MannWhitney-Wilcoxon statistic. J. Amer. Statist. Assoc., 89, 168–176. Dell, T. R., & Clutter, J. L. (1972). Ranked-set sampling theory with order statistics background. Biometrics, 28, 545–555. Halls, L. K., & Dell, T. R. (1966). Trial of ranked-set sampling for forage yields. Forest Science, 12, 22–26. Hettmansperger, T. P. (1995). The ranked-set sample sign test. J. Nonpar. Statist., 4, 263–270. Hollander, M., & Wolfe, D. A. (1998). Nonparametric Statistical Methods. 2nd edition, John Wiley, New York. Kaur, A., Patil, G. P., & Taillie, C. (1997). Unequal allocation models for ranked set sampling with skew distributions. Biometrics, 53, 123–130. Koti, M. K., & Babu, G. J. (1996). Sign test for ranked-set sampling. Comm. Statist. Theory Meth., 25, 1617–1630. McIntyre, G. A. (1952). A method of unbiased selective sampling, using ranked sets. Austral. J. Agric. Res., 3, 385–390. Öztürk, Ö. (1999a). One and two sample sign tests for ranked set samples with selective designs. Comm. Statist. Theory Meth., 28, 1231–1245. Öztürk, Ö. (1999b). Two sample inference based on one sample ranked set sample sign statistic. J. Nonpar. Statist., 10, 197–212. Öztürk, Ö., & Wolfe, D. A. (1998a). Optimal allocation procedure in ranked set sampling for unimodal and multi-modal distributions. Technical Report No 632, Department of Statistics, The Ohio State University, Columbus, OH. Öztürk, Ö., & Wolfe, D. A. (1998b). Optimal ranked set sampling protocol for the signed rank test. Technical Report No 630, Department of Statistics, The Ohio State University, Columbus, OH. Öztürk, Ö., & Wolfe, D. A. (1999). Alternative ranked set sampling protocols for the sign test. Statist. Probab. Lett., in press. Randles, R. H., & Wolfe, D. A. (1991). Introduction to the Theory of Nonparametric Statistics. Krieger, Malabar, FL. Stokes, S. L. (1977). Ranked set sampling with concomitant variables. Comm. Statist. Theory Meth., 6, 1207–1211. Stokes, S. L., & Sager, T. W. (1988). Characterizations of a ranked-set sample with application to estimating distribution functions. J. Amer. Statist. Assoc., 83, 374–381. Takahasi, K., & Wakimoto, K. (1968). On unbiased estimates of the population mean based on the sample stratified by means of ordering. Ann. Inst. Statist. Math., 20, 1–31. 14 Received 26 March 1999 Accepted 11 June 1999 Ömer Öztürk Department of Statistics The Ohio State University Marion, Ohio 43302 USA e-mail: [email protected] Douglas A. Wolfe Department of Statistics The Ohio State University Columbus, Ohio U.S.A. 43210 e-mail: [email protected] 15