Published online 1 July 1999
Published in Agron J 91:721-731 (1999)
© 1999 American Society of Agronomy
677 S. Segoe Rd., Madison, WI 53711 USA
Agronomy Journal 91:721-731 (1999)
© 1999 American Society of Agronomy
STATISTICS
Validity and Efficiency of Neighbor Analyses in Comparison with Classical Complete and Incomplete Block Analyses of Field Experiments
Tianxia Wua and
Pierre Dutilleulb
a Nanjing Agricultural University, Nanjing, China
b Dep. of Plant Science, McGill University, Macdonald Campus, 21,111 Lakeshore Rd., Ste-Anne-de-Bellevue, QC, Canada H9X 3V9
cydp{at}musica.mcgill.ca
Received for publication May 21, 1997.
 |
ABSTRACT
|
|---|
In classical block analysis, the assumption of independence between observations from neighboring plots within a block is violated when spatial autocorrelation is present, resulting in incorrect treatment effect estimates and an inflated error variance estimate. Before recommending any neighbor analysis designed to account for spatial autocorrelation, its validity in estimation and testing and its efficiency must be examined. Our main objective was to assess the validity and efficiency of five neighbor analyses in comparison with complete and incomplete block analyses, using 50 pseudo-experiments for each of 50 data sets derived from two uniformity trials for soybean and wheat. The neighbor models were first difference with (FD-EV) or without (FD) errors in variables, second difference with (SD-EV) or without (SD) errors in variables, and first-order autoregressive errors. Geostatistical analysis based on the normalized semivariance detected spatially structured variation at different degrees in the data sets. The first and second differences were effective in removing spatial trends and autocorrelation. Our simulation study showed that FD-EV provided valid estimates of the error variance in all and SD-EV in most of the data sets. On average, the observed significance level was slightly beyond the theoretical level for both analyses. The other neighbor analyses underestimated the error variance. Excluding SD, neighbor analyses had similar efficiency and were more efficient than complete and incomplete block analyses, with average relative efficiencies of about 300 and 130%, respectively. Relative efficiency tended to increase with decreasing normalized nugget effect (NC0) and increasing CV. When spatially structured variation is strong (i.e., NC0 < 10%), highly rectangular plots must be preferred to nearly square plots in incomplete block and neighbor analyses. Considering validity and efficiency, the best method was FD-EV, which is thus a reliable alternative to classical block analysis when spatial autocorrelation is present.
Abbreviations: CB, complete block FD, first difference FD-EV, first difference with errors in variables Emp, average empirical variance GLS, generalized least squares IB, incomplete block Pre, average predicted variance RCB, randomized complete block REML, restricted maximum likelihood SD, second difference SD-EV, second difference with errors in variables
 |
INTRODUCTION
|
|---|
VALID DIFFERENTIATION between varieties is crucial in plant breeding programs. Such a differentiation must be based on accurate and precise estimation of variety effects, which depends largely on the control of spatial variation among plots. Spatial variation incorporates many factors, such as moisture, pH, and structure of soil and the pressure of weeds. It can be partitioned into two components: purely random noise and structured variation. All approaches towards improving precision of variety effect estimates seek to minimize the latter. These approaches fall into two categories: (i) block designs, including the randomized complete block (RCB) design, and incomplete block (IB) designs, the latter being mainly lattice and generalized lattice designs; and (ii) neighbor analyses, in which trends as well as the correlation between data of neighboring plots can be accounted for.
The precision of block analyses relies on the control of heterogeneity within blocks. Generally, the greater the heterogeneity within blocks, the poorer the precision of variety effect estimates. By nature, there is typically an appreciable correlation among data from neighboring plots, which is termed spatial autocorrelation. Recognizing this, neighbor analyses were proposed as an alternative to block analyses. Spatial autocorrelation can be taken into account by covariate analysis (Papadakis, 1937; Bartlett, 1978; Brownie et al., 1993) or by a trend-plus-error model or a first-difference model with errors in variables (Wilkinson et al., 1983; Green et al., 1985; Besag and Kempton, 1986; Williams, 1986; Gleeson and Cullis, 1987; Baird and Mead, 1991; Grondona and Cressie, 1991). Spatial structure may be in the mean of the variable of interest, and spatial heterogeneity in the mean of trend type (Dutilleul, 1993) can then be modeled by trend surface analysis (e.g., Tamura et al., 1988). All these methods differ in the model used to account for the trend and the assumptions required by the estimation techniques. Recently, considerable interest has been shown in neighbor analyses, because of the noticeable gains in precision that can be achieved in variety effect estimation by using neighbor analysis rather than IB analysis. In a study of 118 wheat variety trials, Kempton and Howes (1981) estimated that the iterated Papadakis procedure (Bartlett, 1978) increased the precision of variety mean differences by 8.7% compared with the IB analysis. In a study of 1019 variety trials over a range of crops, Cullis and Gleeson (1989) showed that the one-dimensional neighbor analysis resulted in a 42% reduction of the variances of varietal yield differences, compared with the RCB analysis, while the IB analysis resulted in a 33% reduction. In a simulation study, Brownie and Gumpertz (1997) showed that model-based methods accounting for spatial variation in terms of correlations between plot errors are robust with respect to validity when spatial autocorrelation is present. Spatial trends were modeled by trend surface analysis and the spatial covariance function used for estimating treatment effects was assumed to be the exponential function used in simulations. However, the neighbor analysis methods based on the differencing technique seem to have never been completely examined for validity and efficiency in comparison with block analyses, after such an examination had been initiated by Besag and Kempton (1986)(Appendix 2) and Baird and Mead (1991).
Because the primary objective of most field experiments is the accurate and precise estimation of treatment contrasts, validity in estimation of the error variance (unbiasedness) and efficiency (precision) should be thoroughly examined before any neighbor method can be recommended as a routine method for field experimentation. Validity in testing should also be assessed by comparing observed and theoretical significance levels. Through randomization, the classical analyses have been proven to be valid in estimation because the treatment mean square and the residual mean square have the same expectation in the absence of treatment effects, but there is no analytical proof that the neighbor methods are valid within such a randomization framework (Baird and Mead, 1991). However, the validity of neighbor methods can be examined empirically by using simulation based on uniformity data (Wilkinson et al., 1983; Besag and Kempton, 1986). Unless neighbor methods are shown to provide unbiased error variance, it is not reasonable to compare the efficiency of neighbor methods relative to block analyses on the basis of average standard errors of pairwise treatment differences.
The objectives of this study therefore are (i) to evaluate the validity and efficiency of neighbor analyses in comparison with classical complete and incomplete block analyses by simulation based on uniformity trials and (ii) to study the effects of spatial dependency, coefficient of variation, block size, plot size, and plot shape on the efficiency of neighbor analyses, using data sets obtained from soybean and wheat uniformity trials on Jiangpu Farm of Nanjing Agricultural University, Nanjing, China (Wu et al., 1995) and on the Aberdeen Substation, Aberdeen, ID (Wiebe, 1935), respectively. In a preliminary step, spatial structure (i.e., trends at large scale and autocorrelation at small scale) is identified in the field experiments by geostatistical analysis.
 |
Materials and methods
|
|---|
Uniformity Trials and Data Collection
Validity and efficiency were evaluated by simulation based on data sets generated from the soybean [Glycine max (L.) Merr.] and wheat (Triticum aestivum L.) uniformity trials described below. The experiment design (including the number of varieties and complete blocks, and the plot size and shape) and the coefficient of variation meet the practical requirements of field variety trials in plant breeding programs (Tables 1 and 2)
.
View this table:
[in this window]
[in a new window]
|
Table 1 Plot size, plot shape, plot arrangement, and coefficients of variation (CV) for the 50 data sets derived from soybean and wheat uniformity trials
|
|
The soybean uniformity trial was conducted on Jiangpu Farm of Nanjing Agricultural University, Nanjing, China, in 1987, using the summer soybean variety NAU 73-935 (Wu et al., 1995). The crop was sown in rows 7.33 m long and 0.40 m apart. Fifteen days after emergence, seedlings were thinned to obtain a within-row distance between plants of about 0.13 m. At harvest time, each row was cut at 1.33-m intervals after removing 0.33 m from the ends of each row, so that each row consisted of five 1.33-m intervals or unit rows. The total weight of plants, excluding roots, was recorded per unit row and then transformed to grain yield, using a harvest index derived from 60 randomly sampled unit rows on which total weight and grain yield were measured. Because the experiment is a uniformity trial, all plots were planted and harvested on the same days. No disease or insect damage was observed, total weight was measured when plants were dry, and the harvest index was assumed to be constant over plots. The 6000 unit rows of this trial were originally arranged in 40 columns of 150 unit rows each in the field.
The wheat uniformity trial was conducted on the Aberdeen Substation, Aberdeen, ID (Wiebe, 1935). It consisted of 1500 unit rows of `Federation' wheat (CItr 4734), 0.30 m apart and 4.57 m long. The 1500 unit rows were originally arranged in 12 columns of 125 unit rows each in the field. The crop was grown in the summer of 1927. Grain yield per unit row was determined after harvest (Table 1 in Wiebe, 1935).
Contour maps show that for both soybean and wheat trials, yield in the eastern part was more homogeneous than in the western part of the experiment field (Fig. 1)
. The plots were rectangular in shape and were constructed by combining unit rows and columns of the original field layout. The different arrangements of columns and unit rows into plots resulted in a total of 50 data sets, corresponding to various combinations of plot size, plot shape, and number of dummy varieties and complete blocks (Tables 1 and 2).
Geostatistical Analysis
The semivariogram (Journel and Huijbregts, 1978; Vieira et al., 1983) is the geostatistical tool commonly used to quantify the spatial structure, which arises from trends (gradients) at large scale and spatial autocorrelation among neighboring plots at small scale. In one-dimensional space, the semivariance estimate is defined as
 | [1] |
where N(h) is the number of pairs of observations (zi, zi+h) separated by distance h. It is expected that the difference zi - zi+h decreases as h decreases. In practice, as h approaches zero,
(h) approaches a positive value C0 called the nugget effect. A nugget effect equal to the sample variance indicates a purely random noise. This is one of the basic assumptions to be satisfied by the residuals of any classical ANOVA model. Conversely, when C0 is less than the sample variance, the semivariogram indicates spatial structure. Thus, C0 can be used as an indicator of the portion of spatial structure in the total variation. To compare the spatially structured portion of variation among data sets, we normalized the semivariance by dividing it by the sample variance; 1.0 minus the normalized semivariance also provided a spatial autocorrelation coefficient. The autocorrelation was assumed to be consistent over columns, so that for each data set the normalized semivariance estimate was computed by
 | [2] |
where m and c are the number of columns and the number of plots per column, respectively; s2 is the sample variance of zi,j; and h' is the number of plots between zi,j and zi,j+h'. Then h, the distance between zi,j and zi,j+h', is equal to h' times the width of a plot, where h' = 1, 2, ..., H and m(c - H) > 50 because
(h) is unreliable for N(h) < 50 (Journel and Huijbregts, 1978). In particular, lags 1 and 2 represent distances equal to one and two plot widths.
To evaluate the effect of blocking and differencing on the spatial structure, different zi,j's were used in the semivariance analysis: (i) yi,j - overall mean, (ii) yi,j - complete block mean, (iii) yi,j - incomplete block mean, (iv) yi,j - yi,j+1, and (v) yi,j - 2yi,j+1 + yi,j+2, where yi,j is the grain yield for plot j of row i in the uniformity trial. For zi,j = yi,j - incomplete block mean in (iii), spatial autocorrelation was assumed to be consistent over blocks, and the number of columns and number of plots per column in Eq. [2] were replaced by the number of blocks and number of plots per block, respectively. The geostatistical analysis was performed with our own computer programs written in the SAS/IML language (SAS Inst., 1989).
Criteria of Evaluation
The criteria of validity and efficiency used here follow Besag and Kempton (1986). For each method of analysis (see below), the average empirical variance (Emp) and the average predicted variance (Pre) of pairwise variety contrasts are defined as
 | [3] |
and
 | [4] |
where
a and
b represent the estimates of the effect of varieties a and b, and v is the number of dummy varieties. In a randomization framework, an analysis is said to provide valid estimates of the error variance if Emp
Pre over a series of randomized layouts. When the value of Pre is substantially smaller (or larger) than that of Emp for a particular method, it suggests that the method underestimates (or overestimates) the variance of the estimated differences between varieties. Since there is no real variety effect in pseudo-experiments that originate from a uniformity trial where dummy varieties are randomly relabeled, the true theoretical difference between any pair of varieties is zero. Thus, Emp reflects the efficiency of variety effect estimates; the smaller the value of Emp, the higher the efficiency.
In hypothesis testing, a method is said to be valid at level
if the probability that it detects a treatment difference, when in fact there is none, is less than or equal to
. Since detection of significant variety effects here is based on F-testing, the valid method should provide a significance level of
to the F-test; i.e., the frequency at which the F-statistic exceeds the tabulated critical value F1-
should be (at least approximately) equal to
in the absence of real variety effects.
Statistical Methods of Data Analysis
Suppose that an experiment comprises m columns, with c plots each. Let the random variable yi,j denote the yield of plot j in row i, and Y the corresponding n-vector with n = mc. The classical RCB ANOVA model (where the block factor is considered fixed) can then be written as
 | [5] |
where µ is the grand mean and 1 is the n-vector of ones,
is the v-vector of variety effects and D is the n x v variety design matrix, ß is the r-vector of block effects and B is the n x r block design matrix, and the
i,j's are assumed to be independent N
deviates. The expected value of Y and its variancecovariance matrix are then given, respectively, by
E
= 1µ + D
+ Bß and Var
=
2
In
, where In is the n x n identity matrix, and
Pre =
, where r is the number of complete blocks.
The equation for the IB ANOVA model with recovery of interblock information (where the block factor is considered random; see, for example, Yates, 1939; Cochran and Cox, 1957) is similar in writing to Eq. [5], except that
E
= 1µ + D
and
Var
=
2ß BBT +
2
In = V
, where
2ß denotes the variance component associated with the random block factor and the T superscript is the transpose matrix operator (Graybill, 1983); other notations are as in Eq. [5]. Variance components
2ß and
2
were estimated by restricted maximum likelihood (REML) (Patterson and Thompson, 1971). The generalized least-squares (GLS) estimate of
, which takes into account the interblock information and optimally benefits from unbiasedness and minimum variance (Searle, 1971), is
 | [6] |
where
2ß and
2
are replaced by their REML estimates in V and the -1 superscript is the inverse matrix operator (Graybill, 1983). Pre can then be computed by
 | [7] |
where trace() and sum() denote the sum of diagonal elements and the sum of all elements of a matrix, respectively (Graybill, 1983).
In this study, we considered only one-dimensional spatial trends, which usually suffices in practice (Kempton, 1985; Besag and Kempton, 1986). The first-difference model with errors in variables (FD-EV) for the n-vector of yields Y defined above can be written as
 | [8] |
where
(-v) is a (v-1)-vector of variety effects (note that the effect of the vth variety is derived from the sum of the v-1 others below) and D is the corresponding n x (v - 1) variety design matrix;
is an n-vector of spatial trend effects whose first differences
1
are assumed to be independent and identically distributed N
, where
1 is the first-difference operator; and
is an n-vector of random errors assumed to be independent N
. The second-difference model with errors in variables (SD-EV) can be written similarly, with the same notations and assumptions except that the second differences
2
are assumed to be independent and identically distributed N
, where
2 is the second-difference operator. Operators
1 and
2 can be written in matrix notation as follows:
and
where
is the Kronecker or direct product (Graybill, 1983).
Hereafter, the estimation methods are presented jointly for the first-difference and the second-difference models, using the notation
d (d = 1, 2). Assuming the difference
d
is independent of the error
(i.e., cov(
d
,
) = 0 for d = 1, 2), and denoting
dY by W and
dD by X, we have
E
= X
and
Var
=
2
In +
2
d
Td = Vd
. Variance components
2
and
2
were estimated by REML and the vector of variety effects
(-v) was estimated by GLS as follows:
 | [9] |
where
2
and
2
are replaced by their REML estimates in Vd, so the vth variety effect estimate is given by
 | [10] |
The variancecovariance matrix of vector
(-v) is given by the upper left
x
submatrix of
-1; the variance of
v is equal to the sum of all elements of this submatrix; and Cov(
a,
v) is equal to minus the sum of elements of the ath row of the same submatrix (a = 1, ..., v - 1). Thus,
 | [11] |
To assess the goodness-of-fit of neighbor models for field experiments, the first-difference (FD) and second-difference (SD) models without errors in variables and the RCB ANOVA model with first-order autoregressive or AR(1) errors were also considered. The model of the former is
 | [12] |
with
E
= X
and Var
=
2
In
. The computation of Pre is the same as for Eq. [11], with similar notations and assumptions (except
). Under the RCB ANOVA model with AR(1) errors, the n-vector Y satisfies Eq. [5] where the variancecovariance structure of the n-vector of errors
is assumed to be first-order autoregressive; that is,
with
The variance component
2
and the autocorrelation parameter
are estimated by REML. The GLS estimate of
and the computation of Pre are similar to those of the IB analysis (see Eq. [6] and [7]).
Simulation Study
For each of the 50 data sets (Table 1), the methods of analysis were applied to 50 pseudo-experiments, which were produced by randomly arranging dummy variety labels within blocks according to a lattice, generalized lattice or complete block design (Table 2). For each method, the sample means of Emp and Pre and the frequency at which the null hypothesis was falsely rejected (Type I error rate) were computed over the 50 pseudo-experiments. Pre is said to be significantly different from Emp if
where the variance s2D is calculated on the 50 differences Emp - Pre. The efficiency of a given method relative to the classical RCB ANOVA was estimated by
 | [13] |
Depending on the method, the computation of Emp, Pre and the observed significance level was performed with PROC MIXED of SAS version 6.10 (SAS Inst., 1995) in combination with our own computer programs written in the SAS/IML language (SAS Inst., 1989). PROC MIXED was used for the REML estimation of variance components for all methods except RCB ANOVA and for the estimation of treatment effects only for IB; we used PROC IML for the GLS estimation of treatment effects for the neighbor analyses. For the block analyses (RCB ANOVA and IB), the F-test was based on the variety mean square over error mean square ratio, while for the neighbor analyses it was given by the Emp against Pre ratio. Only for the RCB ANOVA are the two F-ratios equivalent.
 |
Results and discussion
|
|---|
Geostatistical Analysis
To assess the spatially structured variation of the residuals from block analyses, the normalized nugget effect NC0 was estimated by linear regression using the three first points (h,
*(h)) in the normalized semivariogram (Table 3)
. The NC0 values for IB(6) (incomplete block analysis with blocks of size 6) are not given in Table 3, as the semivariograms were then not smooth enough to allow reliable linear regression. NC0 values less than unity or 100% indicate spatially structured variation; the smaller the NC0, the stronger the spatial structure. The normalized nugget effects for yields (y) minus overall mean are less than those for yields minus CB mean and yields minus IB(10) mean. This shows that the block analyses, and especially the IB, may be useful, to some degree, for removing or taking into account the spatial structure. Given the coefficients of variation of Table 1, spatial structure tends to be weaker for data sets with lower heterogeneity (e.g., wheat B1, CV = 9.6%, and wheat B2, CV = 9.1%, for v = 60). With these two exceptions, only 4 to 16% of the sample variance is random in nature for y - CB mean, based on NC0 (Table 3). Therefore, much of the spatially structured variation is not explained by the ANOVA. For most data sets, NC0 is about 40% for y - IB(10) mean; despite the use of small blocks, structured variation thus still exists, so the precision could be further improved.
View this table:
[in this window]
[in a new window]
|
Table 3 Normalized nugget effects (NC0) for data (y) centered to the overall mean, to the means of complete blocks (CB), and to the means of incomplete blocks of size 10 [IB(10)] for data sets with v = 60.
|
|
The lag-1 autocorrelation coefficients were estimated over columns for y - overall mean and y - CB mean and over blocks for y - IB(10) mean and y - IB(6) mean (Table 4)
. While there is a slight decrease in the autocorrelations from y - overall mean to y - CB mean, the decrease is much more pronounced from y - CB mean to y - IB(10) mean and finally, from y - IB(10) mean to y - IB(6) mean. This observation suggests that strong autocorrelation exists within large blocks and spatial autocorrelation in the errors can be effectively reduced by incomplete blocks, but not always completely removed, even by the use of very small blocks.
View this table:
[in this window]
[in a new window]
|
Table 4 Empirical lag-1 autocorrelation coefficients (r1) for data (y) centered to the overall mean, to the means of complete blocks (CB) and to the means of incomplete blocks of size 10 [IB(10)] and size 6 [IB(6)] for data sets with v = 60.
|
|
Under Model [8], lag-1 autocorrelation is negative for the differencing deviates W, that is, for the first differences:
Cov
= -
2
and for the second differences:
Cov
= -4
2
. The nugget effect is thus meaningless for the differencing deviates because the normalized semivariance is greater than 1.0 at first distance lag. To evaluate how effective differencing is in removing spatially structured variation, compared with complete blocking, normalized semivariograms were computed for the residuals of CB and for the differencing deviates (Fig. 2 and 3)
. As the distance increases, all CB curves rise and level-off at some maximum; such empirical semivariograms can be described by a spherical variogram model (Journel and Huijbregts, 1978). The predominant spatial structure in these data sets inflates the error variance of the ANOVA model, resulting in masked variety effects. In contrast, most of the normalized semivariograms from differencing deviates are close to unity, indicating little spatially structured variation except the negative correlation between Wi and Wi+1. The periodical changes observed in Fig. 3 are caused by systematic variation within a drill width of eight rows (Wiebe, 1935). The spatial correlograms displayed in Fig. 4
show high and positive autocorrelation between nearest neighbors and decreasing autocorrelation with increasing distance between plots for the residuals of the RCB ANOVA model. For the first and second differencing deviates, most of the autocorrelation coefficients are statistically equal to zero, with the exception of first distance lags. Figures 2, 3, and 4 not only illustrate how effective differencing is in removing the spatial structure, but also show that first and second differencing are almost equivalent in this respect.

View larger version (35K):
[in this window]
[in a new window]
|
Fig. 2 Normalized semivariograms for the residuals of the randomized complete block model (dashed line) and for the first differences (solid line) and second differences (dotted line) of the raw data, by using soybean data sets with v = 60 (no. of dummy varieties). See Tables 1 and 2 for the data set codes and characteristics
|
|

View larger version (37K):
[in this window]
[in a new window]
|
Fig. 3 Normalized semivariograms for the residuals of the randomized complete block model (dashed line) and for the first differences (solid line) and second differences (dotted line) of the raw data, by using wheat data sets with v = 60 (no. of dummy varieties). See Tables 1 and 2 for the data set codes and characteristics
|
|

View larger version (21K):
[in this window]
[in a new window]
|
Fig. 4 Spatial autocorrelations for the residuals of the randomized complete block model (dashed line) and for the first differences (solid line) and second differences (dotted line) of the raw data for (a) soybean A1, (b) soybean B5, (c) wheat B2, and (d) wheat A3. See Tables 1 and 2 for the data set codes and characteristics
|
|
Validity Analysis
In a randomization framework, a method of analysis is said to be valid if the average empirical variance Emp and the average predicted variance Pre of pairwise treatment contrasts are equal over the population of all randomized layouts; practically, the condition is the two variance statistics are not significantly different over a sample of 20 randomized layouts or more (Besag and Kempton, 1986, p. 250). The sample means of Emp and Pre, computed over 50 pseudo-experiments for each method and expressed as percentages of the Emp sample mean of the RCB ANOVA, are presented in Tables 5, 6, and 7
for v = 60, 25, and 100, respectively.
View this table:
[in this window]
[in a new window]
|
Table 5 Sample means of average empirical variance (Emp) and average predicted variance (Pre) of pairwise variety contrasts computed over 50 pseudo-experiments from data sets with v = 60. Values reported are percentages relative to the sample means of Emp for the RCB ANOVA.
|
|
View this table:
[in this window]
[in a new window]
|
Table 6 Sample means of average empirical variance (Emp) and average predicted variance (Pre) of pairwise variety contrasts computed over 50 pseudo-experiments from data sets with v = 25. Values reported are percentages relative to the sample means of Emp for the RCB ANOVA. #
|
|
View this table:
[in this window]
[in a new window]
|
Table 7 Sample means of average empirical variance (Emp) and average predicted variance (Pre) of pairwise variety contrasts computed over 50 pseudo-experiments from data sets with v = 100. Values reported are percentages relative to the sample means of Emp for the RCB ANOVA.
|
|
For all data sets, there is close agreement between the sample means of Pre and Emp for the classical RCB ANOVA, the IB with recovery of interblock information and the FD-EV (Tables 5, 6, and 7). To a lesser degree, there is agreement between observed and theoretical significance levels (0.05 and 0.01) for the same methods (Table 8)
. As a result, the three methods are valid, both in estimation and in testing. This is consistent with the findings of Besag and Kempton (1986) and Baird and Mead (1991). In most cases, the sample mean of Pre is much smaller than the sample mean of Emp for the AR(1) model, which shows that the error variance is underestimated and too many significant variety effects result from this analysis. In a few cases, Pre is statistically smaller than Emp for SD-EV. On the other hand, in most cases, there is poor agreement between Pre and Emp for the difference models without errors in variables and especially SD (Table 5), in which the true error variance of pairwise treatment contrasts is consistently underestimated and the actual significance level is inflated (Table 8).
View this table:
[in this window]
[in a new window]
|
Table 8 Observed significance levels for the F-test for variety effects, as computed over 50 pseudo-experiments from soybean data sets with v = 60 and for theoretical significance levels of 0.05 and 0.01.
|
|
Biased error variance most likely arises when the model does not fit the data. For instance, under Model [8] for FD-EV (SD-EV), the autocorrelations should be equal to zero except at lag 1 (lags 1 and 2). For FD-EV, the lag-1 autocorrelation is
-
; for SD-EV, the lag-1 and lag-2 autocorrelations are
-
and
, respectively. However, all autocorrelations should be equal to zero under Model [12] for FD and SD. The tenability of these assumptions can be assessed in pseudo-experiments through the autocorrelation estimates r1 and r2 at lags 1 and 2 (Table 9)
. It is observed that most of the r1's are significantly different from zero on the first differencing deviates, except for wheat A3 and A4, for which the FD method is valid (Table 5). All r1's and some r2's are significantly different from zero on the second differencing deviates (Table 9), which is possibly the main reason for the SD method having biased error variance estimates for all data sets (Table 5). It must be noted that SD-EV is not valid for wheat A1, B1, and A4 for v = 60 (Table 5), where r2 is negative and significant (Table 9) while the lag-2 autocorrelation should theoretically be positive. This suggests that the SD-EV model does not fit the three wheat data sets. Similarly, the FD-EV method would not be valid if r1 was positive instead of negative.
In view of our results, the lag-1 and lag-2 autocorrelation estimates r1 and r2 can be used as tools to test the goodness-of-fit of the first-difference and the second-difference models. In a real experiment, they can be computed from yi,j -
(), where
() estimates the effect of the variety tried in the plot. The following rules could be applied to test a model. For the first-difference models: if r1 is not significantly different from zero, both the FD and the FD-EV model fit; if r1 is significant and negative, only the FD-EV model fits; otherwise, neither of the models fits. For the second difference models: if r1 and r2 are not significantly different from zero, both the SD and the SD-EV model fit; if r1 is negative and significant and r2 is positive, only the SD-EV model fits; otherwise, neither of the models fits.
Efficiency Analysis
In pseudo-experiments, under the assumption of treatment additivity, a method of analysis is said to be more efficient than another if it has a smaller average empirical variance Emp for pairwise treatment contrasts (Baird and Mead, 1991, p. 1474; see also Besag and Kempton, 1986). The outstanding feature of Tables 5, 6, and 7 is the consistent reduction in Emp from the RCB ANOVA to the IB analysis and the neighbor analyses AR(1), FD, FD-EV, and SD-EV, in degrees that may vary with the number of dummy varieties (v). The neighbor analyses, especially the FD-EV and SD-EV, take the spatial autocorrelation adequately into account. Thus, they are more efficient than the IB analysis, even when there is no remaining autocorrelation in incomplete blocks of size 6 for data sets such as soybean A1, A2, and A3 (Table 4). This indicates neighbor analyses not only account for autocorrelation in errors, but also fit the spatial trend. Four of the five neighbor analyses (i.e., AR(1), FD, FD-EV, and SD-EV) provided approximately the same Emp values (Tables 5 and 6). Evidently, they removed a similar amount of structured variation from the error term. Because the difference in efficiency among these four neighbor analyses was slight, only the FD-EV was retained for discussion below. On the other hand, the SD analysis was not studied further after its poor performance with respect to validity.
The efficiency of the neighbor and IB analyses relative to the RCB ANOVA is influenced by several factors, among which is the CV from the RCB ANOVA. According to Fig. 5
, which shows the relationships between CV and NC0 and the RE for FD-EV and IB(6), relative efficiency tends to be higher for data with higher CV. This shows that precision can be improved substantially by using small blocks or neighbor analysis in case of strong heterogeneity among plots within blocks. For example, for soybean A1 with v = 60 and CV = 15.7%, Emp is equal to 19.1% for the FD-EV analysis and 24.8% for IB(6), which corresponds to RE values of 524 and 403%, respectively. On the other hand, since low CV indicates relatively weak heterogeneity within blocks, sizeable improvement of precision is not likely to arise from data with lower CV. For example, for wheat A1 with v = 25 and CV = 6.2%, RE is equal to 142 and 114% for FD-EV and IB, respectively. Furthermore, relative efficiency tends to increase with decreasing NC0that is, with increasing portion of spatially structured variation explained by the method. Moreover, the efficiency for data sets in Group A is higher than that for the corresponding data sets in Group B. This is due to the higher variation among plots and the higher portion of spatially structured variation in Group A (Fig. 1). In view of Fig. 5, the relationship between RE and NC0 is stronger than between RE and CV for FD-EV, and the relationship between RE and NC0 is stronger for FD-EV than for IB. This suggests that NC0 is more reliable than CV as an indicator of efficiency for the neighbor analyses.

View larger version (21K):
[in this window]
[in a new window]
|
Fig. 5 Relative efficiency in variety effect estimation in relation to the CV and the normalized nugget effect for (a) soybean (v = 60) and the FD-EV model, (b) wheat (v = 25 and 60) and the FD-EV model, (c) soybean (v = 60) and the IB(6) model, and (d) wheat (v = 25 and 60) and the IB(6) or IB(5) model
|
|
Block Size, Plot Shape, and Plot Size Effects
Apparently, block size affects the efficiency of IB. Based on Table 5, IB(6) is more efficient than IB(10) for most of the data sets; the exceptions are soybean A4, A5, and A8. In some cases (e.g., soybean B1, B5, and B8 and wheat A1, A2, A4, and B3), the difference in efficiency is less than 20%. Smaller blocks are thus not always more effective than larger blocks in removing systematic variation, since the efficiency of block analyses relies on the control of both the local variation within blocks and the overall variation among blocks. In the absence of a priori information about spatial variation in the field, block arrangement may be not as effective as expected. If blocks of different sizes are found to provide similar efficiency, larger blocks are to be preferred, as they allow more pairwise comparisons of treatments within a block.
Another factor that may affect efficiency is plot shape, the ratio of length to width. To address this point, we have calculated the relative efficiency of FD-EV and IB(6) for three pairs of data sets with the same plot size but different plot shapes, namely data sets 2 and 3, 4 and 5, and 6 and 7 from the western (Group A) and eastern (Group B) parts of the soybean uniformity trial. For both one-dimensional analyses, efficiency is higher for the data set with narrower plots in Group A. For example, soybean A3 presents an RE of 401% for FD-EV, whereas the same method provides an RE of 526% for soybean A2. In contrast, both types of plot shape have similar efficiency for data sets in Group B. Since there is stronger spatial structure in Group A characterized by higher CV (Table 1) and lower NC0 (Table 2), it can be concluded that when the spatial structure is strong (i.e., NC0 < 10%), the neighbor and IB analyses with narrower plots are more effective than those with wider plots in fitting trends and accounting for autocorrelation. Based on Tables 5, 6, and 7, where plot size increases from data set A1(B1) to A8(B8) for soybean and from data set A1(B1) to A4(B4) for wheat, there is no apparent relationship between plot size and RE. Finally, from Tables 1, 3, 5, 6, and 7, it does not seem necessary to use plots larger than 8 m2 for soybean and larger than 4 m2 for wheat to reduce the error variance and ensure good precision in variety effect estimation.
 |
Conclusions
|
|---|
Based on the foregoing, the following steps are useful for obtaining high precision in the analysis of field experiments, especially in trials with a large number of varieties or treatments.- Select the experiment design based on field soil variation. If the heterogeneity is relatively high (i.e., high CV in previous trials), an incomplete block design with highly rectangular plots should be used instead of a complete block design. However, blocking is not required if the spatial structure cannot be predicted or the field productivity is known to be relatively homogeneous.
- Analyze the data using classical block analysis, when applicable, and the neighbor FD-EV method, where variance components are estimated by REML and treatment effects by GLS.
- Compare the efficiency of the different methods. If the efficiency of any (complete or incomplete) block analysis is similar to that of the neighbor FD-EV, results from the block analysis will be more appropriate. Otherwise, if the efficiency of FD-EV is higher, proceed to the next step.
- Test the goodness-of-fit of the FD-EV model. If the lag-1 autocorrelation coefficient of first-difference deviates is negative and significant but the lag-2 autocorrelation coefficient is not significant, the FD-EV model is appropriate. Otherwise, results from the FD-EV analysis may not be appropriate (in particular, the Type I error rate may be exaggerated in significance testing), and we suggest that classical block analyses be used, or other models be tried. The SD-EV analysis will not be necessarily more appropriate, as REML does not always converge for this method.
Geostatistical analysis showed that there was a spatial structure in all of the 50 data sets derived from the two uniformity trials. First and second differencing were revealed to be powerful tools to fit large-scale trends and to remove small-scale autocorrelation from errors, and were more effective than incomplete blocking in these respects. In particular, geostatistical analysis of differencing deviates helped explicate the effect of differencing on spatial structure. Incomplete blocking was generally more effective in reducing the unexplained structured variation in comparison with complete blocking. The first-difference method with errors in variables was identified as empirically valid, and the second-difference model with errors in variables was also valid for most of the data sets. Three other neighbor methods (AR(1), FD and SD) underestimated the error variance, leading to inflated rejection rate in hypothesis testing of variety effects compared with the theoretical significance level. The lag-1 and lag-2 autocorrelation coefficients can be used to assess the goodness-of-fit of the difference models. Four of the neighbor methods (AR(1), FD, FD-EV, and SD-EV) were more efficient than complete and incomplete block analyses. For most data sets, the efficiency of neighbor analyses was higher than that of RCB ANOVA and IB by 200 and 30%, respectively. The relative efficiency tended to increase with decreasing normalized nugget effect and, to a lesser degree, with increasing CV.
When spatially structured variation is strong and one-dimensional, highly rectangular plots must be preferred to near-square plots in FD-EV and IB analyses. In terms of validity and efficiency, the first-difference model with errors in variables, in which variance components and treatment effects are estimated respectively by REML and GLS, is the best method. On that basis, FD-EV can be expected to become a routine method for the analysis of field experiments with a large number of varieties or treatments. The computer program implementing the FD-EV method is available from the corresponding author upon request.SAS Institute 1989; SAS Institute 1995
 |
ACKNOWLEDGMENTS
|
|---|
During her scientific visit at McGill University, the senior author was financially supported through the NSERC and FCAR Nouveaux Chercheurs operating grants of the second author and the NSERC operating grant of Dr. Diane Mather (Plant Science). The authors are grateful to three anonymous reviewers whose comments helped improve the presentation of this paper.
 |
REFERENCES
|
|---|
- Baird D., Mead R. The empirical efficiency and validity of two neighbour models. Biometrics 1991;47:1473-1487.
- Bartlett M.S. Nearest neighbour models in the analysis of field experiments (with discussion). J.R. Stat. Soc., Ser. B 1978;40:147-174.
- Besag J., Kempton R. Statistical analysis of field experiments using neighbouring plots. Biometrics 1986;42:231-251.
- Brownie C., Bowman D.T., Burton J.W. Estimating spatial variation in analysis of data from yield trials: A comparison of methods. Agron. J. 1993;85:1244-1253.[Abstract/Free Full Text]
- Brownie C., Gumpertz M.L. Validity of spatial analyses for large field trials. J. Agric. Biol. Env. Stat. 1997;2:1-23.
- Cliff A.D., Ord J.K. Spatial processes: Models and applications. London: Pion, 1981.
- Cochran W.G., Cox G.M. Experimental designs, 2nd ed New York: John Wiley & Sons, 1957.
- Cullis B.R., Gleeson A.C. The efficiency of neighbour analysis for replicated variety trials in Australia. J. Agric. Sci. (Camb.) 1989;113:223-239.
- Dutilleul P. Spatial heterogeneity and the design of ecological field experiments. Ecology 1993;74:1646-1658.
- Gleeson A.C., Cullis B.R. Residual maximum likelihood (REML) estimation of a neighbour model for field experiments. Biometrics 1987;43:277-288.
- Graybill F.A. Matrices with applications in statistics, 2nd ed Belmont, CA: Wadsworth, 1983.
- Green P.J., Jennison C., Seheult A.H. Analysis of field experiments by least squares smoothing. J.R. Stat. Soc., Ser. B 1985;47:299-315.
- Grondona M.O., Cressie N. Using spatial considerations in the analysis of experiments. Technometrics 1991;33:381-392.
- Journel A.G., Huijbregts C.J. Mining geostatistics. London: Academic Press, 1978.
- Kempton, R.A. 1985. Comparison of nearest neighbour and classical methods of analysis. p. 5152. In R.A. Kempton (ed.) [Proc.] Biometric Society Workshop on Spatial Methods in Field Experiments, Univ. of Durham, Durham, UK.
- Kempton R.A., Howes C.W. The use of neighbouring plot values in the analysis of variety trials. Appl. Stat. 1981;30:59-70.
- Papadakis, J.S. 1937. Méthode statistique pour des expériences sur champ. Bulletin de l'Institut d'Amélioration des Plantes à Salonique. Thessalonique 23.
- Patterson H.D., Thompson R. Recovery of inter-block information when block sizes are unequal. Biometrika 1971;58:545-554.[Abstract/Free Full Text]
- SAS Institute. 1989. SAS/IML user's guide. Release 6.03 ed. SAS Inst., Cary, NC.
- SAS Institute. Introduction to the MIXED procedure: Course notes. Cary, NC: SAS Inst, 1995.
- Searle S.R. Linear models. New York: Wiley, 1971.
- Tamura R.N., Nelson L.A., Naderman G.C. An investigation of the validity and usefulness of trend analysis for field plot data. Agron. J. 1988;80:712-718.[Abstract/Free Full Text]
- Vieira S.R., Hatfield J.L., Nielsen D.R., Biggar J.W. Geostatistical theory and application to variability of some agronomical properties. Hilgardia 1983;51:1-75.
- Wiebe G.A. Variation and correlation in grain yield among 1500 wheat nursery plots. J. Agric. Res. 1935;50:331-357.
- Wilkinson G.N., Eckert S.R., Hancock T.W., Mayo O. Nearest neighbour (NN) analysis of field experiments. J.R. Stat. Soc., Ser. B 1983;45:151-211.
- Williams E.R. A neighbour model for field experiments. Biometrika 1986;73:279-287.[Abstract/Free Full Text]
- Wu T.X., Gai J.Y., Ma Y.H. A study of nearest-neighbor analysis for trials with large number of lines in plant breeding. Acta Agron. Sin. 1995;21:9-16.
- Yates F. The recovery of inter-block information in variety trials arranged in three-dimensional lattices. Ann. Eugen. 1939;10:317-325.
This article has been cited by other articles:

|
 |

|
 |
 
A. N. Kravchenko, G. P. Robertson, S. S. Snap, and A. J. M. Smucker
Using Information about Spatial Variability to Improve Estimates of Total Soil Carbon
Agron. J.,
May 3, 2006;
98(3):
823 - 829.
[Abstract]
[Full Text]
[PDF]
|
 |
|