Statistical methods for analysis of multi-harvest data from perennial pasture variety selection trials
Joanne De Faveri A F , Arūnas P. Verbyla B , Wayne S. Pitchford C , Shoba Venkatanagappa D and Brian R. Cullis EA Department of Agriculture and Fisheries, PO Box 1054, Mareeba, QLD, 4880, Australia.
B Data Analytics, CSIRO Digital Productivity Flagship and School of Agriculture, Food and Wine, The University of Adelaide,Atherton, QLD, 4883, Australia.
C School of Animal and Veterinary Sciences, The University of Adelaide, Roseworthy Campus, SA, 5371, Australia.
D NSW Department of Primary Industries, Tamworth, NSW, 2340, Australia. Current address: Enza Zaden Australia, Narromine, NSW, 2821, Australia.
E National Institute for Applied Statistics Research Australia (NIASRA), School of Mathematics and Applied Statistics, University of Wollongong, NSW, 2522, Australia.
F Corresponding author. Email: Joanne.DeFaveri@daf.qld.gov.au
Crop and Pasture Science 66(9) 947-962 https://doi.org/10.1071/CP14312
Submitted: 23 November 2014 Accepted: 28 April 2015 Published: 4 September 2015
Abstract
Variety selection in perennial pasture crops involves identifying best varieties from data collected from multiple harvest times in field trials. For accurate selection, the statistical methods for analysing such data need to account for the spatial and temporal correlation typically present. This paper provides an approach for analysing multi-harvest data from variety selection trials in which there may be a large number of harvest times. Methods are presented for modelling the variety by harvest effects while accounting for the spatial and temporal correlation between observations. These methods provide an improvement in model fit compared to separate analyses for each harvest, and provide insight into variety by harvest interactions. The approach is illustrated using two traits from a lucerne variety selection trial. The proposed method provides variety predictions allowing for the natural sources of variation and correlation in multi-harvest data.
References
Apiolaza L, Gilmour A, Garrick D (2000) Variance modelling of longitudinal height data from a Pinus radiata progeny test. Canadian Journal of Forest Research 30, 645–654.Bartlett MS (1978) Nearest neighbour models in the analysis of field experiments (with discussion). Journal of the Royal Statistical Society, Series B 40, 147–174.
Beeck CP, Cowling WA, Smith AB, Cullis BR (2010) Analysis of yield and oil from a series of canola breeding trials, part 1: Fitting factor analytic models with pedigree information. Genome 53, 992–1001.
Bjornsson H (1978) Analysis of a series of long-term grassland experiments with autocorrelated errors. Biometrics 34, 645–651.
Bouton J (2012) Breeding lucerne for persistence. Crop & Pasture Science 63, 95–106.
Brien CJ, Bailey RA (2006) Multiple randomizations (with discussion). Journal of the Royal Statistical Society, Series B 68, 571–609.
Butler DG, Cullis BR, Gilmour AR, Gogel BJ (2009) ASReml-R, Reference Manual. Release 3. Queensland Department of Primary Industries and Fisheries, Toowoomba.
Casler M (1999) Spatial variation affects precision of perennial cool-season forage grass trials. Agronomy Journal 91, 75–81.
Cullis B, Gogel B, Verbyla A, Thompson R (1998) Spatial analysis of multi-environment early generation trials. Biometrics 54, 1–18.
Cullis BR, Gleeson AC (1991) Spatial analysis of field experiments – an extension to two dimensions. Biometrics 47, 1449–1460.
Cullis BR, Jefferson P, Thompson R, Smith AB (2014) Factor analytic and reduced animal models for the investigation of additive genotype-by-environment interaction in outcrossing plant species with application to a Pinus radiata breeding programme. Theoretical and Applied Genetics 127, 2193–2210.
Cullis BR, Smith AB, Beeck CP, Cowling WA (2010) Analysis of yield and oil from a series of canola breeding trials. Part 11: Exploring variety by environment interaction using factor analysis. Genome 53, 1002–1016.
De Faveri J (2013) Spatial and temporal modelling of perennial crop variety selection trials. PhD Thesis, The University of Adelaide, SA, Australia. http://digital.library.adelaide.edu.au/dspace/handle/2440/83114.
Diggle P (1988) An approach to the analysis of repeated measurements. Biometrics 44, 959–971.
Dutkowski G, Costa e, Silva J, Gilmour A, Lopez G (2002) Spatial analysis methods for forest genetic trials. Canadian Journal of Forest Research 32, 2201–2214.
Evans J, Roberts E (1979) Analysis of sequential observations with applications to experiments on grazing animals and perennial plants. Biometrics 35, 687–693.
Gabriel KR (1962) Ante-dependence analysis of an ordered set of variables. The Annals of Mathematical Statistics 33, 201–212.
Gilmour AR, Cullis BR, Verbyla AP (1997) Accounting for natural and extraneous variation in the analysis of field experiments. Journal of Agricultural, Biological, and Environmental Statistics 2, 269–273.
Gleeson AC, Cullis BR (1987) Residual maximum likelihood (REML) estimation of a neighbour model for field experiments. Biometrics 43, 277–288.
Green PJ, Silverman BW (1994) ‘Nonparametric regression and generalised linear models.’ (Chapman and Hall: London)
Hayward M, Delacey I, Tyler B, Drake D (1982) The application of pattern analysis for the recognition of adaptation in a collection of Lolium multiflorum populations. Euphytica 31, 383–396.
Kaufman L, Rousseeuw PJ (1990) ‘Finding groups in data: An introduction to cluster analysis.’ (Wiley: New York).
Kelly A, Smith AB, Eccleston J, Cullis BR (2007) The accuracy of varietal selection using factor analytic models for multi-environment plant breeding trials. Crop Science 47, 1063–1070.
Laird N, Ware J (1982) Random-effects models for longitudinal data. Biometrics 38, 963–974.
Lodge G, Gleeson A (1984) A comparison of methods of estimating lucerne population for monitoring persistence. Australian Journal of Experimental Agriculture and Animal Husbandry 24, 174–177.
Meyer K (1998) Estimating covariance functions for longitudinal data using a random regression model. Genetics Selection Evolution 30, 221–240.
Meyer K, Kirkpatrick M (2005) Up hill, down dale: quantitative genetics of curvacoeus traits. Philosophical Transactions of The Royal Society B 360, 1443–1455.
Patterson HD, Thompson R (1971) Recovery of interblock information when block sizes are unequal. Biometrika 31, 100–109.
Piepho HP, Eckl T (2014) Analysis of series of variety trials with perennial crops. Grass and Forage Science 69, 431–440.
R Development Core Team (2012) ‘R: A language and environment for statistical computing.’ (R Foundation for Statistical Computing: Vienna, Austria)
Resende MDV, Thompson R, Welham S (2006) Multivariate spatial statistical analysis of longitudinal data in perennial crops. Brazilian Journal of Mathematics and Statistics 24, 147–169.
Robinson GK (1991) That BLUP is a good thing: The estimation of random effects. Statistical Science 6, 15–51.
Schaeffer L (2004) Application of random regression models in animal breeding. Livestock Production Science 86, 35–45.
Smith AB, Cullis BR, Thompson R (2001) Analysing variety by environment data using multiplicative mixed models and adjustments for spatial field trend. Biometrics 57, 1138–1147.
Smith AB, Stringer JK, Wei X, Cullis BR (2007) Varietal selection for perennial crops where data relate to multiple harvests from a series of field trials. Euphytica 157, 253–266.
Smith K, Casler M (2004) Spatial analysis of forage grass trials across locations, years, and harvests. Crop Science 44, 56–62.
Smith K, Kearney G (2002) Improving the power of pasture cultivar trials to discriminate cultivars on the basis of differences in herbage yield. Australian Journal of Agricultural Research 53, 191–199.
Smith KF, Spangenberg G (2014) Forage breeding for changing environments and production systems: an overview. Crop & Pasture Science 65, i–ii.
Stefanova KT, Smith AB, Cullis BR (2009) Enhanced diagnostics for the spatial analysis of field trials. Journal of Agricultural, Biological, and Environmental Statistics 14, 392–410.
Stram DO, Lee JW (1994) Variance components testing in the longitudinal mixed effects model. Biometrics 50, 1171–1177.
Stringer JK, Cullis BR (2002) Application of spatial analysis techniques to adjust for fertility trends and identify interplot competition in early stage sugarcane selection trials. Australian Journal of Agricultural Research 53, 911–918.
Verbyla AP, Cullis BR, Kenward MG, Welham SJ (1999) The analysis of designed experiments and longitudinal data by using smoothing splines (with discussion). Applied Statistics 48, 269–312.