Free Standard AU & NZ Shipping For All Book Orders Over $80!
Register      Login
Crop and Pasture Science Crop and Pasture Science Society
Plant sciences, sustainable farming systems and food quality
RESEARCH ARTICLE (Open Access)

Random regression models for multi-environment, multi-time data from crop breeding selection trials

J. De Faveri https://orcid.org/0000-0001-7992-5070 A B * , A. P. Verbyla A B and G. Rebetzke https://orcid.org/0000-0001-7404-0046 C
+ Author Affiliations
- Author Affiliations

A CSIRO Data61, Atherton, Qld 4883, Australia.

B UQ QAAFI, St Lucia, Qld 4072, Australia.

C CSIRO Agriculture & Food Black Mountain, Canberra, ACT 2601, Australia.

* Correspondence to: j.defaveri@uq.edu.au

Handling Editor: Davide Cammarano

Crop & Pasture Science 74(4) 271-283 https://doi.org/10.1071/CP21732
Submitted: 4 November 2021  Accepted: 13 July 2022   Published: 31 August 2022

© 2023 The Author(s) (or their employer(s)). Published by CSIRO Publishing. This is an open access article distributed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND)

Abstract

Context: In order to identify best crop genotypes for recommendation to breeders, and ultimately for use in breeding, evaluation is usually conducted in field trials across a range of environments, known as multi-environment trials. Increasingly, many breeding traits are measured over time, for example with high-throughput phenotyping at different growth stages in annual crops or repeated harvests in perennial crops.

Aims: This study aims to provide an efficient, accurate approach for modelling genotype response over time and across environments, accounting for non-genetic sources of variation such as spatial and temporal correlation.

Methods: Because the aim is genotype selection, genetic effects are fitted as random effects, and so the approach is based on random regression, in which linear or non-linear models are used to model genotype responses. A method for fitting random regression is outlined in a multi-environment situation, using underlying cubic smoothing splines to model the mean trend over time. This approach is illustrated on six wheat experiments, using data on grain-filling over thermal time.

Key results: The method correlates genetic effects over time and environments, providing predicted genotype responses while incorporating spatial and temporal correlation between observations.

Conclusions: The approach provides robust genotype predictions by accounting for temporal and spatial effects simultaneously under various situations including those in which trials have different measurement times or where genotypes within trials are not measured at the same times. The approach facilitates investigation into genotype by environment interaction (G × E) both within and across environments.

Implications: The models presented have potential to increase accuracy of predictions over measurement times and trials, provide predictions at times other than those observed, and give a greater understanding of G × E interaction, hence improving genotype selection across environments for repeated-measures traits.

Keywords: crop variety selection, cubic smoothing splines, linear mixed models, MET, MEMT, multi-environment trials, random regression models, statistical genetics.


References

Apiolaza LA, Garrick DJ (2001) Analysis of longitudinal data from progeny tests: some multivariate approaches. Forestry Science 47, 129–140.

Apiolaza LA, Gilmour AR, Garrick DJ (2000) Variance modelling of longitudinal height data from a Pinus radiata progeny test. Canadian Journal of Forest Research 30, 645–654.
Variance modelling of longitudinal height data from a Pinus radiata progeny test.Crossref | GoogleScholarGoogle Scholar |

Butler DG, Cullis BR, Gilmour AR, Gogel BJ, Thompson R (2018) ‘ASReml-R reference manual. Version 4.’ (VSN International: Hemel Hempstead, UK)

Campbell M, Walia H, Morota G (2018) Utilizing random regression models for genomic prediction of a longitudinal trait derived from high-throughput phenotyping. Plant Direct 2, 1–11.
Utilizing random regression models for genomic prediction of a longitudinal trait derived from high-throughput phenotyping.Crossref | GoogleScholarGoogle Scholar |

Cullis BR, Smith AB, Coombes NE (2006) On the design of early generation variety trials with correlated data. Journal of Agricultural, Biological, and Environmental Statistics 11, 381–393.
On the design of early generation variety trials with correlated data.Crossref | GoogleScholarGoogle Scholar |

De Faveri J (2013) Spatial and temporal modelling of perennial crop variety selection trials. PhD Thesis, The University of Adelaide, SA, Australia. Available at http://digital.library.adelaide.edu.au/dspace/handle/2440/83114

De Faveri JD, Verbyla AP, Pitchford WS, Venkatanagappa S, Cullis BR (2015) Statistical methods for analysis of multi-harvest data from perennial pasture variety selection trials. Crop & Pasture Science 66, 947–962.
Statistical methods for analysis of multi-harvest data from perennial pasture variety selection trials.Crossref | GoogleScholarGoogle Scholar |

De Faveri J, Verbyla AP, Cullis BR, Pitchford WS, Thompson R (2017) Residual variance–covariance modelling in analysis of multivariate data from variety selection trials. Journal of Agricultural, Biological and Environmental Statistics 22, 1–22.
Residual variance–covariance modelling in analysis of multivariate data from variety selection trials.Crossref | GoogleScholarGoogle Scholar |

Deery DM, Rebetzke GJ, Jimenez-Berni JA, James RA, Condon AG, Bovill WD, Hutchinson P, Scarrow J, Davy R, Furbank RT (2016) Methodology for high-throughput field phenotyping of canopy temperature using airborne thermography. Frontiers in Plant Science 7, 1808
Methodology for high-throughput field phenotyping of canopy temperature using airborne thermography.Crossref | GoogleScholarGoogle Scholar |

DeGroot BJ, Keown JF, Kachman SD, Van Vleck D (2003) Cubic splines for estimating lactation curves and genetic parameters of first lactation Holstein cows treated with bovine somatotropin. Conference on Applied Statistics in Agriculture 1, 150–163.
Cubic splines for estimating lactation curves and genetic parameters of first lactation Holstein cows treated with bovine somatotropin.Crossref | GoogleScholarGoogle Scholar |

Diggle P (1988) An approach to the analysis of repeated measurements. Biometrics 44, 959–971.

Evans JC, Roberts EA (1979) Analysis of sequential observations with applications to experiments on grazing animals and perennial plants. Biometrics 35, 687–693.
Analysis of sequential observations with applications to experiments on grazing animals and perennial plants.Crossref | GoogleScholarGoogle Scholar |

Forknall CR, Simpfendorfer S, Kelly AM (2019) Using yield response curves to measure variation in the tolerance and resistance of wheat cultivars to fusarium crown rot. Phytopathology 109, 932–941.
Using yield response curves to measure variation in the tolerance and resistance of wheat cultivars to fusarium crown rot.Crossref | GoogleScholarGoogle Scholar |

Gabriel KR (1962) Ante-dependence analysis of an ordered set of variables. The Annals of Mathematical Statistics 33, 201–212.
Ante-dependence analysis of an ordered set of variables.Crossref | GoogleScholarGoogle Scholar |

Gilmour AR, Cullis BR, Verbyla AP (1997) Accounting for natural and extraneous variation in the analysis of field experiments. Journal of Agricultural, Biological, and Environmental Statistics 2, 269–293.
Accounting for natural and extraneous variation in the analysis of field experiments.Crossref | GoogleScholarGoogle Scholar |

Green PJ, Silverman BW (1994) ‘Nonparametric regression and generalised linear models.’ (Chapman and Hall: London, UK)

Huisman AE, Veerkamp RF, van Arendonk JAM (2002) Genetic parameters for various random regression models to describe the weight data of pigs. Journal of Animal Science 80, 575–582.
Genetic parameters for various random regression models to describe the weight data of pigs.Crossref | GoogleScholarGoogle Scholar |

Kelly AM, Smith AB, Eccleston JA, Cullis BR (2007) The accuracy of varietal selection using factor analytic models for multi-environment plant breeding trials. Crop Science 47, 1063–1070.
The accuracy of varietal selection using factor analytic models for multi-environment plant breeding trials.Crossref | GoogleScholarGoogle Scholar |

Kenward M, Roger J (1997) The precision of fixed effects estimates from restricted maximum likelihood. Biometrics 53, 983–997.

Laird NM, Ware JH (1982) Random-effects models for longitudinal data. Biometrics 38, 963–974.
Random-effects models for longitudinal data.Crossref | GoogleScholarGoogle Scholar |

Meyer K (1998) Estimating covariance functions for longitudinal data using a random regression model. Genetics Selection Evolution 30, 221
Estimating covariance functions for longitudinal data using a random regression model.Crossref | GoogleScholarGoogle Scholar |

Meyer K (2005a) Random regression analyses using B-splines to model growth of Australian Angus cattle. Genetics Selection Evolution 37, 473
Random regression analyses using B-splines to model growth of Australian Angus cattle.Crossref | GoogleScholarGoogle Scholar |

Meyer K (2005b) Advances in methodology for random regression analyses. Australian Journal of Experimental Agriculture 45, 847–858.
Advances in methodology for random regression analyses.Crossref | GoogleScholarGoogle Scholar |

Oakey H, Verbyla A, Pitchford W, Cullis B, Kuchel H (2006) Joint modeling of additive and non-additive genetic line effects in single field trials. Theoretical and Applied Genetics 113, 809–819.
Joint modeling of additive and non-additive genetic line effects in single field trials.Crossref | GoogleScholarGoogle Scholar |

Patterson HD, Thompson R (1971) Recovery of interblock information when block sizes are unequal. Biometrika 31, 100–109.

Rebetzke GJ, Chenu K, Biddulph B, Moeller C, Deery DM, Rattey AR, Bennett D, Barrett-Lennard EG, Mayer JE (2013) A multisite managed environment facility for targeted trait and germplasm phenotyping. Functional Plant Biology 40, 1–13.
A multisite managed environment facility for targeted trait and germplasm phenotyping.Crossref | GoogleScholarGoogle Scholar |

Rebetzke GJ, Jimenez-Berni JA, Bovill WD, Deery DM, James RA (2016) High-throughput phenotyping technologies allow accurate selection of stay-green. Journal of Experimental Botany 67, 4919–4924.
High-throughput phenotyping technologies allow accurate selection of stay-green.Crossref | GoogleScholarGoogle Scholar |

Rebetzke GJ, Richards RA, Holland JB (2017) Population extremes for assessing trait value and correlated response of genetically complex traits. Field Crops Research 201, 122–132.
Population extremes for assessing trait value and correlated response of genetically complex traits.Crossref | GoogleScholarGoogle Scholar |

Robinson GK (1991) That BLUP is a good thing: the estimation of random effects. Statistical Science 6, 15–51.

Schaeffer LR (2004) Application of random regression models in animal breeding. Livestock Production Science 86, 35–45.
Application of random regression models in animal breeding.Crossref | GoogleScholarGoogle Scholar |

Smith A, Cullis B, Thompson R (2001) Analysing variety by environment data using multiplicative mixed models and adjustments for spatial field trend. Biometrics 57, 1138–1147.
Analysing variety by environment data using multiplicative mixed models and adjustments for spatial field trend.Crossref | GoogleScholarGoogle Scholar |

Smith AB, Cullis BR, Thompson R (2005) The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches. The Journal of Agricultural Science 143, 449–462.
The analysis of crop cultivar breeding and evaluation trials: an overview of current mixed model approaches.Crossref | GoogleScholarGoogle Scholar |

Smith AB, Stringer JK, Wei X, Cullis BR (2007) Varietal selection for perennial crops where data relate to multiple harvests from a series of field trials. Euphytica 157, 253–266.
Varietal selection for perennial crops where data relate to multiple harvests from a series of field trials.Crossref | GoogleScholarGoogle Scholar |

Smith AB, Ganesalingam A, Kuchel H, Cullis BR (2015) Factor analytic mixed models for the provision of grower information from national crop variety testing programs. Theoretical and Applied Genetics 128, 55–72.
Factor analytic mixed models for the provision of grower information from national crop variety testing programs.Crossref | GoogleScholarGoogle Scholar |

Stefanova KT, Smith AB, Cullis BR (2009) Enhanced diagnostics for the spatial analysis of field trials. Journal of Agricultural, Biological, and Environmental Statistics 14, 392
Enhanced diagnostics for the spatial analysis of field trials.Crossref | GoogleScholarGoogle Scholar |

Verbyla AP (2019) A note on model selection using information criteria for general linear models estimated using REML. Australian & New Zealand Journal of Statistics 61, 39–50.
A note on model selection using information criteria for general linear models estimated using REML.Crossref | GoogleScholarGoogle Scholar |

Verbyla AP, Cullis BR, Kenward MG, Welham SJ (1999) The analysis of designed experiments and longitudinal data by using smoothing splines. Journal of the Royal Statistical Society: Series C (Applied Statistics) 48, 269–311.
The analysis of designed experiments and longitudinal data by using smoothing splines.Crossref | GoogleScholarGoogle Scholar |

Verbyla AP, De Faveri J, Deery DM, Rebetzke GJ (2021) Modelling temporal genetic and spatio-temporal residual effects for high-throughput phenotyping data. Australian & New Zealand Journal of Statistics 63, 284–308.
Modelling temporal genetic and spatio-temporal residual effects for high-throughput phenotyping data.Crossref | GoogleScholarGoogle Scholar |

Wang C, Andersson B, Waldmann P (2009) Genetic analysis of longitudinal height data using random regression. Canadian Journal of Forest Research 39, 1939–1948.
Genetic analysis of longitudinal height data using random regression.Crossref | GoogleScholarGoogle Scholar |

Welham SJ (2008) Smoothing spline models for longitudinal data. In ‘Longitudinal data analysis’. (Eds G Fitzmaurice, M Davidian, G Verbeke, G Molenberghs) pp. 253–289. (Chapman and Hall: London, UK)

White IMS, Thompson R, Brotherstone S (1999) Genetic and environmental smoothing of lactation curves with cubic splines. Journal of Dairy Science 82, 632–638.
Genetic and environmental smoothing of lactation curves with cubic splines.Crossref | GoogleScholarGoogle Scholar |