Advances in methodology for random regression analyses
K. MeyerAnimal Genetics and Breeding Unit (a joint venture between NSW Department of Primary Industries and the University of New England), University of New England, Armidale, NSW 2351, Australia. Email: kmeyer@didgeridoo.une.edu.au
Australian Journal of Experimental Agriculture 45(8) 847-858 https://doi.org/10.1071/EA05040
Submitted: 14 February 2005 Accepted: 29 April 2005 Published: 26 August 2005
Abstract
Random regression analyses have become standard methodology for the analysis of traits with repeated records that are thought of as representing points on a trajectory. Modelling curves as a regression on functions of a continuous covariable, such as time, for each individual, random regression models are readily implemented in standard, linear mixed model analyses. Early applications have made extensive use of regressions on orthogonal polynomials. Recently, spline functions have been considered as an alternative. The use of a particular type of spline function, the so-called B-splines, as basis functions for random regression analyses is outlined, emphasising the local influence of individual observations and low degree of polynomials employed. While such analyses are likely to involve more regression coefficients than polynomial models, it is demonstrated that reduced rank estimation via the leading principal components is feasible and likely to yield more parsimonious models and more stable estimates than full rank analyses. The combined application of B-spline basis function and reduced rank estimation is illustrated for a small set of data for beef cattle.
Additional keywords: B-spline functions, reduced rank estimation, principal components.
Burnham KP, Anderson DR
(2004) Multimodel inference: Understanding {AIC} and {BIC} in model selection. Sociological Methods and Research 33, 261–304.
| Crossref | GoogleScholarGoogle Scholar |
(verified 3 August 2005).
Durbán M,
Harezlak J,
Wand MP, Carroll RJ
(2005) Simple fitting of subject-specific curves for longitudinal data. Statistics in Medicine 24, 1153–1167.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
(verified 3 August 2005).
Foulley JL,
Jaffrézic F, Robert-Granié C
(2000) EM-REML estimation of covariance parameters in Gaussian mixed models for longitudinal data analysis. Genetics Selection Evolution 32, 129–141.
| Crossref | GoogleScholarGoogle Scholar |
Gilmour AR,
Thompson R, Cullis BR
(1995) Average Information REML, an efficient algorithm for variance parameter estimation in linear mixed models. Biometrics 51, 1440–1450.
Jaffrézic F, Pletcher SD
(2000) Statistical models for estimating the genetic basis of repeated measures and other function-valued traits. Genetics 156, 913–922.
| PubMed |
James GM,
Hastie TJ, Sugar CA
(2000) Principal component models for sparse functional data. Biometrika 87, 587–602.
| Crossref | GoogleScholarGoogle Scholar |
Jennrich RI, Schluchter MD
(1986) Unbalanced repeated-measures models with structured covariance matrices. Biometrics 42, 805–820.
| PubMed |
Kirkpatrick M, Heckman N
(1989) A quantitative genetic model for growth, shape, reaction norms, and other infinite-dimensional characters. Journal of Mathematical Biology 27, 429–450.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Kirkpatrick M,
Lofsvold D, Bulmer M
(1990) Analysis of the inheritance, selection and evolution of growth trajectories. Genetics 124, 979–993.
| PubMed |
Kirkpatrick M, Meyer K
(2004) Simplified analysis of complex phenotypes: direct estimation of genetic principal components. Genetics 168, 2295–2306.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Meyer K
(1998) Estimating covariance functions for longitudinal data using a random regression model. Genetics, Selection, Evolution 30, 221–240.
Meyer K
(2000) Random regressions to model phenotypic variation in monthly weights of Australian beef cows. Livestock Production Science 65, 19–38.
| Crossref | GoogleScholarGoogle Scholar |
Meyer K
(2001) Estimating genetic covariance functions assuming a parametric correlation structure for environmental effects. Genetics, Selection, Evolution 33, 557–585.
| Crossref | GoogleScholarGoogle Scholar |
Meyer K
(2005a) Estimates of covariance functions for growth of Angus cattle from random regression analyses fitting B-spline functions. Proceedings of the Association for Advancement of Animal Breeding Genetics 16, 52–55.
Meyer K
(2005b) Random regression analyses using B-splines to model growth of Australian Angus cattle. Genetics, Selection, Evolution 37, 473–500.
| Crossref |
Meyer K
(2005c) Reduced rank estimates of the genetic covariance matrix for live ultra-sound scan traits. Proceedings of the Association for Advancement of Animal Breeding Genetics 16, 56–59.
Meyer K
(2005d) Sampling behaviour of reduced rank estimates of genetic covariance functions. Proceedings of the Association for Advancement of Animal Breeding Genetics 16, 286–289.
Meyer K,
Carrick MJ, Donnelly BJP
(1993) Genetic parameters for growth traits of Australian beef cattle from a multi-breed selection experiment. Journal of Animal Science 71, 2614–2622.
| PubMed |
Meyer K, Kirkpatrick M
(2005a) Restricted maximum likelihood estimation of genetic principal components and smoothed covariance matrices. Genetics, Selection, Evolution. 37, 1–30.
| Crossref | GoogleScholarGoogle Scholar |
Meyer K, Kirkpatrick M
(2005b) Up hill, down dale: quantitative genetics of curvaceous traits. Philosophical Transactions of the Royal Society B 360, 1443–1455.
| Crossref | GoogleScholarGoogle Scholar |
Rice JA, Wu CO
(2001) Nonparametric mixed effects models for unequally sampled noisy curves. Biometrics 57, 253–259.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Ruppert D, Carroll RJ
(2000) Spatially-adaptive penalties for spline fitting. Australian and New Zealand Journal of Statistics 42, 205–223.
| Crossref | GoogleScholarGoogle Scholar |
Shi M,
Weiss RE, Taylor JMG
(1996) An analysis of pediatric CD4 counts for acquired immune deficiency syndrome using flexible random curves. Applied Statistics 45, 151–164.
Smith AB,
Cullis BR, Thompson R
(2001) Analysing variety by environment data using multiplicative mixed models and adjustments for spatial field trends. Biometrics 57, 1138–1147.
| Crossref | GoogleScholarGoogle Scholar | PubMed |
Thompson R,
Cullis BR,
Smith AB, Gilmour AR
(2003) A sparse implementation of the Average Information algorithm for factor analytic and reduced rank variance models. Australian and New Zealand Journal of Statistics 45, 445–459.
| Crossref | GoogleScholarGoogle Scholar |
Torres RAA, Quaas RL
(2001) Determination of covariance functions for lactation traits on dairy cattle using random-coefficient regressions on B-splines. Journal of Animal Science Abstract. 79(Suppl.1), 112.
Verbyla AR,
Cullis BR,
Kenward MG, Welham SJ
(1999) The analysis of designed experiments and longitudinal data by using smoothing splines (with discussion). Applied Statistics 48, 269–311.
White IMS,
Thompson R, Brotherstone S
(1999) Genetic and environmental smoothing of lactation curves with cubic splines. Journal of Dairy Science 82, 632–638.
| PubMed |
Wolfinger RD
(1993) Covariance structure selection in general mixed models. Communications in Statistics — Simulation and Computing 22, 1079–1106.