Free Standard AU & NZ Shipping For All Book Orders Over $80!
Register      Login
Animal Production Science Animal Production Science Society
Food, fibre and pharmaceuticals from animals
RESEARCH ARTICLE (Open Access)

The impact of QTL sharing and properties on multi-breed GWAS in cattle: a simulation study

Irene van den Berg https://orcid.org/0000-0002-9292-8636 A * and Iona M. MacLeod A B
+ Author Affiliations
- Author Affiliations

A Agriculture Victoria, AgriBio, Centre for AgriBioscience, 5 Ring Road, Bundoora, Vic. 3083, Australia.

B School of Applied Systems Biology, La Trobe University, 5 Ring Road, Bundoora, Vic. 3083, Australia.


Handling Editor: Sue Hatcher

Animal Production Science 63(11) 996-1007 https://doi.org/10.1071/AN22460
Submitted: 14 December 2022  Accepted: 13 March 2023   Published: 6 April 2023

© 2023 The Author(s) (or their employer(s)). Published by CSIRO Publishing. This is an open access article distributed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND)

Abstract

Context: Genome-wide association studies (GWAS) and meta-analyses can be used to detect variants that affect quantitative traits. Multi-breed GWAS may lead to increased power and precision compared with within-breed GWAS. However, not all causal variants segregate in all breeds, and variants that segregate in multiple breeds may have different allele frequencies in different breeds. It is not known how differences in minor allele frequency (MAF) affect multi-breed GWAS and meta-analyses.

Aims: Our aim was to study the impact of differences in MAF at causal variants on mapping power and precision.

Methods: We used real imputed sequence data to simulate quantitative traits in three dairy cattle breeds. Causal variants (QTN) were simulated according to the following three scenarios: variants with a similar MAF in all breeds, variants with a lower MAF in one breed than the other, and variants that each only segregated in one of the breeds. We analysed the simulated quantitative traits with three methods to compare mapping power and precision: within-breed GWAS, multi-breed GWAS and meta-analysis.

Key results: Our results indicated that the multi-breed analyses (multi-breed GWAS or meta-analysis) detected similar or more QTN than did within-breed GWAS, with improved mapping precision in most scenarios. However, when MAF differed between breeds, or variants were breed specific, the advantage of the multi-breed analyses over within breed GWAS decreased. Regardless of the type of QTN (similar MAF in all breeds, different MAF in different breeds, or only segregating in one breed), multi-breed GWAS and meta-analyses performed similar or better than did within-breed GWAS, demonstrating the benefits of multi-breed GWAS. We did not find large differences between the results obtained with the meta-analysis and multi-breed GWAS, confirming that a meta-analysis can be a suitable approximation of a multi-breed GWAS.

Conclusions: Our results showed that multi-breed GWAS and meta-analysis generally detect more QTN with improved precision than does within-breed GWAS, and that even with differences in MAF, multi-breed analyses did not perform worse than within-breed GWAS.

Implications: Our study confirmed the benefits of multi-breed GWAS and meta-analysis.

Keywords: allele frequency, dairy cattle, GWAS, meta-analysis, multi-breed, QTL detection, quantitative traits, within breed.


References

Bellinge RHS, Liberles DA, Iaschi SPA, O’brien PA, Tay GK (2005) Myostatin and its implications on animal breeding: a review. Animal Genetics 36, 1–6.
Myostatin and its implications on animal breeding: a review.Crossref | GoogleScholarGoogle Scholar |

Bouwman AC, Veerkamp RF (2014) Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy. BMC Genetics 15, 105
Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy.Crossref | GoogleScholarGoogle Scholar |

Bouwman AC, Daetwyler HD, Chamberlain AJ, Ponce CH, Sargolzaei M, Schenkel FS, Sahana G, Govignon-Gion A, Boitard S, Dolezal M, Pausch H, Brøndum RF, Bowman PJ, Thomsen B, Guldbrandtsen B, Lund MS, Servin B, Garrick DJ, Reecy J, Vilkki J, Bagnato A, Wang M, Hoff JL, Schnabel RD, Taylor JF, Vinkhuyzen AAE, Panitz F, Bendixen C, Holm L-E, Gredler B, Hozé C, Boussaha M, Sanchez M-P, Rocha D, Capitan A, Tribout T, Barbat A, Croiseau P, Drögemüller C, Jagannathan V, Vander Jagt C, Crowley JJ, Bieber A, Purfield DC, Berry DP, Emmerling R, Götz K-U, Frischknecht M, Russ I, Sölkner J, Van Tassell CP, Fries R, Stothard P, Veerkamp RF, Boichard D, Goddard ME, Hayes BJ (2018) Meta-analysis of genome-wide association studies for cattle stature identifies common genes that regulate body size in mammals. Nature Genetics 50, 362–367.
Meta-analysis of genome-wide association studies for cattle stature identifies common genes that regulate body size in mammals.Crossref | GoogleScholarGoogle Scholar |

Brøndum RF, Guldbrandtsen B, Sahana G, Lund MS, Su G (2014) Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle. BMC Genomics 15, 728
Strategies for imputation to whole genome sequence using a single or multi-breed reference population in cattle.Crossref | GoogleScholarGoogle Scholar |

Brøndum RF, Su G, Janss L, Sahana G, Guldbrandtsen B, Boichard D, Lund MS (2015) Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction. Journal of Dairy Science 98, 4107–4116.
Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction.Crossref | GoogleScholarGoogle Scholar |

Daetwyler HD, Capitan A, Pausch H, Stothard P, van Binsbergen R, Brøndum RF, Liao X, Djari A, Rodriguez SC, Grohs C, Esquerré D, Bouchez O, Rossignol M-N, Klopp C, Rocha D, Fritz S, Eggen A, Bowman PJ, Coote D, Chamberlain AJ, Anderson C, VanTassell CP, Hulsegge I, Goddard ME, Guldbrandtsen B, Lund MS, Veerkamp RF, Boichard DA, Fries R, Hayes BJ (2014) Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle. Nature Genetics 46, 858–865.
Whole-genome sequencing of 234 bulls facilitates mapping of monogenic and complex traits in cattle.Crossref | GoogleScholarGoogle Scholar |

Das S, Forer L, Schönherr S, Sidore C, Locke AE, Kwong A, Vrieze SI, Chew EY, Levy S, McGue M, Schlessinger D, Stambolian D, Loh P-R, Iacono WG, Swaroop A, Scott LJ, Cucca F, Kronenberg F, Boehnke M, Abecasis Gçalo R, Fuchsberger C (2016) Next-generation genotype imputation service and methods. Nature Genetics 48, 1284–1287.
Next-generation genotype imputation service and methods.Crossref | GoogleScholarGoogle Scholar |

de Roos APW, Hayes BJ, Spelman RJ, Goddard ME (2008) Linkage disequilibrium and persistence of phase in Holstein–Friesian, Jersey and Angus cattle. Genetics 179, 1503–1512.
Linkage disequilibrium and persistence of phase in Holstein–Friesian, Jersey and Angus cattle.Crossref | GoogleScholarGoogle Scholar |

Gautier M, Capitan A, Fritz S, Eggen A, Boichard D, Druet T (2007) Characterization of the DGAT1 K232A and variable number of tandem repeat polymorphisms in French Dairy Cattle. Journal of Dairy Science 90, 2980–2988.
Characterization of the DGAT1 K232A and variable number of tandem repeat polymorphisms in French Dairy Cattle.Crossref | GoogleScholarGoogle Scholar |

Hayes BJ, Daetwyler HD (2019) 1000 bull genomes project to map simple and complex genetic traits in cattle: applications and outcomes. Annual Review of Animal Biosciences 7, 89–102.
1000 bull genomes project to map simple and complex genetic traits in cattle: applications and outcomes.Crossref | GoogleScholarGoogle Scholar |

Jiang J, Ma L, Prakapenka D, VanRaden PM, Cole JB, Da Y (2019) A large-scale genome-wide association study in U.S. Holstein cattle. Frontiers in Genetics 10, 412
A large-scale genome-wide association study in U.S. Holstein cattle.Crossref | GoogleScholarGoogle Scholar |

Kemper KE, Hayes BJ, Daetwyler HD, Goddard ME (2015) How old are quantitative trait loci and how widely do they segregate? Journal of Animal Breeding and Genetics 132, 121–134.
How old are quantitative trait loci and how widely do they segregate?Crossref | GoogleScholarGoogle Scholar |

Loh P-R, Palamara PF, Price AL (2016) Fast and accurate long-range phasing in a UK Biobank cohort. Nature Genetics 48, 811–816.
Fast and accurate long-range phasing in a UK Biobank cohort.Crossref | GoogleScholarGoogle Scholar |

Marete AG, Guldbrandtsen B, Lund MS, Fritz S, Sahana G, Boichard D (2018) A meta-analysis including pre-selected sequence variants associated with seven traits in three French dairy cattle populations. Frontiers in Genetics 9, 522
A meta-analysis including pre-selected sequence variants associated with seven traits in three French dairy cattle populations.Crossref | GoogleScholarGoogle Scholar |

Pausch H, MacLeod IM, Fries R, Emmerling R, Bowman PJ, Daetwyler HD, Goddard ME (2017) Evaluation of the accuracy of imputed sequence variant genotypes and their utility for causal variant detection in cattle. Genetics Selection Evolution 49, 24
Evaluation of the accuracy of imputed sequence variant genotypes and their utility for causal variant detection in cattle.Crossref | GoogleScholarGoogle Scholar |

Raven L-A, Cocks BG, Hayes BJ (2014) Multibreed genome wide association can improve precision of mapping causative variants underlying milk production in dairy cattle. BMC Genomics 15, 62
Multibreed genome wide association can improve precision of mapping causative variants underlying milk production in dairy cattle.Crossref | GoogleScholarGoogle Scholar |

Raymond B, Bouwman AC, Schrooten C, Houwing-Duistermaat J, Veerkamp RF (2018a) Utility of whole-genome sequence data for across-breed genomic prediction. Genetics Selection Evolution 50, 27
Utility of whole-genome sequence data for across-breed genomic prediction.Crossref | GoogleScholarGoogle Scholar |

Raymond B, Bouwman AC, Wientjes YCJ, Schrooten C, Houwing-Duistermaat J, Veerkamp RF (2018b) Genomic prediction for numerically small breeds, using models with pre-selected and differentially weighted markers. Genetics Selection Evolution 50, 49
Genomic prediction for numerically small breeds, using models with pre-selected and differentially weighted markers.Crossref | GoogleScholarGoogle Scholar |

Sargolzaei M, Chesnais JP, Schenkel FS (2014) A new approach for efficient genotype imputation using information from relatives. BMC Genomics 15, 478
A new approach for efficient genotype imputation using information from relatives.Crossref | GoogleScholarGoogle Scholar |

Teissier M, Sanchez MP, Boussaha M, Barbat A, Hoze C, Robert-Granie C, Croiseau P (2018) Use of meta-analyses and joint analyses to select variants in whole genome sequences for genomic evaluation: an application in milk production of French dairy cattle breeds. Journal of Dairy Science 101, 3126–3139.
Use of meta-analyses and joint analyses to select variants in whole genome sequences for genomic evaluation: an application in milk production of French dairy cattle breeds.Crossref | GoogleScholarGoogle Scholar |

Van den Berg I, Boichard D, Lund MS (2016a) Comparing power and precision of within-breed and multibreed genome-wide association studies of production traits using whole-genome sequence data for 5 French and Danish dairy cattle breeds. Journal of Dairy Science 99, 8932–8945.
Comparing power and precision of within-breed and multibreed genome-wide association studies of production traits using whole-genome sequence data for 5 French and Danish dairy cattle breeds.Crossref | GoogleScholarGoogle Scholar |

Van den Berg I, Boichard D, Lund MS (2016b) Sequence variants selected from a multi-breed GWAS can improve the reliability of genomic predictions in dairy cattle. Genetics Selection Evolution 48, 83
Sequence variants selected from a multi-breed GWAS can improve the reliability of genomic predictions in dairy cattle.Crossref | GoogleScholarGoogle Scholar |

Van den Berg I, Xiang R, Jenko J, Pausch H, Boussaha M, Schrooten C, Tribout T, Gjuvsland AB, Boichard D, Nordbø Ø, Sanchez M-P, Goddard ME (2020) Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94 321 cattle from eight cattle breeds. Genetics Selection Evolution 52, 37
Meta-analysis for milk fat and protein percentage using imputed sequence variant genotypes in 94 321 cattle from eight cattle breeds.Crossref | GoogleScholarGoogle Scholar |

Van den Berg I, Ho PN, Nguyen TV, Haile-Mariam M, MacLeod IM, Beatson PR, O’Connor E, Pryce JE (2022) GWAS and genomic prediction of milk urea nitrogen in Australian and New Zealand dairy cattle. Genetics Selection Evolution 54, 15
GWAS and genomic prediction of milk urea nitrogen in Australian and New Zealand dairy cattle.Crossref | GoogleScholarGoogle Scholar |

VanRaden PM, Tooker ME, O’Connell JR, Cole JB, Bickhart DM (2017) Selecting sequence variants to improve genomic predictions for dairy cattle. Genetics Selection Evolution 49, 32
Selecting sequence variants to improve genomic predictions for dairy cattle.Crossref | GoogleScholarGoogle Scholar |

Willer CJ, Li Y, Abecasis GR (2010) METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191.
METAL: fast and efficient meta-analysis of genomewide association scans.Crossref | GoogleScholarGoogle Scholar |

Xiang R, Van Den Berg I, MacLeod IM, Hayes BJ, Prowse-Wilkins CP, Wang M, Bolormaa S, Liu Z, Rochfort SJ, Reich CM, Mason BA, Vander Jagt CJ, Daetwyler HD, Lund MS, Chamberlain AJ, Goddard ME (2019) Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits. Proceedings of the National Academy of Sciences of the United States of America 116, 19398–19408.
Quantifying the contribution of sequence variants with regulatory and evolutionary significance to 34 bovine complex traits.Crossref | GoogleScholarGoogle Scholar |

Yang J, Lee SH, Goddard ME, Visscher PM (2011) GCTA: a tool for genome-wide complex trait analysis. The American Journal of Human Genetics 88, 76–82.
GCTA: a tool for genome-wide complex trait analysis.Crossref | GoogleScholarGoogle Scholar |

Yang J, Ferreira T, Morris AP, Medland SE, Madden PAF, Heath AC, Martin NG, Montgomery GW, Weedon MN, Loos RJ, Frayling TM, McCarthy MI, Hirschhorn JN, Goddard ME, Visscher PM, Genetic Investigation of ANthropometric Traits (GIANT) Consortium DIAbetes Genetics Replication And Meta-analysis (DIAGRAM) Consortium (2012) Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits. Nature Genetics 44, 369–375.
Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits.Crossref | GoogleScholarGoogle Scholar |

Yengo L, Vedantam S, Marouli E, Sidorenko J, Bartell E, Sakaue S, Graff M, Eliasen AU, Jiang Y, Raghavan S, et al. (2022) A saturated map of common genetic variants associated with human height. Nature 610, 704–712.
A saturated map of common genetic variants associated with human height.Crossref | GoogleScholarGoogle Scholar |