Register      Login
Animal Production Science Animal Production Science Society
Food, fibre and pharmaceuticals from animals
RESEARCH ARTICLE

Effect of SNP origin on analyses of genetic diversity in cattle

Laercio R. Porto Neto A B C and William Barendse A B D
+ Author Affiliations
- Author Affiliations

A Cooperative Research Centre for Beef Genetic Technologies, CJ Hawkins Homestead, University of New England, Armidale, NSW 2351, Australia.

B CSIRO Livestock Industries, Queensland Bioscience Precinct, 306 Carmody Road, St Lucia, Qld 4067, Australia.

C The University of Queensland, School of Animal Studies, Gatton, Qld 4343, Australia.

D Corresponding author. Email: bill.barendse@csiro.au

Animal Production Science 50(8) 792-800 https://doi.org/10.1071/AN10073
Submitted: 14 May 2010  Accepted: 8 June 2010   Published: 31 August 2010

Abstract

The methods of single nucleotide polymorphism (SNP) identification can lead to ascertainment bias, which will affect population genetic analyses based on those data. In livestock species, the methods of SNP identification through genome sequencing are likely to suffer from this ascertainment bias. In the present study, a subset of data from the Bovine HapMap Project was re-analysed to quantify the effects of ascertainment bias on a range of common analyses and statistics. Data from 189 animals of the zebu breeds Brahman, Nelore and Gir, taurine beef Angus, Limousin and Hereford and taurine dairy Holstein, Jersey and Brown Swiss were analysed. There were 141 SNPs each of Angus, Brahman and Holstein origin, giving a total of 423 SNPs organised in 141 triplets. Each triplet consisted of one SNP of each breed, separated on average by 0.75 Mb within each triplet and where triplets were separated by 14.96 Mb to ensure that each triplet was unaffected by linkage disequilibrium. The minor allele frequency distribution, estimates of the F-statistic, FST, the partitioning of variance and population substructure were relatively unaffected by breed of origin of the SNPs. Estimates of heterozygosity were significantly affected by breed of origin of the SNPs. The clustering of animals of closely related breeds varied in the principal component analyses (PCA). However, in the PCA the effect of breed of origin of 141 SNPs was similar to the effect of using different panels of 141 SNPs of all three breeds, so the differences found in the PCA may not be all due to bias by the origin of the SNPs. Based on these results, analyses that depend on FST, including signatures of selection, gene flow and effective population size are unlikely to be strongly affected by SNP origin. Analyses that partition genetic variance and some analyses of population substructure will also be largely unaffected. However, analyses that are dependent on locus heterozygosity, which can be used for studying population bottlenecks, or those that study selection using extended haplotype homozygosity may be significantly affected by breed of origin of the SNPs.


Acknowledgements

We thank James Kijas for many valuable discussions of genetic diversity in livestock. We thank the Bovine HapMap Consortium for access to the raw genotypes in the BHP. LRPN is supported by an Endeavour International Postgraduate Research Scholarship, a University of Queensland International Student Living Allowance and a Beef CRC scholarship.


References


Akey JM, Zhang K, Xiong MM, Jin L (2003) The effect of single nucleotide polymorphism identification strategies on estimates of linkage disequilibrium. Molecular Biology and Evolution 20, 232–242.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Barendse W, Reverter A, Bunch RJ, Harrison BE, Barris W, Thomas MB (2007) A validated whole-genome association study of efficient food conversion in cattle. Genetics 176, 1893–1905.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Barendse W, Harrison BE, Bunch RJ, Thomas MB, Turner LB (2009) Genome wide signatures of positive selection: the comparison of independent samples and identification of regions associated to traits. BMC Genomics 10, 178.
Crossref | GoogleScholarGoogle Scholar | PubMed | open url image1

Bowcock AM, Kidd JR, Mountain JL, Hebert JM, Carotenuto L, Kidd KK, Cavalli-Sforza LL (1991) Drift, admixture, and selection in human evolution: a study with DNA polymorphisms. Proceedings of the National Academy of Sciences, USA 88, 839–843.
Crossref | GoogleScholarGoogle Scholar | CAS | open url image1

Caraux G, Pinloche S (2005) PermutMatrix: a graphical environment to arrange gene expression profiles in optimal linear order. Bioinformatics 21, 1280–1281.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Clark AG, Hubisz MJ, Bustamante CD, Williamson SH, Nielsen R (2005) Ascertainment bias in studies of human genome-wide polymorphism. Genome Research 15, 1496–1502.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Elsik CG, Tellam RL, Worley KC, Gibbs RA, Muzny DM , et al . (2009) The genome sequence of taurine cattle: a window to ruminant biology and evolution. Science 324, 522–528.
Crossref | GoogleScholarGoogle Scholar | PubMed | open url image1

Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Molecular Ecology 14, 2611–2620.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Excoffier L, Laval G, Schneider S (2005) Arlequin (version 3.0): an integrated software package for population genetics data analysis. Evolutionary Bioinformatics 1, 47–50.
CAS |
open url image1

Gibbs RA, Taylor JF, Van Tassell CP, Barendse W, Eversole KA , et al . (2009) Genome-wide survey of SNP variation uncovers the genetic structure of cattle breeds. Science 324, 528–532.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Goudet J (1995) FSTAT (Version 1.2): a computer program to calculate F-statistics. Journal of Heredity 86, 485–486. open url image1

Guillot G, Foll M (2009) Correcting for ascertainment bias in the inference of population structure. Bioinformatics 25, 552–554.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Kijas J, Townley D, Dalrymple B, Heaton M, Maddox J, McGrath A, Wilson P, Ingersoll R, McCulloch R, McWilliam S (2009) A genome wide survey of SNP variation reveals the genetic structure of sheep breeds. PLoS ONE 4, e4668.
Crossref | GoogleScholarGoogle Scholar | PubMed | open url image1

López Herráez DL, Bauchet M, Tang K, Theunert C, Pugach I, Li J, Nandineni MR, Gross A, Scholz M, Stoneking M (2009) Genetic variation and recent positive selection in worldwide human populations: evidence from nearly 1 million SNPs. PLoS ONE 4, e7888.
Crossref | GoogleScholarGoogle Scholar | PubMed | open url image1

MacHugh DE, Loftus RT, Cunningham P, Bradley DG (1998) Genetic structure of seven European cattle breeds assessed using 20 microsatellite markers. Animal Genetics 29, 333–340.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Mukesh M, Sodhi M, Bhatia S, Mishra BP (2004) Genetic diversity of Indian native cattle breeds as analysed with 20 microsatellite loci. Journal of Animal Breeding and Genetics 121, 416–424.
Crossref | GoogleScholarGoogle Scholar | CAS | open url image1

Nielsen R (2004) Population genetic analysis of ascertained SNP data. Human Genomics 1, 218–224.
CAS | PubMed |
open url image1

Patterson N, Price AL, Reich D (2006) Population structure and eigenanalysis. PLoS Genetics 2, 2074–2093.
Crossref | GoogleScholarGoogle Scholar | CAS | open url image1

Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155, 945–959.
CAS | PubMed |
open url image1

Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MAR , et al . (2007) PLINK: a tool set for whole-genome association and population-based linkage analyses. American Journal of Human Genetics 81, 559–575.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Rosenberg NA (2004) DISTRUCT: a program for the graphical display of population structure. Molecular Ecology Notes 4, 137–138.
Crossref | GoogleScholarGoogle Scholar | open url image1

Rosenblum EB, Novembre J (2007) Ascertainment bias in spatially structured populations: a case study in the eastern fence lizard. The Journal of Heredity 98, 331–336.
Crossref | GoogleScholarGoogle Scholar | PubMed | open url image1

Sanders JO (1980) History and development of zebu cattle in the United States. Journal of Animal Science 50, 1188–1200. open url image1

Sharma R, Pandey AK, Singh Y, Prakash B, Mishra BP, Kathiravan P, Singh PK, Singh G (2009) Evaluation of genetic variation and phylogenetic relationship among North Indian cattle breeds. Asian-Australasian Journal of Animal Sciences 22, 13–19.
CAS |
open url image1

Snelling WM, Allan MF, Keele JW, Kuehn LA, McDaneld T, Smith TPL, Sonstegard TS, Thallman RM, Bennett GL (2010) Genome-wide association study of growth in crossbred beef cattle. Journal of Animal Science 88, 837–848.
Crossref | GoogleScholarGoogle Scholar | CAS | PubMed | open url image1

Turner LB, Harrison BE, Bunch RJ, Porto Neto LR, Li YT, Barendse W (2010) A genome wide association study of tick burden and milk composition in cattle. Animal Production Science 50, 235–245.
Crossref | GoogleScholarGoogle Scholar | open url image1

Womack JE (2006) The impact of sequencing the bovine genome. Australian Journal of Experimental Agriculture 46, 151–153.
Crossref | GoogleScholarGoogle Scholar | CAS | open url image1