Quality control of genotypes using heritability estimates of gene content at the marker.

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Quality control filtering of single-nucleotide polymorphisms (SNPs) is a key step when analyzing genomic data. Here we present a practical method to identify low-quality SNPs, meaning markers whose genotypes are wrongly assigned for a large proportion of individuals, by estimating the heritability of gene content at each marker, where gene content is the number of copies of a particular reference allele in a genotype of an animal (0, 1, or 2). If there is no mutation at the marker, gene content has an additive heritability of 1 by construction. The method uses restricted maximum likelihood (REML) to estimate heritability of gene content at each SNP and also builds a likelihood-ratio test statistic to test for zero error variance in genotyping. As a by-product, estimates of the allele frequencies of markers at the base population are obtained. Using simulated data with 10% permutation error (4% actual error) in genotyping, the method had a specificity of 0.96 (4% of correct markers are rejected) and a sensitivity of 0.99 (1% of wrong markers are accepted) if markers with heritability lower than 0.975 are discarded. Checking of Mendelian errors resulted in a lower sensitivity (0.84) for the same simulation. The proposed method is further illustrated with a real data set with genotypes from 3534 animals genotyped for 50,433 markers from the Illumina PorcineSNP60 chip and a pedigree of 6473 individuals; those markers underwent very little quality control. A total of 4099 markers with P-values lower than 0.01 were discarded based on our method, with associated estimates of heritability as low as 0.12. Contrary to other techniques, our method uses all information in the population simultaneously, can be used in any population with markers and pedigree recordings, and is simple to implement using standard software for REML estimation. Scripts for its use are provided.

Most cited references 20

Record: found
Abstract: not found
Article: not found

Recovery of inter-block information when block sizes are unequal

H. PATTERSON, R Thompson (1971)

0 comments Cited 428 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

António Ramos, Richard P M A Crooijmans, Nabeel Affara … (2009)

Background The dissection of complex traits of economic importance to the pig industry requires the availability of a significant number of genetic markers, such as single nucleotide polymorphisms (SNPs). This study was conducted to discover several hundreds of thousands of porcine SNPs using next generation sequencing technologies and use these SNPs, as well as others from different public sources, to design a high-density SNP genotyping assay. Methodology/Principal Findings A total of 19 reduced representation libraries derived from four swine breeds (Duroc, Landrace, Large White, Pietrain) and a Wild Boar population and three restriction enzymes (AluI, HaeIII and MspI) were sequenced using Illumina's Genome Analyzer (GA). The SNP discovery effort resulted in the de novo identification of over 372K SNPs. More than 549K SNPs were used to design the Illumina Porcine 60K+SNP iSelect Beadchip, now commercially available as the PorcineSNP60. A total of 64,232 SNPs were included on the Beadchip. Results from genotyping the 158 individuals used for sequencing showed a high overall SNP call rate (97.5%). Of the 62,621 loci that could be reliably scored, 58,994 were polymorphic yielding a SNP conversion success rate of 94%. The average minor allele frequency (MAF) for all scorable SNPs was 0.274. Conclusions/Significance Overall, the results of this study indicate the utility of using next generation sequencing technologies to identify large numbers of reliable SNPs. In addition, the validation of the PorcineSNP60 Beadchip demonstrated that the assay is an excellent tool that will likely be used in a variety of future studies in pigs.

0 comments Cited 283 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

A Simple Method for Computing the Inverse of a Numerator Relationship Matrix Used in Prediction of Breeding Values

C. R. Henderson (1976)

0 comments Cited 165 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (iso-abbrev): Genetics

Title: Genetics

Publisher: Genetics Society of America

ISSN (Electronic): 1943-2631

ISSN (Print): 0016-6731

Publication date (Electronic): Mar 2015

Volume: 199

Issue: 3

Affiliations

[1 ] Departamento de Producción Animal, Facultad de Agronomía, Universidad de Buenos Aires, C1417DSE Buenos Aires, Argentina Consejo Nacional de Investigaciones Científicas y Técnicas, Av. Rivadavia 1917, C1033AAJ Buenos Aires, Argentina.

[2 ] INRA, Génétique, Physiologie et Systèmes d'Elevage (GenPhySE), F-31326 Castanet-Tolosan, France Université de Toulouse, INP, ENSAT, Génétique, Physiologie et Systèmes d'Elevage (GenPhySE), F-31326 Castanet-Tolosan, France andres.legarra@toulouse.inra.fr.

[3 ] INRA, Génétique, Physiologie et Systèmes d'Elevage (GenPhySE), F-31326 Castanet-Tolosan, France Université de Toulouse, INP, ENSAT, Génétique, Physiologie et Systèmes d'Elevage (GenPhySE), F-31326 Castanet-Tolosan, France.

[4 ] Animal and Dairy Science, University of Georgia, Athens, Georgia 30602.

[5 ] Instituto Nacional de Investigación Agropecuaria, Canelones 90200, Uruguay.

Article

Publisher Item ID: genetics.114.173559

DOI: 10.1534/genetics.114.173559

PMC ID: 4349063

PubMed ID: 25567991

SO-VID: 3381d8dd-7bd1-419d-a2d8-724f02d8e964

History

Keywords: GenPred,REML,SNP,gene content,genomic selection,quality control,shared data resource

Data availability:

Keywords: GenPred, REML, SNP, gene content, genomic selection, quality control, shared data resource

Quality control of genotypes using heritability estimates of gene content at the marker.

Read this article at

Abstract

Most cited references 20

Recovery of inter-block information when block sizes are unequal

Design of a High Density SNP Genotyping Assay in the Pig Using SNPs Identified and Characterized by Next Generation Sequencing Technology

A Simple Method for Computing the Inverse of a Numerator Relationship Matrix Used in Prediction of Breeding Values

Author and article information

Journal

Affiliations

Article

History

Comments

Comment on this article

Similar content 119

Cited by 11

Most referenced authors 269