Genomic prediction of host resistance to sea lice in farmed Atlantic salmon populations

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

Sea lice have significant negative economic and welfare impacts on marine Atlantic salmon farming. Since host resistance to sea lice has a substantial genetic component, selective breeding can contribute to control of lice. Genomic selection uses genome-wide marker information to predict breeding values, and can achieve markedly higher accuracy than pedigree-based methods. Our aim was to assess the genetic architecture of host resistance to sea lice, and test the utility of genomic prediction of breeding values. Individual lice counts were measured in challenge experiments using two large Atlantic salmon post-smolt populations from a commercial breeding programme, which had genotypes for ~33 K single nucleotide polymorphisms (SNPs). The specific objectives were to: (i) estimate the heritability of host resistance; (ii) assess its genetic architecture by performing a genome-wide association study (GWAS); (iii) assess the accuracy of predicted breeding values using varying SNP densities (0.5 to 33 K) and compare it to that of pedigree-based prediction; and (iv) evaluate the accuracy of prediction in closely and distantly related animals.

Results

Heritability of host resistance was significant (0.22 to 0.33) in both populations using either pedigree or genomic relationship matrices. The GWAS suggested that lice resistance is a polygenic trait, and no genome-wide significant quantitative trait loci were identified. Based on cross-validation analysis, genomic predictions were more accurate than pedigree-based predictions for both populations. Although prediction accuracies were highest when closely-related animals were used in the training and validation sets, the benefit of having genomic-versus pedigree-based predictions within a population increased as the relationships between training and validation sets decreased. Prediction accuracy reached an asymptote with a SNP density of ~5 K within populations, although higher SNP density was advantageous for cross-population prediction.

Conclusions

Host resistance to sea lice in farmed Atlantic salmon has a significant genetic component. Phenotypes relating to host resistance can be predicted with moderate to high accuracy within populations, with a major advantage of genomic over pedigree-based methods, even at relatively sparse SNP densities. Prediction accuracies across populations were low, but improved with higher marker densities. Genomic selection can contribute to lice control in salmon farming.

Electronic supplementary material

The online version of this article (doi:10.1186/s12711-016-0226-9) contains supplementary material, which is available to authorized users.

Related collections

Most cited references 24

Record: found
Abstract: found
Article: not found

Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach

Hans D Daetwyler, Beatriz Villanueva, John A. Woolliams (2008)

Background The prediction of the genetic disease risk of an individual is a powerful public health tool. While predicting risk has been successful in diseases which follow simple Mendelian inheritance, it has proven challenging in complex diseases for which a large number of loci contribute to the genetic variance. The large numbers of single nucleotide polymorphisms now available provide new opportunities for predicting genetic risk of complex diseases with high accuracy. Methodology/Principal Findings We have derived simple deterministic formulae to predict the accuracy of predicted genetic risk from population or case control studies using a genome-wide approach and assuming a dichotomous disease phenotype with an underlying continuous liability. We show that the prediction equations are special cases of the more general problem of predicting the accuracy of estimates of genetic values of a continuous phenotype. Our predictive equations are responsive to all parameters that affect accuracy and they are independent of allele frequency and effect distributions. Deterministic prediction errors when tested by simulation were generally small. The common link among the expressions for accuracy is that they are best summarized as the product of the ratio of number of phenotypic records per number of risk loci and the observed heritability. Conclusions/Significance This study advances the understanding of the relative power of case control and population studies of disease. The predictions represent an upper bound of accuracy which may be achievable with improved effect estimation methods. The formulae derived will help researchers determine an appropriate sample size to attain a certain accuracy when predicting genetic risk.

0 comments Cited 289 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Genomic prediction in animals and plants: simulation of data, validation, reporting, and benchmarking.

Hans D Daetwyler, Mario Calus, Ricardo Pong-Wong … (2013)

The genomic prediction of phenotypes and breeding values in animals and plants has developed rapidly into its own research field. Results of genomic prediction studies are often difficult to compare because data simulation varies, real or simulated data are not fully described, and not all relevant results are reported. In addition, some new methods have been compared only in limited genetic architectures, leading to potentially misleading conclusions. In this article we review simulation procedures, discuss validation and reporting of results, and apply benchmark procedures for a variety of genomic prediction methods in simulated and real example data. Plant and animal breeding programs are being transformed by the use of genomic data, which are becoming widely available and cost-effective to predict genetic merit. A large number of genomic prediction studies have been published using both simulated and real data. The relative novelty of this area of research has made the development of scientific conventions difficult with regard to description of the real data, simulation of genomes, validation and reporting of results, and forward in time methods. In this review article we discuss the generation of simulated genotype and phenotype data, using approaches such as the coalescent and forward in time simulation. We outline ways to validate simulated data and genomic prediction results, including cross-validation. The accuracy and bias of genomic prediction are highlighted as performance indicators that should be reported. We suggest that a measure of relatedness between the reference and validation individuals be reported, as its impact on the accuracy of genomic prediction is substantial. A large number of methods were compared in example simulated and real (pine and wheat) data sets, all of which are publicly available. In our limited simulations, most methods performed similarly in traits with a large number of quantitative trait loci (QTL), whereas in traits with fewer QTL variable selection did have some advantages. In the real data sets examined here all methods had very similar accuracies. We conclude that no single method can serve as a benchmark for genomic prediction. We recommend comparing accuracy and bias of new methods to results from genomic best linear prediction and a variable selection approach (e.g., BayesB), because, together, these methods are appropriate for a range of genetic architectures. An accompanying article in this issue provides a comprehensive review of genomic prediction methods and discusses a selection of topics related to application of genomic prediction in plants and animals.

0 comments Cited 187 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Family-based association tests for genomewide association scans.

Wei-Min Chen, Gonçalo R Abecasis (2007)

With millions of single-nucleotide polymorphisms (SNPs) identified and characterized, genomewide association studies have begun to identify susceptibility genes for complex traits and diseases. These studies involve the characterization and analysis of very-high-resolution SNP genotype data for hundreds or thousands of individuals. We describe a computationally efficient approach to testing association between SNPs and quantitative phenotypes, which can be applied to whole-genome association scans. In addition to observed genotypes, our approach allows estimation of missing genotypes, resulting in substantial increases in power when genotyping resources are limited. We estimate missing genotypes probabilistically using the Lander-Green or Elston-Stewart algorithms and combine high-resolution SNP genotypes for a subset of individuals in each pedigree with sparser marker data for the remaining individuals. We show that power is increased whenever phenotype information for ungenotyped individuals is included in analyses and that high-density genotyping of just three carefully selected individuals in a nuclear family can recover >90% of the information available if every individual were genotyped, for a fraction of the cost and experimental effort. To aid in study design, we evaluate the power of strategies that genotype different subsets of individuals in each pedigree and make recommendations about which individuals should be genotyped at a high density. To illustrate our method, we performed genomewide association analysis for 27 gene-expression phenotypes in 3-generation families (Centre d'Etude du Polymorphisme Humain pedigrees), in which genotypes for ~860,000 SNPs in 90 grandparents and parents are complemented by genotypes for ~6,700 SNPs in a total of 168 individuals. In addition to increasing the evidence of association at 15 previously identified cis-acting associated alleles, our genotype-inference algorithm allowed us to identify associated alleles at 4 cis-acting loci that were missed when analysis was restricted to individuals with the high-density SNP data. Our genotype-inference algorithm and the proposed association tests are implemented in software that is available for free.

0 comments Cited 156 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Hsin-Yuan Tsai: Hsin-yuan.Tsai@roslin.ed.ac.uk

Alastair Hamilton: Alastair.Hamilton@hendrix-genetics.com

Alan E. Tinch: Alan.Tinch@hendrix-genetics.com

Derrick R. Guy: Derrick.Guy@hendrix-genetics.com

James E. Bron: j.e.bron@stir.ac.uk

John B. Taggart: j.b.taggart@stir.ac.uk

Karim Gharbi: Karim.Gharbi@ed.ac.uk

Michael Stear: Michael.Stear@glasgow.ac.uk

Oswald Matika: oswald.matika@roslin.ed.ac.uk

Ricardo Pong-Wong: Ricardo.pong-wong@roslin.ed.ac.uk

Steve C. Bishop: info@roslin.ed.ac.uk

Ross D. Houston: Ross.Houston@roslin.ed.ac.uk

Journal

Journal ID (nlm-ta): Genet Sel Evol

Journal ID (iso-abbrev): Genet. Sel. Evol

Title: Genetics, Selection, Evolution : GSE

Publisher: BioMed Central (London )

ISSN (Print): 0999-193X

ISSN (Electronic): 1297-9686

Publication date (Electronic): 29 June 2016

Publication date PMC-release: 29 June 2016

Publication date Collection: 2016

Volume: 48

Electronic Location Identifier: 47

Affiliations

[ ]The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Midlothian, EH25 9RG UK

[ ]Landcatch Natural Selection Ltd., 15 Beta Centre, Stirling University Innovation Park, Stirling, FK9 4NF UK

[ ]Institute of Aquaculture, University of Stirling, Stirling, FK9 4LA UK

[ ]Edinburgh Genomics, Ashworth Laboratories, King’s Buildings, University of Edinburgh, Edinburgh, EH9 3JT UK

[ ]Institute of Biodiversity, Animal Health and Comparative Medicine, University of Glasgow, Bearsden Road, Glasgow, G61 1QH UK

Article

Publisher ID: 226

DOI: 10.1186/s12711-016-0226-9

PMC ID: 4926294

PubMed ID: 27357694

SO-VID: a2c8dd87-3d45-48fe-94ef-c25e2b7ef77a

License:

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

History

Date received : 8 February 2016

Date accepted : 17 June 2016

Funding

Funded by: FundRef http://dx.doi.org/10.13039/501100000396, Technology Strategy Board;

Award ID: TP 5771-40299

Award Recipient : Ross D. Houston

Funded by: Innovate UK

Award ID: 45266-329178

Award Recipient : Ross D. Houston

Funded by: Biotechnology and Biological Sciences Research Council (GB)

Award ID: BB/H022007/1

Award ID: BB/J004235/1

Award ID: BB/J004324/1

Award Recipient : Ross D. Houston

Funded by: The Marine Alliance for Science and Technology for Scotland

Award ID: HR09011

Award Recipient : John B. Taggart

Custom metadata

ScienceOpen disciplines: Genetics

Data availability:

ScienceOpen disciplines: Genetics

Comments

Comment on this article

scite_

Cited by 89

See all cited by

Most referenced authors 1,101

See all reference authors

Genomic prediction of host resistance to sea lice in farmed Atlantic salmon populations

Read this article at

Abstract

Background

Results

Conclusions

Electronic supplementary material

Related collections