A haplotype-resolved draft genome of the European sardine ( Sardina pilchardus )

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Background

The European sardine ( Sardina pilchardus Walbaum, 1792) is culturally and economically important throughout its distribution. Monitoring studies of sardine populations report an alarming decrease in stocks due to overfishing and environmental change, which has resulted in historically low captures along the Iberian Atlantic coast. Important biological and ecological features such as population diversity, structure, and migratory patterns can be addressed with the development and use of genomics resources.

Findings

The genome of a single female individual was sequenced using Illumina HiSeq X Ten 10x Genomics linked reads, generating 113.8 gigabase pairs of data. Three draft genomes were assembled: 2 haploid genomes with a total size of 935 megabase pairs (N50 103 kilobase pairs) each, and a consensus genome of total size 950 megabase pairs (N50 97 kilobase pairs). The genome completeness assessment captured 84% of Actinopterygii Benchmarking Universal Single-Copy Orthologs. To obtain a more complete analysis, the transcriptomes of 11 tissues were sequenced to aid the functional annotation of the genome, resulting in 40,777 genes predicted. Variant calling on nearly half of the haplotype genome resulted in the identification of >2.3 million phased single-nucleotide polymorphisms with heterozygous loci.

Conclusions

A draft genome was obtained, despite a high level of sequence repeats and heterozygosity, which are expected genome characteristics of a wild sardine. The reference sardine genome and respective variant data will be a cornerstone resource of ongoing population genomics studies to be integrated into future sardine stock assessment modelling to better manage this valuable resource.

Related collections

Most cited references 25

Record: found
Abstract: found
Article: not found

Profile hidden Markov models.

S. Eddy (1998)

The recent literature on profile hidden Markov model (profile HMM) methods and software is reviewed. Profile HMMs turn a multiple sequence alignment into a position-specific scoring system suitable for searching databases for remotely homologous sequences. Profile HMM analyses complement standard pairwise comparison methods for large-scale sequence analysis. Several software implementations and two large libraries of profile HMMs of common protein domains are available. HMM methods performed comparably to threading methods in the CASP2 structure prediction exercise.

0 comments Cited 1255 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

The genomic basis of adaptive evolution in threespine sticklebacks

Felicity Jones, Manfred Grabherr, Yingguang Frank Chan … (2012)

Summary Marine stickleback fish have colonized and adapted to innumerable streams and lakes formed since the last ice age, providing an exceptional opportunity to characterize genomic mechanisms underlying repeated ecological adaptation in nature. Here we develop a high quality reference genome assembly for threespine sticklebacks. By sequencing the genomes of 20 additional individuals from a global set of marine and freshwater populations, we identify a genome-wide set of loci that are consistently associated with marine-freshwater divergence. Our results suggest that reuse of globally-shared standing genetic variation, including chromosomal inversions, plays an important role in repeated evolution of distinct marine and freshwater sticklebacks, and in the maintenance of divergent ecotypes during early stages of reproductive isolation. Both coding and regulatory changes occur in the set of loci underlying marine-freshwater evolution, with regulatory changes likely predominating in this classic example of repeated adaptive evolution in nature.

0 comments Cited 728 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

The Dfam database of repetitive DNA families

Robert Hubley, Robert D. Finn, Jody Clements … (2015)

Repetitive DNA, especially that due to transposable elements (TEs), makes up a large fraction of many genomes. Dfam is an open access database of families of repetitive DNA elements, in which each family is represented by a multiple sequence alignment and a profile hidden Markov model (HMM). The initial release of Dfam, featured in the 2013 NAR Database Issue, contained 1143 families of repetitive elements found in humans, and was used to produce more than 100 Mb of additional annotation of TE-derived regions in the human genome, with improved speed. Here, we describe recent advances, most notably expansion to 4150 total families including a comprehensive set of known repeat families from four new organisms (mouse, zebrafish, fly and nematode). We describe improvements to coverage, and to our methods for identifying and reducing false annotation. We also describe updates to the website interface. The Dfam website has moved to http://dfam.org. Seed alignments, profile HMMs, hit lists and other underlying data are available for download.

0 comments Cited 304 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): Gigascience

Journal ID (iso-abbrev): Gigascience

Journal ID (publisher-id): gigascience

Title: GigaScience

Publisher: Oxford University Press

ISSN (Electronic): 2047-217X

Publication date Collection: May 2019

Publication date (Electronic): 21 May 2019

Publication date PMC-release: 21 May 2019

Volume: 8

Issue: 5

Electronic Location Identifier: giz059

Affiliations

[1 ]CCMAR Centre of Marine Sciences, University of Algarve, Campus de Gambelas, 8005–139 Faro, Portugal

[2 ]CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO, Laboratório Associado, Universidade do Porto, Vairão, Portugal

Author notes

Correspondence address. Adelino V. M. Canário, CCMAR Centre of Marine Sciences, University of Algarve, Campus de Gambelas, 8005-139 Faro, Portugal E-mail: acanario@ 123456ualg.pt

Authors contributed equally.

Author information

Bruno Louro http://orcid.org/0000-0001-8164-581X

Gianluca De Moro http://orcid.org/0000-0002-5542-0278

Ana Veríssimo http://orcid.org/0000-0003-3396-9822

Adelino V M Canário http://orcid.org/0000-0002-6244-6468

Article

Publisher ID: giz059

DOI: 10.1093/gigascience/giz059

PMC ID: 6528745

PubMed ID: 31112613

SO-VID: aeb7e9e2-4178-44d1-88cf-ef1396435ce5

License:

This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

History

Date received : 26 September 2018

Date revision received : 11 March 2019

Date accepted : 30 April 2019

Page count

Pages: 8

Funding

Funded by: Foundation for Science and Technology 10.13039/501100001871

Award ID: UID/Multi/04326/2016

Funded by: European Regional Development Fund 10.13039/501100008530

Award ID: 22153-01/SAICT/2016

Funded by: National Infrastruture of Distributed Computing of Portugal

Award ID: ALG-01-0145-FEDER-022121

Award ID: ALG-01-0145-FEDER-022231

Award ID: MAR2020

Funded by: European Maritime and Fisheries Fund

Award ID: MAR-01.04.02-FEAMP-0024

Funded by: Horizon 2020 10.13039/100010661

Award ID: 654008

Comments

Comment on this article

scite_

Cited by 6

See all cited by

Most referenced authors 1,691

See all reference authors

A haplotype-resolved draft genome of the European sardine ( Sardina pilchardus)

Read this article at

Abstract

Background

Findings

Conclusions

Related collections

Arabidopsis genomics

Most cited references 25

Profile hidden Markov models.

The genomic basis of adaptive evolution in threespine sticklebacks

The Dfam database of repetitive DNA families

Author and article information

Journal

Affiliations

Author notes

Author information

Article

History

Page count

Funding

Categories

Comments

Comment on this article

Similar content 450

Cited by 6

Most referenced authors 1,691