On the use of genome‐wide data to model and date the time of anthropogenic hybridisation: An example from the Scottish wildcat

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Related collections

Most cited references 84

Record: found
Abstract: found
Article: found

Is Open Access

Fast and accurate short read alignment with Burrows–Wheeler transform

Heng Li, Richard Durbin (2009)

Motivation: The enormous amount of short reads generated by the new DNA sequencing technologies call for the development of fast and accurate read alignment programs. A first generation of hash table-based methods has been developed, including MAQ, which is accurate, feature rich and fast enough to align short reads from a single individual. However, MAQ does not support gapped alignment for single-end reads, which makes it unsuitable for alignment of longer reads where indels may occur frequently. The speed of MAQ is also a concern when the alignment is scaled up to the resequencing of hundreds of individuals. Results: We implemented Burrows-Wheeler Alignment tool (BWA), a new read alignment package that is based on backward search with Burrows–Wheeler Transform (BWT), to efficiently align short sequencing reads against a large reference sequence such as the human genome, allowing mismatches and gaps. BWA supports both base space reads, e.g. from Illumina sequencing machines, and color space reads from AB SOLiD machines. Evaluations on both simulated and real data suggest that BWA is ∼10–20× faster than MAQ, while achieving similar accuracy. In addition, BWA outputs alignment in the new standard SAM (Sequence Alignment/Map) format. Variant calling and other downstream analyses after the alignment can be achieved with the open source SAMtools software package. Availability: http://maq.sourceforge.net Contact: rd@sanger.ac.uk

0 comments Cited 10187 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

The variant call format and VCFtools

Petr Danecek, Adam Auton, Gonçalo R Abecasis … (2011)

Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: rd@sanger.ac.uk

0 comments Cited 3396 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

Second-generation PLINK: rising to the challenge of larger and richer datasets

Christopher Chang, Carson Chow, Laurent Tellier … (2015)

PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for even faster and more scalable implementations of key functions. In addition, GWAS and population-genetic data now frequently contain probabilistic calls, phase information, and/or multiallelic variants, none of which can be represented by PLINK 1's primary data format. To address these issues, we are developing a second-generation codebase for PLINK. The first major release from this codebase, PLINK 1.9, introduces extensive use of bit-level parallelism, O(sqrt(n))-time/constant-space Hardy-Weinberg equilibrium and Fisher's exact tests, and many other algorithmic improvements. In combination, these changes accelerate most operations by 1-4 orders of magnitude, and allow the program to handle datasets too large to fit in RAM. This will be followed by PLINK 2.0, which will introduce (a) a new data format capable of efficiently representing probabilities, phase, and multiallelic variants, and (b) extensions of many functions to account for the new types of information. The second-generation versions of PLINK will offer dramatic improvements in performance and compatibility. For the first time, users without access to high-end computing resources can perform several essential analyses of the feature-rich and very large genetic datasets coming into use.

0 comments Cited 2811 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Jo Howard‐McCombe: (View ORCID Profile)

Journal

Title: Molecular Ecology

Abbreviated Title: Molecular Ecology

Publisher: Wiley

ISSN (Print): 0962-1083

ISSN (Electronic): 1365-294X

Publication date Created: August 2021

Publication date (Electronic): July 2021

Publication date (Print): August 2021

Volume: 30

Issue: 15

Pages: 3688-3702

Affiliations

[1 ]School of Biological Sciences University of Bristol Bristol UK

[2 ]School of Mathematics University of Bristol Bristol UK

[3 ]Department of Natural Sciences National Museums Scotland Edinburgh UK

[4 ]RZSS WildGenes Laboratory Royal Zoological Society of Scotland Edinburgh UK

Article

DOI: 10.1111/mec.16000

PubMed ID: 34042240

SO-VID: d9679980-2699-4175-8ee0-7cc9bfeae35f

License:

http://creativecommons.org/licenses/by/4.0/

http://doi.wiley.com/10.1002/tdm_license_1.1

History

Data availability:

Comments

Comment on this article

scite_

Cited by 5

See all cited by

Most referenced authors 1,444

See all reference authors

On the use of genome‐wide data to model and date the time of anthropogenic hybridisation: An example from the Scottish wildcat

Read this article at

Related collections

Genome Integrity

Most cited references 84

Fast and accurate short read alignment with Burrows–Wheeler transform

The variant call format and VCFtools

Second-generation PLINK: rising to the challenge of larger and richer datasets

Author and article information

Contributors

Journal

Affiliations

Article

History

Comments

Comment on this article

Similar content 2,468

Cited by 5

Most referenced authors 1,444