Gene losses in the common vampire bat illuminate molecular adaptations to blood feeding

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Vampire bats are the only mammals that feed exclusively on blood. To uncover genomic changes associated with this dietary adaptation, we generated a haplotype-resolved genome of the common vampire bat and screened 27 bat species for genes that were specifically lost in the vampire bat lineage. We found previously unknown gene losses that relate to reduced insulin secretion ( FFAR1 and SLC30A8), limited glycogen stores ( PPP1R3E), and a unique gastric physiology ( CTSE). Other gene losses likely reflect the biased nutrient composition ( ERN2 and CTRL) and distinct pathogen diversity of blood ( RNASE7) and predict the complete lack of cone-based vision in these strictly nocturnal bats ( PDE6H and PDE6C). Notably, REP15 loss likely helped vampire bats adapt to high dietary iron levels by enhancing iron excretion, and the loss of CYP39A1 could have contributed to their exceptional cognitive abilities. These findings enhance our understanding of vampire bat biology and the genomic underpinnings of adaptations to blood feeding.

Abstract

Genes that are specifically lost in the common vampire bat provide new insights into the genomic adaptations to blood feeding.

Related collections

Most cited references 141

Record: found
Abstract: found
Article: not found

STAR: ultrafast universal RNA-seq aligner.

Alexander Dobin, Carrie A. Davis, Felix Schlesinger … (2013)

Accurate alignment of high-throughput RNA-seq data is a challenging and yet unsolved problem because of the non-contiguous transcript structure, relatively short read lengths and constantly increasing throughput of the sequencing technologies. Currently available RNA-seq aligners suffer from high mapping error rates, low mapping speed, read length limitation and mapping biases. To align our large (>80 billon reads) ENCODE Transcriptome RNA-seq dataset, we developed the Spliced Transcripts Alignment to a Reference (STAR) software based on a previously undescribed RNA-seq alignment algorithm that uses sequential maximum mappable seed search in uncompressed suffix arrays followed by seed clustering and stitching procedure. STAR outperforms other aligners by a factor of >50 in mapping speed, aligning to the human genome 550 million 2 × 76 bp paired-end reads per hour on a modest 12-core server, while at the same time improving alignment sensitivity and precision. In addition to unbiased de novo detection of canonical junctions, STAR can discover non-canonical splices and chimeric (fusion) transcripts, and is also capable of mapping full-length RNA sequences. Using Roche 454 sequencing of reverse transcription polymerase chain reaction amplicons, we experimentally validated 1960 novel intergenic splice junctions with an 80-90% success rate, corroborating the high precision of the STAR mapping strategy. STAR is implemented as a standalone C++ code. STAR is free open source software distributed under GPLv3 license and can be downloaded from http://code.google.com/p/rna-star/.

0 comments Cited 13682 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: found

Is Open Access

SciPy 1.0: fundamental algorithms for scientific computing in Python

Pauli Virtanen, Ralf Gommers, Travis E. Oliphant … (2020)

SciPy is an open-source scientific computing library for the Python programming language. Since its initial release in 2001, SciPy has become a de facto standard for leveraging scientific algorithms in Python, with over 600 unique code contributors, thousands of dependent packages, over 100,000 dependent repositories and millions of downloads per year. In this work, we provide an overview of the capabilities and development practices of SciPy 1.0 and highlight some recent technical developments.

0 comments Cited 5812 times     Rated -3 of 5. – based on 1 reviews

Bookmark

Record: found
Abstract: found
Article: not found

Integrative Genomics Viewer

James Robinson, Helga Thorvaldsdóttir, Wendy Winckler … (2011)

To the Editor Rapid improvements in sequencing and array-based platforms are resulting in a flood of diverse genome-wide data, including data from exome and whole genome sequencing, epigenetic surveys, expression profiling of coding and non-coding RNAs, SNP and copy number profiling, and functional assays. Analysis of these large, diverse datasets holds the promise of a more comprehensive understanding of the genome and its relation to human disease. Experienced and knowledgeable human review is an essential component of this process, complementing computational approaches. This calls for efficient and intuitive visualization tools able to scale to very large datasets and to flexibly integrate multiple data types, including clinical data. However, the sheer volume and scope of data poses a significant challenge to the development of such tools. To address this challenge we developed the Integrative Genomics Viewer (IGV), a lightweight visualization tool that enables intuitive real-time exploration of diverse, large-scale genomic datasets on standard desktop computers. It supports flexible integration of a wide range of genomic data types including aligned sequence reads, mutations, copy number, RNAi screens, gene expression, methylation, and genomic annotations (Figure S1). The IGV makes use of efficient, multi-resolution file formats to enable real-time exploration of arbitrarily large datasets over all resolution scales, while consuming minimal resources on the client computer (see Supplementary Text). Navigation through a dataset is similar to Google Maps, allowing the user to zoom and pan seamlessly across the genome at any level of detail from whole-genome to base pair (Figure S2). Datasets can be loaded from local or remote sources, including cloud-based resources, enabling investigators to view their own genomic datasets alongside publicly available data from, for example, The Cancer Genome Atlas (TCGA) 1 , 1000 Genomes (www.1000genomes.org/), and ENCODE 2 (www.genome.gov/10005107) projects. In addition, IGV allows collaborators to load and share data locally or remotely over the Web. IGV supports concurrent visualization of diverse data types across hundreds, and up to thousands of samples, and correlation of these integrated datasets with clinical and phenotypic variables. A researcher can define arbitrary sample annotations and associate them with data tracks using a simple tab-delimited file format (see Supplementary Text). These might include, for example, sample identifier (used to link different types of data for the same patient or tissue sample), phenotype, outcome, cluster membership, or any other clinical or experimental label. Annotations are displayed as a heatmap but more importantly are used for grouping, sorting, filtering, and overlaying diverse data types to yield a comprehensive picture of the integrated dataset. This is illustrated in Figure 1, a view of copy number, expression, mutation, and clinical data from 202 glioblastoma samples from the TCGA project in a 3 kb region around the EGFR locus 1, 3 . The investigator first grouped samples by tumor subtype, then by data type (copy number and expression), and finally sorted them by median copy number over the EGFR locus. A shared sample identifier links the copy number and expression tracks, maintaining their relative sort order within the subtypes. Mutation data is overlaid on corresponding copy number and expression tracks, based on shared participant identifier annotations. Several trends in the data stand out, such as a strong correlation between copy number and expression and an overrepresentation of EGFR amplified samples in the Classical subtype. IGV’s scalable architecture makes it well suited for genome-wide exploration of next-generation sequencing (NGS) datasets, including both basic aligned read data as well as derived results, such as read coverage. NGS datasets can approach terabytes in size, so careful management of data is necessary to conserve compute resources and to prevent information overload. IGV varies the displayed level of detail according to resolution scale. At very wide views, such as the whole genome, IGV represents NGS data by a simple coverage plot. Coverage data is often useful for assessing overall quality and diagnosing technical issues in sequencing runs (Figure S3), as well as analysis of ChIP-Seq 4 and RNA-Seq 5 experiments (Figures S4 and S5). As the user zooms below the ~50 kb range, individual aligned reads become visible (Figure 2) and putative SNPs are highlighted as allele counts in the coverage plot. Alignment details for each read are available in popup windows (Figures S6 and S7). Zooming further, individual base mismatches become visible, highlighted by color and intensity according to base call and quality. At this level, the investigator may sort reads by base, quality, strand, sample and other attributes to assess the evidence of a variant. This type of visual inspection can be an efficient and powerful tool for variant call validation, eliminating many false positives and aiding in confirmation of true findings (Figures S6 and S7). Many sequencing protocols produce reads from both ends (“paired ends”) of genomic fragments of known size distribution. IGV uses this information to color-code paired ends if their insert sizes are larger than expected, fall on different chromosomes, or have unexpected pair orientations. Such pairs, when consistent across multiple reads, can be indicative of a genomic rearrangement. When coloring aberrant paired ends, each chromosome is assigned a unique color, so that intra- (same color) and inter- (different color) chromosomal events are readily distinguished (Figures 2 and S8). We note that misalignments, particularly in repeat regions, can also yield unexpected insert sizes, and can be diagnosed with the IGV (Figure S9). There are a number of stand-alone, desktop genome browsers available today 6 including Artemis 7 , EagleView 8 , MapView 9 , Tablet 10 , Savant 11 , Apollo 12 , and the Integrated Genome Browser 13 . Many of them have features that overlap with IGV, particularly for NGS sequence alignment and genome annotation viewing. The Integrated Genome Browser also supports viewing array-based data. See Supplementary Table 1 and Supplementary Text for more detail. IGV focuses on the emerging integrative nature of genomic studies, placing equal emphasis on array-based platforms, such as expression and copy-number arrays, next-generation sequencing, as well as clinical and other sample metadata. Indeed, an important and unique feature of IGV is the ability to view all these different data types together and to use the sample metadata to dynamically group, sort, and filter datasets (Figure 1 above). Another important characteristic of IGV is fast data loading and real-time pan and zoom – at all scales of genome resolution and all dataset sizes, including datasets comprising hundreds of samples. Finally, we have placed great emphasis on the ease of installation and use of IGV, with the goal of making both the viewing and sharing of their data accessible to non-informatics end users. IGV is open source software and freely available at http://www.broadinstitute.org/igv/, including full documentation on use of the software. Supplementary Material 1

0 comments Cited 3417 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Contributors

Moritz Blumer:

ORCID: https://orcid.org/0000-0002-5775-1767

Role: ConceptualizationRole: Data curationRole: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing - original draftRole: Writing - review & editing

Tom Brown:

ORCID: https://orcid.org/0000-0001-8293-4816

Role: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing - original draftRole: Writing - review & editing

Mariella Bontempo Freitas:

ORCID: https://orcid.org/0000-0001-5132-242X

Role: Data curationRole: Formal analysisRole: MethodologyRole: ResourcesRole: SupervisionRole: ValidationRole: Writing - review & editing

Ana Luiza Destro:

ORCID: https://orcid.org/0000-0002-0269-4654

Role: Data curationRole: Formal analysisRole: InvestigationRole: ResourcesRole: ValidationRole: Writing - original draft

Juraci A. Oliveira:

ORCID: https://orcid.org/0000-0003-0150-2291

Role: MethodologyRole: ResourcesRole: Validation

Ariadna E. Morales:

ORCID: https://orcid.org/0000-0002-0637-7349

Role: Formal analysis

Tilman Schell: Role: Formal analysisRole: Investigation

Carola Greve:

ORCID: https://orcid.org/0000-0003-4993-1378

Role: InvestigationRole: ResourcesRole: Validation

Martin Pippel:

ORCID: https://orcid.org/0000-0002-8134-5929

Role: Data curationRole: Formal analysisRole: SoftwareRole: Validation

David Jebb: Role: ConceptualizationRole: Formal analysisRole: MethodologyRole: Project administrationRole: SoftwareRole: Writing - review & editing

Nikolai Hecker: Role: ResourcesRole: Writing - review & editing

Alexis-Walid Ahmed: Role: Data curationRole: Formal analysisRole: MethodologyRole: SoftwareRole: Writing - review & editing

Bogdan M. Kirilenko:

ORCID: https://orcid.org/0000-0002-9394-4275

Role: ResourcesRole: Software

Maddy Foote:

ORCID: https://orcid.org/0000-0001-9837-6329

Role: ConceptualizationRole: ResourcesRole: Writing - review & editing

Axel Janke:

ORCID: https://orcid.org/0000-0002-9394-1904

Role: ConceptualizationRole: SupervisionRole: Writing - review & editing

Burton K. Lim:

ORCID: https://orcid.org/0000-0002-0884-0421

Role: Funding acquisitionRole: ResourcesRole: Writing - review & editing

Michael Hiller:

ORCID: https://orcid.org/0000-0003-3024-1449

Role: ConceptualizationRole: Funding acquisitionRole: MethodologyRole: Project administrationRole: SupervisionRole: ValidationRole: VisualizationRole: Writing - original draftRole: Writing - review & editing

Journal

Journal ID (nlm-ta): Sci Adv

Journal ID (iso-abbrev): Sci Adv

Journal ID (publisher-id): sciadv

Journal ID (hwp): advances

Title: Science Advances

Publisher: American Association for the Advancement of Science

ISSN (Electronic): 2375-2548

Publication date Collection: March 2022

Publication date (Electronic, pub): 25 March 2022

Volume: 8

Issue: 12

Electronic Location Identifier: eabm6494

Affiliations

[1 ]Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany.

[2 ]Max Planck Institute for the Physics of Complex Systems, 01187 Dresden, Germany.

[3 ]Center for Systems Biology Dresden, 01307 Dresden, Germany.

[4 ]Goethe University, Faculty of Biosciences, Max-von-Laue-Str. 9, 60438 Frankfurt, Germany.

[5 ]Department of Animal Biology, Federal University of Viçosa, Viçosa, Brazil.

[6 ]Department of General Biology, Federal University of Viçosa, Viçosa, Brazil.

[7 ]LOEWE Centre for Translational Biodiversity Genomics, Senckenberganlage 25, 60325 Frankfurt, Germany.

[8 ]Senckenberg Research Institute, Senckenberganlage 25, 60325 Frankfurt, Germany.

[9 ]Native Bat Conservation Program, Toronto Zoo, 361A Old Finch Avenue, Toronto, Ontario M1B 5K7, Canada.

[10 ]Senckenberg Biodiversity and Climate Research Centre, Senckenberganlage 25, 60325 Frankfurt am Main, Germany.

[11 ]Department of Natural History, Royal Ontario Museum, 100 Queen’s Park, Toronto, Ontario M5S 2C6, Canada.

Author notes

[* ]Corresponding author. Email: michael.hiller@ 123456senckenberg.de

[†]

Present address: Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK.

Author information

Moritz Blumer https://orcid.org/0000-0002-5775-1767

Tom Brown https://orcid.org/0000-0001-8293-4816

Mariella Bontempo Freitas https://orcid.org/0000-0001-5132-242X

Ana Luiza Destro https://orcid.org/0000-0002-0269-4654

Juraci A. Oliveira https://orcid.org/0000-0003-0150-2291

Ariadna E. Morales https://orcid.org/0000-0002-0637-7349

Carola Greve https://orcid.org/0000-0003-4993-1378

Martin Pippel https://orcid.org/0000-0002-8134-5929

Bogdan M. Kirilenko https://orcid.org/0000-0002-9394-4275

Maddy Foote https://orcid.org/0000-0001-9837-6329

Axel Janke https://orcid.org/0000-0002-9394-1904

Burton K. Lim https://orcid.org/0000-0002-0884-0421

Michael Hiller https://orcid.org/0000-0003-3024-1449

Article

Publisher ID: abm6494

DOI: 10.1126/sciadv.abm6494

PMC ID: 8956264

PubMed ID: 35333583

SO-VID: c17070da-c69d-43e7-b279-ab59668911ea

Copyright © Copyright © 2022 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works. Distributed under a Creative Commons Attribution NonCommercial License 4.0 (CC BY-NC).

License:

This is an open-access article distributed under the terms of the Creative Commons Attribution-NonCommercial license, which permits use, distribution, and reproduction in any medium, so long as the resultant use is not for commercial advantage and provided the original work is properly cited.

History

Date received : 01 October 2021

Date accepted : 03 February 2022

Funding

Funded by: FundRef http://dx.doi.org/10.13039/501100004189, Max-Planck-Gesellschaft;

Award ID: -

Custom metadata

Copyeditor Vivian Hernandez

Data availability:

Comments

Comment on this article

scite_

Cited by 14

See all cited by

Most referenced authors 2,324

See all reference authors

- Version 1

Gene losses in the common vampire bat illuminate molecular adaptations to blood feeding

Read this article at

Abstract

Abstract

Abstract

Related collections

CRISPR/Cas9 editing in human blood

Most cited references 141

STAR: ultrafast universal RNA-seq aligner.

SciPy 1.0: fundamental algorithms for scientific computing in Python

Integrative Genomics Viewer

Author and article information

Contributors

Journal

Affiliations

Author notes

Author information

Article

History

Funding

Categories

Custom metadata

Comments

Comment on this article

Similar content 505

Cited by 14

Most referenced authors 2,324