+1 Recommend
1 collections
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Challenges and Approaches to Genotyping Repetitive DNA


      Read this article at

          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.


          Individuals within a species can exhibit vast variation in copy number of repetitive DNA elements. This variation may contribute to complex traits such as lifespan and disease, yet it is only infrequently considered in genotype-phenotype associations. Although the possible importance of copy number variation is widely recognized, accurate copy number quantification remains challenging. Here, we assess the technical reproducibility of several major methods for copy number estimation as they apply to the large repetitive ribosomal DNA array (rDNA). rDNA encodes the ribosomal RNAs and exists as a tandem gene array in all eukaryotes. Repeat units of rDNA are kilobases in size, often with several hundred units comprising the array, making rDNA particularly intractable to common quantification techniques. We evaluate pulsed-field gel electrophoresis, droplet digital PCR, and Nextera-based whole genome sequencing as approaches to copy number estimation, comparing techniques across model organisms and spanning wide ranges of copy numbers. Nextera-based whole genome sequencing, though commonly used in recent literature, produced high error. We explore possible causes for this error and provide recommendations for best practices in rDNA copy number estimation. We present a resource of high-confidence rDNA copy number estimates for a set of S. cerevisiae and C. elegans strains for future use. We furthermore explore the possibility for FISH-based copy number estimation, an alternative that could potentially characterize copy number on a cellular level.

          Most cited references60

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          Genome evolution across 1,011 Saccharomyces cerevisiae isolates

          Large-scale population genomic surveys are essential to explore the phenotypic diversity of natural populations. Here we report the whole-genome sequencing and phenotyping of 1,011 Saccharomyces cerevisiae isolates, which together provide an accurate evolutionary picture of the genomic variants that shape the species-wide phenotypic landscape of this yeast. Genomic analyses support a single ‘out-of-China’ origin for this species, followed by several independent domestication events. Although domesticated isolates exhibit high variation in ploidy, aneuploidy and genome content, genome evolution in wild isolates is mainly driven by the accumulation of single nucleotide polymorphisms. A common feature is the extensive loss of heterozygosity, which represents an essential source of inter-individual variation in this mainly asexual species. Most of the single nucleotide polymorphisms, including experimentally identified functional polymorphisms, are present at very low frequencies. The largest numbers of variants identified by genome-wide association are copy-number changes, which have a greater phenotypic effect than do single nucleotide polymorphisms. This resource will guide future population genomics and genotype–phenotype studies in this classic model system.
            • Record: found
            • Abstract: found
            • Article: not found

            Ribosome biogenesis in the yeast Saccharomyces cerevisiae.

            Ribosomes are highly conserved ribonucleoprotein nanomachines that translate information in the genome to create the proteome in all cells. In yeast these complex particles contain four RNAs (>5400 nucleotides) and 79 different proteins. During the past 25 years, studies in yeast have led the way to understanding how these molecules are assembled into ribosomes in vivo. Assembly begins with transcription of ribosomal RNA in the nucleolus, where the RNA then undergoes complex pathways of folding, coupled with nucleotide modification, removal of spacer sequences, and binding to ribosomal proteins. More than 200 assembly factors and 76 small nucleolar RNAs transiently associate with assembling ribosomes, to enable their accurate and efficient construction. Following export of preribosomes from the nucleus to the cytoplasm, they undergo final stages of maturation before entering the pool of functioning ribosomes. Elaborate mechanisms exist to monitor the formation of correct structural and functional neighborhoods within ribosomes and to destroy preribosomes that fail to assemble properly. Studies of yeast ribosome biogenesis provide useful models for ribosomopathies, diseases in humans that result from failure to properly assemble ribosomes.
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Scaling read aligners to hundreds of threads on general-purpose processors

              Abstract Motivation General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. We discuss some of these and propose methods for improving thread scaling in tools that analyze each read independently, such as read aligners. Results We implement these methods in new versions of Bowtie, Bowtie 2 and HISAT. We greatly improve thread scaling in many scenarios, including on the recent Intel Xeon Phi architecture. We also highlight how bottlenecks are exacerbated by variable-record-length file formats like FASTQ and suggest changes that enable superior scaling. Availability and implementation Experiments for this study: https://github.com/BenLangmead/bowtie-scaling . Bowtie http://bowtie-bio.sourceforge.net . Bowtie 2 http://bowtie-bio.sourceforge.net/bowtie2 . HISAT http://www.ccb.jhu.edu/software/hisat Supplementary information Supplementary data are available at Bioinformatics online.

                Author and article information

                G3 (Bethesda)
                G3: Genes, Genomes, Genetics
                G3: Genes, Genomes, Genetics
                G3: Genes, Genomes, Genetics
                G3: Genes|Genomes|Genetics
                Genetics Society of America
                22 November 2019
                January 2020
                : 10
                : 1
                : 417-430
                [* ]Department of Genome Sciences,
                []Molecular and Cellular Biology Program, University of Washington, Seattle, 98195,
                []Department of Molecular Genetics, University of Toronto, Ontario M5S 1A8, Canada, and
                [§ ]Altius Institute for Biomedical Sciences, Seattle, Washington 98121
                Author notes
                [1 ]Corresponding author: University of Washington, 3720 15th Ave NE, Seattle, WA, 98195. E-mail: queitsch@ 123456uw.edu
                Author information
                Copyright © 2020 Morton et al.

                This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                : 24 September 2019
                : 21 November 2019
                Page count
                Figures: 4, Tables: 2, Equations: 1, References: 83, Pages: 14

                ribosomal dna,repetitive dna,whole genome sequencing,copy number variation
                ribosomal dna, repetitive dna, whole genome sequencing, copy number variation


                Comment on this article