26
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      The structure, function and evolution of a complete human chromosome 8

      research-article
      1 , 1 , 1 , 1 , 2 , 3 , 3 , 4 , 1 , 3 , 5 , 6 , 1 , 1 , 6 , 7 , 8 , 8 , 3 , 1 , 1 , 9 , 1 , 1 , 1 , 1 , 10 , 5 , 2 , 4 , 11 , 3 , 1 , 9 ,
      Nature
      Nature Publishing Group UK
      Evolutionary genetics, Genomics, Genome evolution, Centromeres

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The complete assembly of each human chromosome is essential for understanding human biology and evolution 1, 2 . Here we use complementary long-read sequencing technologies to complete the linear assembly of human chromosome 8. Our assembly resolves the sequence of five previously long-standing gaps, including a 2.08-Mb centromeric α-satellite array, a 644-kb copy number polymorphism in the β-defensin gene cluster that is important for disease risk, and an 863-kb variable number tandem repeat at chromosome 8q21.2 that can function as a neocentromere. We show that the centromeric α-satellite array is generally methylated except for a 73-kb hypomethylated region of diverse higher-order α-satellites enriched with CENP-A nucleosomes, consistent with the location of the kinetochore. In addition, we confirm the overall organization and methylation pattern of the centromere in a diploid human genome. Using a dual long-read sequencing approach, we complete high-quality draft assemblies of the orthologous centromere from chromosome 8 in chimpanzee, orangutan and macaque to reconstruct its evolutionary history. Comparative and phylogenetic analyses show that the higher-order α-satellite structure evolved in the great ape ancestor with a layered symmetry, in which more ancient higher-order repeats locate peripherally to monomeric α-satellites. We estimate that the mutation rate of centromeric satellite DNA is accelerated by more than 2.2-fold compared to the unique portions of the genome, and this acceleration extends into the flanking sequence.

          Abstract

          The complete assembly of human chromosome 8 resolves previous gaps and reveals hidden complex forms of genetic variation, enabling functional and evolutionary characterization of primate centromeres.

          Related collections

          Most cited references76

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            MAFFT Multiple Sequence Alignment Software Version 7: Improvements in Performance and Usability

            We report a major update of the MAFFT multiple sequence alignment program. This version has several new features, including options for adding unaligned sequences into an existing alignment, adjustment of direction in nucleotide alignment, constrained alignment and parallel processing, which were implemented after the previous major update. This report shows actual examples to explain how these features work, alone and in combination. Some examples incorrectly aligned by MAFFT are also shown to clarify its limitations. We discuss how to avoid misalignments, and our ongoing efforts to overcome such limitations.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Cutadapt removes adapter sequences from high-throughput sequencing reads

                Bookmark

                Author and article information

                Contributors
                eee@gs.washington.edu
                Journal
                Nature
                Nature
                Nature
                Nature Publishing Group UK (London )
                0028-0836
                1476-4687
                7 April 2021
                7 April 2021
                2021
                : 593
                : 7857
                : 101-107
                Affiliations
                [1 ]GRID grid.34477.33, ISNI 0000000122986657, Department of Genome Sciences, , University of Washington School of Medicine, ; Seattle, WA USA
                [2 ]GRID grid.48336.3a, ISNI 0000 0004 1936 8075, Developmental Therapeutics Branch, , National Cancer Institute, ; Bethesda, MD USA
                [3 ]GRID grid.94365.3d, ISNI 0000 0001 2297 5165, Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, , National Institutes of Health, ; Bethesda, MD USA
                [4 ]GRID grid.7644.1, ISNI 0000 0001 0120 3326, Department of Biology, , University of Bari, Aldo Moro, ; Bari, Italy
                [5 ]GRID grid.250820.d, ISNI 0000 0000 9420 1591, Stowers Institute for Medical Research, ; Kansas City, MO USA
                [6 ]GRID grid.15447.33, ISNI 0000 0001 2289 6897, Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, , Saint Petersburg State University, ; Saint Petersburg, Russia
                [7 ]GRID grid.266100.3, ISNI 0000 0001 2107 4242, Graduate Program in Bioinformatics and Systems Biology, , University of California, San Diego, ; San Diego, CA USA
                [8 ]GRID grid.4367.6, ISNI 0000 0001 2355 7002, McDonnell Genome Institute, Department of Genetics, , Washington University School of Medicine, ; St Louis, MO USA
                [9 ]GRID grid.34477.33, ISNI 0000000122986657, Howard Hughes Medical Institute, , University of Washington, ; Seattle, WA USA
                [10 ]GRID grid.21925.3d, ISNI 0000 0004 1936 9000, Department of Pathology, , University of Pittsburgh, ; Pittsburgh, PA USA
                [11 ]GRID grid.205975.c, ISNI 0000 0001 0740 6917, Center for Biomolecular Science and Engineering, , University of California, Santa Cruz, ; Santa Cruz, CA USA
                Author information
                http://orcid.org/0000-0003-2396-0656
                http://orcid.org/0000-0002-8651-1615
                http://orcid.org/0000-0002-9648-4278
                http://orcid.org/0000-0002-1472-8962
                http://orcid.org/0000-0002-3688-4501
                http://orcid.org/0000-0003-2223-9787
                http://orcid.org/0000-0002-9809-8127
                http://orcid.org/0000-0001-6340-6065
                http://orcid.org/0000-0001-8414-8966
                http://orcid.org/0000-0001-8413-6498
                http://orcid.org/0000-0003-0743-3637
                http://orcid.org/0000-0001-7762-8777
                http://orcid.org/0000-0002-3670-4507
                http://orcid.org/0000-0003-2983-8934
                http://orcid.org/0000-0002-8246-4014
                Article
                3420
                10.1038/s41586-021-03420-7
                8099727
                33828295
                8bbf5a71-49bd-4b95-9805-808aab6e8427
                © The Author(s) 2021

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 4 September 2020
                : 4 March 2021
                Categories
                Article
                Custom metadata
                © The Author(s), under exclusive licence to Springer Nature Limited 2021

                Uncategorized
                evolutionary genetics,genomics,genome evolution,centromeres
                Uncategorized
                evolutionary genetics, genomics, genome evolution, centromeres

                Comments

                Comment on this article