23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Local sequence determinants of two in-frame triplet deletion/duplication hotspots in the RHD/RHCE genes

      letter
      1 , , 2 , 1
      Human Genomics
      BioMed Central

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Different types of human gene mutation can vary in size quite dramatically (e.g., single nucleotide substitutions vs. copy number variations), but what they all have in common is that their occurrence is often closely bound up with specific characteristics of the local DNA sequence environment [1]. Here, we highlight the importance of local sequence features that underlie the two in-frame triplet deletion/duplication hotspots in the cis-linked, highly homologous RHD and RHCE paralogs. The first hotspot refers to an 8-bp sequence tract in exon 1 of the RHD and RHCE genes, in which three different variants were reported (Figure 1a) [2-4]. The first variant is a deletion of one of two juxtaposed CTC triplets in the RHD gene, which gives rise to an in-frame deletion of a single amino acid, Leu27 [2]. The second variant is identical to the first but occurred at the analogous location in the RHCE gene [3]. Henceforth, we shall employ the term ‘deduplication’ [5], which emphasizes the identity of the deleted sequence and the sequence immediately abutting the site of the deletion, to describe this particular type of microdeletion (<21 bp in length in accordance with Ball et al. [6]). Deduplication accounts for a significant proportion of disease-causing microdeletions; indeed, for microdeletion events of 2–5 bp, 38 % were found to be deduplications [6]. Replication slippage is currently regarded as the major mechanism underlying the generation of deduplications: the primer strand containing the newly synthesized first direct repeat dissociates from the template strand and then misaligns (slipping forward) at the second direct repeat; continued DNA synthesis then leads to the deletion of one of the two direct repeats. It should be pointed out that while direct repeats are a prerequisite for replication slippage, they are certainly not the sole determinant of this mutational mechanism. In this regard, we noted that the two CTC triplets together constitute a DNA polymerase arrest site (consensus sequence WGGAG, where W = A or T [7]) (Figure 1a). It is therefore tempting to speculate that a combination of these two sequence features could have served to make this short region particularly prone to replication slippage. Figure 1 The three currently known in-frame triplet deletion/duplication variants in exon 1 of the RHD / RHCE genes. Their underlying mutational mechanisms are also shown. (a) Illustration of the identical deletion of one of two neighboring CTC repeats (in bold) which occurred at the corresponding positions of RHD[2] and RHCE[3] genes as well as the 3-bp in-frame duplication in the RHD gene [4], illustrated here as having arisen from the duplication of c.74_76 (boxed). The RHD and RHCE genes share 100% sequence identity in the region from c.70_87. The underlined CTCCT motif corresponds to the complementary strand of the DNA polymerase arrest site WGGAG. (b) Schema for how the 3-bp duplication could have been generated in accordance with the model of serial replication slippage (see Chen et al. [8] for details). Short direct repeats that could have mediated the two steps of replication slippage are highlighted in bold or are underlined. The horizontal arrows indicate the direction of DNA synthesis. The identical 3-bp deletions are explicable by a single step of forward slippage (not shown). Recently, Pereira and colleagues reported the first in-frame triplet duplication in the RHD gene; this duplication affected the same short region as the aforementioned two deduplications in exon 1 (Figure 1a) [4]. As pointed out by the original authors, this duplication could have resulted from either a duplication of c.74-76TTC or c.75_77TCT. These authors emphasized the importance of a DNA motif (i.e., TTCTC that was identified by analogy to previously reported deletion-predisposing DNA motifs in the RHD gene [9]) in generating this duplication but did not provide a model to explain how this duplication could have been generated. Given that the sequence tract in question is prone to replication slippage, we surmised that this duplication might also be explicable in terms of such a mechanism. Indeed, as illustrated in Figure 1b, it can be readily explained by the model of serial replication slippage [8], invoking one step of forward slippage and one step of backward slippage. The second hotspot refers to a 63-bp region of exon 5 in the RHD and RHCE genes, in which four in-frame triplet deletions (c.644_646delTCT [3], c.684_686delGAG, and c.705_707delAGA [9] in RHD; c.685_687delAGA [10] in RHCE) were reported (Figure 2). Several distinct DNA repeats or motifs (e.g., GAGAA and GAAGA) have previously been implicated in the generation of three of these four variants [9]. A comparative evaluation of the four variants led us to propose a consensus motif RAGAA (R = A or G) (Figure 2). Since only the c.644_646delTCT variant can be explained in terms of replication slippage, it may be that RAGAA is associated with a recombination-predisposing activity that is distinct from the DNA polymerase arrest site WGGAG. In other words, the different local sequence contexts in exons 1 and 5 of RHD and RHCE could predispose to subtly different mutational processes. Figure 2 The three 3-bp deletion variants in exon 5 of RHD . The deletions are highlighted in bold. The underlined sequences refer to the consensus motif RAGAA or its complement. The c.685_687AGA deletion in RHCE [10] is not shown.

          Related collections

          Most cited references9

          • Record: found
          • Abstract: found
          • Article: not found

          Microdeletions and microinsertions causing human genetic disease: common mechanisms of mutagenesis and the role of local DNA sequence complexity.

          In the Human Gene Mutation Database (www.hgmd.org), microdeletions and microinsertions causing inherited disease (both defined as involving < or = 20 bp of DNA) account for 8,399 (17%) and 3,345 (7%) logged mutations, in 940 and 668 genes, respectively. A positive correlation was noted between the microdeletion and microinsertion frequencies for 564 genes for which both microdeletions and microinsertions are reported in HGMD, consistent with the view that the propensity of a given gene/sequence to undergo microdeletion is related to its propensity to undergo microinsertion. While microdeletions and microinsertions of 1 bp constitute respectively 48% and 66% of the corresponding totals, the relative frequency of the remaining lesions correlates negatively with the length of the DNA sequence deleted or inserted. Many of the microdeletions and microinsertions of more than 1 bp are potentially explicable in terms of slippage mutagenesis, involving the addition or removal of one copy of a mono-, di-, or trinucleotide tandem repeat. The frequency of in-frame 3-bp and 6-bp microinsertions and microdeletions was, however, found to be significantly lower than that of mutations of other lengths, suggesting that some of these in-frame lesions may not have come to clinical attention. Various sequence motifs were found to be over-represented in the vicinity of both microinsertions and microdeletions, including the heptanucleotide CCCCCTG that shares homology with the complement of the 8-bp human minisatellite conserved sequence/chi-like element (GCWGGWGG). The previously reported indel hotspot GTAAGT and its complement ACTTAC were also found to be overrepresented in the vicinity of both microinsertions and microdeletions, thereby providing a first example of a mutational hotspot that is common to different types of gene lesion. Other motifs overrepresented in the vicinity of microdeletions and microinsertions included DNA polymerase pause sites and topoisomerase cleavage sites. Several novel microdeletion/microinsertion hotspots were noted and some of these exhibited sufficient similarity to one another to justify terming them "super-hotspot" motifs. Analysis of sequence complexity also demonstrated that a combination of slipped mispairing mediated by direct repeats, and secondary structure formation promoted by symmetric elements, can account for the majority of microdeletions and microinsertions. Thus, microinsertions and microdeletions exhibit strong similarities in terms of the characteristics of their flanking DNA sequences, implying that they are generated by very similar underlying mechanisms.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            On the sequence-directed nature of human gene mutation: the role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease.

            Different types of human gene mutation may vary in size, from structural variants (SVs) to single base-pair substitutions, but what they all have in common is that their nature, size and location are often determined either by specific characteristics of the local DNA sequence environment or by higher order features of the genomic architecture. The human genome is now recognized to contain "pervasive architectural flaws" in that certain DNA sequences are inherently mutation prone by virtue of their base composition, sequence repetitivity and/or epigenetic modification. Here, we explore how the nature, location and frequency of different types of mutation causing inherited disease are shaped in large part, and often in remarkably predictable ways, by the local DNA sequence environment. The mutability of a given gene or genomic region may also be influenced indirectly by a variety of noncanonical (non-B) secondary structures whose formation is facilitated by the underlying DNA sequence. Since these non-B DNA structures can interfere with subsequent DNA replication and repair and may serve to increase mutation frequencies in generalized fashion (i.e., both in the context of subtle mutations and SVs), they have the potential to serve as a unifying concept in studies of mutational mechanisms underlying human inherited disease. © 2011 Wiley-Liss, Inc.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Context of deletions and insertions in human coding sequences.

              We studied the dependence of the rate of short deletions and insertions on their contexts using the data on mutations within coding exons at 19 human loci that cause mendelian diseases. We confirm that periodic sequences consisting of three to five or more nucleotides are mutagenic. Mutability of sequences with strongly biased nucleotide composition is also elevated, even when mutations within homonucleotide runs longer than three nucleotides are ignored. In contrast, no elevated mutation rates have been detected for imperfect direct or inverted repeats. Among known candidate contexts, the indel context GTAAGT and regions with purine-pyrimidine imbalance between the two DNA strands are mutagenic in our sample, and many others are not mutagenic. Data on mutation hot spots suggest two novel contexts that increase the deletion rate. Comprehensive analysis of mutability of all possible contexts of lengths four, six, and eight indicates a substantially elevated deletion rate within YYYTG and similar sequences, which is one of the two contexts revealed by the hot spots. Possible contexts that increase the insertion rate (AT(A/C)(A/C)GCC and TACCRC) and decrease deletion (TATCGC) or insertion (GCGG) rates have also been identified. Two-thirds of deletions remove a repeat, and over 80% of insertions create a repeat, i.e., they are duplications.
                Bookmark

                Author and article information

                Journal
                Hum Genomics
                Hum. Genomics
                Human Genomics
                BioMed Central
                1473-9542
                1479-7364
                2012
                2 August 2012
                : 6
                : 1
                : 8
                Affiliations
                [1 ]Etablissement Français du Sang (EFS) – Bretagne and INSERM, U1078, Brest, France
                [2 ]Institute of Medical Genetics, Cardiff University, Cardiff, CF14 4XN, UK
                Article
                1479-7364-6-8
                10.1186/1479-7364-6-8
                3500233
                23245726
                78ac0f2d-8870-415a-bb6a-e66861fd1389
                Copyright ©2012 Chen et al.; licensee Biomed Central Ltd.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 10 May 2012
                : 14 May 2012
                Categories
                Letter to the Editor

                Genetics
                Genetics

                Comments

                Comment on this article