| Literature DB >> 18208328 |
Shuang Yang1, J Roman Arguello, Xin Li, Yun Ding, Qi Zhou, Ying Chen, Yue Zhang, Ruoping Zhao, Frédéric Brunet, Lixin Peng, Manyuan Long, Wen Wang.
Abstract
Previous studies of repetitive elements (REs) have implicated a mechanistic role in generating new chimerical genes. Such examples are consistent with the classic model for exon shuffling, which relies on non-homologous recombination. However, recent data for chromosomal aberrations in model organisms suggest that ectopic homology-dependent recombination may also be important. Lack of a dataset comprising experimentally verified young duplicates has hampered an effective examination of these models as well as an investigation of sequence features that mediate the rearrangements. Here we use approximately 7,000 cDNA probes (approximately 112,000 primary images) to screen eight species within the Drosophila melanogaster subgroup and identify 17 duplicates that were generated through ectopic recombination within the last 12 mys. Most of these are functional and have evolved divergent expression patterns and novel chimeric structures. Examination of their flanking sequences revealed an excess of repetitive sequences, with the majority belonging to the transposable element DNAREP1 family, associated with the new genes. Our dataset strongly suggests an important role for REs in the generation of chimeric genes within these species.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18208328 PMCID: PMC2211543 DOI: 10.1371/journal.pgen.0040003
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1An Example Illustrating the Detection of New Genes
(A) The probe LD47348 (CG10595) detected two signals in the clade of D. yakuba-santomea-teissieri while only detecting one signal in other species. The new additional signal suggests a new gene candidate.
(B) Southern hybridization results further confirm the extra copy in the D. yakuba-santomea-teissieri clade (M is 1-kb extension marker [Invitrogen]). Lanes 1–8 correspond to Xho I digested DNAs of D. yakuba, D. teissieri, D. santomea, D. erecta, D. melanogaster, D. simulans, D. mauritiana, and D. sechellia, respectively).
(C) Cartoon figure displaying the gene structures of the parental gene (d, or CG10595) and the new duplicate (d-r). The duplicated region is indicated by vertical dash lines. d-r recruited one upstream exon as indicated by yellow box.
(D) Expression patterns of the parental gene. (E) expression patterns of the new gene d-r revealed by one round of RT-PCR and a second round of nested PCR (M indicates DL2000 DNA molecular marker (Takara); E+, E−, L2+, L2−, L3+, L3−, P+, P−, A+, and A− correspond to positive and negative reactions for embryos, second instar larvae, third instar larvae, pupae, and adults, respectively). From these gels, it is clear that d-r is only expressed in the third instar larvae while the parental copy is expressed ubiquitously. All the bands in the negative control lanes are primer dimer bands. E+ and L3+ are weak but clearly visible.
List of the Young Duplicates Identified and their Parental Loci
Figure 2The Phylogenetic Distribution of the 17 New DLR Duplicates Identified in This Study
The species phylogeny and time scale are from [58]. Different color bars show different gene families. The kep1 gene family has six new duplicates (indicated by red bars).
Figure 3The Gene Structures of 16 New Duplicates Mapped on the Species Phylogeny
CR33318 is not shown because it is a truncated copy without detectable expression and has frame shift mutations. Duplicated regions are indicated with vertical dash lines. Horizontal dash lines in CG7635-r, CG3101-r, d-r, and klg-r indicate that we only obtained partial coding regions with RT-PCR and longer coding regions may exist outward. Boxes are exon regions and lines indicate introns. Yellow boxes indicate recruited chimeric regions, green boxes indicate parental loci UTRs, and blue boxes indicate duplicate loci UTRs. Positions of start and stop codons are marked.
Repetitive Elements at the Breakpoints of Duplicate Pairs
Figure 4Two Examples of New Genes with Repetitive Sequences at the Breakpoints
(A) Shows a satellite DNA sequence (SAR) located at the 5′ breakpoints of mkg-r2 and its parental gene mkg-r.
(B) Shows the existence of a transposon (DNAREP1-DM) at all the four breakpoints of a CR9337 duplicate pair.
Figure 5A Simplified Schematic of the Repetitive Sequence Flanking New Genes and Their Distribution over the D. melanogaster Subgroup Phylogeny
Left panel displays the varying degrees of identity and degeneration between flanking regions of paralogs, with the right panel displaying the branches in which they are found; 1: (CG2952:CG2952-r), (kep1: CG4021), (kep1:CG9337), (CG9902:CG7692), (kep1:CG3875); 2: (CG3875-CG3927); 3: (CG9337-CG9337-r); 4: (mkgr-mkgr2); 5: (CR9337-CR33318); 6: (CG3101-CG3101-r). The red blocks in the left panel indicate alignable regions of the TEs and other repeat sequences. The black boxes represent sequences of TEs and other repeats; fragmented black boxes represent RE fragments. The long boxes in various colors represent the identified new genes. See also Figure S1, for the alignments.