| Literature DB >> 23651622 |
Chengjun Zhang1, Jun Wang, Nicholas C Marowsky, Manyuan Long, Rod A Wing, Chuanzhu Fan.
Abstract
In an effort to identify newly evolved genes in rice, we searched the genomes of Asian-cultivated rice Oryza sativa ssp. japonica and its wild progenitors, looking for lineage-specific genes. Using genome pairwise comparison of approximately 20-Mb DNA sequences from the chromosome 3 short arm (Chr3s) in six rice species, O. sativa, O. nivara, O. rufipogon, O. glaberrima, O. barthii, and O. punctata, combined with synonymous substitution rate tests and other evidence, we were able to identify potential recently duplicated genes, which evolved within the last 1 Myr. We identified 28 functional O. sativa genes, which likely originated after O. sativa diverged from O. glaberrima. These genes account for around 1% (28/3,176) of all annotated genes on O. sativa's Chr3s. Among the 28 new genes, two recently duplicated segments contained eight genes. Fourteen of the 28 new genes consist of chimeric gene structure derived from one or multiple parental genes and flanking targeting sequences. Although the majority of these 28 new genes were formed by single or segmental DNA-based gene duplication and recombination, we found two genes that were likely originated partially through exon shuffling. Sequence divergence tests between new genes and their putative progenitors indicated that new genes were most likely evolving under natural selection. We showed all 28 new genes appeared to be functional, as suggested by Ka/Ks analysis and the presence of RNA-seq, cDNA, expressed sequence tag, massively parallel signature sequencing, and/or small RNA data. The high rate of new gene origination and of chimeric gene formation in rice may demonstrate rice's broad diversification, domestication, its environmental adaptation, and the role of new genes in rice speciation.Entities:
Keywords: Oryza; chimera; comparative genomics; gene duplication; new gene
Mesh:
Year: 2013 PMID: 23651622 PMCID: PMC3673630 DOI: 10.1093/gbe/evt071
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPhylogeny of six rice species showing the species divergence time and an illustration of new gene origination in Oryza sativa. Genes “A,” “C,” and “D” are orthologous in six species. Gene “B” is a new gene in O. sativa and/or Asian rice species. “AA” stands for the Oryza “A” genome type. “BB” stands for Oryza “B” genome type.
The New Genes, Paralogs, and Creation Mechanisms
| New Gene | Annotation | Paralogs | Possible Formation Mechanisms | |
|---|---|---|---|---|
| 1 | Os03g01008 | Expressed protein | ChrSy.fgenesh.mRNA.80 | Segmental duplication |
| 2 | Os03g01014 | Expressed protein | ChrSy.fgenesh.mRNA.82 | Segmental duplication |
| 3 | Os03g01020 | Pectinesterase inhibitor domain containing protein | ChrSy.fgenesh.mRNA.85 | Segmental duplication |
| 4 | Os03g01490 | Expressed protein | Os03g01420 | Tandem duplication, chimera |
| 5 | Os03g02130 | Hypothetical protein | Os01g63170 | Gene duplication |
| 6 | Os03g02340 | Expressed protein | Os05g05090 | Gene duplication, chimera |
| 7 | Os03g03050 | Expressed protein | Os07g20240 | Gene duplication |
| 8 | Os03g04760 | Expressed protein | Os05g11820 | Gene duplication |
| 9 | Os03g07090 | Expressed protein | Os11g08990 | Gene duplication |
| 10 | Os03g07270 | Glycine-rich cell wall protein | Os01g57250 | Gene duplication, chimera |
| 11 | Os03g07690 | Expressed protein | Os01g22910 | Gene duplication |
| 12 | Os03g09130 | Expressed protein | Os03g18760/Os11g07660 | Gene duplication, chimera |
| 13 | Os03g10840 | Expressed protein | Os03g11130 | Exon shuffling, chimera |
| 14 | Os03g11860 | Expressed protein | Os01g09060 | Gene duplication, chimera |
| 15 | Os03g12480 | Expressed protein | Os06g42410 | Gene duplication |
| 16 | Os03g12580 | Expressed protein | Os06g01010 | Exon shuffling, chimera |
| 17 | Os03g15060 | Expressed protein | Os01g19250 | Gene duplication, chimera |
| 18 | Os03g15110 | Expressed protein | Os03g46230 | Gene duplication, chimera |
| 19 | Os03g16320 | Expressed protein | Os04g50840 | Gene duplication |
| 20 | Os03g18650 | Hypothetical protein | Os05g38540 | Gene duplication |
| 21 | Os03g21310 | Ulp1 protease family | Os08g33280 | Gene duplication, chimera |
| 22 | Os03g24630 | Hypothetical protein | Os05g36060 | Gene duplication |
| 23 | Os03g24980 | SWIM zinc finger family protein | Os03g24970 | Tandem gene duplication, chimera |
| 24 | Os03g24990 | Ulp1 protease family | Os03g24960 | Tandem gene duplication, chimera |
| 25 | Os03g25950 | Expressed protein | Os12g32810 | Gene duplication, chimera |
| 26 | Os03g29140 | Expressed protein | Os01g09060 | Gene duplication, chimera |
| 27 | Os03g32526 | tRNA-splicing endonuclease positive effector related | Os06g20500 | Gene duplication |
| 28 | Os03g33920 | Conserved hypothetical protein | Os06g36630 | Gene duplication |
FIllustration and example of four general patterns of new gene origination in Oryza sativa genome. The genes above are new genes and the genes below are parental genes. (A) New gene formed chimeric gene structure from partial parental gene sequence. (B) New gene formed intact and nonchimeric structure from partial parental gene. (C) New gene formed from entire parental gene and shared same exon–intron gene structure. (D) New gene formed from entire parental gene but with different exon–intron gene structure. Exon, filled box; intron, solid line; homologous region, dash line. The start and stop codons are marked for each gene.
FIllustration and example of chimeric new gene. (A) New gene formed from one parental gene. (B) New gene formed from two parental genes. Exon, filled box; intron, solid line; homologous region, dash line. The start and stop codons are marked for each gene.
Expression of New Genes in Oryza sativa
| Locus | RNA-Seq Data | EST | MPSS | Small RNA |
|---|---|---|---|---|
| Os03g01008 | + | − | − | − |
| Os03g01014 | + | − | − | − |
| Os03g01020 | + | + | + | + |
| Os03g01490 | + | + | + | + |
| Os03g02130 | − | − | − | + |
| Os03g02340 | + | − | − | + |
| Os03g03050 | + | + | − | + |
| Os03g04760 | + | − | − | + |
| Os03g07090 | − | − | − | + |
| Os03g07270 | + | + | − | + |
| Os03g07690 | + | − | − | + |
| Os03g09130 | − | − | − | + |
| Os03g10840 | + | − | − | + |
| Os03g11860 | − | − | + | + |
| Os03g12480 | − | − | − | + |
| Os03g12580 | − | − | − | + |
| Os03g15060 | + | − | − | + |
| Os03g15110 | + | + | − | + |
| Os03g16320 | + | − | − | + |
| Os03g18650 | − | − | − | + |
| Os03g21310 | + | + | + | + |
| Os03g24630 | − | − | − | + |
| Os03g24980 | − | − | − | + |
| Os03g24990 | − | − | − | + |
| Os03g25950 | + | − | − | + |
| Os03g29140 | + | − | − | + |
| Os03g32526 | + | + | − | + |
| Os03g33920 | − | − | − | + |
Note.— +, present; −, absent.