| Literature DB >> 30858588 |
Li Zhang1, Yan Ren2, Tao Yang3, Guangwei Li3, Jianhai Chen1, Andrea R Gschwend1, Yeisoo Yu4, Guixue Hou3, Jin Zi2, Ruo Zhou2, Bo Wen2, Jianwei Zhang4, Kapeel Chougule4, Muhua Wang4, Dario Copetti4, Zhiyu Peng2, Chengjun Zhang1,5, Yong Zhang6, Yidan Ouyang3, Rod A Wing7, Siqi Liu8, Manyuan Long9.
Abstract
New protein-coding genes that arise de novo from non-coding DNA sequences contribute to protein diversity. However, de novo gene origination is challenging to study as it requires high-quality reference genomes for closely related species, evidence for ancestral non-coding sequences, and transcription and translation of the new genes. High-quality genomes of 13 closely related Oryza species provide unprecedented opportunities to understand de novo origination events. Here, we identify a large number of young de novo genes with discernible recent ancestral non-coding sequences and evidence of translation. Using pipelines examining the synteny relationship between genomes and reciprocal-best whole-genome alignments, we detected at least 175 de novo open reading frames in the focal species O. sativa subspecies japonica, which were all detected in RNA sequencing-based transcriptomes. Mass spectrometry-based targeted proteomics and ribosomal profiling show translational evidence for 57% of the de novo genes. In recent divergence of Oryza, an average of 51.5 de novo genes per million years were generated and retained. We observed evolutionary patterns in which excess indels and early transcription were favoured in origination with a stepwise formation of gene structure. These data reveal that de novo genes contribute to the rapid evolution of protein diversity under positive selection.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30858588 DOI: 10.1038/s41559-019-0822-5
Source DB: PubMed Journal: Nat Ecol Evol ISSN: 2397-334X Impact factor: 15.460