| Literature DB >> 31120894 |
Stephen Branden Van Oss1, Anne-Ruxandra Carvunis1.
Abstract
Entities:
Mesh:
Substances:
Year: 2019 PMID: 31120894 PMCID: PMC6542195 DOI: 10.1371/journal.pgen.1008160
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Genome-scale studies of orphan and de novo genes in various lineages.
For purposes of this table, genes are defined as orphan genes (when species-specific) or TRGs (when limited to a closely related group of species) when the mechanism of origination has not been investigated, and as when de novo origination has been inferred, irrespective of method of inference. The designation of de novo genes as “candidates” or “proto-genes” reflects the language used by the authors of the respective studies.
| Organism | Homology Detection Method(s) | Evidence of Expression? | Evidence of Selection? | Evidence of Physiological Role? | # Orphan/ | Notes | Ref. |
|---|---|---|---|---|---|---|---|
| Arthropods | BLASTP for all 30 species against each other, TBLASTN for | ESTs, RNA-seq; RT-PCR on select candidates | 37 | Prediction of signal peptides and subcellular localization for subset of orphans | ~65,000 orphan genes across 30 species | Abundance of orphan genes dependent on time since emergence from common ancestor; >40% of orphans from intergenic matches indicating possible | [ |
| BLASTP against 62 species, PSI-BLAST against NCBI nonredundant protein database, TBLASTN against PlantGDB-assembled unique transcripts database; searched syntenic region of two closely related species | Transcriptomic and translatomic data from multiple sources | Allele frequencies of | None | 782 | Also assessed DNA methylation and histone modifications | [ | |
| BLASTP against four | Microarray, RT-PCR | None | RNAi on five | 738 orphan genes | Five orphans identified as | [ | |
| BLASTP against NCBI nonredundant protein database, TBLASTN against NCBI nucleotide database, TBLASTN against NCBI EST database, PSI-BLAST against NCBI nonredundant protein database, InterProScan [ | Microarray | None | TRGs enriched for expression changes in response to abiotic stresses compared to other genes | 1761 nuclear TRGs; 28 mitochondrial TRGs | ~2% of TRGs thought to be | [ | |
| BLASTN of query cDNAs against | cDNA/ expressed sequence tags (ESTs) | Ka/Ks ratios calculated between retained new genes and their parental genes are significantly >1, indicating most new genes are functionally constrained | List includes several genes with characterized molecular roles | 72 orphan genes; 2 | Gene duplication dominant mechanism for new genes; 7/59 orphans specific to | [ | |
| Presence or absence of orthologs in other | Indirect (RNAi) | Youngest essential genes show signatures of positive selection (α = 0.25 as a group) | Knockdown with constitutive RNAi lethal for 59 TRGs | 195 “young” (>35myo) TRGs; 16 | Gene duplication dominant mechanism for new genes | [ | |
| RNA-seq in | RNA-seq | Nucleotide diversity lower in non-expressing relatives; | Structural features of | 106 fixed and 142 segregating | Specifically expressed in testes | [ | |
| BLASTP against other primates; BLAT against chimpanzee and orangutan genomes, manual check of syntenic regions in chimpanzee and orangutan | RNA-seq | Substitution rate provides some evidence for weak selection; 59/60 | None | 60 | Enabling mutations identified; highest expression seen in brain and testes | [ | |
| BLASTP against chimpanzee, BLAT and Ssearch of syntenic region in chimpanzee, manual check of syntenic regions in chimpanzee and macaque | EST/cDNA | No evidence of selective constraint seen by nucleotide divergence | One of the genes identified has a known role in leukemia | 3 | Estimated that human genome contains ~ 18 human-specific | [ | |
| BLASTP of all focal species against each other, BLASTP against NCBI nonredundant protein database, PSI-BLAST against NCBI nonredundant protein database, HMM Profile-Profile of TRG families against each other; families then merged and searched against four profile databases | Mass Spectrometry (MS) | Ka/Ks ratios across | None | 288 candidate | MS evidence of translation for 25 candidates | [ | |
| BLASTP of rat and mouse against each other, BLASTP against Ensembl compara database; searched syntenic regions in rat and mouse | UniGene Database | Subset of genes shows low nucleotide diversity and high ORF conservation across 17 strains | Two mouse genes cause morbidity when knocked out | 69 | Enabling mutations identified for 9 mouse genes | [ | |
| BLASTP against NCBI nonredundant protein database | Microarray | None | None | 781 orphan genes | Age-dependent features of genes compatible with | [ | |
| Protein-to-protein and nucleotide-to-nucleotide BLAT against eight | RNA-seq (all | 22 | Expression of | 175 | ~57% of | [ | |
| Primates | BLASTP against 15 eukaryotes, BLASTN against human genome, analysis of syntenic regions | ESTs | Ka/Ks ratios for TRGs below one but higher than established genes; coding scores consistent with translated proteins | Several genes have well-characterized cellular roles | 270 TRGs | ~5.5% of TRGs estimated to have originated | [ |
| BLASTP against NCBI nonredundant protein database | None | Mouse genes share 50% identity with rat ortholog | None | 84 TRGs | Species-specific genes excluded from analysis; | [ | |
| BLASTP and PSI-BLAST against 18 fungal species, HMMER and HHpred against several databases, TBLASTN against three close relatives | None | None | Majority of orphans have characterized fitness effects | 188 orphan genes | Ages of genes determined at level of individual residues | [ | |
| BLASTP, TBLASTX, and TBLASTN against 14 other yeast species, BLASTP against NCBI nonredundant protein database | Ribosome Profiling | All 25 | None | 25 | [ | ||
| BLASTP against NCBI nonredundant protein database, TBLASTN against ten outgroup species; BLASTP and phmmer against 20 yeast species reannotated using syntenic alignments | Transcript isoform sequencing (TIF-seq), Ribosome Profiling | Most genes weakly constrained but a subset under strong selection, according to Neutrality Index, Direction of Selection, Ka/Ks, and McDonald-Kreitman tests | Subcellular localization demonstrated for five genes | ~13,000 | >65% of | [ |