| Literature DB >> 22112745 |
Abstract
BACKGROUND: Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this study was to search for evidence of de novo intron gain in domesticated genes from an analysis of their exon/intron structures.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22112745 PMCID: PMC3257199 DOI: 10.1186/1745-6150-6-59
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
DNA transposon-derived genes with de novo gained introns
| TE group (progenitor of the novel gene) | Gene name (ID) | Presence of introns | Location of introns | Number of introns |
|---|---|---|---|---|
| Tc1/mariner/pogo | JRK | yes | 5' UTR/coding | 2 |
| JRKL | yes | coding | 2 | |
| TIGD 1-7 | yes | 5' UTR | 1 | |
| POGZ | yes | coding | 5-13 | |
| POGK | yes | coding | 5 | |
| SETMAR | yes | coding | 2 | |
| Transib | RAG1 | yes | coding | 1-3 |
| hAT | ZBED1 | yes | 5' UTR | 1 |
| ZBED4 | yes | 5' UTR | 1-2 | |
| ZBED5 | yes | 5' UTR/coding | 2 | |
| ZMYM6 | yes | coding | 14 | |
| ZNF862 | yes | coding | 7 | |
| C5ORF54 (Buster3) | yes | 5' UTR | 1 | |
| GTF2IRD2 | yes | 5' UTR/coding | 15 | |
| PRKRIR (THAP0) | yes | coding | 4 | |
| THAP 1-11 | yes | 5' UTR/coding | diverse | |
| PIF/Harbinger | HARBI1 | yes | 5' UTR/coding | 2 |
| NAIF1 | yes | coding | 1 | |
| piggyBac | PGBD1 | yes | 5' UTR/coding | 1-6 |
| PGBD2 | yes | 5' UTR/coding | 2 | |
| PGBD3 | yes | 5' UTR/coding | 1-4 | |
| PGBD4 | yes | coding | 1 | |
| PGBD5 | yes | coding | 8 |
Retroelement (Metaviridae and ERV)-derived genes with de novo gained introns
| TE group (progenitor of the novel gene) | Gene name (ID) | Presence of introns | Location of introns | Number of introns |
|---|---|---|---|---|
| Chromovirus (sushi): MART | RGAG1 | yes | 5' UTR/coding | 3 |
| ZCCHC16 | yes | 5' UTR | 2-6 | |
| ZCCHC5 | yes | 5' UTR | 1 | |
| PEG10 | yes | 5' UTR or coding | 1-2 | |
| LDOC1L | yes | 5' UTR | 1 | |
| RGAG4 | yes | 3' UTR | 1 | |
| C22ORF29 | yes | 5' UTR | 2 | |
| FAM127b | yes | coding | 1 | |
| Barthez: PNMA | PNMA2 | yes | 5' UTR | 2 |
| PNMA3 | yes | 5' UTR | 1 | |
| MOAP1 | yes | 5' UTR | 1-2 | |
| PNMA5 | yes | 5' UTR or 3' UTR | 3 | |
| PNMA6A | yes | 5' UTR or coding | 1-2 | |
| PNMA6B | yes | 5' UTR | 1 | |
| ZCCHC12 | yes | 5' UTR | 3 | |
| ZCCHC18 | yes | 5' UTR | 2 | |
| CCDC8 | yes | coding | 1 | |
| PNMAL1 | yes | 5' UTR | 2 | |
| PNMAL2 | yes | coding | 1 | |
| Osvaldo: ARC | ARC | yes | 3' UTR | 2 |
| Gin1 | yes | 5' UTR/coding | 7 | |
| SCAND3 | yes | coding | 3 | |
| KRBA2 | yes | 5' UTR or coding | 1-3 | |
| NYNRIN | yes | 5' UTR/coding | 8 | |
| ERV: | ERVFRD-1 | yes | 5' UTR | 1 |
| ERVW-1 | yes | 5' UTR | 1 | |
| syncytin b | yes | 5' UTR | 2 |
Phylogenomic analysis of domesticated genes: insight into their origin and timing of intron gains
| Gene name | Origin in the LCA of Chordata | Origin in the LCA of Gnathostomata | Origin in the LCA of Tetrapoda | Origin in the LCA of Amniota | Origin in the LCA of Theria | Origin in the LCA of placentals |
|---|---|---|---|---|---|---|
| RGAG1 | ▲ | |||||
| ZCCHC16 | ▲ | |||||
| ZCCHC5 | ▲ | |||||
| PEG10 | ▲ | |||||
| LDOC1L | ▲ | |||||
| RGAG4 | ▲ | |||||
| C22ORF29 | ▲ | |||||
| PNMA2 | ▲ | |||||
| PNMA3 | ▲ | |||||
| MOAP1 | ▲ | |||||
| PNMA5 | ▲ | |||||
| PNMA6A | ▲ | |||||
| PNMA6B | ▲ | |||||
| ZCCHC12 | ▲ | |||||
| ZCCHC18 | ▲ | |||||
| CCDC8 | ▲ | |||||
| PNMAL1 | ▲ | |||||
| PNMAL2 | ▲ | |||||
| ARC | ▲ | |||||
| SCAND3 | ▲ | |||||
| KRBA2 | ▲ | |||||
| NYNRIN | ▲ | |||||
| POGK | ▲ | |||||
| ZBED1 | ▲ | |||||
| ZNF862 | ▲ | |||||
| Buster3 | ▲ | |||||
| PRKRIR | ▲ | |||||
| THAP 9 | ▲ | |||||
| HARBI1 | ▲ | |||||
| NAIF1 | ▲ | |||||
| PGBD1 | ▲ | |||||
| PGBD2 | ▲ | |||||
| PGBD5 | ▲ |
Figure 1Intron densities in domesticated genes compared with vertebrate, mammalian and human normal genes. The intron density is given as the number of introns per kb of coding sequence for vertebrate, mammalian and human normal genes, values were taken from literature [13]. The intron density of old domesticated genes is given as the number of introns per kb of coding sequence (introns in CDS) while the intron density of young Eutheria-specific domesticated genes is given as number of introns per kb of 5' UTR sequence (5' UTR introns).
Intron position conservation in Eutheria-specific domesticated genes
| Gene name | Euarchontoglires | Laurasiatheria | Xenarthra | Afrotheria |
|---|---|---|---|---|
| RGAG1 | ● | ● | ● | ● |
| RGAG4 | ● | ● | incomplete | ● |
| ZCCHC5 | ● | ● | ● | ● |
| LDOC1L | ● | ● | ● | ● |
| C22OF29 | ● | ● | ● | ● |
| PNMA2 | ● | ● | ● | ● |
| PNMA3 | ● | ● | incomplete | ● |
| MOAP1 | ● | ● | ● | ● |
| PNMA5 | ● | ● | ● | ● |
| PNMA6A | ● | ? | ? | ● |
| ZCCHC12 | ● | ● | ● | ● |
| PNMAL1 | ● | ● | ● | ● |
| KRBA2 | ● | ● | ● | ● |
| PGBD1 | ● | ● | ● | ● |
| Buster3 | ● | ● | ● | ● |
●: conserved intron position, ?: poor sequence quality, incomplete: only parts of the intron or intron positions can be seen in the current assembly.
Alternative splicing in Eutheria-specific domesticated genes
| Gene name | human | mouse | number of splicing events | splicing event details |
|---|---|---|---|---|
| PNMA2 | ● | 14 | EI, CE | |
| PNMA3 | ● | 3 | II, IR | |
| MOAP1 | ● | 1 | IR | |
| ZCCHC12 | ● | 7 | EI, CE, II, IR | |
| ZCCHC18 | ● | 9 | II, IR, EI | |
| POGK | ● | 5 | EI, CE, II | |
| POGK | ● | 3 | CE | |
| PRKRIR | ● | 1 | CE | |
| THAP9 | ● | 7 | EI, CE, II | |
| Buster3 | ● | 1 | CE | |
| PGBD1 | ● | 2 | EI, II | |
| PGBD2 | ● | 2 | CE, II | |
| PGBD5 | ● | 1 | IR |
●: alternative splicing is present. Splicing events: intron isoform (II), intron retention (IR), exon isoform (EI) and casette exon (CE).