| Literature DB >> 26484147 |
Juliann Shih1, Russ Hodge2, Miguel A Andrade-Navarro3.
Abstract
Variation is essential to species survival and adaptation during evolution. This variation is conferred by the imperfection of biochemical processes, such as mutations and alterations in DNA sequences, and can also be seen within genomes through processes such as the generation of antibodies. Recent sequencing projects have produced multiple versions of the genomes of humans and fruit flies (Drosophila melanogaster). These give us a chance to study how individual gene sequences vary within and between species. Here we arranged human and fly genes in orthologous pairs and compared such within-species variability with their degree of conservation between flies and humans. We observed that a significant number of proteins associated with mRNA translation are highly conserved between species and yet are highly variable within each species. The fact that we observe this in two species whose lineages separated more than 700 million years ago suggests that this is the result of a very ancient process. We hypothesize that this effect might be attributed to a positive selection for variability of virus-interacting proteins that confers a general resistance to viral hijacking of the mRNA translation machinery within populations. Our analysis points to this and to other processes resulting in positive selection for gene variation.Entities:
Keywords: Drosophila; Evolution; Human genome; Population; Variation
Year: 2014 PMID: 26484147 PMCID: PMC4536057 DOI: 10.1016/j.gdata.2014.11.010
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1Distributions of interspecies similarity (in percentage identity). Fly to human in nucleotides (A) and amino acids (B). Human to fly in nucleotides (C) and amino acids (D).
Fig. 2Distributions of intraspecies variation values. Fly genes in nucleotides (A) and amino acids (B). Human genes in nucleotides (C) and amino acids (D).
Fig. 3Distribution of intra- versus interspecies conservation. Cut-offs were applied to select sets of genes with high intraspecies variation and high interspecies conservation (filled diamonds). (A) Fly intra- versus interspecies (nt): cut-offs nt ≥ 0.36863452 and nt ≥ 0.544397227. (B) Fly intra- versus interspecies (aa): aa ≥ 0.036343094 and aa ≥ 0.49. (C) Human intra- versus interspecies (nt): nt ≥ 1.867292351 and nt ≥ 0.603982301. (D) Human intra- versus interspecies (aa): aa > = 0.0893 and aa ≥ 0.62. Regression lines (black) are shown for each distribution indicating r value for Pearson 's correlation, two tailed p-value, and the slope (m) and value at y = 0 (b) of the regression line.
Enriched gene ontology terms. For each cell in the table: (i) number of genes tested (annotated with any GO terms), (ii) definition of the enriched GO term, (iii) identifier, (iv) symbols of the genes with the GO term found in the test set, (v) number of genes found of all annotated with the GO term, and (vi) p-value indicating the significance of the enrichment. A larger list is shown as Supplementary Table 2.
| nt conservation | aa conservation | |
|---|---|---|
| Fly | 94 genes | 34 genes |
| Structural constituent of ribosome | Amino acid metabolic process | |
| GO: 0003735 | GO: 0006520 | |
| mrps7 tko rps23 rpl27a rpl13 cg4866 rpl23 rps26 rpl8 rpl27 mrpl14 rpl7a rps15 rps9 | cg9547 aats-gly aats-asn aats-arg cg10361 aats-gln aats-glupro cg6415 | |
| 14 of 111 | 8 of 52 | |
| 0.0312 | 0.00073 | |
| Human | 28 genes | 24 genes |
| Cytosolic small ribosomal subunit | Cytosolic ribosome | |
| GO: 0005843 | GO: 0005830 | |
| rps16 rps25 rps9 | rps2 rplp0 rps16 rps23 rps9 rpl8 | |
| 3 of 17 | 6 of 43 | |
| 0.13 | 0.000723 |