| Literature DB >> 22745226 |
Georgia Tsagkogeorga1, Vincent Cahais, Nicolas Galtier.
Abstract
Phylogenomics has revealed the existence of fast-evolving animal phyla in which the amino acid substitution rate, averaged across many proteins, is consistently higher than in other lineages. The reasons for such differences in proteome-wide evolutionary rates are still unknown, largely because only a handful of species offer within-species genomic data from which molecular evolutionary processes can be deduced. In this study, we use next-generation sequencing technologies and individual whole-transcriptome sequencing to gather extensive polymorphism sequence data sets from Ciona intestinalis. Ciona is probably the best-characterized member of the fast-evolving Urochordata group (tunicates), which was recently identified as the sister group of the slow-evolving vertebrates. We introduce and validate a maximum-likelihood framework for single-nucleotide polymorphism and genotype calling, based on high-throughput short-read typing. We report that the C. intestinalis proteome is characterized by a high level of within-species diversity, efficient purifying selection, and a substantial percentage of adaptive amino acid substitutions. We conclude that the increased rate of amino acid sequence evolution in tunicates, when compared with vertebrates, is the consequence of both a 2-6 times higher per-year mutation rate and prevalent adaptive evolution.Entities:
Mesh:
Year: 2012 PMID: 22745226 PMCID: PMC3509891 DOI: 10.1093/gbe/evs054
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Illumina sequencing and read mapping statistics
| Sample | Locality | No. of reads | Total length (Mb) | Mapped reads (%) | Genes ≥ 5X | Genes ≥ 10X |
|---|---|---|---|---|---|---|
| GA02G | Grimstad, Norway | 3,909,166 | 293 | 32.8 | 2,834 | 1,920 |
| GA02I | South Askøy, Norway | 3,474,750 | 261 | 30.2 | 2,248 | 1,557 |
| GA02J | South Søtra, Norway | 3,415,890 | 256 | 39.2 | 3,078 | 2,077 |
| GA12M | South Askøy, Norway | 4,941,551 | 371 | 29.3 | 1,473 | 797 |
| GA12N | South Søtra, Norway | 2,440,102 | 183 | 31.9 | 1,543 | 970 |
| GA02L | Chester, Canada | 5,048,582 | 379 | 29.6 | 2,727 | 1,702 |
| GA02M | Port La Tour, Canada | 5,493,035 | 412 | 21.5 | 1,864 | 1,182 |
| GA02N | Petit-de-Grat, Canada | 7,115,473 | 534 | 35.6 | 3,624 | 2,565 |
| Total | 35,838,549 | 2,688 | 31.0 | 4,744 | 3,261 |
Target gene sharing across individuals
| No. of individuals | Transcripts (all) | Genes (all) | Genes ≥ 5X | Genes ≥ 10X |
|---|---|---|---|---|
| 8 | 11,466 | 8,080 | 612 | 317 |
| 7 | 13,939 | 9,800 | 1,170 | 689 |
| 6 | 15,639 | 11,015 | 1,590 | 955 |
| 5 | 16,939 | 11,998 | 2,063 | 1,280 |
| 4 | 17,970 | 12,754 | 2,509 | 1,669 |
| 3 | 18,789 | 13,361 | 3,054 | 2,067 |
| 2 | 19,425 | 13,854 | 3,649 | 2,532 |
| 1 | 19,934 | 14,282 | 4,744 | 3,261 |
Error model assessment in SNP and genotype calls
| Model | site_p | Stop% | No. of codon sites | |
|---|---|---|---|---|
| M2 | 0.75 | 6.9 | 133 | |
| M1 | 0.75 | 4.8 | 138 | |
| M1 | 0.5 | 4.8 | 137 | |
| M1 | 0.25 | 4.8 | 122 | |
| M1 | 0.125 | 4.8 | 104 | |
| M2 | 0.125 | 6.9 | −0.036 | 101 |
Main population genomic statistics calculated under various genotype calling methods
| Model | site_p | 103
| 102
| |||||
|---|---|---|---|---|---|---|---|---|
| M2 | 0.75 | 2.60 | 5.48 | 0.048 | 0.074 | 0.208 | 0.546 | 0.792 |
| M1 | 0.75 | 2.62 | 5.70 | 0.046 | 0.074 | 0.256 | 0.542 | 0.782 |
| M1 | 0.25 | 2.64 | 5.73 | 0.046 | 0.074 | 0.251 | 0.541 | 0.790 |
| M1 | 0.125 | 2.69 | 5.64 | 0.048 | 0.078 | 0.260 | 0.532 | 0.812 |
| M2 | 0.125 | 2.67 | 5.42 | 0.049 | 0.077 | 0.215 | 0.535 | 0.820 |
| CI | 0.5 | ±0.14 | ±0.21 | ±0.003 | ±0.005 | ±0.05 | ±0.04 | ±0.15 |
aThe 95% confidence intervals (CIs, 90% for αKEW) around the estimates obtained under model M1, site_p = 0.5 (bolded).
Comparison of major population genomic statistics across three animal species
| No. of genes | 1,602 | 10,996 | 1,431 | 6,530 | 980 |
| 102
| 5.70 | 3.27 | 2.88 | 0.164 | 0.089 |
| 0.046 | 0.085 | 0.058 | 0.241 | 0.303 | |
| 0.074 | 0.115 | 0.100 | 0.229 | 0.181 |
FDistribution of πS and πN/πS across genes in three animal species. In D. simulans and H. sapiens, the subsets of orthologs to the C. intestinalis genes analyzed in this study were used here.
Estimates of the tunicate versus vertebrate mutation rate ratio obtained using population genomics parameters of three distinct vertebrate species
| Rabbit | Mouse | Human | |
|---|---|---|---|
| ( | 2 | 2 | 2 |
| 0.65 | 0.57 | 0.15 | |
| (1 − | 0.63 | 0.51 | 0.37 |
| 0.03 | 0.1 | 0.21 | |
| 1.34 | 4.34 | 9.13 | |
| µT/µV | 1.69 | 4.43 | 6.76 |