| Literature DB >> 21917143 |
Abstract
BACKGROUND: Complete exome resequencing has the power to greatly expand our understanding of non-human primate genomes. This includes both a better appreciation of the variation that exists in non-human primate model species, but also an improved annotation of their genomes. By developing an understanding of the variation between individuals, non-human primate models of human disease can be better developed. This effort is hindered largely by the lack of comprehensive information on specific non-human primate genetic variation and the costs of generating these data. If the tools that have been developed in humans for complete exome resequencing can be applied to closely related non-human primate species, then these difficulties can be circumvented.Entities:
Mesh:
Year: 2011 PMID: 21917143 PMCID: PMC3308050 DOI: 10.1186/gb-2011-12-9-r87
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Sample read coverage
| Human | Chimpanzee | Rhesus macaque | ||||
|---|---|---|---|---|---|---|
| Complete | 150,776 | 91.9% | 148,109 | 90.3% | 105,127 | 64.1% |
| Partial | 10,583 | 6.5% | 12,776 | 7.8% | 52,941 | 32.3% |
| Nothing | 2,683 | 1.6% | 3,157 | 1.9% | 5,974 | 3.6% |
| All | 164,042 | 164,042 | 164,042 | |||
| Complete | 155,669 | 94.8% | 154,322 | 94.0% | 136,123 | 82.8% |
| Partial | 5,775 | 3.6% | 6,583 | 4.1% | 21,547 | 13.2% |
| Nothing | 2,598 | 1.6% | 3,137 | 1.9% | 6,372 | 3.9% |
| All | 164,042 | 164,042 | 164,042 | |||
| Complete | 155,669 | 94.8% | 149,541 | 95.3% | 146,606 | 94.7% |
| Partial | 5,775 | 3.6% | 6,034 | 3.8% | 6,785 | 4.4% |
| Nothing | 2,598 | 1.6% | 1,293 | 0.8% | 1,469 | 0.9% |
| All | 164,042 | 156,868 | 154,860 | |||
Exomic coverage on the human genome in exon entirety, coding sequence (CDS) only and coding sequence in self-genomes.
Figure 1Human coverage compared to cross-species coverage. (a,b) Scatter plots showing the coverage level for each coding exon from human sample reads on human genome annotation compared to chimpanzee sample reads on human genome annotation (a) or rhesus sample reads on human genome annotation (b). (c,d) Histograms showing the difference between human and cross-species coverage, chimpanzee (c) or rhesus macaque (d), demonstrating the effects of species bias in capture efficacy.
Figure 2Coverage compared to divergence. (a,b) Scatter plots showing the relationship between coverage and divergence from human for each coding exon in chimpanzee (a) and rhesus macaque (b). (c,d) Scatter plots also show the relationship between divergence from human and coverage differences between human and chimpanzee (c) and rhesus macaque (d).
Nucleotide variation in self-self assemblies
| Human | Chimpanzee | Rhesus macaque | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Hom | Het | Hemi | Hom | Het | Hemi | Hom | Het | Hemi | |
| Frameshift | 10 | 53 | 0 | 114 | 119 | 15 | 169 | 162 | 4 |
| Amino acid loss | 13 | 63 | 0 | 7 | 68 | 3 | 16 | 73 | 0 |
| Frameshift | 13 | 52 | 4 | 85 | 83 | 22 | 128 | 169 | 6 |
| Amino acid gain | 12 | 42 | 1 | 5 | 57 | 1 | 9 | 62 | 1 |
| Synonymous | 23 | 34 | 2 | 22 | 57 | 5 | 20 | 77 | 1 |
| Non-synonymous | 79 | 269 | 2 | 312 | 571 | 47 | 245 | 793 | 9 |
| Stop codon gain | 0 | 7 | 0 | 1 | 19 | 2 | 0 | 25 | 1 |
| Stop codon loss | 1 | 0 | 0 | 0 | 2 | 0 | 0 | 1 | 0 |
| Synonymous | 4,259 | 8,159 | 206 | 3,204 | 11,549 | 198 | 8,777 | 14,878 | 233 |
| Non-synonymous | 3,618 | 8,336 | 155 | 3,725 | 12,465 | 346 | 6,158 | 13,252 | 220 |
| Stop codon gain | 8 | 171 | 2 | 11 | 286 | 2 | 34 | 339 | 3 |
| Stop codon loss | 9 | 7 | 0 | 2 | 21 | 0 | 18 | 20 | 1 |
| Deletion frameshift | 10 | 0 | 107 | 14 | 159 | 4 | |||
| Insertion frameshift | 13 | 4 | 85 | 21 | 125 | 6 | |||
| Either | 23 | 4 | 182 | 35 | 280 | 9 | |||
| Stop codon gain | 8 | 2 | 12 | 4 | 34 | 4 | |||
| Stop codon loss | 10 | 0 | 2 | 0 | 17 | 1 | |||
| Either | 18 | 2 | 14 | 4 | 51 | 5 | |||
| Any of above | 41 | 6 | 194 | 38 | 327 | 14 | |||
Nucleotide level variation observed in self-self assemblies. Hemi, hemizygous; Het, heterozygous; Hom, homozygous; MNP, multi-nucleotide polymorphism; SNP, single-nucleotide polymorphism.
Nucleotide variation in assemblies to human
| Chimpanzee | Rhesus macaque | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sample | Genome | Sample | Genome | |||||||||
| Hom | Het | Hemi | Hom | Het* | Hemi | Hom | Het | Hemi | Hom | Het* | Hemi | |
| Frameshift | 147 | 185 | 11 | 550 | 296 | 72 | 337 | 360 | 18 | 504 | 200 | 32 |
| Amino acid loss | 191 | 210 | 12 | 183 | 74 | 10 | 643 | 295 | 35 | 697 | 290 | 37 |
| Frameshift | 104 | 163 | 11 | 242 | 260 | 29 | 190 | 321 | 15 | 281 | 300 | 17 |
| Amino acid gain | 109 | 269 | 13 | 69 | 237 | 3 | 319 | 600 | 27 | 246 | 755 | 34 |
| Synonymous | 289 | 125 | 10 | 78 | 13 | 4 | 1,310 | 318 | 81 | 963 | 52 | 41 |
| Non-synonymous | 1,986 | 1,024 | 56 | 1,971 | 293 | 79 | 13,022 | 2,822 | 502 | 17,762 | 982 | 682 |
| Stop codon gain | 7 | 27 | 1 | 20 | 16 | 4 | 36 | 106 | 1 | 55 | 28 | 0 |
| Stop codon loss | 2 | 2 | 0 | 0 | 2 | 0 | 8 | 4 | 0 | 6 | 2 | 0 |
| Synonymous | 76,204 | 14,506 | 2,355 | 75,930 | 826 | 1,864 | 318,662 | 23,085 | 11,117 | 346,779 | 2,688 | 11,653 |
| Non-synonymous | 49,699 | 16,017 | 2,220 | 49,093 | 1,105 | 1,696 | 139,476 | 22,633 | 5,943 | 160,418 | 2,574 | 6,479 |
| Stop codon gain | 204 | 379 | 19 | 228 | 17 | 15 | 400 | 732 | 20 | 445 | 26 | 21 |
| Stop codon loss | 85 | 34 | 4 | 27 | 1 | 3 | 137 | 50 | 5 | 75 | 2 | 3 |
| Deletion frameshift | 143 | 10 | 489 (678)a | 55 | 328 | 17 | 438 (579)a | 27 | ||||
| Insertion frameshift | 101 | 10 | 230 (439)a | 28 | 180 | 14 | 266 (519)a | 16 | ||||
| Either | 241 | 20 | 679 (1,016)a | 76 | 489 | 29 | 665 (1,013)a | 39 | ||||
| Stop codon gain | 200 | 16 | 237 (259)a | 14 | 406 | 21 | 443 (494)a | 19 | ||||
| Stop codon loss | 87 | 4 | 28 (32)a | 0 | 143 | 5 | 80 (84)a | 3 | ||||
| Either | 280 | 19 | 263 (289)a | 14 | 534 | 26 | 513 (567)a | 22 | ||||
| Any of above | 499 | 39 | 895 (1,316)a | 87 | 945 | 51 | 1082 (1,425)a | 56 | ||||
Comparison of nucleotide level variation in sample reads and genomic cross-species alignments to human. aValues in parentheses are Hom + Het*. Hemi, hemizygous; Het, heterozygous; Hom, homozygous; MNP, multi-nucleotide polymorphism; SNP, single-nucleotide polymorphism.