| Literature DB >> 20660110 |
Ryan M Ames1, Bharat M Rash, Kathryn E Hentges, David L Robertson, Daniela Delneri, Simon C Lovell.
Abstract
Population-level differences in the number of copies of genes resulting from gene duplication and loss have recently been recognized as an important source of variation in eukaryotes. However, except for a small number of cases, the phenotypic effects of this variation are unknown. Data from the Saccharomyces Genome Resequencing Project permit the study of duplication in genome sequences from a set of individuals within the same population. These sequences can be correlated with available information on the environments from which these yeast strains were isolated. We find that yeast show an abundance of duplicate genes that are lineage specific, leading to a large degree of variation in gene content between individual strains. There is a detectable bias for specific functions, indicating that selection is acting to preferentially retain certain duplicates. Most strikingly, we find that sets of over- and underrepresented duplicates correlate with the environment from which they were isolated. Together, these observations indicate that gene duplication can give rise to substantial phenotypic differences within populations that in turn can offer a shortcut to evolutionary adaptation.Entities:
Mesh:
Year: 2010 PMID: 20660110 PMCID: PMC2997561 DOI: 10.1093/gbe/evq043
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Number of Predicted Genes, Duplicates, and LSDs for Saccharomyces cerevisiae
| Strain | Contigs | Predicted Genes | Duplicate Genes | LSDs | |
| Inc. Trans. | No Trans. | ||||
| DBVPG6765 | 2,879 | 5,770 | 1,195 | 84 | 81 |
| RM11_1A | 384 | 5,501 | 1,353 | 86 | 77 |
| REF (S288c) | 18 | 5,464 | 1,356 | 149 | 100 |
| SK1 | 2,827 | 5,797 | 1,219 | 83 | 72 |
| W303 | 3,853 | 5,296 | 837 | 34 | 20 |
| Y55 | 2,751 | 5,875 | 1,282 | 94 | 92 |
| YJM789 | 207 | 5,437 | 1,292 | 29 | 15 |
| DBVPG1373 | 3,656 | 3,974 | 378 | 6 | 4 |
| DBVPG1788 | 3,617 | 3,621 | 340 | 2 | 2 |
| DBVPG6044 | 3,955 | 4,276 | 457 | 2 | 2 |
| L_1374 | 3,150 | 3,118 | 256 | 6 | 6 |
| L_1528 | 3,439 | 3,380 | 302 | 12 | 10 |
| S288c | 3,408 | 3,516 | 367 | 9 | 3 |
| UWOPS05_227_2 | 3,021 | 3,177 | 235 | 0 | 0 |
| YIIc17_E5 | 2,955 | 3,010 | 235 | 6 | 2 |
| YJM975 | 3,061 | 3,174 | 235 | 2 | 2 |
| YJM978 | 2,975 | 3,084 | 250 | 4 | 4 |
| YPS128 | 3,696 | 4,097 | 415 | 11 | 6 |
| YPS606 | 4,033 | 4,615 | 563 | 9 | 4 |
| YS4 | 3,021 | 3,119 | 280 | 26 | 24 |
| YS9 | 3,029 | 3,090 | 287 | 46 | 40 |
| 273614N | 2,591 | 2,485 | 159 | 2 | 2 |
| 322134S | 2,727 | 2,548 | 196 | 13 | 8 |
| 378604X | 3,014 | 2,998 | 255 | 14 | 10 |
| BC187 | 2,309 | 2,044 | 92 | 2 | 2 |
| DBVPG1106 | 2,013 | 1,802 | 86 | 4 | 2 |
| DBVPG1853 | 3,026 | 2,879 | 198 | 2 | 0 |
| DBVPG6040 | 2,487 | 2,384 | 210 | 16 | 14 |
| K11 | 2,657 | 2,629 | 163 | 2 | 2 |
| NCYC110 | 2,395 | 2,277 | 145 | 0 | 0 |
| NCYC361 | 1,873 | 1,301 | 83 | 16 | 14 |
| UWOPS03_461_4 | 2,927 | 2,969 | 223 | 4 | 4 |
| UWOPS05_217_3 | 2,454 | 2,591 | 241 | 8 | 8 |
| UWOPS83_787_3 | 2,713 | 2,721 | 206 | 19 | 16 |
| UWOPS87_2421 | 2,796 | 2,816 | 221 | 6 | 6 |
| Y12 | 2,584 | 2,472 | 147 | 6 | 6 |
| Y9 | 2,324 | 2,274 | 130 | 0 | 0 |
| YJM981 | 1,435 | 1,238 | 62 | 2 | 2 |
| YS2 | 2,294 | 1,639 | 112 | 8 | 6 |
Number of LSDs including transposable genes.
Number of LSDs excluding transposable genes.
Number of Predicted Genes, Duplicates, and LSDs for Saccharomyces paradoxus
| Strain | Contigs | Predicted Genes | Duplicate Genes | LSDs | |
| Inc. Trans. | No Trans. | ||||
| CBS432 | 1,773 | 5,409 | 1,140 | 50 | 50 |
| REF (CBS432) | 17 | 5,348 | 1,269 | 31 | 31 |
| CBS5829 | 3,439 | 5,656 | 1,095 | 54 | 54 |
| N_17 | 3,606 | 5,797 | 1,163 | 116 | 114 |
| N_45 | 3,005 | 5,907 | 1,257 | 125 | 123 |
| UWOPS91_917_1 | 4,589 | 5,172 | 1,139 | 543 | 541 |
| A12 | 3,709 | 3,767 | 388 | 14 | 12 |
| A4 | 3,745 | 3,935 | 346 | 5 | 5 |
| DBVPG4650 | 4,082 | 4,381 | 508 | 0 | 0 |
| DBVPG6304 | 4,094 | 4,617 | 536 | 19 | 19 |
| N_43 | 3,801 | 4,663 | 583 | 8 | 8 |
| N_44 | 3,704 | 4,091 | 421 | 2 | 2 |
| Q32_3 | 3,919 | 3,924 | 399 | 0 | 0 |
| Q59_1 | 3,856 | 3,871 | 393 | 4 | 4 |
| Q62_5 | 4,064 | 3,981 | 408 | 6 | 6 |
| Q95_3 | 4,029 | 4,411 | 484 | 6 | 6 |
| T21_4 | 3,953 | 4,106 | 474 | 6 | 6 |
| UFRJ50816 | 3,649 | 3,602 | 345 | 12 | 10 |
| Y6_5 | 3,305 | 3,103 | 277 | 2 | 2 |
| Y7 | 3,805 | 3,673 | 363 | 0 | 0 |
| YPS138 | 3,847 | 4,093 | 432 | 5 | 5 |
| IFO1804 | 2,668 | 2,564 | 160 | 0 | 0 |
| KPN3828 | 2,700 | 2,545 | 137 | 0 | 0 |
| KPN3829 | 2,666 | 2,502 | 162 | 4 | 2 |
| Q89_8 | 2,816 | 2,526 | 153 | 2 | 0 |
| S36_7 | 1,642 | 1,239 | 43 | 0 | 0 |
| UFRJ50791 | 2,315 | 2,130 | 103 | 0 | 0 |
| Z1_1 | 3,073 | 2,847 | 186 | 2 | 2 |
Number of LSDs including transposable genes.
Number of LSDs excluding transposable genes.
FChromosomal distribution of duplicate genes. The graphs show the distribution of duplicate genes (black) and randomly generated duplicate genes (white) for 16 Saccharomyces cerevisiae chromosomes. Arrows indicate positions of centromeres.
FThe age of population duplicates (white) and LSDs (black) is measured by the number of synonymous mutations (Ks). A higher value of Ks indicates that the duplicate genes have diverged and are therefore older. Genes from both Saccharomyces cerevisiae and S. paradoxus are shown.
FSigns of selection acting on duplicate genes. (A) The number of Saccharomyces cerevisiae duplicate genes; (B) the number of S. paradoxus duplicates; (C) the number of S. cerevisiae LSDs; and (D) the number of S. paradoxus LSDs. Here selection is measured by the ratio of nonsynonymous (Ka) to synonymous mutations (Ks). A higher Ka/Ks ratio indicates that one member of a duplicate pair has more nonsynonymous substitutions.
FThe distribution branch length ratios for (A) Saccharomyces cerevisiae and (B) S. paradoxus. The branch ratio is defined as the ratio between the branch lengths on a phylogenetic tree of each duplicate pair rooted by a Kluyveromyces waltii outgroup.
FPhenetic and phylogenetic trees for Saccharomyces cerevisiae and S. paradoxus. (A) The 21 S. cerevisiae strains with over- or underrepresented “Biological Process” GO terms. (B) The 18 S. paradoxus strains with over- or underrepresented “Biological Process” GO terms. Distances between strains were determined using the semantic distance between the over- and underrepresented “Biological Process” GO terms of each strain. Branches leading to each strain were then colored according to environmental background. Strains from similar backgrounds have similar overrepresented GO terms, indicating selection for similar types of duplicate genes. (C) Phylogenetic for all S. cerevisiae strains. (D) Phylogenetic tree for all S. paradoxus strains. Phylogenetic trees are taken from Liti et al. (2009) and are based on single nucleotide polymorphism data.