Yongzhong Zhao1, Richard J Epstein. 1. Laboratory of Computational Oncology, Faculty of Medicine, The University of Hong Kong, Pokfulam, Hong Kong.
Abstract
Tumor suppressor genes are classified by their somatic behavior either as caretakers (CTs) that maintain DNA integrity or as gatekeepers (GKs) that regulate cell survival, but the germ line role of these disease-related gene subgroups may differ. To test this hypothesis, we have used genomic data mining to compare the features of human CTs (n = 38), GKs (n = 36), DNA repair genes (n = 165), apoptosis genes (n = 622), and their orthologs. This analysis reveals that repair genes are numerically less common than apoptosis genes in the genomes of multicellular organisms (P < 0.01), whereas CT orthologs are commoner than GK orthologs in unicellular organisms (P < 0.05). Gene targeting data show that CTs are less essential than GKs for survival of multicellular organisms (P < 0.0005) and that CT knockouts often permit offspring viability at the cost of male sterility. Patterns of human familial oncogenic mutations confirm that isolated CT loss is commoner than is isolated GK loss (P < 0.00001). In sexually reproducing species, CTs appear subject to less efficient purifying selection (i.e., higher Ka/Ks) than GKs (P = 0.000003); the faster evolution of CTs seems likely to be mediated by gene methylation and reduced transcription-coupled repair, based on differences in dinucleotide patterns (P = 0.001). These data suggest that germ line CT/repair gene function is relatively dispensable for survival, and imply that milder (e.g., epimutational) male prezygotic repair defects could enhance sperm variation-and hence environmental adaptation and speciation-while sparing fertility. We submit that CTs and repair genes are general targets for epigenetically initiated adaptive evolution, and propose a model in which human cancers arise in part as an evolutionarily programmed side effect of age- and damage-inducible genetic instability affecting both somatic and germ line lineages.
Tumor suppressor genes are classified by their somatic behavior either as caretakers (CTs) that maintain DNA integrity or as gatekeepers (GKs) that regulate cell survival, but the germ line role of these disease-related gene subgroups may differ. To test this hypothesis, we have used genomic data mining to compare the features of humanCTs (n = 38), GKs (n = 36), DNA repair genes (n = 165), apoptosis genes (n = 622), and their orthologs. This analysis reveals that repair genes are numerically less common than apoptosis genes in the genomes of multicellular organisms (P < 0.01), whereas CT orthologs are commoner than GK orthologs in unicellular organisms (P < 0.05). Gene targeting data show that CTs are less essential than GKs for survival of multicellular organisms (P < 0.0005) and that CT knockouts often permit offspring viability at the cost of male sterility. Patterns of human familial oncogenic mutations confirm that isolated CT loss is commoner than is isolated GK loss (P < 0.00001). In sexually reproducing species, CTs appear subject to less efficient purifying selection (i.e., higher Ka/Ks) than GKs (P = 0.000003); the faster evolution of CTs seems likely to be mediated by gene methylation and reduced transcription-coupled repair, based on differences in dinucleotide patterns (P = 0.001). These data suggest that germ line CT/repair gene function is relatively dispensable for survival, and imply that milder (e.g., epimutational) male prezygotic repair defects could enhance sperm variation-and hence environmental adaptation and speciation-while sparing fertility. We submit that CTs and repair genes are general targets for epigenetically initiated adaptive evolution, and propose a model in which humancancers arise in part as an evolutionarily programmed side effect of age- and damage-inducible genetic instability affecting both somatic and germ line lineages.
A longstanding debate in evolutionary biology concerns how species of increasing structural complexity maintain their capacity for genetic variation—and, hence, adaptation and divergence—despite a predictably increasing need for genetic fidelity (Gulick 1893; Gould JL and Gould CG 1997). Relevant to this conflict, Cope's rule postulates that increasing body size creates a short-term reproductive advantage for the individual organism (Kingsolver and Pfennig 2004) while worsening long-term extinction risk for the clade (Van Valkenburgh et al. 2004; Hone and Benton 2005). This trade-off suggests that higher evolving organisms are subject to a progressive “Red Queen”–type clash between intensifying negative selection for phenotypic stability and weakening positive selection for genotypic variability (Markov 2000)—consistent with the modest proportion (0.03%) of coding sequence estimated to have been positively selected in humans, when compared with that negatively selected (2.5–5%) in humans or positively selected in simpler species such as Drosophila (20%) (Ponting and Lunter 2006). Indeed, prevailing theory teaches that most genetic novelty results from fixation of random (nonadaptive) drift affecting neutral (Kimura 1968) or near-neutral (Ohta 1998) alleles, rejecting the Lamarckian doctrine that environmental pressures can drive (i.e., not merely fix) beneficial mutations.In previous work, we showed that silent mutations may nonrandomly affect intragenic sites of differing functional importance (Epstein et al. 2000; Lin et al. 2003) and that such mutational patterns vary with both strand-specific transcription-related DNA repair (Tang et al. 2006) and gene expression levels (Tang and Epstein 2007). It therefore remains plausible that ambient stressors such as heat (Maresca and Schwartz 2006), starvation (Hastings et al. 2004), inflammation (Blanco et al. 2007; Lavon et al. 2007), toxins (Salnikow and Zhitkovich 2008), free radical injury (Cerda and Weitzman 1997), or other sources of DNA damage (Ponder et al. 2005) could modify gene transcription and thus alter the rate of mutations affecting fitness (Galhardo et al. 2007)—including the occasional generation of beneficial mutations (Monk 1995; Elena and de Visser 2003; Nei 2005). Clues favoring this inducible (adaptive) evolutionary paradigm over neutrality for metazoan genomes—as is already accepted for bacterial (Ponder et al. 2005; Cirz and Romesberg 2007) and plant genomes (Galloway and Etterson 2007)—include faster-than-expected rates of phenotype acquisition, close temporal correlation with environmental changes, proof of improved fitness, or convergence (Levasseur et al. 2007).A mechanism for such non-Darwinian genomic plasticity has been suggested in recent times by the discovery of heritable epigenetic changes capable of reprogramming developmental and adult gene expression (Martin et al. 2005; Morgan et al. 2005), coupled with the predisposition of such changes to cause germ line mutations (Cooper and Krawczak 1989) or postzygotic mosaicism (Ohlsson et al. 1999) that sometimes cause disease (Andrews et al. 1996; Smith and Hurst 1998; Esteller et al. 2001). The frequency of germ line epimutations or imprinting errors—estimated to be an order of magnitude higher than that of germ line mutations (Horsthemke 2006)—can be either environmentally regulated (Dolinoy and Jirtle 2008), as illustrated by the inducibility of spermatogonial stem cell DNA hypermethylation by air pollution (Yauk et al. 2008), or parentally age dependent (Oakes et al. 2003; Perrin et al. 2007). If such epimutations affect modifier genes involved in DNA repair, a “slippery slope” of somatic and transgenerational genetic instability (i.e., a mutator phenotype) may result (Jacinto and Esteller 2007), leading not only to an increase in deleterious (purifiable) mutations (Wu et al. 2007; Morak et al. 2008) but also to occasional advantageous (positively selectable) mutations (Sniegowski et al. 2000; Cirz and Romesberg 2007) and/or speciation events (Sniegowski 1998). Selection of such “driver” beneficial mutations may lead in turn to “hitchhiking” of mutator (epi)mutations in modifier genes (Johnson 1999) as “passengers” (Frohling et al. 2007). Such mutational buffering could enhance evolvability (Wagner 2008)—consistent with the idea that error-free DNA repair may be maladaptive in mutagenic or stressful environments (Breivik and Gaudernack 2004; Ponder et al. 2005; Siegl-Cachedenier et al. 2007)—yet may also impair performance and hence robustness (Lenski et al. 2006; Frank 2007; Petrie and Roberts 2007). The “evo-devo” conundrum thus remains as to whether evolvability is indeed selectable (Hendrikse et al. 2007; Lynch 2007), and if so, by what mechanism (Colegrave and Collins 2008; Pigliucci 2008). Even if such a mechanism exists (King and Jukes 1969; Monk 1995), traditional thinking predicts that such selection may act only very weakly at a “good-for-the-species” level (Hiraiwa-Hasegawa 2000).We have addressed this dilemma by comparing 2 classes of human genes implicated in prevention of cancer, a disease of disordered microevolution (Gatenby and Vincent 2003; Iwasa et al. 2004; Breivik 2005). Tumorigenesis is potentiated by genomic instability (Schneider and Kulesz-Martin 2004; Bielas et al. 2006) arising via multistep inactivation of so-called tumor suppressor genes (Nowak et al. 2004), which, like proto-oncogenes, have been reported to be under strong negative selection pressure (Thomas et al. 2003). These carcinogenic loss-of-function events mainly affect DNA repair—mediated by caretaker genes (CTs) such as BRCA1 and MLH1—or apoptosis, mediated by gatekeeper genes (GKs) such as TP53 and Rb (Kinzler and Vogelstein 1997). These suppressor gene subsets, as well as their disease-causing mutations (Futreal et al. 2004; University Medical Center Groningen 2006), are distinguishable using gene databases (Doctor et al. 2003; Wood et al. 2005). Although long regarded as recessive oncogenes that require 2 “hits” for disease expression (Knudson 2000), suppressor genes are increasingly recognized to exhibit clonal haploinsufficiency in tumors (Santarosa and Ashworth 2004; Smilenov 2006). Given recent evidence for the role of adaptive evolution in cancer progression (Babenko et al. 2006; Crespi and Summers 2006), the occurrence of such haploinsufficiency supports the view that gene loss and pseudogenization (“less is more”) can accelerate genome evolution in certain contexts (Olson 1999). Because deleterious mutations (those causing genetic death) are purged by negative selection, whereas nondeleterious mutations may be positively selected, systematic comparison of CT and GK evolutionary rates should clarify whether these repair and apoptosis gene subsets are subject to distinct evolutionary forces. Consistent with this possibility, comparisons of human and chimpanzee genomes have confirmed different evolutionary rates in functionally distinct gene categories related to tumorigenesis (Clark et al. 2003; Bustamante et al. 2005; Nielsen et al. 2005; Kelley et al. 2006; Voight et al. 2006), whereas adaptive evolution of the BRCA1CT has been well documented (Huttley et al. 2000; Fleming et al. 2003; Pavlicek et al. 2004). Here, we use genomic data mining to test the hypothesis that germ line CTs are commoner targets for methylation-dependent mutational inactivation than are GKs and, hence, that repair gene dysfunction contributes both to germ line evolvability and somatic tumor progression. A male-dependent prezygotic mechanism for this process, which we have termed programmed genetic instability or PGI (Epstein and Zhao 2006a), is also presented.
Materials and Methods
Identification and Classification of CTs and GKs
We mined data to compare the structural and functional characteristics of humanCTs and GKs (see supplement 1 [Supplementary Material online] for sources). Given the multigenic interdependence of DNA repair and cellular apoptosis (Wee and Aguda 2006), unambiguous identification of genes that exclusively mediate 1 of these 2 processes is not straightforward. We sought to minimize the “noise” of this functional overlap in 2 ways. First, we used a familial tumor suppressor gene database (Futreal et al. 2004; University Medical Center Groningen 2006) to restrict the choice of genes to those with major neoplastic effects (i.e., heritable cancer syndromes) when deleted in the germ line; this yielded a total of 74 tumor suppressor genes (table 1). Second, by cross-correlating the former data set with a database of DNA repair genes (Wood et al. 2005), we subclassified this familial cancer susceptibility gene subset as CTs (n = 38) and then designated the remainder—the majority of which were confirmed to mediate apoptosis (Doctor et al. 2003)—as GKs (n = 36; table 1).
Table 1
Classification of CT and GK Suppressor Genes, Listing RefSeq, EntrezGene, and Ensembl Identifiers
CTs
RefSeq
Entrez Gene
Ensembl
GKs
RefSeq
Entrez Gene
Ensembl
1
ATM
NM_000051
472
ENST00000278616
APC
NM_000038
324
ENST00000257430
2
BLM
NM_000057
641
ENST00000355112
AXIN2
NM_004655
8313
ENST00000307078
3
BRCA1
NM_007295
672
ENST00000309486
BMPR1A
NM_004329
657
ENST00000372037
4
BRCA2
NM_000059
675
ENST00000267071
BUB1B
NM_001211
701
ENST00000287598
5
BRIP1
NM_032043
83990
ENST00000259008
CDC73
NM_024529
79577
ENST00000367436
6
DDB2
NM_000107
1643
ENST00000256996
CDH1
NM_004360
999
ENST00000268794
7
ERCC2
NM_000400
2068
ENST00000221481
EXT1
NM_000127
2131
ENST00000378204
8
ERCC3
NM_000122
2071
ENST00000285398
MEN1
NM_130803
4221
ENST00000337652
9
ERCC4
NM_005236
2072
ENST00000311895
NF1
NM_000267
4763
ENST00000358273
10
ERCC5
NM_000123
2073
ENST00000375971
NF2
NM_000268
4771
ENST00000338641
11
ERCC6
NM_000124
2074
ENST00000355832
PRKAR1A
NM_212472
5573
ENST00000358598
12
ERCC8
NM_000082
1161
ENST00000265038
PTCH
NM_000264
5727
ENST00000331920
13
FANCA
NM_000135
2175
ENST00000305699
PTEN
NM_000314
5728
ENST00000371953
14
FANCB
NM_001018113
2187
ENST00000340604
RB1
NM_000321
5925
ENST00000267163
15
FANCC
NM_000136
2176
ENST00000289081
SBDS
NM_016038
51119
ENST00000246868
16
FANCD2
NM_033084
2177
ENST00000287647
SDHB
NM_003000
6390
ENST00000375499
17
FANCE
NM_021922
2178
ENST00000229769
SDHC
NM_003001
6391
ENST00000367975
18
FANCF
NM_022725
2188
ENST00000327470
SDHD
NM_003002
6392
ENST00000375549
19
FANCG
NM_004629
2189
ENST00000378643
SMAD4
NM_005359
4089
ENST00000342988
20
FANCL
NM_018062
55120
ENST00000233741
SMARCB1
NM_001007468
6598
ENST00000344921
21
FANCM
NM_020937
57697
ENST00000267430
STK11
NM_000455
6794
ENST00000326873
22
LIG4
NM_002312
3981
ENST00000310534
SUFU
NM_016169
51684
ENST00000369902
23
MLH1
NM_000249
4292
ENST00000231790
TCF1
NM_000545
6927
ENST00000257555
24
MLH3
NM_014381
27030
ENST00000238662
TSC1
NM_000368
7248
ENST00000298552
25
MRE11A
NM_005591
4361
ENST00000323929
TSC2
NM_000548
7249
ENST00000219476
26
MSH2
NM_000251
4436
ENST00000233146
TSHR
NM_000369
7253
ENST00000298171
27
MSH3
NM_002439
4437
ENST00000265081
VHL
NM_000551
7428
ENST00000256474
28
MSH6
NM_000179
2956
ENST00000234420
WT1
NM_024426
7490
ENST00000379079
29
MUTYH
NM_012222
4595
ENST00000372112
CDK4
NM_000075
1019
ENST00000257904
30
NBN
NM_001024688
4683
ENST00000265433
CHEK2
NM_001005735
11200
ENST00000382580
31
PMS1
NM_000534
5378
ENST00000342075
CYLD
NM_015247
1540
ENST00000311559
32
PMS2
NM_000535
5395
ENST00000265849
EXT2
NM_000401
2132
ENST00000358681
33
POLH
NM_006502
5429
ENST00000372236
FH
NM_000143
2271
ENST00000205832
34
RAD51
NM_002875
5888
ENST00000382643
FLCN
NM_144997
201163
ENST00000285071
35
RECQL4
NM_002907
5965
ENST00000314748
GPC3
NM_004484
2719
ENST00000370818
36
WRN
NM_000553
7486
ENST00000298139
TP53
NM_000546
7157
ENST00000269305
37
XPA
NM_000380
7507
ENST00000259463
38
XPC
NM_004628
7508
ENST00000285021
Classification of CT and GK Suppressor Genes, Listing RefSeq, EntrezGene, and Ensembl Identifiers
Analyses of Gene Sequences, Mutations, and Evolutionary Rate
Human and mouse reference sequences, and species gene numbers, were downloaded from NCBI Entrez Gene (http://www.ncbi.nlm.nih.gov/Entrez/Gene). Mutation data were downloaded from the Human Gene Mutation Database. K-estimator 6.1 (with window size of 33 codons and step size of 10 codons using Kimura 2-parameter [2p] method) (Comeron 1999) and PAML 3.15 with yn00 model (Yang and Nielsen 2002) were used for evolutionary rate calculations. For analysis of gene evolutionary profiles, we downloaded coding sequences from Ensembl (http://www.ensembl.org). Kendall package from R-gui (http://www.r-project.org) was used for statistical analysis. Other analyses were done using MATLAB (7.6) Statistical Toolbox (5.1) (http://www.mathworks.com) for principal component analysis and nonparametric tests, and R-gui (2.60) for χ2 or Fisher's exact test.
Supergene Concatenation
Orthologous gene sequences of human, mouse, rat, chimpanzee, and rhesus monkey were aligned with amino acid sequences using ClustalW (Thompson et al. 1994), then reverted to codon sequences. In-house Perl scripts (available upon request) were developed for aligned codon concatenation. All the aligned CT and GK sequences were concatenated for supergene construction—23,100 codons for the GK supergene and 32,335 for the CT supergene. The supergene tree was constructed using a neighbor-joining method, with a modified Nei–Gojobori approach, as input in MEGA4 (codon substitution number Molecular Evolutionary Genetics Analysis software, version 4.0), producing 2 types of tree: synonymous and nonsynonymous substitution trees.
Coding Sequence Feature Analysis
We used reference sequences downloaded from NCBI Entrez Gene. For multiple splicing forms, the longest coding sequence was used for analysis. Mono- and dinucleotide composition was assessed using in-house Perl scripts. Additional methodologic details are supplied in the supplements, text, and legends (Supplementary Material online).
Results
Phylogenetic Comparison of CT and GK Orthologs
As an initial assessment, we quantified the numbers of humanCT and GK orthologs among species of differing biological complexity. Table 2 shows that orthologs of humanCTs occur more often than those of GKs in unicellular (P = 0.047) than in multicellular organisms (P = 0.192)—suggesting that CTs are phylogenetically older, whereas GKs may be more essential to the evolution of multicellular organisms. Although this phylogenetic rise in GK ortholog frequency may reflect selection for increased developmental complexity as predicted by Cope's rule, it may also reflect the increasing importance of policing rogue elements (either intragenomic elements like meiotic drivers or intercellular outlaws like cancer cells) as multicellularity evolves and organismal cell number increases.
Table 2
Phylogenetic and Gene Essentiality Profiles of Familial Cancer Syndrome Genes
Parameters
CTs
GKs
P value
Yeast orthologs
0.047
Present
30
20
Absent
8
16
Worm orthologs
0.192
Present
30
33
Absent
8
3
Yeast essentiality
0.63
Deletion lethal
3
2
Deletion nonlethal
27
18
Worm essentiality
0.003
RNAi lethal
1
11
RNA nonlethal
29
22
Mouse essentiality
<0.00001
Knockout lethal
4
28
Normal
21
8
Male infertility
13
0
NOTE.—.For yeast phylogenetic analysis, CT analysis excludes the numerous Fanconi anemia gene orthologs in order to avoid data confounding. Gene knockout phenotypes were sourced using data mined from http://www.informatics.jax.org/. All P values were computed using Fisher's exact test (2 sided).
Phylogenetic and Gene Essentiality Profiles of Familial Cancer Syndrome GenesNOTE.—.For yeast phylogenetic analysis, CT analysis excludes the numerous Fanconi anemia gene orthologs in order to avoid data confounding. Gene knockout phenotypes were sourced using data mined from http://www.informatics.jax.org/. All P values were computed using Fisher's exact test (2 sided).We next assessed differences in CT and GK gene essentiality (Liao et al. 2006) based on deletion, RNAi, and gene-targeting data (supplement 2, Supplementary Material online). This analysis shows that germ line GK ortholog disruption is more often lethal than CT knockout in multicellular (worm RNAi data and mouse knockouts; P < 0.00001, Fisher's exact test, 2 sided) but not in unicellular organisms (yeast deletions; P = 0.63; table 2). Hence, relative to GK function, germ line CT function appears selectively dispensable in multicellular organisms.Analysis of mammalian gene-targeting phenotypes further reveals that, unlike GK knockouts, viable CT knockouts are associated with male sterility (table 2). We infer from this finding that CT dysfunction selectively permits (organism) viability at the expense of (genetic) fidelity, severe defects of which might be expected to cause sperm dysfunction or death. As discussed below, however, the possibility is raised that less profound (i.e., nondeletional or epimutational) germ line repair deficiencies could be associated with offspring fertility.To assess further the impact of repair and apoptotic gene defects transmitted through the germ line, we compared CT and GK mutation frequencies in cancer families and somatic tumors. This shows that isolated germ line CT mutations are commoner than isolated GK mutations (P < 0.00001; table 3), reinforcing the notion that CT/repair function is significantly more dispensable for survival than is GK/apoptosis function.
Table 3
Mutational Analysis of Familial (germ line) and Sporadic (somatic) Human Tumor Mutations of CTs and GKs (i.e., relative frequencies of familial vs. sporadic human tumor mutations in the germ line and/or somatic lineages)
Germ line versus somatic human tumor mutations
Germ line and somatic
Germ line only
P value
CT (n = 38)
6
32
<0.00001
GK (n = 36)
25
11
NOTE.—Mutation frequencies were determined by mining Cancer Gene Census and PubMed with P values computed using Fisher's exact test (2 sided).
Mutational Analysis of Familial (germ line) and Sporadic (somatic) HumanTumor Mutations of CTs and GKs (i.e., relative frequencies of familial vs. sporadic humantumor mutations in the germ line and/or somatic lineages)NOTE.—Mutation frequencies were determined by mining Cancer Gene Census and PubMed with P values computed using Fisher's exact test (2 sided).
Phylogenetic Analysis of Apoptosis versus Repair
The foregoing data apply only to tumor suppressor genes. To determine whether these results can be generalized, we performed a cross-species quantitation of orthologs implicated in either apoptosis or repair (supplement 1, Supplementary Material online). Based on the assumptions that organism complexity is increasing from yeast to humans and that assignation of gene ontology includes some random effects, we performed Kendall rank test (R-gui Kendall package, www.r-project.org) for correlation analysis of DNA repair genes and apoptosis genes. As shown in figure 1, phylogenetic differences in apoptosis gene numbers are significant (tau = 1, 2-sided P = 0.008535), whereas for DNA repair genes this is not the case (tau = 0.333, 2-sided P = 0.45237). Considered together with table 2, this difference confirms that increases in biological complexity depend more upon apoptotic than repair gene number.
F
Evolutionary characteristics of human CTs and GKs. (A) Overall quantitation of apoptotic gene versus repair gene number in different phyla: human (H. sapiens), mouse (Mus musculus), fish (D. rerio), fly (D. melanogaster), worm (C. elegans), and yeast (S. pombe). Blue filled squares, DNA repair genes; red filled circles, apoptosis genes; black stars, total gene number of respective genome from Ensemble 49 (www.ensembl.org). Data were mined as detailed in the Materials and Methods. (B) Relative divergence of CT and GK genes in mammals (human–mouse; divergence time approximately 85 MYA) and worms (C. elegans–C. briggsae; divergence time approximately 100 MYA) quantified using Ka/Ks. Blue squares, CT orthologs; red circles, GK orthologs. (C) Principal component analysis with parameters of Ka/Ks of CTs and GKs. We normalized the computed 10 pairwise divergence of human, chimpanzee, rhesus, monkey, mice, and rats using Ka/Ks analysis. 1st PC, first principal component (x axis); 3rd PC, third principal component (y axis).
Evolutionary characteristics of humanCTs and GKs. (A) Overall quantitation of apoptotic gene versus repair gene number in different phyla: human (H. sapiens), mouse (Mus musculus), fish (D. rerio), fly (D. melanogaster), worm (C. elegans), and yeast (S. pombe). Blue filled squares, DNA repair genes; red filled circles, apoptosis genes; black stars, total gene number of respective genome from Ensemble 49 (www.ensembl.org). Data were mined as detailed in the Materials and Methods. (B) Relative divergence of CT and GK genes in mammals (human–mouse; divergence time approximately 85 MYA) and worms (C. elegans–C. briggsae; divergence time approximately 100 MYA) quantified using Ka/Ks. Blue squares, CT orthologs; red circles, GK orthologs. (C) Principal component analysis with parameters of Ka/Ks of CTs and GKs. We normalized the computed 10 pairwise divergence of human, chimpanzee, rhesus, monkey, mice, and rats using Ka/Ks analysis. 1st PC, first principal component (x axis); 3rd PC, third principal component (y axis).
Comparison of CT and GK Evolutionary Rates
We then used nonparametric 2-sample Kolmogorov–Smirnov goodness of fit hypothesis testing with kstest2 function (MATLAB, http://www.mathworks.com, statistical toolbox) for Ka/Ks—a positive correlate of positive selection—ortholog analysis. This confirmed that CTs evolve more rapidly than GKs in sexually reproducing (human vs. mouse, estimated divergence time 85 Myr; P = 0.000003) but not in self-fertilizing (2 worm species, estimated divergence time 100 Myr; P = 0.2582) multicellular organisms (fig. 1; Kruskal–Wallis test, P < 0.004); Ks was different in worms (P = 0.007) but not in mammals (P = 0.091). Consistent with our earlier finding of selective male sterility in CT knockouts, these data support the hypothesis that rapid CT evolution is related in some way to sexual reproduction. Principal component analysis based on evolutionary rate as well as variables such as GC content (see fig. 5) and gene length (see supplement 3, Supplementary Material online) provides further visual evidence that CTs and GKs are distinguishable based on genetic evolutionary parameters (fig. 1).
F
Box plot illustration of CT- and GK-coding sequence features relating to DNA methylation and gene expression. The boxes feature lines at the lower quartile, median, and upper quartile values. The whiskers are lines extending from each end of the boxes to show the extent of the data; outliers are values beyond the ends of the whiskers. (A) Comparison of GC content and dinucleotide skew in CTs and GKs. (B) Frequency of methylation-related dinucleotides and asymmetry in CTs versus GKs. Outliers are denoted by plus signs.
We then used neighbor-joining supergene (gene concatenation) trees (Zhang et al. 1998) to characterize CT and GK branches under selection. This methodology, which has been reported to yield more accurate phylogenetic data than multigene approaches (Gadagkar et al. 2005), confirms that accelerated evolution of CTs compared with GKs is evident in human–macaca, human–mouse, and mouse–rat comparisons, though apparently not in human–chimpanzee comparisons (fig. 2). This difference is further illustrated by a tree diagram showing the distinct divergence parameters of CTs and GKs (fig. 2). It is noted that the ratio of missense to silent mutations (A/S) encoded by CT single-nucleotide polymorphisms (SNPs) is no higher than in GKs (supplement 1, Supplementary Material online) and that, compared with historical data (Zhang et al. 1998; Fay et al. 2001), larger polymorphisms are present in both rare and common SNPs (P < 0.001, Pearson's χ2). We have also noted that CTs evolve more rapidly than tissue-specific genes, whereas GKs evolve more slowly than housekeeping genes (P < 0.001, Mann–Whitney U test; data not shown), emphasizing the wide class differences in evolutionary rates.
F
Evolutionary rate comparison of CTs and GKs. (A) Distribution of Ka/Ks in pairwise comparisons between CTs and GKs within 4 lineages including human–chimpanzee (top left), human–macaca (top right), human–mouse (bottom left), and human–rat (bottom right). Bin size 0.05 was used for distribution computation. For this analysis—which compares the different evolutionary rates of human CTs and GKs using a variety of species comparators, as distinct from comparing gene evolutionary rates in multiple species—P values were calculated by ranked 2-sample Mann–Whitney U test using MATLAB Statistical toolbox function rank sum. (B) Lineage-specific comparison of evolutionary rate of CTs and GKs using supergene concatenation. Blue bars, CTs; red bars, GKs.
Evolutionary rate comparison of CTs and GKs. (A) Distribution of Ka/Ks in pairwise comparisons between CTs and GKs within 4 lineages including human–chimpanzee (top left), human–macaca (top right), human–mouse (bottom left), and human–rat (bottom right). Bin size 0.05 was used for distribution computation. For this analysis—which compares the different evolutionary rates of humanCTs and GKs using a variety of species comparators, as distinct from comparing gene evolutionary rates in multiple species—P values were calculated by ranked 2-sample Mann–Whitney U test using MATLAB Statistical toolbox function rank sum. (B) Lineage-specific comparison of evolutionary rate of CTs and GKs using supergene concatenation. Blue bars, CTs; red bars, GKs.To explore possible differences of divergence between CTs and GKs, we used a sliding window analysis of concatenated CT (38 genes, 41,169 codons) and GK (36 genes, 25,968 codons) supergenes. A window size of 33 codons and step size of 10 codons was used in conjunction with K-estimator (calculation of the number of nucleotide substitutions per site and the confidence intervals) 6.1, Kimura 2p model. As shown in figure 3, the Ka/Ks (red), but neither Ka nor Ks (green and blue, respectively), is overrepresented in CTs compared with GKs. We further tested the distribution of these 3 parameters by using the nonparametric 2-sample Kolmogorov–Smirnov test. Figure 3 confirms that only Ka/Ks is significant (P = 0.0000079) but neither Ka nor Ks, which correlate with deleterious mutation and neutral mutation rate, respectively (for further details, see supplement [Supplementary Material online] and fig. 3).
F
Sliding window analysis of human–mouse CT and GK divergence. (A) Evolutionary rates of CTs (top) and GKs (bottom) using human–mouse alignments (window size 33 codons, step size 10 codons). Red line, Ka/Ks; blue line, Ks; green line, Ka. Most regions with Ka/Ks >1.5, with P value <0.05 (bootstrap threshold computed with K-estimator 6.1). (B) Distribution of Ka, Ks, Ka/Ks in CTs and GKs using bin size 0.05 and nonparametric Kolmogorov–Smirnov test, revealing a significant difference of Ka/Ks (P = 0.0000079), thus confirming more rapid evolution of CTs than GKs.
Sliding window analysis of human–mouseCT and GK divergence. (A) Evolutionary rates of CTs (top) and GKs (bottom) using human–mouse alignments (window size 33 codons, step size 10 codons). Red line, Ka/Ks; blue line, Ks; green line, Ka. Most regions with Ka/Ks >1.5, with P value <0.05 (bootstrap threshold computed with K-estimator 6.1). (B) Distribution of Ka, Ks, Ka/Ks in CTs and GKs using bin size 0.05 and nonparametric Kolmogorov–Smirnov test, revealing a significant difference of Ka/Ks (P = 0.0000079), thus confirming more rapid evolution of CTs than GKs.To check whether human–chimpanzee divergences of CTs and GKs are in fact similar, as suggested by the findings in figure 2, we used the McDonald and Kreitman (1991) test for evolutionary rate analysis. Data were mined as described (Bustamante et al. 2005) using a Poisson random field model—a variant of the McDonald–Kreitman test—for divergence versus diversity comparison. Data were obtained from Celera Genomics, which applied exon-specific polymerase chain reaction amplification to 20,362 loci in 39 humans and 1 male chimpanzee. We note, however, that the following genes were not found in this data set: 9 CTs (BRCA2, BRIP1, FANCD2, FANCM, MLH1, MLH3, NBN, PMS2, and RECQL4) and 10 GKs (CDC73, EXT1, NF1, PTCH, SBDS, SDHD, TSHR, WT1, CHEK2, and FH). The following parameters were used: synonymous divergence, synonymous polymorphism, nonsynonymous divergence (DN), nonsynonymous polymorphism (PN), and γ (Poisson random field model parameters). All mined data are supplied (supplement [Supplementary Material online] and fig. 4). In contrast to figure 2, these results—which show a higher frequency of GKs lacking both PN and DN sites relative to CTs (Pearson's χ2, P = 0.0044)—confirm that the rapid evolution of CTs relative to GKs persisted during the human–chimpanzee divergence approximately 10 MYA (fig. 4). Hence, having shown that CTs evolved faster than GKs both during primate–rodent (figs. 1–3) and human–chimpanzee divergence (fig. 4), we conclude that the rapid evolution of CTs is likely independent of timescale.
F
McDonald–Kreitman testing of CT evolutionary rate relative to GKs during human–chimpanzee divergence. (A) Distributions of synonymous divergence (DS), synonymous polymorphism (PS), nonsynonymous divergence (DN), and nonsynonymous polymorphism (PN) of CTs and GKs using bin size 1 and nonparametric tests (median of 2 unpaired samples, using the MATLAB 7.6 statistical toolbox rank-sum function). This shows that DN is the most significant parameter (P = 0.00016), followed by PN (P = 0.004), with DS and PS not significant, thus confirming rapid CT evolution. (B) R-gui function χ2 test function in terms of the summarized codon changes of CTs and GKs. This also shows that the evolutionary rates of CTs and GKs are different (P = 0.0000037). (C) Poisson random field parametric test. The selection parameters are illustrated using a 95% confidence interval and show no significant difference (P = 0.1913, by nonparametric test of equal distribution of 2 samples using function kstest2), presumably reflecting numerous nonchange genes in terms of the parameters. (D) Parametric retesting of CT and GKs in terms of DS, PS, DN, and PN, confirming that CTs evolved more rapidly than GKs during human–chimpanzee divergence.
McDonald–Kreitman testing of CT evolutionary rate relative to GKs during human–chimpanzee divergence. (A) Distributions of synonymous divergence (DS), synonymous polymorphism (PS), nonsynonymous divergence (DN), and nonsynonymous polymorphism (PN) of CTs and GKs using bin size 1 and nonparametric tests (median of 2 unpaired samples, using the MATLAB 7.6 statistical toolbox rank-sum function). This shows that DN is the most significant parameter (P = 0.00016), followed by PN (P = 0.004), with DS and PS not significant, thus confirming rapid CT evolution. (B) R-gui function χ2 test function in terms of the summarized codon changes of CTs and GKs. This also shows that the evolutionary rates of CTs and GKs are different (P = 0.0000037). (C) Poisson random field parametric test. The selection parameters are illustrated using a 95% confidence interval and show no significant difference (P = 0.1913, by nonparametric test of equal distribution of 2 samples using function kstest2), presumably reflecting numerous nonchange genes in terms of the parameters. (D) Parametric retesting of CT and GKs in terms of DS, PS, DN, and PN, confirming that CTs evolved more rapidly than GKs during human–chimpanzee divergence.
Sequence-Based Evidence for Evolutionary CT Gene Methylation
Our previous studies implicated nonrandom CG → TA transitional mutations (CpG decay) as a correlate of adaptive evolution in less transcribed (Tang and Epstein 2007) and/or less essential coding sequences (Epstein et al. 2000); conversely, we have implicated CpG conservation as a correlate of negative selection in more transcribed (Tang et al. 2006) and/or more essential sequences (Lin et al. 2003). These results suggested an evolutionary paradigm in which concomitant promoter and coding sequence methylation accelerate (epi)mutational functional loss of nonessential coding sequences. To test whether any such methylation-dependent signatures distinguish CTs and GKs, we computed the sequence component of the relevant transcribed coding strand and applied a nonparametric 2-sample Kolmogorov–Smirnov goodness of fit hypothesis test with kstest2 function of MATLAB (http://www.mathworks.com) statistical tool box for all parameters compared. Meaningful CT and GK gene expression data were unable to be derived from GNF Expression Atlas 2 based on U133A and GNF1H Chips, perhaps reflecting parametric uncertainty (Su et al. 2004). However, for gene expression-related sequence features relating to CpG mutation and asymmetric transcription-related repair (Tang et al. 2006), we compared GC content ((G + C)/(G + C + A + T), P = 0.0106), TA skew ((T − A)/(T + A), P = 0.5242), GC skew ((G − C)/(G + C), P = 0.0028), and B factor ((G + T − A − C)/(A + T + G + C), P = 0.0860) (Majewski 2003) (fig. 5). These differences suggest greater methylation-dependent mutation of CTs relative to GKs during recent human evolution. We then compared the methylation-related sequence features of CT/GK transcribed strands, revealing differences in CpG content (CpG count/total dinucleotide count, P = 0.0011), CpA content (CpA count/total dinucleotide count, P = 0.2555), TpG content (TpG count/total dinucleotide count, P = 0.0174), and DNA methylation-related dinucleotide asymmetry (TpG × CpA)/(CpG × CpG), P = 0.00105) (fig. 5). These results indicate less transcription-related repair of methylated CTs relative to GKs in germ line–coding sequences. Considered together with the gene essentiality differences presented in table 2, the rapid evolution of CTs relative to GKs suggests an epimutational mechanism for CT functional inhibition that escapes purification while accelerating mutation.Box plot illustration of CT- and GK-coding sequence features relating to DNA methylation and gene expression. The boxes feature lines at the lower quartile, median, and upper quartile values. The whiskers are lines extending from each end of the boxes to show the extent of the data; outliers are values beyond the ends of the whiskers. (A) Comparison of GC content and dinucleotide skew in CTs and GKs. (B) Frequency of methylation-related dinucleotides and asymmetry in CTs versus GKs. Outliers are denoted by plus signs.Model of changing human male CT and GK function from germ line to somatic contexts. The x axis is divided into the following time frames: T1, prezygotic spermatogonia and spermatocytes; T2, postzygotic embryonic development; T3, prereproductive infancy and childhood; T4, reproductive adult life; and T5, postreproductive senescence. The y axis models the relative extent of either GK functionality (PCD) or CT dysfunctionality (PGI) during these time frames. Damage/stress-inducible prezygotic promoter (CpG island) methylation of spermatogonial/spermatocyte CTs and GKs, respectively, increases PGI while reducing PCD—thus maximizing genetic variability in response to changing environmental selection pressures. Postfertilization male germ line gene demethylation has the opposite effect, causing a sustained decline in PGI and a rise in PCD. During postreproductive adult life, an age-dependent (as well as damage-inducible) methylation clock induces somatic CT/GK gene inactivation, leading in turn to a senescent rise in PGI and decline of PCD that predisposes to sporadic tumor outgrowth.
Discussion
The central finding of this study is that CTs are evolving more rapidly than GKs and that this process—which appears likely to be mediated by methylation-dependent mutation—is confined to higher sexually reproducing species. These CT–GK distinctions are consistent with earlier work showing that apoptosis-regulatory genes are essential for development of higher organisms (Aravind et al. 2001), whereas germ line mutation of DNA repair genes drives species evolution (Fay et al. 2001). However, because loss of DNA repair capacity also contributes to cancer development (Cleaver et al. 1995; Breivik and Gaudernack 2004), it is also reasonable to expect tumor suppressor genes in general to be under strong purifying selection (Thomas et al. 2003). Here, we propose that this apparent discrepancy arises from a complex mix of dualistic variables: 1) the bifunctional evolutionary role of repair genes in either conserving genetic information or permitting genetic variation, depending on selection pressure; 2) the bifunctional ability of sexual reproduction either to promote (prezygotically) variation through repair inhibition and intra- or intermale sperm competition or to conserve (postzygotically) genetic fidelity through repair activation, apoptosis, and miscarriage; and 3) the bifunctional role of CpG dinucleotides and promoter CpG islands in either enhancing transcription and repair (when demethylated) or in repressing transcription and predisposing to mutation (when methylated).Epigenetic reprogramming occurs not only in the early embryo, where somatic patterns of gene expression are set, but also during germ cell development (Morgan et al. 2005)—which changes can be heritable for at least 2 generations (Anway et al. 2008). The methylation dynamics of sperm/testis DNA are unique (Oakes et al. 2007a); unlike oocyte DNA, prezygotic protamine-compacted male germ cell DNA tends to be methylated in nonpromoter regions (Oakes et al. 2007b) that undergo rapid demethylation following fertilization (Haaf 2006). Such sex-specific DNA methylation appears necessary but not sufficient (El-Maarri et al. 1998) to explain the higher mutation rate of mammalian male germ cells (Agulnik et al. 1997; Hurst and Ellegren 1998; Wyckoff et al. 2000; Makova and Li 2002)—suggesting in turn that exposures of the (external) testes to heat (Nikolopoulos et al. 2007), DNA damage (Barber et al. 2006), or other insults (Hara et al. 1999) could well play an evolutionarily programmed epimutagenic role, consistent with the postnatal timing of male germ cell promoter methylation (Driscoll and Migeon 1990). Notably, this hypothesis differs from the standard view of male-driven evolution reflecting a simple excess of male germ cell divisions, giving rise in turn to more replicative mutations (Drost and Lee 1995).In earlier work, we showed that rarely transcribed genes with promoter CpG islands are hot spots for adaptive evolution (Tang and Epstein 2007), raising the possibility that promoter methylation of sperm target gene classes such as CTs could be a mechanistic “missing link” between environmental selection for specific transcriptomes (Su et al. 2004) and/or coding sequence CpG mutations that permit transgenerational propagation of genetic instability (Wu et al. 2007; Morak et al. 2008). Consistent with this, CTs more often contain promoter CpG islands than do GKs (Zhao Y, unpublished data): classic GKs such as TP53 do not contain promoter CpG islands, whereas canonical CTs such as MLH1 and BRCA1 do. Instructively, the latter gene is mutated and not methylated in familial cancer syndromes (Chen et al. 2006), yet is methylated and not mutated in chemotherapy-induced second malignancies (Scardocci et al. 2006); similar exclusivity between repair gene methylation events and oncogenic indels or point mutations in repair-deficient tumors (Esteller et al. 2001; Toyooka et al. 2006) suggests the fluidity of such epimutations. Although in this model extrinsic damage is the major regulator of prezygotic male germ cell CT methylation—as well as being both a cause and effect of progressive suppressor gene repression in precancerous adult somatic tissues (Neri et al. 2007)—age may be as important as damage in the latter process (Kim et al. 2005), with parental (especially paternal) age playing a synergistic role in the former (Oakes et al. 2003).We define this model of a “methylation clock” regulating the epigenetic inactivation of CT/repair genes in the male germ line and adult somatic tissues—commensurate with extrinsic damage/stress or intrinsic ageing/senescence—as PGI; just as programmed cell death (PCD) is the mechanism of negative selection, so is PGI proposed to be the mechanism of positive selection (fig. 6). We suggest that PGI intensifies genetic divergence during sexual reproduction at the level of sperm–egg fusion, in contrast to PCD which eliminates both oocyte (Suh et al. 2006) and sperm DNA defects after syngamy (Fatehi et al. 2006). This sequence (sperm → ovum, zygote → embryo) of positive followed by negative selection fits with the notion of sexual conflict (Partridge and Hurst 1998), which should enhance biological system robustness and evolvability (Kitano 2004). We also note that heritable, though not necessarily familial, predisposition to carcinogenesis could also be propagated transgenerationally by PGI (Epstein and Zhao 2006b).
F
Model of changing human male CT and GK function from germ line to somatic contexts. The x axis is divided into the following time frames: T1, prezygotic spermatogonia and spermatocytes; T2, postzygotic embryonic development; T3, prereproductive infancy and childhood; T4, reproductive adult life; and T5, postreproductive senescence. The y axis models the relative extent of either GK functionality (PCD) or CT dysfunctionality (PGI) during these time frames. Damage/stress-inducible prezygotic promoter (CpG island) methylation of spermatogonial/spermatocyte CTs and GKs, respectively, increases PGI while reducing PCD—thus maximizing genetic variability in response to changing environmental selection pressures. Postfertilization male germ line gene demethylation has the opposite effect, causing a sustained decline in PGI and a rise in PCD. During postreproductive adult life, an age-dependent (as well as damage-inducible) methylation clock induces somatic CT/GK gene inactivation, leading in turn to a senescent rise in PGI and decline of PCD that predisposes to sporadic tumor outgrowth.
What evidence do we have for PGI operating through the male germ line? The original hypothesis arose from prior knowledge, namely, 1) evidence for male-driven evolution from other groups (Agulnik et al. 1997; Hurst and Ellegren 1998; Wyckoff et al. 2000; Makova and Li 2002); 2) evidence for repair genes being speciation genes, implying a permissive role in rapid evolution (Radman and Wagner 1993; Cleaver et al. 1995; Sniegowski 1998); and 3) the external anatomic location of the testis, making male germ cells uniquely vulnerable to mutagenic, yet potentially reparable, environmental damage (Roest et al. 1996; Hsia et al. 2003; Feitsma et al. 2007). Against this background, we needed to integrate the following new data: 1) CT/repair genes are evolving with unexpected rapidity; 2) CT knockouts are relatively dispensable for survival, resulting only in reduced male fertility; and 3) abnormally low CT/repair gene expression is characteristic of teratozoospermia (Zhao Y, unpublished data), a condition of abnormal sperm morphology that does not significantly reduce in vitro fertilization success rates (Keegan et al. 2007). These considerations invited the hypothesis: could the rapid evolution of CT/repair genes reflect selection for a permissive (i.e., causal loss-of-function) role in male-driven evolution? This latter possibility is supported by the finding that heterozygous males and homozygous females can remain fertile when homozygous male repair gene knockouts are sterile (Roest 1996) and that even minimal repair transgene expression suffices to rescue fertility in repair-knockout males (Hsia 2003). Moreover, homozygous repair-deficient males may survive—albeit at the cost of tumor susceptibility—in genetic backgrounds where homozygous females die (Cranston et al. 1997). This striking gender-specific difference strongly suggests a male-specific significance for accelerated CT evolution, which involves interaction of ambient damage with selection for subtle (e.g., epimutational) male germ cell repair defects.Our gene targeting and familial cancer data confirm that germ line CT defects are less lethal than GK defects (tables 2 and 3). The effects of repair gene targeting depend critically on the extent of functional inhibition and the presence or otherwise of associated mutations; in general, mild (e.g., haploinsufficient) CT/repair defects tend to increase mutation while decreasing apoptosis (Frank et al. 2005; Smilenov et al. 2005) and may even enhance fitness and lifespan (Siegl-Cachedenier et al. 2007), whereas severe repair defects tend to cause increased apoptosis and premature lethality (Henrie et al. 2003). Hence, because low-level sperm DNA damage can be repaired after fertilization of repair-proficient oocytes (Menezo 2006; Marchetti et al. 2007; Fernandez-Gonzalez et al. 2008)—particularly in the setting of prior DNA damage “conditioning” of such oocytes (Agrawal and Wang 2008)—mild nondeletional prezygotic male germ line repair defects could plausibly enhance sperm divergence with minimal fertility compromise, thereby safeguarding species’ evolvability while offsetting transgenerational risks of paternally transmitted birth defects (Marchetti and Wyrobek 2008) or cancer (Zenzes et al. 1999; Yauk et al. 2007). This conclusion is further supported by the finding that chronic exposure of spermatogonia to low-dose damage greatly reduces genetic damage induced by an acute second hit (Cai et al. 1993; Koana et al. 2007). Our findings therefore suggest the evolution of an environmentally interactive genetic program for promoting divergence in the germ line. We note that the related age-dependent predisposition to cancer is not wholly disadvantageous to the species as such mortality may promote redistribution of environmental resources to younger and more fertile individuals.As an extension of the “immortal strand” hypothesis of stem cell fidelity (Cairns 2006), our findings raise the notion that the transcribed (well repaired, CpG-demethylated) DNA strand represents the “immovable object” of negative selection, whereas the untranscribed (poorly repaired, CpG-methylated) strand provides the “irresistible force” of positive selection. Furthermore, consistent with the notion that sexual conflict selects for distinct gender phenotypes (Hosken et al. 2001), the evolutionary paradigm presented here implies that the usual targets of positive selection—choice and specialization (Barkman 2003; Hamm et al. 2007), discrimination and success (Swanson et al. 2003), and beauty (Morris RD and Morris JA 2004)—are intrinsically male (sperm/PGI dependent) in origin, whereas the targets of negative selection (normality, utility, longevity, and fidelity) are genetically female or oocyte/PCD dependent. Both positively and negatively selected traits are vital for fitness in sexually reproducing species: for example, the reproductive success of butterflies, birds, and peacocks depends on highly variegated yet symmetric surface patterns (Gould JL and Gould CG 1997) that predict a low underlying frequency of deleterious functional mutations (Morris RD and Morris JA 2004). In our model, PGI is driven by sexual selection pressure on the male germ line to optimize as opposed to normalize whatever phenotypes can be optimized—such as sperm velocity (Gage et al. 2004) or ion channel function regulating motility (Podlaha et al. 2005)—while also enhancing variation in species-discriminatory phenotypes such as egg-binding proteins (Moller and Cuervo 2003). Hence, the basis for sexual conflict in our model is that “average” can never be “best” and that “health” may not guarantee “popularity” (Zaidel et al. 2005)—whether with respect to beauty (DeBruine et al. 2007) or to sperm competition (Neff and Pitcher 2005)—thus helping to explain the paradox that mutation rates tend to be reduced by natural selection yet increased by sexual selection (Moller and Cuervo 2003).In conclusion, we have shown that CTs are less essential for germ line viability and hence more rapidly evolving than GKs. The resulting model of PGI implicates sexual reproduction as an evolutionary masterstroke: uniquely, sex plays off prezygotic positive selection (in which epimutational divergence is accelerated by environmental stress and damage, and efficiently selected and fixed by intra- and/or intermale sperm competition) against the evolutionary safety valves of postzygotic repair and negative selection (Menezo 2006). Via this mechanism, we propose that environmental stressors succeed in the otherwise oxymoronic task of “selecting for divergence” via epigenetic inhibition of DNA repair, thus helping to settle the “4-billion-year struggle of selfish genes to balance the need for variation with the equally important goal of conserving success” (Gould JL and Gould CG 1997). Moreover, if tumors do indeed arise in part as a side effect of PGI, cancer susceptibility may be most accurately viewed as the tumor-permissive “price” paid by multicellular organisms for genetic plasticity, with the species reaping the ultimate evolutionary “reward” of a delayed time to extinction.
Supplementary Material
Supplement 1, 2, and 3 are available at Molecular Biology and Evolution online (http://www.mbe.oxfordjournals.org/).
Authors: Stefan Fröhling; Claudia Scholl; Ross L Levine; Marc Loriaux; Titus J Boggon; Olivier A Bernard; Roland Berger; Hartmut Döhner; Konstanze Döhner; Benjamin L Ebert; Sewit Teckie; Todd R Golub; Jingrui Jiang; Marcus M Schittenhelm; Benjamin H Lee; James D Griffin; Richard M Stone; Michael C Heinrich; Michael W Deininger; Brian J Druker; D Gary Gilliland Journal: Cancer Cell Date: 2007-12 Impact factor: 31.743
Authors: David Hamm; Brian S Mautz; Mariana F Wolfner; Charles F Aquadro; Willie J Swanson Journal: Am J Hum Genet Date: 2007-05-08 Impact factor: 11.025
Authors: Vladimir N Babenko; Malay K Basu; Fyodor A Kondrashov; Igor B Rogozin; Eugene V Koonin Journal: BMC Cancer Date: 2006-02-09 Impact factor: 4.430