Vincent J Lynch1. 1. Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut 06520-8106, USA. vincent.j.lynch@yale.edu
Abstract
Transgenic animal models have played an important role in elucidating gene functions and the molecular basis development, physiology, behavior, and pathogenesis. Transgenic models have been so successful that they have become a standard tool in molecular genetics and biomedical studies and are being used to fulfill one of the main goals of the post-genomic era: to assign functions to each gene in the genome. However, the assumption that gene functions and genetic systems are conserved between models and humans is taken for granted, often in spite of evidence that gene functions and networks diverge during evolution. In this review, I discuss some mechanisms that generate functional divergence and highlight recent examples demonstrating that gene functions and regulatory networks diverge through time. These examples suggest that annotation of gene functions based solely on mutant phenotypes in animal models, as well as assumptions of conserved functions between species, can be wrong. Therefore, animal models of gene function and human disease may not provide appropriate information, particularly for rapidly evolving genes and systems.
Transgenic animal models have played an important role in elucidating gene functions and the molecular basis development, physiology, behavior, and pathogenesis. Transgenic models have been so successful that they have become a standard tool in molecular genetics and biomedical studies and are being used to fulfill one of the main goals of the post-genomic era: to assign functions to each gene in the genome. However, the assumption that gene functions and genetic systems are conserved between models and humans is taken for granted, often in spite of evidence that gene functions and networks diverge during evolution. In this review, I discuss some mechanisms that generate functional divergence and highlight recent examples demonstrating that gene functions and regulatory networks diverge through time. These examples suggest that annotation of gene functions based solely on mutant phenotypes in animal models, as well as assumptions of conserved functions between species, can be wrong. Therefore, animal models of gene function and human disease may not provide appropriate information, particularly for rapidly evolving genes and systems.
Biomedical studies of animal models have played an important role in elucidating the molecular basis and pathogenesis of human disease, as well as in developing and testing novel therapies [1,2]. As early as the 1930s, animal models such as the fruit fly (Drosophila melanogaster), zebrafish (Danio rerio), and mouse (Mus musculus) were being used to study the embryology and development of tissue and organ systems [1,2]. These early studies significantly advanced our understanding of embryology and development by taking advantage of forward-genetic approaches to describe and characterize mutants, but it was not until the development of target transgenesis that it became possible to generate specific animal models of human disease.The development of transgenic animals in the early 1980s was a major breakthrough in molecular genetics and marked the beginning of modern biomedical studies. Gene targeting in model organisms enabled the deletion (“knockout”) or replacement (“knockin”) of any gene in the genome, allowing investigation of that gene’s function in development, morphology, physiology, and even behavior [1-3]. Gene targeting has been so successful in elucidating the molecular function of genes that it has become a standard tool in molecular genetics and biomedical studies and now is being used to resolve one of the major goals of the post-genomic era: assigning a function to each gene in the genome [4]. Despite the great success of animal models in studies of gene function and disease, those studies are based on the critical underlying assumption that gene functions and developmental systems are conserved between models and humans — which may not always be the case.In this review, I outline some of the mechanisms that lead to evolutionary divergence in protein functions and gene regulatory networks and discuss how divergence between species can complicate studies of model organisms. I begin with a brief summary of the biological justifications for using animal models to study development and disease and discuss how principles emerging from evolutionary developmental biology challenge the assumption of conserved genetic architecture. Finally, I provide some examples of divergence in protein function and gene regulation between mouse — a common model organism for studying gene function and disease — and humans. While model organisms are a vitally important part of biomedical research, the assumption that functions are conserved between models and other organisms needs to be critically examined on a case-by-case basis.
Evolutionary and developmental justifications for animal models
Comparative anatomists have long recognized the similarity of tissue and organ systems between diverse animals, particularly within mammals. Indeed, these shared similarities led early comparative embryologists to formalize the concept of “homology,” or similarity between characteristics because of shared ancestry, as a distinct biological phenomenon [5]. The numerous similarities in nervous, cardiovascular, endocrine, immune, musculoskeletal, and reproductive system development and function between model organisms such as the mouse, fish, fly, and humans provide a strong evolutionary foundation that the developmental processes giving rise to homologous structures is likely to be similar between different species.Perhaps the strongest evidence that homologous genes control the development of homologous structures is the finding that proteins between remarkably divergent organisms can be functionally equivalent, often despite great divergence in both protein similarity and time. For example, the first use of transgenics to demonstrate that homologous genes functioned in homologous ways was between the human Hox transcription factor HoxB-4 and its Drosophila homolog Deformed (Dfd). McGinnis and colleagues (1990) tested whether the human HoxB-4 gene could substitute for the regulatory functions of Dfd in Drosophila embryos by inserting the HoxB-4 gene into the Drosophila genome. Amazingly, the human HoxB-4 gene can properly regulate endogenous Dfd expression in developing embryonic and larval cells through its autoregulatory element [6]. Thus, the autoregulatory function of the human and Drosophila homologs has been conserved after more than 800 million years of divergence.Probably the best-known example of homologous proteins initiating homologous developmental programs is the transcription factor Pax6. Loss of function mutations in Pax6 result in similar eye malformations in mice, humans, and Drosophila [7-9], indicating that Pax6 has a conserved function in eye formation, despite the vast differences between vertebrate and invertebrate eyes. Ectopic expression of eyeless, the Drosophila homolog of the vertebrate Pax6 gene, in the antennae, legs, and wings of flies induces eye development on these body parts, indicating eyeless functions a “master regulator” that switches on the developmental program for eye formation [10]. Remarkably, ectopic expression of the mouse Pax6 gene in Drosophila embryos induces the formation of well developed compound eyes [10], while expression of the Drosophila eyeless gene in Xenopus embryos leads to the induction of vertebrate eyes [11]. These data strongly indicate the function of Pax genes as high-level regulatory switches that activate the gene regulatory network leading to eye development is conserved between animals.Numerous other examples of functional equivalence have been identified between orthologous genes from very different organisms [12-19] and between paralogous genes from ancient gene and genome duplications [20-27]. The broad consensus resulting from these studies is that developmental systems do not change much through evolution, and, therefore, studies of gene function in one species are directly applicable to another. But is this conclusion justified?
Challenges to the assumption of conserved genetic architecture
I have briefly reviewed some of the evidence suggesting that protein functions and the developmental systems to which they contribute are so conserved during evolution that biomedical studies in model organisms are directly applicable to human development, physiology, and disease. However, numerous studies have found that homologous proteins do not necessarily remain functionally equivalent during evolution [28-55]. For example, the transcription factor HoxA-11 has evolved a novel ability to activate prolactin (PRL) expression in endometrial stromal cells in placental mammals, while HoxA-11 from non-placental mammals like opossum, platypus, and chicken lack activation functions and can only repress PRL expression. Similarly, mouse phospholipase C zeta 1 (PLCZ1) is actively transported into the nucleus shortly after fertilization, but PLCZ1 from other mammals lack the nuclear localization signal (NLS) found in mouse PLCZ1 and do not enter the nucleus. Furthermore, evolutionary modification of gene regulatory networks and developmental systems are mediated through changes in how genes are regulated and function [56-60], thus the assumption that homologous genes function in homologous ways between models and humans needs to be critically examined. In the following section, I review some of the ways that developmental systems can diverge and discuss how these factors can bias biomedical studies that assume conservation of regulatory systems between models and humans.
Positive selection promotes functional divergence
Many powerful methods for detecting adaptive molecular evolution have been developed over the last decade. These methods compare the rate of synonymous (non-amino acid changing) and non-synonymous (amino acid changing) nucleotide substitutions to infer the strength and direction of selection acting on protein coding genes [61-63]. Since synonymous nucleotide substitutions do not lead to amino acid changes, they are not exposed to natural selection acting on the protein structure/function and accumulate in the genome at a more or less clock-like, or neutral, rate. On the other hand, non-synonymous nucleotide changes generate amino acid changes and are, therefore, exposed to the action of selection. If an amino acid change is deleterious for the structure/function of a protein, the new alleles’ frequency in the population will remain low and eventually disappear. If, however, the amino acid change is beneficial because it generates a novel function or contributes other beneficial effects, the frequency of that allele in the population will increase until it completely replaces the ancestral allele. Thus, when amino acid changes are adaptive, the rate of fixation in the population will increase above the background rate of synonymous substitutions [61-63].Numerous cases of molecular adaptation have been identified in various systems from viruses to humans with the common theme that positive selection is very often associated with an escalating arms race, such as between immune genes and parasites, or the emergence of novel functions. For example, one of the first demonstrations of molecular adaption was in the antigen recognition site of major histocompatibility (MHC) genes. It was shown that amino acid substitutions occurred more rapidly at this locus than either synonymous substitutions or amino acid changes in other parts of the protein [64,65]. Positive selection can also act to modify ancestral functions or generate novel ones. For instance, the progesterone receptor (PGR) evolved extremely rapidly in humans and chimpanzees with the majority of amino acid substitutions occurring in parts of the protein important for transcriptional activity [66]. Remarkably, the episodes of rapid PGR evolution are coincident with changes in the mechanism of parturition in higher apes, suggesting that using rodents as a model organism to study PGR actions in humans and other primates may not produce meaningful results.One of the major findings from recent genome-wide scans for adaptively evolving genes is that positive selection may be more common than previously thought [67-69]. These analyses suggest that positive selection in humans is strongest for X-linked genes and genes related to the immune system, reproduction, and sensory perception [69]. Unlike early studies of positive selection in humans, which focused on a few genes of interest (PRM1, PRM2, SRY, FOXP2, G6PD, and MC1R), many of the genes identified from genome-wide scans are novel [69]. They include several genes with testis-specific expression (USP26, C15orf2, and HYAL3), immune regulation (CD58, APOBEC3F, and CD72), tumor antigens (SAGE1 and MAGEC2), and others with unknown functions (FLJ46156I, ABHD1, and LOC389458) [67-69]. Many of the genes that were positively selected in humans are involved in resistance to malaria (HBB, CD40L, FY, and CD36), HIV (CCR5-Δ32) or other infectious diseases (MHC). Other genes under selection are involved in skin pigmentation (OCA2, MYO5A, DTNBP1, TYRP1, and SLC24A5), diet (LCT and ADH), brain development (MCPH1, ASPM, CDK5RAP2, CENPJ, GABRA4, PSEN1, SYT1, SLC6A4, and SNTG1) and bone morphogenesis (BMP3, BMPR2, and BMP5), among many others [67-69]. Analysis of Affymetrix array data indicated that positively selected genes are more tissue-specific than other genes and more frequently expressed in spleen, testes, liver, and breast than other tissues [67]. Remarkably, the number of positively selected genes is substantially smaller in humans than in chimpanzees [70]. These results suggest that knockouts of mouse genes to identify their functions in humans may not identify human-specific functions, particularly for quickly evolving genes under positive selection. Indeed, a recent study (discussed in more detail below) found that genes with differential effects when mutated in humans vs. mice are commonly associated with increased rates of evolution and positive selection [96].
Orthologous proteins, divergent networks
One of the best studied gene regulatory networks specifies the development of the heart [71,72]. The fundamental functional components of hearts are the cardiac muscle cells that express an array of contractile proteins, such as actin, myosin, troponin, and tropomyosin. The first heart-like organs evolved more than 500 million years ago in the ancestor of extant bilaterians and probably resembled the simple tube-like hearts of tunicates, amphioxus, and Drosophila [71]. During evolution, this simple, single-layered tube, which pumped via peristaltic contractions, evolved into a more efficient pump, beating via synchronous contractions with thick muscular chambers functionally specialized for receiving (atria) and sending (ventricles) blood [71]. Interestingly, muscle cells are developmentally and evolutionarily derived from mesoderm, a phylogenetically older tissue type [71].While the structure of invertebrate and vertebrate hearts are dramatically different, the core gene regulatory network for heart formation is highly conserved and governed by homologous transcription factors and signaling molecules in all animals (Figure 1) [71,72]. This conserved core of regulators controls cardiac cell fate, the expression of genes for contractile proteins, and the morphogenesis of the heart [71]. For example, initiation of both Drosophila and mouse hearts during development is dependent on the homologous transcription factors tinman (tin) and Nkx2.5, respectively. Detailed network analysis showed that both tin and Nkx2.5 function by regulating many of the same target genes in their respective species. However, gene swap experiments that attempted to rescue tin-null flies with its mammalian homolog Nkx2.5 uncovered that they were only partially equivalent. Specifically, the mouse Nkx2.5 gene could completely rescue the expression of some target genes, like FascIII, and partially rescue others like MEF2, but the Nkx2.5 gene could not rescue the expression of eve and zfb-1 in the developing Drosophila heart [47,50].
Figure 1
Gene regulatory networks for heart development in Drosophila (top) and vertebrates (middle); shared linkages are shown at the bottom. A core set of regulatory genes are used in common between insect and vertebrates and are linked in a similar way in conserved subcircuits of the gene network. Gray boxes highlight different ways that the same two nodes of the network are linked in Drosophila and vertebrates. Orthologs of many regulatory genes in vertebrate heart formation are not known in Drosophila. Image used with permission from [72].
Similar to tin and Nkx2.5, human Otx1/2 genes can rescue their Drosophila ortholog, otd, in nervous system development [46,73]. The reverse, however, is only partially true. Otd can replace most Otx1/2 functions in mouse nervous system development, but it is unable to rescue the development of the mesencephalon, cerebellar foliation, or the lateral semicircular canals of the inner ear [74]. These differences in function between otd and Otx1/2 likely result from divergence in their ability to regulate downstream target genes. Montalta-He et al. (2002) tested this possibility using whole genome microarrays in Drosophila, overexpressing otd and Otx2. Amazingly, of the 287 and 682 genes that responded to otd and Otx2 overexpression, respectively, only 90 were shared target genes. Thus, even transcription factors that are functionally equivalent with respect to the development of a particular structure can be nonequivalent with respect to regulation of gene networks at the systems level.The functional equivalence of tin/Nxk2.5 with respect to FascIII and MEF2 expression, but not eve and zfb-1, in the developing heart and the divergence of otd/Otx2 target genes and functions in specific parts of nervous system development is particularly revealing. FascIII is a cell surface antigen expressed in derivatives of visceral mesoderm, and MEF2 is a transcription factor that is a central component of gene regulatory networks in all muscle types [47,50]. Thus, they are likely ancient targets of tin/Nxk2.5, whose expression in muscle cells predates the origin of the heart. However, the ability of tin, but not Nkx2.5, to regulate eve and zfb-1 suggests these genes are either derived, clade-specific targets of tin in Drosophila or Nkx2.5 has lost the ability to regulate them during the evolution of mammals. Similarly, the function of otd/Otx2 in nervous system development is ancient, and their ability to regulate partially overlapping sets of target genes reflects the underlying homologies in nervous system development in bilaterians. Nevertheless, clade-specific functions have evolved for both genes, suggesting that while the basic developmental plan of nervous system development is similar between bilaterians, otd/Otx2 have functionally diverged with respect to their target gene repertoires and the ability to direct the development of species-specific neural structures.Like the divergence in protein functions, gene regulatory networks can diverge over time. Thus, homologous proteins may not have equivalent functions in regulatory networks, or may not even be part of the same network between different species. This suggests that functional genomic and biomedical studies should be careful when determining gene function and network position in models and applying it to other species, particularly when the phylogenetic distance between the species grows.
Developmental and physiological systems drift
It is generally a reasonable assumption — and fundamental to biomedical studies using animal models — that homologous developmental processes govern the development and function of homologous cells, tissues, and organs. Indeed, animal models have provided invaluable information on development, disease, and gene function. However, a growing number of studies have uncovered surprising divergence in developmental pathways, often without overt changes in phenotype [75]. This phenomenon, the development of homologous systems via divergent processes, has been termed developmental system drift (DSD).DSD is emerging as a general process that arises after species diverge. Indeed, the divergence in target genes between insect and mammalian tin/Nxk2.5 and otd/Otx2 likely are special cases of DSD. While the divergence between insect and mammalian developmental systems may have occurred over 100 million years, DSD can arise over much shorter time frames. For example, thoracic bristles are missing in hybrids between Drosophila melanogaster and Drosophila simulans, even though the pattern of thoracic bristles is identical between D. melanogaster and D. simulans [76]. More dramatically, the second (T2) and third (T3) thoracic segments of hybrids between D. subobsura and D. madierensis are partially transformed into the first segment (T1) [77,78].Hints of DSD also have been discovered in limb development and spermatogenesis. Radical-fringe (R-fng), an extracellular protein that binds Notch during cell signaling, is expressed at the dorsal-ventral boundary of the limb bud in mouse and chicken, where it promotes the formation of the apical epidermal ridge (AER; Johnston et al. 1997). Remarkably, while R-fng is sufficient to induce AER formation in chickens [79], mouse knockouts of R-fng have no defects in limb development [80]. A particularly interesting example of DSD is the testis-specific protein, Y-encoded (TSPY) gene. One or more copies of the TSPY gene are found on the Y chromosome of all placental mammals, and, as its name implies, its expression is restricted to the testis [81]. Remarkably, although TSPY is functional in rat, Mus musculus has a single-copy TSPY pseudogene that does not generate a functional transcript; knockin of multiple human TSPY does not alter spermatogenesis [81]. Obviously, this natural knockout does not affect fertility in mice and restoration of functional copies of TSPY does not alter fertility, suggesting that the spermatogenesis has drifted away from TSPY-dependence in mice.Recent studies suggest that the molecular basis of DSD likely is transcriptional regulation. Various elements of transcriptional regulation play central roles in development, homeostasis, and evolution. Initial studies comparing genome-wide binding sites for two stem cell-specific transcription factors found large differences in binding site locations between human and mouse [82]. In a more comprehensive analysis, Odom et al. (2007) systematically compared the binding of four tissue-specific transcription factors (FOXA2, HNF1A, HNF4A, and HNF6) using ChIP-chip on hepatocytes purified from human and mouse livers. Despite the conserved function of these transcription factors, between 41 percent and 89 percent of their binding sites were species-specific [83]. Moreover, genome-wide analysis of polymorphisms in human populations using the Derived Allele Frequency (DAF) test of natural selection found that many human- and primate-specific transcription factor binding sites in cis-regulatory elements are evolving under positive selection [68]. These results suggest that divergence in gene regulation is adaptively driven and can accumulate particularly fast without leading to obvious differences in phenotype [54,84]. Thus, even morphologically similar organs can develop from divergent gene regulatory networks.
Pseudoorthology
The differential gain and loss of genes is an important source of functional variation between species. Differences in gene content between species arise from lineage-specific gains via duplication and losses via deletion or inactivation. A consequence of gene duplication is often functional differentiation because one copy is no longer constrained to perform the ancestral function and is free to acquire novel functions. This process of neofunctionalization has been proposed to be the main source of new protein functions during evolution [85].While gene duplication followed by neofunctionalization is a major source of molecular novelty, it can also mask true orthology — particularly if there is differential loss of duplicate genes in two species. For example, if gene A duplicates before a speciation event to produce paralogs A and A’, differential loss of duplicates after a subsequent speciation can lead to one species maintaining A while the other maintains A’. If the initial duplication event was associated with neofunctionalization, then even though both species apparently have a copy of the A gene, the gene’s functions will not be the same and can be very different.Unfortunately, there are few studies of how common differential gene loss is, thus, it is not clear how often it may bias functional genomic annotation of gene function or biomedical studies of gene functions [86]. However, a recent study of the mammalian beta-globin gene family suggests it can be common among some genes. Opazo et al. (2008) used a comparative genomic approach to investigate β-globin gene turnover in placental mammals. Interestingly, they found that there was large-scale differential loss and retention of genes after an initial expansion of the non-adult portion of the β-globin gene cluster in the common ancestor of placental mammals (Figure 2). The differential sorting and species-specific duplication/neofunctionalization of ε-, γ-, and η-globin gene lineages among groups of placental mammals has produced species differences in the functions of hemoglobin isoforms, thereby generating variation in the complement of globin genes among mammals [87].
Figure 2
β-globin gene family evolution in eutherian mammals. Two successive duplications of a proto-ε gene gave rise to the γ and η genes in the stem-lineage of eutherian mammals after their divergence from marsupials. Consequently, the full complement of embryonic globin genes (hemoglobin ε, γ and η) was present in the common ancestor of the eutherian mammals. The η gene was lost in the common ancestor of Xenarthrans and Afrotherians, and the γ gene was lost in Xenarthrans after divergence from the Afrotherians. The η and γ genes were independently lost in the Euarchontoglires and Laurasiatherians, respectively. Crosses indicate lineage-specific gene losses. Image used with permission from [87].
Functional divergence between mouse and human genes
The examples discussed above indicate that functional divergence in genes and regulatory networks accumulate during evolution. This suggests that a fundamental assumption of comparative genomics and biomedical studies, i.e., that gene functions are conserved between model organisms and humans, may be too simplistic. Indeed, beyond case-by-case studies and anecdotal evidence, there have been few systematic examinations to verify the assumption of functional conservation between models and humans. Examples of functional divergence, however, have received little attention within the biomedical community, despite a growing number of studies finding that changes in protein function may be common during mammalian evolution.Perhaps one of the most dramatic examples of an essential human gene that is non-essential in mouse is Acrosin (ACR). ACR enables sperm to penetrate the extracellular matrix and fertilize oocytes by proteolyzing the zona pellucida. Given the role of ACR in fertilization, it is not surprising that numerous studies have implicated ACR in human male infertility [88,89]. Unexpectedly, however, male ACR-null mice are fertile in spite of the complete absence of acrosin protease activity in the sperm [90]. Thus, ACR is not essential for sperm penetration of the zona pellucida or fertilization in mice, despite its necessity for human fertility. Interestingly, mouse sperm have much lower levels of ACR than other rodents, such as rat or hamster, suggesting that other proteases may function in place of ACR during mouse fertilization [91]. In fact, other serine proteases in addition to ACR are present in mouse sperm, but not in rat or hamster, which may be an indication of developmental system drift in mammalian fertilization systems [91].Like ACR, Deleted in Azoospermia (DAZ) and DAZ-like (DAZL) are involved in male germ cell differentiation and function. DAZL is highly conserved during evolution and has been isolated from species as diverse as mammals, frogs, fish, and worms; DAZ is only found in apes and other Old World monkeys [92]. Thus, the DAZ gene duplicated sometime after Old World and New World monkeys diverged but before the radiation of the Old World monkeys. In most mammals, the ancestral DAZL is sufficient to complete gametogenesis; however, in humans, deletions removing the Y-chromosomal DAZ gene are often associated with azoospermia or oligospermia [93], indicating DAZ is necessary for spermatogenesis. Interestingly, most amino acids in DAZ are under strong selective constraint, suggesting they are important for protein structure/function. Nevertheless, a few sites evolving under diversifying selection suggest they are involved in a molecular arms race or that these sites have different species-specific functions [94]. Lineage-specific analysis indicated that human members of this gene family were evolving by positive Darwinian selection, suggesting DAZ may have evolved human-specific functions [94].Vogel et al. (2002) studied the functions of the human DAZL and DAZ genes in mouse DAZL-null mice and found that both DAZL and DAZ enabled production of prophase spermatocytes. However, both human genes failed to promote differentiation into mid- to late pachytenes. The human DAZL transgene led to more early germ cells compared with that observed in DAZ. Human DAZL and DAZ can only substitute for early functions of the mouse DAZL, suggesting gene- and species-specific function have evolved in DAZL/DAZ after human and mouse diverged and/or after DAZL/DAZ duplicated, particularly in their ability to drive mid- to late spermatogenesis. Thus, even though both human DAZL/DAZ and mouse DAZ are essential genes, the essentiality has diverged.During fertilization, sperm trigger a series of oscillations in the concentration of intracellular calcium, which initiates egg activation. The major protein delivered by the sperm that induces the calcium flux is an isoform of phospholipase C termed phospholipase C zeta 1 (PLCZ1). Remarkably, mouse PLCZ1 is transported into the nucleus after fertilization, but PLCZ1 from other species, such as human, rat, and fish, are not. Ito et al. (2008) traced this difference in nuclear localization ability to a novel nuclear localization signal (NLS) that evolved in mouse within a preexisting lysine/arginine rich region of the protein (Figure 3). All PLCZ1 genes tested by Ito et al. (2008) could induce Ca2+ oscillations when expressed in mouse eggs, but the activity was highly variable. For example, Ca2+ oscillations induced by human PLCZ1 continued far beyond the time of pronuclear formation, whereas oscillations induced by rat PLCZ1 stopped far before pronuclear formation. Ito and colleagues found that PLCZ1 sequestration into the pronucleus participates in termination of Ca2+ oscillations of mouse embryos, but not other mammals, indicating the mechanism that terminates Ca2+ oscillations after fertilization has diverged significantly, even between the closely related mouse and rat [95].
Figure 3
Evolution of a novel nuclear localization signal (NLS) in PLCZ1. (a) Logo of the X-Y linker region from 31 species of amniotes. In this representation of a multiple sequence alignment the conservation of amino acid residues in the protein are indicated by the height of the column (Bit Score) while the conservation of specific amino acids in at that site are shown by the height of letters within columns. Note that conservation is generally poor. (b) Superimposed structural models of human (blue) and mouse (red) PLCZ1 proteins. Note that while the structure of human and mouse PLCZ1 proteins are generally the same, the X-Y linker region is much longer in humans leading to a longer disordered loop than in mouse. The mouse NLS is shown as red spheres and the human pre-NLS is shown as blue spheres.
The examples of ACR, DAZ/DAZL, and PLCZ1 show that homologous proteins are not necessarily functionally equivalent, but these case studies only hint that a larger pattern of functional divergence may exist. Recently, Liao and Zhang (2008) systematically compared the phenotype of null mutations for 120 essential human genes, i.e., genes associated with death before puberty or infertility, which have been knocked out in mice. Remarkably, ~22 percent of human essential genes were non-essential in mouse. Changes in essentiality between species were associated with adaptive evolution of the proteins, consistent with previous findings that changes in protein function are often associated with rapid evolution. By examining the evolution of genes that have changed essentiality between humans and mice in a broader context, they found that positive selection was much more common in the human lineage than in the rodent, suggesting that the differences in essentiality arose during human evolution [96].
Conclusions and outlook
Biomedical studies of animal models have played incredibly important roles in elucidating the molecular basis development, physiology, behavior, and pathogenesis of human disease. Transgenic models are so successful in elucidating the molecular function of genes that they are now a standard tool in molecular genetics and biomedical studies and are being used to assign functions to each gene in the genome through initiatives such as the Knockout Mouse Project (KOMP). Yet the assumption that gene functions and developmental systems are conserved between models and humans is taken for granted, in spite of the evidence that gene functions and gene networks can diverge through evolution. The mechanisms and examples of functional divergence discussed above indicate that gene studies in model organisms need to be examined carefully for several potential biases. Annotation of gene functions based solely on mutant phenotypes in animal models, as well as assumptions of conserved functions between species, may be misleading. Perhaps more importantly, animal models of human disease may not provide appropriate information on gene function, particularly for rapidly evolving genes and systems. It is vitally important to demonstrate equivalence between the model and human gene with respect to the particular function under study to avoid spurious conclusions.
Authors: Vincent J Lynch; Andrea Tanzer; Yajun Wang; Frederick C Leung; Birgit Gellersen; Deena Emera; Gunter P Wagner Journal: Proc Natl Acad Sci U S A Date: 2008-09-22 Impact factor: 11.205
Authors: Christina L Zheng; Beth Wilmot; Nicole Ar Walter; Denesa Oberbeck; Sunita Kawane; Robert P Searles; Shannon K McWeeney; Robert Hitzemann Journal: BMC Genomics Date: 2015-02-05 Impact factor: 3.969