Literature DB >> 22490996

Evolution of vertebrate tissues driven by differential modes of gene duplication.

Masanobu Satake1, Masakado Kawata, Aoife McLysaght, Takashi Makino.   

Abstract

In this study, we investigated the evolution of vertebrate tissues by examining the potential association among gene expression, duplication, and base substitution patterns. In particular, we compared whole-genome duplication (WGD) with small-scale duplication (SSD), as well as tissue restricted with ubiquitously expressed genes. All patterns were also analysed in the light of gene evolutionary rates. Among those genes characterized by rapid evolution and expressed in a restricted range of tissues, SSD was represented in a larger proportion than WGD. Conversely, genes with ubiquitous expression were associated with slower evolutionary rates and a larger proportion of WGD. The results also show that evolutionary rates were faster in genes expressed in endodermal tissues and slower in ectodermal genes. Accordingly, the proportion of the SSD and WGD genes was highest in the endoderm and ectoderm, respectively. Therefore, quickly evolving SSD genes might have contributed to the faster evolution of endodermal tissues, whereas the comparatively slowly evolving WGD genes might have functioned to maintain the basic characteristics of ectodermal tissues. Mesenchymal tissues occupied an intermediate position in this regard, whereas the patterns observed for haemocytes were unique. Rapid tissue evolution could be related to a specific gene duplication mode (SSD) and faster molecular evolution in response to exposure to the external environment. These findings reveal general patterns underlying the evolution of tissues and their corresponding genes.

Entities:  

Mesh:

Year:  2012        PMID: 22490996      PMCID: PMC3415292          DOI: 10.1093/dnares/dss012

Source DB:  PubMed          Journal:  DNA Res        ISSN: 1340-2838            Impact factor:   4.458


Introduction

Mutations are the main factors driving genome evolution and may happen within genes through base substitutions or involve their entire duplication.[1] In the latter case, two mechanisms have been recognized: whole-genome duplication (WGD) and small-scale duplication (SSD), which occurs in relatively a small region of the genome during evolution. For instance, the early vertebrate ancestor is thought to have undergone two rounds of WGD,[2-5] as suggested by four vertebrate Hox gene clusters located in different chromosomes. Paralogous genes originated from WGD are referred to as ohnologues.[6] Singletons, in turn, are genes that did not undergo either WGD or SSD. In general, duplicated genes are redundant and their functions overlap. Thus, in simple organisms such as yeasts and nematodes, the proportion of essential genes in duplicated genes is half as low as that in singletons.[7,8] In mice, however, the proportion of essential genes is comparable between duplicated genes and singletons.[9,10] Furthermore, ohnologues are likely to contain a larger proportion of essential genes than SSD genes.[11] Ohnologues are indeed associated with development, the regulation of transcription, and protein complex formation.[11] For these genes to function properly, the relative amount of their products must be balanced.[12-14] WGD typically involves the simultaneous duplication of all genes, therefore preserving the relative dosages of each gene. Because either the loss or the gain of ohnologues may lead to dosage imbalance, ohnologues are expected to be retained intact in the genome.[15] Similarly, ancient WGD-derived ohnologues are expected to be more conservative than recently evolved SSD genes. Gene expression can be tissue-specific, determining phenotypes such as the morphology and function of tissues, or ubiquitous. Previous research indicates that ubiquitously expressed genes are likely to evolve slowly.[16-18] For example, in humans and mice, the orthologous genes that are expressed in a limited number of tissues tend to evolve faster than ubiquitously expressed genes.[18] Little is known, however, about the extent to which the evolution of tissues is influenced by the differential modes of gene duplication and expression. Indeed, there are no reports exploring the effects of gene duplication events, such as WGD and SSD, on tissue-restricted or ubiquitous gene expression. In this study, the potential associations between gene evolutionary rates, duplication (WGD, SSD, and singletons), and gene expression breadth in different tissues (restricted or ubiquitous) were investigated. In addition, these parameters were also analysed in relation to the developmental origin of tissues (endodermal, mesenchymal, or ectodermal). The results support the notion that both base substitutions within genes and gene duplication are associated with gene expression breadth and that the nature of duplication (WGD or SSD) differs substantially depending on the germ-layer origin of the tissue. Tissue evolution is therefore discussed here as the outcome of a process involving the gene evolutionary rate, duplication, and expression.

Materials and Methods

Classification of human genes based on the gene duplication mode

Protein-coding genes of human origin were obtained from Ensembl release 52 (http://www.ensembl.org). A total of 7294 ohnologues and 9027 SSD genes were defined as described in Makino and McLysaght.[15] Briefly, duplicated genes were so judged when the two aligned sequences showed homology in their >30% length with e < 10−7 in BLAST search. Ohnologues were syntenic genes located on paralogous chromosomal regions and derived from WGD, whereas SSD genes were duplicated genes not experiencing WGD. Of the 9027 SSD genes, 1478 genes were classified as both ohnologues and SSD genes and, therefore, excluded from the analysis, resulting in 7549 pure SSD genes. An additional 6064 genes were classified as singletons. Thus, 20 907 genes (7294 + 7549 + 6064 = 20 907) were considered. To define the origin of SSDs, a sequence similarity search was performed within protein-coding human genes using the all-against-all BLASTP program. Synonymous substitution rates (KS) were estimated for each close paralogue. There were 2510 and 5039 SSD genes for which KS were <1 (recent SSD) and ≥1 (old SSD), respectively (2510 + 5039 = 7549).

Human orthologous genes and gene evolutionary rates

Human genes with orthologues in mice and other species were obtained from Ensembl release 52. Orthologous sequences were aligned using CLUSTALW,[19] and KS and non-synonymous substitution rates (KA) were deduced for each orthologous pair using the method of Yang and Nielsen[20] implemented in PAML.[21] Next, ω values (KA/KS) were calculated.

Human EST data

An expression sequence tag (EST) database[22] was used to determine the expression profile of human genes in various tissues. The data were registered at NCBI and included 3 199 559 reads from 47 different human tissues. Each tissue contained more than 10 000 ESTs (68 076 on an average). Using these 47 tissues, the breadth of expression of a gene was represented by the number of tissues in which the gene was identified, as based on the detection of ESTs.[22] Thus, breadth varied from 1 to 47. Among the 20 907 human genes considered for analysis, there were 3871 for which ESTs were not identified in the EST database. These genes were excluded from further analyses, therefore resulting in a total number of genes of 17 036, among which 6 952 were ohnologues, 5505 were SSDs, and 4579 were singletons.

Classification of human tissues based on the developmental origin

Forty-three tissues were classified into four subgroups based on their developmental origin, as follows: endoderm (nasopharynx, thymus, stomach, colon, bladder, liver, trachea, lung, pancreas, uterus, cervix, prostate, and intestine), ectoderm (breast, skin, caudate nucleus, hypothalamus, eye, thalamus, subthalamic nucleus, cerebellum, hippocampus, corpus callosum, nervous system, amygdala, and substantia nigra), mesenchymal (synovium, kidney, adipose tissue, bone, adrenal gland, cartilage, muscle, pericardium, and heart), and haemocytes (B cells, bone marrow, germinal centre B cells, blood cells, lymph node, spleen, blood vessels, and T lymphocytes). The developmental origin of four tissues, such as ovary, testis, amnion, and placenta, was not categorized into any of the above four subgroups. One caveat of this classification, however, is that the intestine is classified as an endodermal tissue based on the presence of endoderm-derived intestinal epithelium. However, as a macroscopic organ, the intestine includes not only epithelium but also mesoderm-derived tissues such as the submucosa and muscles. Thus, in the above classification, an endodermal tissue usually includes both endoderm- and mesoderm-derived tissues, whereas an ectodermal tissue includes both ectoderm- and mesoderm-derived tissues.

Definition of tissue evolutionary rates

The tissue evolutionary rate was originally calculated by Kuma et al.[23] and this method has been adopted in the succeeding studies.[18,24] Thus, we simply employed their definition in the present study. We assumed that a set of genes are expressed in a given tissue type and that the ω values of expressing genes are X1, X2, …, X (for ohnologues), Y1, Y2, …, Y (for SSD genes), and Z1, Z2, …, Z (for singletons). The evolutionary rate of this tissue is then given by: [(X1 + X2 + ··· + X) + (Y1 + Y2 + ··· + Y) + (Z1 + Z2 + ··· + Zo)]/(m + n + o), where m, n, and o are the respective number of ohnologues, SSD genes, and singletons.

Results

Association between expression breadth and duplication mode

Figure 1 shows the gene number distribution over various expression breadths (note that the actual number of genes expressed in each tissue at each expression breadth is shown in Supplementary Table S1). Figure 1A–C displays the numbers of ohnologues, SSD genes, and singletons, respectively. The average number of tissues in which each gene type was expressed was 18.7 for ohnologues (blue), 16.6 for SSD genes (red), and 18.0 for singletons (green). Figure 1D shows that the proportions of each gene type in each expression breadth were noticeably different. Specifically, the proportion of SSD genes was relatively higher among breadth-restricted genes (see n = 1, n ≤ 3, n ≤ 5, and n ≤ 10), whereas the proportion of ohnologues was increasingly higher among genes expressed in a larger number of tissues (see n > 10, n > 20, and n > 40).
Figure 1.

The number and proportion of ohnologues, SSD genes, and singletons at each tissue expression breadth (as determined by the number of tissues where a corresponding EST was detected for a respective gene). The number of genes at each expression breadth was counted for ohnologues (A, blue), SSD genes (B, red), and singletons (C, green). (D) The percentage of ohnologues, SSD genes, and singletons at each breadth.

The number and proportion of ohnologues, SSD genes, and singletons at each tissue expression breadth (as determined by the number of tissues where a corresponding EST was detected for a respective gene). The number of genes at each expression breadth was counted for ohnologues (A, blue), SSD genes (B, red), and singletons (C, green). (D) The percentage of ohnologues, SSD genes, and singletons at each breadth.

Association between gene evolutionary rates and duplication mode

Ohnologues are believed to be more conservative than SSD genes with respect to functional essentiality and dosage-balance requirement.[11,15] To determine whether ohnologues are also conservative in terms of molecular evolution, non-synonymous nucleotide divergence in the coding region (between humans and mice) were examined. Table 1 shows that the average ω value of ohnologues (0.11) was 0.55–0.57-fold lower than that of SSD genes (0.19; P < 2.2 × 10−16, the Mann–Whitney U-test) and of singletons (0.20; P < 2.2 × 10−16). The result confirms the conservative nature of ohnologues in the evolution of coding regions.
Table 1.

Average KA/Ks values

CategorySubcategoryNumber of compared orthologous gene pairsAverage KA/KsSD
Expressed genesOhnologues69520.110.11
Total SSD genes55050.190.16
 Old SSD genes (Ks ≥ 1)43560.170.14
 Recent SSD genes (Ks < 1)11490.290.20
Singletons45790.200.22
Narrowly expressed genes (n ≤ 10)Ohnologues21510.130.12
Total SSD genes20080.260.18
 Old SSD genes (Ks ≥ 1)13380.220.16
 Recent SSD genes (Ks < 1)6700.340.20
Singletons11010.290.19
Broadly expressed genes (n > 10)Ohnologues48010.100.11
Total SSD genes34970.150.14
 Old SSD genes (Ks ≥ 1)30180.140.12
 Recent SSD genes (Ks < 1)4790.230.19
Singletons34780.170.23
Average KA/Ks values

Tissue evolutionary rates in restricted expression breadths

In Fig. 2A, tissue evolutionary rates are plotted in an increasing order, for each of 47 tissues, and under a condition of n ≤ 10 (see ‘Proportions of ohnologues and SSD genes in various expression breadths’ as for a rationale of the use of n ≤ 10). The ω values gradually increased from 0.13 in a slowly evolving, leftward tissue to 0.29 in a fast evolving, rightward tissue.
Figure 2.

Contribution of ohnologues, SSD genes, and singletons among breadth-restricted (n ≤ 10 tissues) and breadth-ubiquitous (n > 10 tissues) genes. Blue, red, and green represent the ohnologues, SSD genes, and singletons, respectively. Tissues are aligned on the x-axis in the order of magnitude of the KA/KS (ω) values of their expressed genes. Parameters used in the y-axis are as follows: (A and E) the average ω values of all genes expressed in a given tissue, which corresponds to the defined tissue evolutionary rate; (B and F) the ω values of ohnologues, SSD genes, and singletons; (C and G) the proportion of ohnologues, SSD genes, and singletons; and (D and H) the ω values and the proportions calculated for ohnologues, SSD genes, and singletons. The order of tissues aligned are as follows from the left to the right. In (A)–(D), the nervous system, subthalamic nucleus, amygdale, cerebellum, cartilage, substantia nigra, hypothalamus, hippocampus, pericardium, corpus callosum, thalamus, T lymphocytes, ovary, eye, heart, caudate nucleus, adipose tissue, pancreas, skin, muscle, adrenal gland, prostate, breast, lymph, kidney, colon, amnion, cervix, placenta, stomach, bladder, lung, uterus, bone, germinal centre B cell, intestine, trachea, bone marrow, B cells, testis, liver, spleen, thymus, blood vessels, synovium, blood, and nasopharynx. In (E)–(H), the amnion, cartilage, B cells, skin, muscle, nervous system, hypothalamus, cervix, adipose tissue, substantia nigra, caudate nucleus, corpus callosum, T lymphocytes, subthalamic nucleus, heart, bone marrow, adrenal gland, lymph, blood vessels, bone, bladder, amygdale, ovary, hippocampus, pancreas, breast, colon, blood, eye, pericardium, cerebellum, thalamus, prostate, stomach, germinal centre B cell, kidney, liver, spleen, lung, uterus, testis, placenta, intestine, thymus, synovium, trachea, and nasopharynx.

Contribution of ohnologues, SSD genes, and singletons among breadth-restricted (n ≤ 10 tissues) and breadth-ubiquitous (n > 10 tissues) genes. Blue, red, and green represent the ohnologues, SSD genes, and singletons, respectively. Tissues are aligned on the x-axis in the order of magnitude of the KA/KS (ω) values of their expressed genes. Parameters used in the y-axis are as follows: (A and E) the average ω values of all genes expressed in a given tissue, which corresponds to the defined tissue evolutionary rate; (B and F) the ω values of ohnologues, SSD genes, and singletons; (C and G) the proportion of ohnologues, SSD genes, and singletons; and (D and H) the ω values and the proportions calculated for ohnologues, SSD genes, and singletons. The order of tissues aligned are as follows from the left to the right. In (A)–(D), the nervous system, subthalamic nucleus, amygdale, cerebellum, cartilage, substantia nigra, hypothalamus, hippocampus, pericardium, corpus callosum, thalamus, T lymphocytes, ovary, eye, heart, caudate nucleus, adipose tissue, pancreas, skin, muscle, adrenal gland, prostate, breast, lymph, kidney, colon, amnion, cervix, placenta, stomach, bladder, lung, uterus, bone, germinal centre B cell, intestine, trachea, bone marrow, B cells, testis, liver, spleen, thymus, blood vessels, synovium, blood, and nasopharynx. In (E)–(H), the amnion, cartilage, B cells, skin, muscle, nervous system, hypothalamus, cervix, adipose tissue, substantia nigra, caudate nucleus, corpus callosum, T lymphocytes, subthalamic nucleus, heart, bone marrow, adrenal gland, lymph, blood vessels, bone, bladder, amygdale, ovary, hippocampus, pancreas, breast, colon, blood, eye, pericardium, cerebellum, thalamus, prostate, stomach, germinal centre B cell, kidney, liver, spleen, lung, uterus, testis, placenta, intestine, thymus, synovium, trachea, and nasopharynx.

Gene evolutionary rate in restricted expression breadths

The average ω values were calculated separately for each type of gene as follows: (X1 + X2 + ··· + X)/m for ohnologues, (Y1 + Y2 + ··· + Y)/n for SSD genes, and (Z1 + Z2 + ··· + Z)/o for singletons. Figure 2B shows that the evolutionary rate (ω) of ohnologues (blue), SSD genes (red), and singletons (green) increased gradually in line with increases in tissue evolutionary rates, indicating that all gene types evolved in parallel with the tissue evolution. Therefore, gene evolutionary rates were higher in the genes that are expressed in fast evolving tissues irrespective of the gene type, whereas those were lower in slowly evolving tissues regardless of the gene type. It is worth noticing, however, that the average ω value of ohnologues was low (0.13), whereas that of SSD genes and singletons was high (0.26 and 0.29, respectively; Fig. 2B and see n ≤ 10 in Table 1).

Proportions of gene type in restricted expression breadths

Figure 2C shows that an increase in tissue evolutionary rates was accompanied by a corresponding decrease in the proportion of ohnologues and an increase in the proportion of SSD genes. This result indicates that ohnologues and SSD genes tended to be expressed in slowly and fast evolving tissues, respectively. The proportion of singletons also appeared to increase in parallel with tissue evolutionary rates. Figure 2B and C also indicates that ohnologues and SSD genes behave differently. In the case of SSD genes, ω values as well as proportions are positively associated with tissue evolutionary rates, suggesting that SSD genes contribute to faster tissue evolution. In the case of ohnologues, however, an increase in ω was associated with a decrease in proportion, accompanying the tissue evolutionary rate. Therefore, further analysis is needed to examine the relative contribution of ohnologues to tissue evolution.

Differential contribution of ohnologues, SSD genes, and singletons among genes with restricted expression

Both the evolutionary rate of each gene type and their proportion are incorporated in the following formula: where [(X1 + X2 + ··· + X)/m] × [m/(m + n + o)] represents ohnologues, [(Y1 + Y2 + ··· + Y)/n] × [n/(m + n + o)] represents SSD genes, and [(Z1 + Z2 + ··· + Z)/o] × [o/(m + n + o)} represents singletons. Figure 2D shows each ohnologue, SSD, and singleton component in relation to tissue evolutionary rates. For SSD genes and singletons, there was a positive association between tissue and gene evolutionary rates. In contrast, ohnologues were almost flat irrespective of tissue evolution. This flat line for ohnologues suggests that ohnologues did not play a major role in tissue evolution, in agreement with the notion that ohnologues are conservative in nature. However, this does not necessarily mean a lack of any role for ohnologues in tissue evolution. In fact, the proportions of ohnologues were substantially reduced in fast evolving tissues. This effect and the increase in the ω values cancelled out each other.

Equal contribution of ohnologues, SSD genes, and singletons among genes with ubiquitous expression

Figure 2E–H represents the results obtained under a condition of n > 10 tissues and show virtually no variation (flat lines) for all parameters measured and gene types among tissues. This is not surprising given the ubiquitous nature of the expression of each gene type in a broad range of tissues. The average ω values when n > 10 tissues were remarkably lower, particularly for SSD genes and singletons (0.15 and 0.17, respectively), than those in observed when n ≤ 10 tissues (0.26 and 0.29, respectively; Fig. 2F and B and Table 1). In terms of the proportion of gene types (Fig. 2G), the proportion of ohnologues (40%) was substantially higher than that of SSD genes and singletons (30% in both cases). This result is in sharp contrast to that found for expression breadths of n ≤ 10 tissues, indicating that among those genes that are expressed ubiquitously, there is a larger proportion of ohnologues and a lower proportion of SSD genes and singletons. Finally, the relative contribution of ohnologues, SSD genes, and singletons is nearly the same for all tissues (Fig. 2H), which is in high contrast to the situation where n ≤ 10 tissues (Fig. 2D). Overall, the results show that the relative proportion of ohnologues, SSD genes, and singletons, as well as their evolutionary rates, are substantially different between genes with ubiquitous and restricted expression breadths.

Proportions of ohnologues and SSD genes in various expression breadths

To confirm the previous findings, we analysed cases involving expression breadths other than n ≤ 10 and n > 10 tissues. Similar results to those shown in Fig. 2A–D were obtained for n = 1, n ≤ 3, n ≤ 5, n ≤ 20, and n ≤ 40 tissues, whereas similar observations as those indicated in Fig. 2E–H were obtained for n > 20 and n > 40 tissues (data not shown). Figure 3 shows the proportion of ohnologues (blue) and SSD genes (red) for each tissue under various expression breadths (n = 1, n ≤ 3, n ≤ 5, n ≤ 10, n ≤ 20, n ≤ 40, n > 10, n > 20, and n > 40). The x-axis represents the tissue evolutionary rate, whereas the y-axis represents the proportions of ohnologues and SSD genes. In tissue-restricted expression (n = 1, n ≤ 3, n ≤ 5, n ≤ 20, and n ≤ 40), the faster the tissue evolutionary rates, the higher the proportion of SSD genes and the lower the proportion of ohnologues expressed (correlation coefficients and P-values are shown in Supplementary Table S2). When the expression breadth was ≤10, a statistically significant correlation was detected for both the ohnologues and SSD genes (P = 1.6 × 10−13 and P = 2.1 × 10−5, respectively). A similar correlation was observed when the expression breadth was set at ≤20 and ≤40. However, the correlation became rather weak in these cases. Therefore, at expression breadth ≤10, correlation became maximum and most reliable.
Figure 3.

Proportion of ohnologues and SSD genes at various gene expression breadths. The y-axis represents the proportions of ohnologues (blue) and SSD genes (red), whereas the x-axis shows the tissue evolutionary rates [the average KA/KS (ω) value of all genes expressed in a given tissue]. P-values and linear correlation coefficients are shown in Supplementary Table S2.

Proportion of ohnologues and SSD genes at various gene expression breadths. The y-axis represents the proportions of ohnologues (blue) and SSD genes (red), whereas the x-axis shows the tissue evolutionary rates [the average KA/KS (ω) value of all genes expressed in a given tissue]. P-values and linear correlation coefficients are shown in Supplementary Table S2. On the other hand, in those cases where n > 20 and n > 40 tissues, the proportion of ohnologues was higher and the proportion of SSD genes was lower but there was no significant correlation between tissue evolutionary rates and the proportion of either gene type.

Orthologous genes between human and other vertebrates

The previous analyses involved genes that are orthologous in human and mice. However, the origin of ohnologues can be traced back to the emergence of vertebrates. Therefore, human orthologues in various vertebrates, including rat, cow, dog, opossum, and chicken, were extracted from Ensembl. Using an expression breadth of n ≤ 10, analyses similar to those shown in Fig. 2C were performed. In the results shown in Supplementary Fig. S1, human genes were used as references for pairwise comparisons. Similar to the results shown in Fig. 2C, the comparison between humans and other species (Supplementary Table S3) shows that those tissues with higher ω values were associated with a larger proportion of SSD genes than with ohnologues.

Recent SSD genes contribute to faster tissue evolution

Approximate gene duplication times can be estimated based on KS values, with lower and higher KS values corresponding to recent and ancient duplication events, respectively. To examine the relative contribution of each SSD gene to tissue expression, a threshold value of KS was set to 1.0 and the number and the percentage of each SSD gene at various expression breadths were counted (Fig. 4A–C). Narrower expression breadths were associated with a larger percentage of recent SSD genes (KS < 1.0, red), whereas the percentage of ancient SSD genes (KS ≥ 1.0, pink) was nearly constant regardless of expression breadth. These results indicate a higher contribution of recent SSD genes to tissue-restricted gene expression.
Figure 4.

Recent and ancient SSD genes. (A–C) The number and the percentage of ancient (pink) and recent (red) SSD genes are shown for each tissue expression breadth. (D) The proportions of ancient and recent SSD genes among genes with tissue-restricted expression limited to n ≤ 10 tissues.

Recent and ancient SSD genes. (A–C) The number and the percentage of ancient (pink) and recent (red) SSD genes are shown for each tissue expression breadth. (D) The proportions of ancient and recent SSD genes among genes with tissue-restricted expression limited to n ≤ 10 tissues. An additional analysis similar to that shown in Fig. 2C was performed using recent and ancient SSD genes as criteria. The results are shown in Fig. 4D (expression breadth ≤10 tissues). Co-linearity between ω values and gene proportions was 0.58 for recent (P = 1.7 × 10−5) and 0.33 for ancient SSD genes (P = 0.024), indicating a larger contribution of recent SSD genes to the positive co-linearity among all SSD genes observed in Fig. 2C. Yet, the slightly positive co-linearity observed for ancient SSD genes indicates that ancient SSD genes are distinct from ohnologues, which showed negative co-linearity. These results indicate that the evolution of SSD genes of both recent and ancient origins, as well as of ohnologues, is distinct in nature.

Association between gene evolutionary rates, duplication mode, and developmental origin of tissues

In the analyses of the results shown in Fig. 2, fast and slow tissue evolutionary rates were based on ω values but we did not address tissue types. To examine how ω values and the proportion of SSD genes and ohnologues are associated with the developmental origin of tissues, ω values as well as the proportions of ohnologues, SSD genes, and singletons were calculated for each tissue type. Values calculated for the four subgroups with a breadth of n ≤ 10 tissues are presented in Fig. 5 (Supplementary Table S4 shows P-values).
Figure 5.

Contribution of ohnologues, SSD genes, and singletons to gene sets expressed in haemocytes, tissues of endodermal and ectodermal origin, and mesenchymal tissues. (A) The average KA/KS (ω) values associated with each gene set. (B)–(D) The proportion of ohnologues, SSD genes, and singletons expressed in each tissue type. In each panel, averages and standard deviations are shown. P-values for pairwise comparisons are shown in Supplementary Table S4.

Contribution of ohnologues, SSD genes, and singletons to gene sets expressed in haemocytes, tissues of endodermal and ectodermal origin, and mesenchymal tissues. (A) The average KA/KS (ω) values associated with each gene set. (B)–(D) The proportion of ohnologues, SSD genes, and singletons expressed in each tissue type. In each panel, averages and standard deviations are shown. P-values for pairwise comparisons are shown in Supplementary Table S4. The average ω values were significantly higher in endodermal, intermediate in mesenchymal, and lower in ectodermal tissues (Fig. 5A). Conversely, the proportion of ohnologues was higher in ectodermal than in endodermal and mesenchymal tissues (Fig. 5B), whereas the proportion of SSD genes was highest in endodermal, intermediate in mesenchymal, and lowest in ectodermal tissues (Fig. 5C). The proportion of singletons did not vary among the three subgroups (Fig. 5D). Therefore, it is possible that SSD genes with higher ω values might have contributed to the faster evolution of endodermal tissues, whereas ohnologues with lower ω values might have functioned to maintain the essential characteristics of ectodermal tissues. Mesenchymal tissues occupied an intermediate position.

The unique evolutionary position of haemocytes

Haemocyte genes displayed unique features that were distinct from those of the three subgroups of tissues described previously. Specifically, the average ω value of haemocyte genes was the highest, whereas the proportion of haemocyte ohnologues was the lowest. Furthermore, the proportion of haemocyte singletons was higher than that of the other three subtypes of tissues.

Discussion

Gene evolutionary rate, duplication, and expression breadth

In the present study, the putative association between gene evolutionary rates, duplication, and their expression breadth in different tissues was examined. Gene evolutionary rates are affected by various factors. For example, duplicated genes are believed to evolve relatively fast, whereas singletons evolve more slowly.[1] Another important factor is the expression breadth of each gene. Genes expressed in a wide range of tissues tend to evolve more slowly, whereas those restricted to a narrow range of tissues evolve faster.[18,25] In addition, as the number of genes in a gene family increases, the number of tissues in which the corresponding gene is expressed decreases.[26] However, these findings were based on the analysis of only two parameters at a time, even though co-linearity among the three elements considered (gene evolutionary rate, gene duplication, and gene expression breadth) is also possible. To analyse the involvement of duplicated genes on tissue evolution, we initially focused on the relationship between gene expression breadth and gene duplication. However, the percentages of singletons (Fig. 1D, green) and duplicated genes (Fig. 1D; red + blue) remained nearly constant at 30–35 and 65–70%, respectively, and were independent of gene expression breadth. This result suggests that the association between gene expression breadth and gene duplication is not as simple as reported previously. The present study not only considered the analysis of gene evolutionary rates but also incorporated into it the distinction between SSD genes and ohnologues when considering gene duplication. Gene evolutionary rates were fast and slow for SSD genes and ohnologues, respectively (Table 1). Interestingly, SSD genes were represented in a larger proportion within narrow expression ranges, whereas the opposite was observed for ohnologues (Fig. 1D). The present study is therefore the first to show co-linearity among gene evolutionary rate, duplication, and expression breadth.

Contribution of SSD genes, ohnologues, and singletons to expression breadths in tissues

Genes that are expressed in a narrow range of tissues are considered to play significant roles in the establishment of the characteristic features of the corresponding tissues. Prior studies have shown that those genes that are expressed specifically in neural tissues evolve more slowly, whereas those expressed in other tissue types evolve faster.[18,23] Alternatively, a substantial proportion of those genes with a tissue-specific expression tend to encode secreted polypeptides irrespective of tissue types, and there are few such genes in neural tissues.[24] In this study, we also examined if the nature of SSD genes depended on the type of tissue in which the corresponding genes were expressed. To this end, the evolutionary rate of a tissue was defined as the average ω value of those genes expressed in that tissue, whereas the nature of the expressed genes was evaluated by their ω values, proportions as well as by the product of these parameters. These parameters were calculated for each type of gene (SSD, ohnologues, and singletons) and plotted for each tissue (Fig. 2). Among genes with a narrow spectrum of expression, SSD genes (and singletons) played a minor role in slowly evolving tissues with low ω values. However, the evolutionary rates as well as the proportion of SSD genes were associated positively with the evolutionary rates of tissues. Thus, SSD genes (and singletons) seemed to play major roles in fast evolving tissues with high ω values, indicating that the relative contribution of SSD genes differs substantially depending on the type of tissue in which the gene is expressed. In contrast, genes with a wide spectrum of expression are presumed to maintain basic cellular functions irrespective of the tissue type. Among these genes, the relative contribution of SSD genes and ohnologues was similar. Our results indicate that ohnologues with slower rates of evolution and wide expression ranges are associated with slowly evolving tissues, whereas SSD genes with faster rates of evolution and narrow range of expression are associated with fast evolving tissues. Specifically, there were less ohnologues and more SSD genes in endodermal tissues such as the digestive tract. These tissues face the outer environment directly and therefore they may differentiate functionally and pleiotropically to adjust to environmental changes. Rapid tissue evolutionary rates could be therefore related to a specific mode of duplication (SSD genes) and faster molecular evolution (high ω values) in response to this exposure to the outer environment. Conversely, in ectoderm-derived tissues, such as the nervous system, there were more ohnologues and less SSD genes. Because these tissues are not exposed to the outer environment, their evolution may not be driven by functional differentiation but rather by the need to maintain basal functions. Ohnologues with a slow evolutionary rate and lower probability of further duplication were therefore predominant in slowly evolving tissues such as the ectoderm. Mesenchymal tissues were associated with an intermediate phenotype in terms of ω values and the proportion of ohnologues and SSD genes. Those genes with expression restricted to haemocytes exhibited very unique features. The percentage of ohnologues was the smallest among all tissue types, whereas singletons were the most representative, a result not observed for the other three tissue types. A lower proportion of conservative ohnologues might allow the relaxation of those constraints required for the formation and functionality of tissues. This flexibility might be further accelerated by an increase in the ratio of singletons. Such findings might be expected in light of the dramatic transitions of haematopoietic organs and tissues from the aorta, gonads, and mesonephros regions to the liver, spleen, and bone marrow during mammalian development and vertebrate evolution. Furthermore, the haemocytes used in this study included immune-competent cells and tissues, and immune-related genes are known to evolve fast (immunoglobulin and T cell receptor gene families were excluded from the analysis here). In this sense, endodermal tissues and haemocytes appear to adopt distinct strategies to increase the proportion of SSD genes and singletons, respectively, when adapting to environmental changes.

Supplementary Data

Supplementary Data are available at www.dnaresearch.oxfordjournals.org.

Funding

The work was supported by research grants from the Ministry of Education, Science, Sports, Culture and Technology of Japan and by the Creative Interdisciplinary Research Program of the Center for Interdisciplinary Research, Tohoku University, Japan. M.S. is supported by a GCOE program, ‘Network Medicine’, at Tohoku University.
  24 in total

1.  Robustness--it's not where you think it is.

Authors:  K Wolfe
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

2.  Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models.

Authors:  Z Yang; R Nielsen
Journal:  Mol Biol Evol       Date:  2000-01       Impact factor: 16.240

3.  Exploring the etiology of haploinsufficiency.

Authors:  Reiner A Veitia
Journal:  Bioessays       Date:  2002-02       Impact factor: 4.345

4.  Elevated rates of protein secretion, evolution, and disease among tissue-specific genes.

Authors:  Eitan E Winter; Leo Goodstadt; Chris P Ponting
Journal:  Genome Res       Date:  2004-01       Impact factor: 9.043

5.  Duplicate genes and robustness to transient gene knock-downs in Caenorhabditis elegans.

Authors:  Gavin C Conant; Andreas Wagner
Journal:  Proc Biol Sci       Date:  2004-01-07       Impact factor: 5.349

6.  Gene dosage balance in cellular pathways: implications for dominance and gene duplicability.

Authors:  Reiner A Veitia
Journal:  Genetics       Date:  2004-09       Impact factor: 4.562

7.  Ohnologs in the human genome are dosage balanced and frequently associated with disease.

Authors:  Takashi Makino; Aoife McLysaght
Journal:  Proc Natl Acad Sci U S A       Date:  2010-05-03       Impact factor: 11.205

8.  Extensive genomic duplication during early chordate evolution.

Authors:  Aoife McLysaght; Karsten Hokamp; Kenneth H Wolfe
Journal:  Nat Genet       Date:  2002-05-28       Impact factor: 38.330

9.  Nonlinear effects in macromolecular assembly and dosage sensitivity.

Authors:  Reiner A Veitia
Journal:  J Theor Biol       Date:  2003-01-07       Impact factor: 2.691

10.  Role of duplicate genes in genetic robustness against null mutations.

Authors:  Zhenglong Gu; Lars M Steinmetz; Xun Gu; Curt Scharfe; Ronald W Davis; Wen-Hsiung Li
Journal:  Nature       Date:  2003-01-02       Impact factor: 49.962

View more
  10 in total

1.  Tissue-Specific Evolution of Protein Coding Genes in Human and Mouse.

Authors:  Nadezda Kryuchkova-Mostacci; Marc Robinson-Rechavi
Journal:  PLoS One       Date:  2015-06-29       Impact factor: 3.240

2.  Region-specific expression of young small-scale duplications in the human central nervous system.

Authors:  Edith Le Floch; Christophe Battail; Solène Brohard-Julien; Vincent Frouin; Vincent Meyer; Smahane Chalabi; Jean-François Deleuze
Journal:  BMC Ecol Evol       Date:  2021-04-21

3.  Effect of IRS4 levels on PI 3-kinase signalling.

Authors:  Gerta Hoxhaj; Kumara Dissanayake; Carol MacKintosh
Journal:  PLoS One       Date:  2013-09-10       Impact factor: 3.240

4.  Identification of 2R-ohnologue gene families displaying the same mutation-load skew in multiple cancers.

Authors:  Michele Tinti; Kumara Dissanayake; Silvia Synowsky; Luca Albergante; Carol MacKintosh
Journal:  Open Biol       Date:  2014-05-07       Impact factor: 6.411

5.  Relationship between gene duplicability and diversifiability in the topology of biochemical networks.

Authors:  Zhanyong Guo; Wen Jiang; Nuno Lages; Wade Borcherds; Degeng Wang
Journal:  BMC Genomics       Date:  2014-07-08       Impact factor: 3.969

6.  Global analysis of human duplicated genes reveals the relative importance of whole-genome duplicates originated in the early vertebrate evolution.

Authors:  Debarun Acharya; Tapash C Ghosh
Journal:  BMC Genomics       Date:  2016-01-22       Impact factor: 3.969

7.  Selective Constraints on Coding Sequences of Nervous System Genes Are a Major Determinant of Duplicate Gene Retention in Vertebrates.

Authors:  Julien Roux; Jialin Liu; Marc Robinson-Rechavi
Journal:  Mol Biol Evol       Date:  2017-11-01       Impact factor: 16.240

8.  Functional divergence of duplicate genes several million years after gene duplication in Arabidopsis.

Authors:  Kousuke Hanada; Ayumi Tezuka; Masafumi Nozawa; Yutaka Suzuki; Sumio Sugano; Atsushi J Nagano; Motomi Ito; Shin-Ichi Morinaga
Journal:  DNA Res       Date:  2018-02-21       Impact factor: 4.458

Review 9.  AMPK: An Epigenetic Landscape Modulator.

Authors:  Brendan Gongol; Indah Sari; Tiffany Bryant; Geraldine Rosete; Traci Marin
Journal:  Int J Mol Sci       Date:  2018-10-19       Impact factor: 5.923

10.  The lasting after-effects of an ancient polyploidy on the genomes of teleosts.

Authors:  Gavin C Conant
Journal:  PLoS One       Date:  2020-04-16       Impact factor: 3.240

  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.