| Literature DB >> 21935469 |
Abstract
Recent genome-wide analyses have revealed patterns of positive selection acting on protein-coding genes in humans and mammals. To assess whether the conclusions drawn from these analyses are valid for other vertebrates and to identify mammalian specificities, I have investigated the selective pressure acting on protein-coding genes of the puffer fishes Tetraodon and Takifugu. My results indicate that the strength of purifying selection in puffer fishes is similar to previous reports for murids but stronger in hominids, which have a smaller population size. Gene ontology analyses show that more than half of the biological processes targeted by positive selection in mammals are also targeted in puffer fishes, highlighting general patterns for vertebrates. Biological processes enriched with positively selected genes that are shared between mammals and fishes include immune and defense responses, signal transduction, regulation of transcription and several of their descendent terms. Mammalian-specific processes displaying an excess of positively selected genes are related to sensory perception and neurological processes. The comparative analyses also revealed that, for both mammals and fishes, genes encoding extracellular proteins are preferentially targeted by positive selection, indicating that adaptive evolution occurs more often in the extra-cellular environment rather than inside the cell. Moreover, I present here the first genome-wide characterization of neutrally-evolving regions of protein-coding genes. This analysis revealed an unexpectedly high proportion of genes containing both positively selected motifs and neutrally-evolving regions, uncovering a strong link between neutral evolution and positive selection. I speculate that neutrally-evolving regions are a major source of novelties screened by natural selection.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21935469 PMCID: PMC3172292 DOI: 10.1371/journal.pone.0024800
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Distribution of Ka, Ks and Ka/Ks.
These values are based on full-length CDS of Tetraodon and Takifugu orthologous genes. The gMYN method [55] was used. Genes were grouped into categories of 0.05 units. The X axis was cut at 2.4 for clarity purpose, yet the maximum Ka value reached 1.2, the maximum Ks category was 5 and the maximum Ka/Ks category was 7.
Biological Process GO terms enriched with either pPSG, with genes displaying small P-values as given by the MWU test, or with GNR.
| BP GO ID | Biological Process term | pPSG | MWU test | GNR |
| GO:0009889 | Regulation of biosynthetic process | 4.1e-2 | ||
| GO:0032268 | Regulation of cellular protein metabolic process | 3.3e-8 | ||
| GO:0006417 | Regulation of translation | 1.9e-7 | ||
| GO:0006446 | Regulation of translational initiation | 6.2e-8 | ||
| GO:0010468 | Regulation of gene expression | 4.9e-2 | ||
| GO:0010608 | Postranscriptional regulation of gene expression | 1.8e-6 | ||
| GO:0050896 | Response to stimulus | 2.4e-2 |
| |
| GO:0006950 | Response to stress | 1.2e-2 | ||
| GO:0006952 | Defense response | 2.4e-2 | ||
| GO:0002541 | Activation of plasma proteins involved in acute inflammatory response | 5e-3 | ||
| GO:0002376 | Immune system process | 2.142e-3 | ||
| GO:0006955 | Immune response | 3.1e-2 | 2.147e-3 | |
| GO:0002682 | Regulation of immune system process | |||
| GO:0002684 | Positive regulation of immune system process | 5e-3 | ||
| GO:0050776 | Regulation of immune response | 5e-3 | ||
| GO:0050778 | Positive regulation of immune response | 5e-3 | ||
| GO:0002253 | Activation of immune response | 5e-3 | ||
| GO:0019882 | Antigen processing and presentation | 5.1e-8 | ||
| GO:0006956 | Complement activation | 5e-3 | ||
| GO:0008152 | Metabolic process | |||
| GO:0006298 | Mismatch repair | 1.9e-2 | ||
| GO:0006302 | Double-strand break repair | 1.4e-2 | ||
| GO:0016999 | Antibiotic metabolic process | 8.9e-3 | ||
| GO:0022610 | Biological adhesion | 2.5e-10 | ||
| GO:0007155 | Cell adhesion | 2.5e-10 | ||
| GO:0007010 | Cytoskeleton organization | 4e-3 | ||
| GO:0000226 | Microtubule cytoskeleton organization | 2.1e-3 | ||
| GO:0006810 | Transport | |||
| GO:0015674 | Di-, tri-valent inorganic cation transport | 1.1e-2 | ||
| GO:0006816 | Calcium ion transport | 1.2e-2 | ||
| GO:0006820 | Anion transport | 3.3e-10 | ||
| GO:0015698 | Inorganic anion transport | 6.5e-13 | ||
| GO:0006817 | Phosphate transport | 3.5e-2 | 3.2e-21 | |
| GO:0007165 | Signal transduction | 2.272e-2 | ||
| GO:0006813 | Potassium ion transport | 4.9833e-2 | ||
| GO:0006355 | Regulation of transcription, DNA-dependent | 2.848e-2 | ||
| GO:0006468 | Protein amino acid phosphorylation | 2.8506e-2 |
For the MWU test, P-values that are significant after Holm correction are given in bold.
Molecular Function GO terms enriched with either pPSG, with genes displaying small P-values as given by the MWU test, or with GNR.
| MF GO ID | Molecular Function term | pPSG | MWU test | GNR |
| GO:0005515 | Protein binding | 2.3e-2 | ||
| GO:0030983 | Mismatched DNA binding | 4.7e-3 | ||
| GO:0005149 | Interleukin-1 receptor binding | 3.4e-3 | ||
| GO:0003712 | Transcription cofactor activity | 2.6e-2 | ||
| GO:0017016 | Ras GTPase binding | 9.6e-3 | ||
| GO:0005249 | Voltage-gated potassium channel activity | 4.97e-2 | ||
| GO:0043167 | Ion binding | 4.93e-2 | ||
| GO:0043169 | Cation binding | 4.9e-2 | ||
| GO:0046872 | Metal ion binding | 4.9e-2 | ||
| GO:0046914 | Transition metal ion binding | 8.8e-3 | ||
| GO:0008270 | Zinc ion binding | 6.2e-5 | ||
| GO:0030247 | Polysaccharide binding | 3.4e-3 | ||
| GO:0005540 | Hyaluronic acid binding | 1.7e-3 | ||
| GO:0003676 | Nucleic acid binding | 1.1e-5 | ||
| GO:0003743 | Translation initiation factor activity | 8.6e-5 | ||
| GO:0008227 | G-protein coupled amine receptor activity | 1.3e-3 | ||
| GO:0004969 | Histamine receptor activity | 1.9e-2 | ||
| GO:0003824 | Catalytic activity | |||
| GO:0004016 | Adenylate cyclase activity | 9.7e-3 | ||
| GO:0016859 | Cis-trans isomerase activity | 1.1e-2 | ||
| GO:0004869 | Cytein-type endopeptidase inhibitor activity | 5.9e-4 | ||
| GO:0005199 | Structural constituent of cell wall | 1.1e-2 | 2.4e-11 | |
| GO:0005201 | Extracellular matrix structural constituent | 4.6e-3 | 5.8e-6 |
Cellular Component GO terms enriched with either pPSG, with genes displaying small P-values as given by the MWU test, or with GNR.
| CC GO ID | Cellular Component term | pPSG | MWU test | GNR |
| GO:0005576 | Extracellular region | 1.76e-2 | ||
| GO:0044421 | Extracellular region part | 9.9e-3 | 3.04e-2 | 4.39e-6 |
| GO:0031012 | Extracellular matrix | 9.8e-3 | 1.8e-6 | |
| GO:0005578 | Proteinaceous extracellular matrix | 9.9e-3 | 8.23e-5 | |
| GO:0044420 | Extracellular matrix part | 9.9e-3 | 2.74e-9 | |
| GO:0005581 | Collagen | 9.88e-3 | 8.86e-12 | |
| GO:0016020 | Membrane | |||
| GO:0042611 | MHC protein complex | 6.59e-6 | ||
| GO:0042612 | MHC class I protein complex | 1.45e-3 | ||
| GO:0042613 | MHC class II protein complex | 4.14e-3 | ||
| GO:0034704 | Calcium channel complex | 7.87e-3 | ||
| GO:0005891 | Voltage-gated calcium channel complex | 7.87e-3 | ||
| G0:0005622 | Intracellular | 1.77e-2 | ||
| GO:0030134 | ER to Golgi transport vesicle | 1.07e-2 | ||
| GO:0012507 | ER to Golgi transport vesicle membrane | 1.07e-2 | ||
| GO:0030127 | COPII vesicle coat | 1.07e-2 | ||
| GO:0042579 | Microbody | 4.09e-2 | ||
| GO:0005777 | Peroxisome | 4.09e-2 |
Cellular Component GO terms enriched with PSG in human and mouse.
| GO term ID | Cellular Component GO term | Human | Mouse |
| GO:0044421 | Extracellular region part | 1.69E-07 | 6.27E-08 |
| GO:0005615 | Extracellular space | 2.62E-05 | 6.27E-08 |
| GO:0044464 | Cell part | 2.16E-05 | _ |
| GO:0009986 | Cell surface | 2.22E-06 | 1.97E-11 |
| GO:0009897 | External side of plasma membrane | 1.49E-08 | 1.97E-11 |
| GO:0016020 | Membrane | 1.58E-28 | 4.04E-07 |
| GO:0044425 | Membrane part | 1.35E-35 | 1.52E-07 |
| GO:0031224 | Intrinsic to membrane | 9.10E-41 | 6.27E-08 |
| GO:0016021 | Integral to membrane | 7.28E-41 | 6.27E-08 |
| GO:0005886 | Plasma membrane | 9.10E-41 | 3.28E-06 |
| GO:0044459 | Plasma membrane part | 3.32E-20 | 7.50E-06 |
| GO:0031226 | Intrinsic to plasma membrane | 2.38E-26 | _ |
| GO:0005887 | Integral to plasma membrane | 1.62E-25 | _ |
| GO:0043235 | Receptor complex | 1.19E-05 | 1.37E-04 |
| GO:0042101 | T cell receptor complex | 0.0014 | 6.83E-04 |
| GO:0042105 | Alpha-beta T cell receptor complex | _ | 1.82E-04 |
| GO:0008305 | Integrin complex | _ | 0.0352 |
| GO:0001772 | Immunological synapse | 5.65E-06 | 2.09E-05 |
| GO:0016324 | Apical plasma membrane | 0.0266 | _ |
| GO:0005773 | Vacuole | 1.17E-04 | 3.84E-04 |
| GO:0000323 | Lytic vacuole | 3.17E-05 | 1.56E-04 |
| GO:0005764 | Lysosome | 3.17E-05 | 1.56E-04 |
| GO:0005765 | Lysosomal membrane | _ | 0.015 |
The set of PSG were taken from [11] and analyzed against the human and mouse gene-associated GO database as implemented in GOstat [58].
Summary of main GO terms enriched in PSG that are shared between mammals and puffer fishes, that are specific to mammals or specific to puffer fishes.
| Main GO terms | Shared by mammals and fishes | Missing in fishes | Specific to fishes |
|
| X | ||
| Response to stimulus | X | ||
| Defense response | X | ||
| Response to stress | X | ||
| Immune system process, Immune response | X | ||
| Signal transduction | X | ||
| Regulation of transcription DNA-dependent | X | ||
| Ion transport | X | ||
| Sensory perception | X | ||
| Neurological process | X | ||
| Cytolysis | X | ||
| Single fertilisation | X | ||
| Mismatch repair | X | ||
| Protein phosphorylation | X | ||
|
| |||
| DNA binding | X | ||
| Protein binding | X | ||
| Metal ion binding | X | ||
| Olfactory receptor activity, Rhodopsin-like receptor activity, Taste receptor activity | X | ||
| Protease inhibitor activity | X | ||
| Transferase activity, RNA methyltransferase activity | X | ||
| Nucelotide binding | X | ||
| Serine hydrolase activity | X | ||
| Voltage-gated potassium channel activity | X | ||
| Adenylate cyclase activity | X | ||
| Cis-trans isomeras activity | X | ||
| Structural constituent of cell wall | X | ||
| Extracellular matrix structural constituent | X | ||
|
| |||
| Extracellular region, Extracellular matrix | X | ||
| Membrane, Intrinsic to membrane, MHC protein complex | X | ||
| Nucleus | X | ||
| Cytoplasm | X |
*Found in puffer fishes.
**found in mammals.
***the type of ion may differ between mammals and puffer fishes.
The data for mammals has been taken from [11] and from [10]. Some GO terms were grouped if they were close descendent/parent terms in the GO database hierarchy.