| Literature DB >> 21789237 |
Márton Miskei1, Csaba Ádám, László Kovács, Zsolt Karányi, Viktor Dombrádi.
Abstract
Phosphoprotein phosphatases (PPP), these ancient and important regulatory enzymes are present in all eukaryotic organisms. Based on the genome sequences of 12 Drosophila species we traced the evolution of the PPP catalytic subunits and noted a substantial expansion of the gene family. We concluded that the 18-22 PPP genes of Drosophilidae were generated from a core set of 8 indispensable phosphatases that are present in most of the insects. Retropositons followed by tandem gene duplications extended the phosphatase repertoire, and sporadic gene losses contributed to the species specific variations in the PPP complement. During the course of these studies we identified 5, up till now uncharacterized phosphatase retrogenes: PpY+, PpD5+, PpD6+, Pp4+, and Pp6+ which are found only in some ancient Drosophila. We demonstrated that all of these new PPP genes exhibit a distinct male specific expression. In addition to the changes in gene numbers, the intron-exon structure and the chromosomal localization of several PPP genes was also altered during evolution. The G-C content of the coding regions decreased when a gene moved into the heterochromatic region of chromosome Y. Thus the PPP enzymes exemplify the various types of dynamic rearrangements that accompany the molecular evolution of a gene family in Drosophilidae.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21789237 PMCID: PMC3137614 DOI: 10.1371/journal.pone.0022218
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Phosphoprotein phosphatase (PPP) catalytic subunits of D. melanogaster.
(A) Phylogenetic relationship of the amino acid sequences of 19 phosphatases. The bar indicates 0.2 amino acid substitutions per site, * labels male specific expression. (B) Chromosomal localization of the PPP genes. Chromosomes X, 2, 3, and 4 are depicted as solid bars with the numbering of the first and the last divisions. Chromosome Y is represented by an unproportional broken bar. The capital letters above the chromosomes label the chromosome arms termed the Muller elements. Although chromosome numbers vary in different Drosophila species the Muller elements are common to all species and can be conveniently used for the comparison of a gene's global location. The directions of the ORFs are indicated by arrows (left to right corresponds to 5′ to 3′ in the upper strand). The genes that have no introns in their coding regions are boxed.
Localization of PPP genes in Muller elements of 12 Drosophila species.
| Classification | Gene name, synonym | Dmel | Dsim | Dsec | Dyak | Dere | Dana | Dpse | Dper | Dwil | Dmoj | Dvir | Dgri |
| Type 1or PPP1 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
| |||||||||||
| Calcineurinor Pp2Bor PPP3 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
| Type 2or PPP2-4-6 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
| |||||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
| ||||||||||
|
|
|
|
|
|
|
| |||||||
| PPP5 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| PPP7 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| PPP | Total number |
|
|
|
|
|
|
|
|
|
|
|
|
According to Cohen [65].
The gene names are given according to Flybase (http://flybase.org/) and are followed by frequently used synonyms. The names of the genes that constitute the minimal PPP toolkit are written in bold face.
Species names are abbreviated as follows: Dmel: D. melanogaster, Dsim: D. simulans, Dsec: D. sechellia, Dyak: D. yakuba, Dere: D. erecta, Dana: D. ananassae, Dpse: D. pseudoobscura, Dper: D. persimilis, Dwil: D. willistoni, Dmoj: D. mojevensis, Dvir: D. virilis, Dgri: D. grimshawi.
The correct chromosomal localization of a gene is given in parentheses if different from the position inferred from the gene name.
These gene sequences were
identified,
revised, or
confirmed by PCR and DNA sequencing.
The gene size was confirmed by PCR.
The gene localization was corrected on the bases of genome environment.
x stands for a lost/missing gene.
The localization of intronless genes is highlighted in underlined italics.
Identification of PPP genes in sequenced insect genomes.
| Gene, Class |
|
|
|
|
|
|
|
|
| Abbreviations |
|
|
|
|
|
|
|
|
|
| XP_001843526 | XP_001663366 | XP_312797 | XP_966417 | XP_623273 | XP_001604472 | XP_001944422 | EEB19394 |
|
| XP_001849462 | XP_001653770 | XP_309483 | XP_001813974 | XM_392943 | XP_001602738 | XM_001945867 | |
|
| XP_001863301 | XP_001663281 | XP_319345 | XP_973546 | XP_623105 | XP_001602506 | NP_001119644 | EEB13988 |
|
| XP_001843269 | XP_001648308 | XP_310323 | XR_043119 | XM_624666 | XP_001606225 | XP_001950610 | EEB16959 |
|
| XP_001866724 | XP_001648846 | XP_311859 | XP_967314 | XP_394400 | XP_001603727 | XP_001951846 | EEB14281 |
|
| XP_968705 | XP_394519 | XP_001602102 (A) | XP_001945831 | EEB19897 | |||
|
|
|
|
| |||||
|
| XP_001850926 | XP_001650298 | XP_313034 | XP_971407 | XP_624242 | XP_001603324 | XP_001948640 | EEB13025 |
|
| XP_001844975 | XP_001663541 | XP_317894 | EEB16582 | ||||
|
|
|
|
|
|
|
|
|
|
Genes are termed as the D. melanogaster orthologs, PPP classification is given according to Cohen [65].
The revised protein sequence can be found at http://www.medchem.dote.hu/hu/node/1127.
The two Nvit_CanA1 sequences have been arbitrarily differentiated as A and B.
PPP genes that contain no introns in their coding regions are underlined. The two types of PPP3 are separated by a broken line.
Figure 2Phylogeny of PPP catalytic subunits in insects.
The phylogeny of 287 insect PPP catalytic subunits is presented. Enzyme subfamilies are labeled in line with the D. melanogaster nomenclature (see ). The color codes for the non-Drosophila species are given in the lower right corner, species abbreviations are listed in . The scale indicates 0.2 amino acid substitutions per site in the primary structure. Uncertain branching points are circled and labeled with the appropriate bootstrap value. The following 8 short, partial polypeptide sequences were excluded from the comparisons: Dsim flw, Dana Pp1-Y1, Dsec Pp1-Y2, Dwil PpN58A, Dyak Pp2B-14D, Dvir Pp2B-14D, Dsim CanA-14F, Dsim rdgC.
Figure 3Phylogeny of PPP catalytic subunits in Drosophilidae.
The phylogenetic analysis of 227 Drosophila PPP catalytic subunits is presented as in . The color of the branches indicates the chromosomal localization of the corresponding genes, the color code of the Muller elements is given in the lower right corner. The names of the 5 novel PPP enzymes that were identified in the present study are labeled by a+suffix and are surrounded by a heavy border. The scale indicates 0.1 amino acid substitutions per site in the primary structure.
Figure 4Gain and loss of PPP genes in Drosophilidae.
The basic PPP set of insects is in a black box. Gained genes are in blue and lost genes are in red boxes. The broken line indicates that the Pp2B-14D gene was gained in the Diptera. The family tree of the 12 Drosophila species with a mutational clock is presented in .
Expression of the new PPP genes in Drosophila imagos.
| Gene name | Dmel | Dana | Dpse | Dwil | Dvir | |||||
| Sex | M | F | M | F | M | F | M | F | M | F |
|
|
|
|
|
|
|
|
|
| ||
|
|
|
|
|
| ||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| ||||||||
|
|
|
|
|
|
|
|
|
|
|
|
The gene names are as in Table 1. New PPP genes are in bold.
Sexes are denoted as F: female, and M: male.
Adam et al. [8].
Empty fields indicate missing/unidentified genes.
Figure 5Rearrangements of the PpD6, PpD6+, Pp1- Y1, Pp1-Y2 gene cluster.
(A) Chromosomal localizations of PpD6 (D6), PpD6+ (D6+), Pp1- Y1 (Y1), and Pp1-Y2 (Y2) genes. Chromosome segments are symbolized by horizontal bars, the coloring identifies Muller elements C, B and chromosome Y in blue, pink, and black, respectively. The directions of the coding regions are indicated by arrows, pointing from 5′ to 3′ in the sense strand. In chromosome Y the positions and orientations of inactive viral reverse transcriptase (RT), pol and gag genes are also shown for orientation; here the arrows represent only the relative directions within a short DNA segment. (B) The G−C content in the coding regions of the PpD6, PpD6+, Pp1- Y, and Pp1-Y2 genes. The upper part of the panel shows the average G−C content + SD for all PPP genes of a given species, while the lower part depicts the G−C content of the four selected phosphatases.