Literature DB >> 25543670

Phylogenetic and structural analysis of the phospholipase A2 gene family in vertebrates.

Qi Huang1, Yuan Wu1, Chao Qin1, Wenwu He2, Xing Wei1.   

Abstract

The phospholipase A (PLA)2 family is the most complex gene family of phospholipases and plays a crucial role in a number of physiological activities. However, the phylogenetic background of the PLA2 gene family and the amino acid residues of the PLA2G7 gene following positive selection gene remain undetermined. In this study, we downloaded 49 genomic data sets of PLA from different species, including the human, house mouse, Norway rat, pig, dog, chicken, cattle, African clawed frog, Sumatran orangutan and the zebrafish species. Phylogenetic relationships were determined using the neighbor-joining (NJ), minimum evolution (ME) and maximum parsimony (MP) methods, as well as the Bayesian information criterion. The results were then presented as phylogenetic trees. Positive selection sites were detected using site, branch and branch‑site models. These methods led us to the following assumptions: i) closer lineages were observed between PLA2G16 and PLA2G6, PLA2G7 and PLA2G4, PLA2G3 and PLA2G12, as well as among PLA2G10, PLA2G5 and PLA2G15; ii) PLA2G5 appeared to be the origin of the PLA2 family, and PLA2G7 was one of the most evolutionarily distant PLA2 proteins; iii) 16 positive-selection sites were detected and were marked in the PLA2G7 protein sequence as 327D, 257Q, 276G, 34s, 66G, 67C, 319S, 28N, 50S, 54T, 58R, 75T, 88Q, 92R, 179H and 191K.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25543670      PMCID: PMC4314415          DOI: 10.3892/ijmm.2014.2047

Source DB:  PubMed          Journal:  Int J Mol Med        ISSN: 1107-3756            Impact factor:   4.101


Introduction

The phospholipase gene family encodes enzymes that hydrolyze phospholipids into fatty acids and other micromolecules. This gene family is classified into four major classes, namely, phospholipase PLA, PLB, PLC and PLD (1), based on the types of catalytic reaction of phospholipids. The majority of coding enzymes play crucial roles in lipid metabolism (2), cell proliferation (3), muscle contraction (4-6) and in the inflammation process (5). The PLA class includes two subfamilies, namely PLA1 and PLA2. PLA1 cleaves the SN-1 acyl chain and is a major component of snake venom (7,8), whereas PLA2 cleaves the SN-2 acyl chain and releases arachidonic acid, which mediates anti-inflammatory and inflammatory responses (9). The PLA2 gene family is divided into nine groups based on their function: PLA2G3, PLA2G4, PLA2G5, PLA2G6, PLA2G7, PLA2G10, PLA2G12, PLA2G15 and PLA2G16 (10). The coding enzymes of these genes are important in platelet activity (11,12) and B-cell activity (13,14). The dysfunction of one or more of these genes leads to stroke (14) and other neurological diseases (15–17). However, all the functions of these genes have not yet been fully elucidated. A number of studies have focused on the association between the PLA2 gene family and various physiological and pathological conditions. The results of a clinical trial demonstrated that high levels of sPLA2 mass in the circulation are not associated with a high risk of cardiovascular disease (18), which is not consistent with some earlier basal medical studies (11). Apart from potential flaws during study design, the differences in the results obtained may be attributed to genetic alterations reflecting a greater sensitivity to cardiovascular risk. In addition, certain studies have performed phylogenetic analyses (19). However, the available information is still insufficient partly due to methodological limitations and gene data inclusion criteria. In the present study, we analyzed the functions and phylogenetic background of the PLA2 gene family. We aimed to firstly determine the phylogenetic background of the PLA2 gene family in vertebrates using the neighbor-joining (NJ), minimum evolution (ME) and maximum parsimony (MP) methods, as well as the Bayesian information criterion, and secondly, to detect the positive selection sites of the PLA2 gene family to define the structure and biological activity of the gene by site-directed mutagenesis, which may provide possible therapeutic targets. The data presented in this study provide insight into the phylogenetic relationships and functional differentiation of the phospholipase and PLA2 gene families.

Materials and methods

Data collection

We searched and downloaded natural and intact amino acid and gene sequences of the phospholipase and PLA2 families from the NCBI database (http://www.ncbi.nlm.nih.gov/gene/). These sequences included human, house mouse, Norway rat, pig, dog, chicken, cattle, African clawed frog, Sumatran orangutan and zebrafish sequences.

Sequence alignment

We used the EBI web tool, MUSCLE (20), to align the sequences of the phospholipase and PLA2 family proteins. Rearranged gene sequences were generated according to the new amino acid alignment. The results of the amino acid alignment were placed in an aligned CDS fasta file using the EMBL web tool, PAL2NAL (21) (http://www.bork.embl.de/pal2nal/), which can form multiple codon alignments from matching amino acid sequences. The format was converted with the use of MEGA4.0. software (22).

Phylogenetic analysis

The full alignment of sequences was used for the phylogenetic analysis. Akaike Information Criterion in PAUP* version 4.0 (23) was applied to evaluate the most appropriate model of amino acid substitution for early tree-building analyses. ML optimizations and distance methods were valued by the PhyML program in PAUP* version 4.0 (24). The most appreciated evolution type, GTR+I+G, was computed for the PLA2 gene family using Modeltest version 3.7 (25). Phylogenetic trees were reconstructed using the Bayesian method from the DNA alignment with the use of MrBayes version 3.1.2 software (26,27) according to the best-fit predictive model. The parameters for tree generation were as follows: 2×106 generations of the PLA2 gene family were included with sampling every 1,000 generations, and with four chains (three cold, one heated); the first 250,000 generations (250 trees) were discarded from every run for the two families (phospholipase and PLA2). Analyses with the NJ, ME and MP methods were performed using MEGA4.0. software (22).

Estimation of positive selection sites

Selective pressures of HA and NA genes were detected by CODEML in the PAML package version 4.4 (28). Three codon-based likelihood methods were run as branch, site and branch-site models. P<0.05 was used to determine whether or not the alternative hypothesis was significant. In these analyses, ML estimates of the selection pressure were based on the ratio dN/dS (ω), where dN and dS are the non-synonymous and synonymous substitution rates, respectively, which vary across codons; the probability of each codon being under positive selection was estimated. Positive selection sites can occur in very short episodes or on only a few sites during the evolution of duplicated genes when ω >1 (29). All alignments resulted from the PAL2NAL web tool. The parameter estimates (ω) and likelihood scores were calculated for three pairs of models: M0 (one ratio) vs. M3 (discrete); M1a (nearly neutral) vs. M2a (positive selection); and M7 (β) vs. M8 (β + ω). The likelihood ratio test (LRT) was used to compare the fit to the data of two nested models, assuming that twice the log likelihood difference between the two models (2ΔL) follows a χ2 distribution with a number of degrees of freedom equal to the difference in the number of free parameters (30). Naive empirical Bayes and empirical Bayes selection criteria implemented in PAML4 were used to identify sites under positive selection or relaxed purifying selection in the foreground group with significant LRTs. Each branch group was also labeled as a foreground group. The flow of positive selective site detection is presented in Fig. 1.
Figure 1

Flow of positive selective sites detection using hte site model, branch model and branch-site models.

Protein structure analysis and positive selection site marking

The protein sequence liner and 3D structure of PLA2 based on PLA2G7_Homo were created by the online tool, PredictProtein (31) (www.predictprotein. org), and I-TASSER (32–34) (http://zhanglab.ccmb.med.umich.edu/I-TASSER/). Functional areas were marked in Figs. 3 and 4.
Figure 3

Positive selection sites in planar structure. Amino acid residuals in orange background belong to positive selection sites detected using the site model. Amino acid residues in red background belong to positive selection sites detected using the branch model and branch-site model. #, protein kinase C phosphorylation site; &, casein kinase II phosphorylation site; @, serine active site.

Figure 4

Positive selection sites in three dimensional structures. (A) Positive selection sites from site model on α-helix; (B) Positive selection sites from site model on random coil. (C) Positive selection sites from branch model and branch-site model on α-helix; (D) positive selection sites from branch model and branch-site model on β-sheet; (E) positive selection sites from branch model and branch-site model on random coil.

Results

Phylogenetic analysis of PLA2 gene family in vertebrates

A total of 49 sequences from 10 species were used to reconstruct a phylogenetic tree for the PLA2 gene family using the NJ, ME, MP methods, as well as the Bayesian information criterion with bootstrap value detection. The details of the included data are presented in Table I. A total of 25 nodes (56.81% in total) showed bootstrap values ≥95% and 34 nodes (77.27% in total) had bootstrap values ≥80% in the Bayes building tree (Fig. 2D). In each subgroup, mammal data, including data from the Sumatran orangutan, pig, Norway rat, human, house mouse, dog and cattle were gathered. The data from the African clawed frog, chicken and zebrafish were much more original than those from mammals, indicating that the taxonomy of host organisms reflects the phylogenetic background of the PLA2 gene family. The vertebrate PLA2 gene family was sorted into nine lineages according to the type of reaction for catalyzing phospholipids. PLA2G7 seems to be the most distant lineage in this gene family, indicating a large number of structural changes accumulating on them. Furthermore, all the groups were divided into two major clades; clade 1 included PLA2G16, PLA2G6, PLA2G10, PLA2G5 and PLA2G15, whereas clade 2 included PLA2G7, PLA2G4, PLA2G3 and PLA2G12. Closer lineages were observed between PLA2G16 and PLA2G6, PLA2G7 and PLA2G4, PLA2G3 and PLA2G12, well as among PLA2G10, PLA2G5 and PLA2G15. Moreover, the phylogenetic relationships obtained by the NJ, ME and MP methods were different (Fig. 2A–C).
Table I

Data on PLA2 subfamily in vertebrates.

AbbSpeciesCountryYearNCBI-PIDNCBI-GIDChromosomeTaxonomic groups
PLA2G7_XenopusAfrican clawed frogUSA2002NP_001017267.1NM_001017267.2UnAmphibians
PLA2G7_SusPigBelgium2009NP_001106484.1NM_001113013.17Mammals
PLA2G7_RattusNorway ratSweden2012NP_001009353.1NM_001009353.19Mammals
PLA2G7_MusHouse mouseChina2013NP_038765.2NM_013737.517Mammals
PLA2G7_HomoHumanChina2013NP_001161829.1NM_001168357.16Mammals
PLA2G7_GallusChickenUSA1995NP_990300.1NM_204969.13Birds
PLA2G7_DanioZebrafishChina2004NP_998354.1NM_213189.120Fish
PLA2G7_CanisDogUSA1995NP_001003198.1NM_001003198.112Mammals
PLA2G7_BosCattleUSA2009NP_777003.2NM_174578.423Mammals
PLA2G6_XenopusAfrican clawed frogUSA2002NP_001072661.1NM_001079193.1UnAmphibians
PLA2G6_RattusNorway ratGermany2012NP_001005560.1NM_001005560.17Mammals
PLA2G6_MusHouse mouseJapan2013NP_001185954.1NM_001199025.115Mammals
PLA2G6_HomoHumanBrazil2013NP_003551.2NM_003560.222Mammals
PLA2G6_GallusChickenAustria2008NP_001124210.1NM_001130738.11Birds
PLA2G5_RattusNorway ratUK2004NP_058870.1NM_017174.15Mammals
PLA2G5_MusHouse mouseUSA2013NP_001116426.1NM_001122954.14Mammals
PLA2G5_HomoHumanUSA2013NP_000920.1NM_000929.21Mammals
PLA2G5_GallusChicken--NP_001264973.1NM_001278044.121Birds
PLA2G5_BosCattleUSA2009NP_001179981.1NM_001193052.12Mammals
PLA2G4_XenopusAfrican clawed frogUSA2002NP_001080867.1NM_001087398.1UnAmphibians
PLA2G4_RattusNorway ratUSA2011NP_598235.2NM_133551.213Mammals
PLA2G4_PongoSumatran orangutan--NM_001132692.1NP_001126164.11Mammals
PLA2G4_MusHouse mouseJapan2013NP_032895.1NM_008869.31Mammals
PLA2G4_HomoHumanJapan2013NP_077734.1NM_024420.21Mammals
PLA2G4_DanioZebrafishUSA2013NP_571370.1NM_131295.22Fish
PLA2G3_RattusNorway ratSingapore2013NP_001099485.1NM_001106015.114Mammals
PLA2G3_MusHouse mouseJapan2013NP_766379.2NM_172791.211Mammals
PLA2G3_HomoHumanSpain2013NP_056530.2NM_015715.322Mammals
PLA2G3_BosCattleUSA2009NP_001074379.1NM_001080910.117Mammals
PLA2G16_SusPigJapan2007NP_001231443.1NM_001244514.12Mammals
PLA2G16_RattusNorway ratFrance2001NP_058756.2NM_017060.21Mammals
PLA2G16_MusHouse mouseJapan2012NP_644675.2NM_139269.219Mammals
PLA2G16_HomoHumanUSA2012NP_009000.2NM_007069.311Mammals
PLA2G16_BosCattleChina2012NP_001068748.1NM_001075280.229Mammals
PLA2G15_XenopusAfrican clawed frogUSA2002NP_001089365.1NM_001095896.1UnAmphibians
PLA2G15_RattusNorway ratJapan2005NP_001004277.1NM_001004277.219Mammals
PLA2G15_MusHouse mouseUSA2013NP_598553.1NM_133792.28Mammals
PLA2G15_HomoHumanCanada2010NP_036452.1NM_012320.316Mammals
PLA2G15_CanisDogUSA2007NP_001002940.1NM_001002940.15Mammals
PLA2G15_BosCattleUSA2009NP_776985.2NM_174560.218Mammals
PLA2G12_XenopusAfrican clawed frogUSA2003NP_001017096.1NM_001017096.2UnAmphibians
PLA2G12_SusPigJapan2007NP_001230267.1NM_001243338.114Mammals
PLA2G12_RattusNorway ratUSA2002NP_001102035.1NM_001108565.12Mammals
PLA2G12_MusHouse mouseItaly2011NP_075685.2NM_023196.43Mammals
PLA2G12_HomoHumanCanada2010NP_110448.2NM_030821.44Mammals
PLA2G10_RattusNorway ratFrance1999NP_058872.1NM_017176.210Mammals
PLA2G10_MusHouse mouseJapan2013NP_036117.1NM_011987.216Mammals
PLA2G10_HomoHumanIran2013NP_003552.1NM_003561.116Mammals
PLA2G10_GallusChicken--NP_001171686.1NM_001184757.114Birds

Abb, abbreviation; NCBI-PID, protein ID in NCBI; NCBI-GID, gene ID in NCBI.

Figure 2

Phylogenetic trees of the PLA2 gene in vertebrates. (A) Phylogenetic tree produced using the NJ method; (B) phylogenetic tree produced using the ME method; (C) phylogenetic tree produced using the MP method. Genes with a crimson disc belong to the PLA2G16 group; genes with an orange disc belong to the PLA2G6 group; genes with a grey disc belong to the PLA2G10 group; genes with a dark blue disc belong to the PLA2G10 group; genes with a yellow disc belong to the PLA2G15 group; genes with a red disc belong to the PLA2G7 group; genes with a green disc belong to the PLA2G4 group; genes with a purple disc belong to the PLA2G3 group; genes with a light blue disc belong to the PLA2G12 group. (D) Phylogenetic tree of the PLA2 gene in vertebrates produced using the Bayesian method. Genes with a crimson branch belong to the PLA2G16 group; genes with an orange branch belong to the PLA2G6 group; genes with a grey branch belong to the PLA2G10 group; genes with a dark blue branch belong to the PLA2G10 group; genes with a yellow branch belong to the PLA2G15 group; genes with a red branch belong to the PLA2G7 group; genes with a green branch belong to the PLA2G4 group; genes with a purple branch belong to the PLA2G3 group; genes with a light blue branch belong to the PLA2G12 group; genes in pink belong to mammals; genes in dark yellow belong to birds; genes in dark green belong to amphibians; genes in sky blue belong to fish. NJ, neighbor-joining; ME, minimum evolution; MP, maximum parsimony.

Analysis of positive selection sites of the PLA2 gene family in vertebrates

Positive selection sites were also computed under site, branch and branch-site models for the PLA2 gene family. During site model computing, only P<0.05 and ω>1 indicated the presence of possible positive selection sites. As a result, only M7/M8 met the criteria (P=0.00000, w=2.52322) and showed the following eight positive selection sites: 28N, 34K, 50S, 54T, 58R, 75T, 88Q and 92R (all positive selection sites mentioned in this manuscript refer to amino acids of PLA2G7) (Table II).
Table II

Parameter estimates and likelihood scores of PLA2 for site models in PAML.

ModelnpEstimates of parameterslnLLRT pairsdf2ΔlnLp-valuePositively selected sites BEB (%)
M0:one ratio94ω=0.16513−41914.67M0/M34366.4 5.04128E-78None
M3:discrete98p0=0.18381, p1=0.61322, p2=0.20297, ω0=0.06840ω1=0.14537,ω2=0.42278−41731.47
M1a:neutral95p0=0.92349, p1=0.07651,ω0=0.15503, ω1=1.00000−41830.79M1a/M2a201.0000033M (55.5), 34K (52.8)
M2a:selection97p0=0.92349, p1=0.03595, p2=0.04056, ω0=0.15503, ω1=1.00000, ω2=1.00000−41830.79
M7:β95p=2.25732, q=10.02539−45800.80M7/M828103.860.0000028N (65.8), 34K (53.7), 50S (60.4), 54T (64.6), 58R (64.9), 75T (70.0), 88Q (77.2), 92R (50.1)
M8:β and ω97p0=0.99999, p=0.17653, q=1.31411, p1=0.00001, ω=2.52322−41748.87

Selection analysis by site models was performed using CODEML implemented in PAML. np, number of free parameters; lnL, loglikelihood; LRT, likelihood ratio test; df, degrees of freedom; 2ΔlnL, twice the log-likelihood difference of the models compared; BEB, Bayes empirical Bayes approach. p-values with bold font indicate statistical significance.

Additional calculations were performed to confirm and supplement the results. The branch model was used for positive branch selection. The free-ratio model was significantly higher than the one-ratio model (2ΔlnL=694.2, p=1.306E-93, df=185), indicating heterogeneous selection among branches. Two-ratio models were used using the selected 12 branches; the results revealed that two models (Td and Tf) were significantly different (Pd=3.978E-08, Pf=0.017) at ω>1. Subsequently, branch-site models were used to search for amino acid sites that underwent positive selection in the statistically significant foreground branches Td and Tf (Table III).
Table III

Parameter estimates and likelihood scores of PLA2 for site models in PAML parameter estimates and likelihood scores of PLA2 for branch models in PAML.

ModelnpEstimates of parameterslnLLRT pairsdf2ΔlnLp-value
Fr:free ratios185−41567.57M0/Fr91694.21.306E-93
Tx:two ratios
Ta95ω0=0.1649, ωa=3.6122−41913.33M0/Ta12.680.102
Tb95ω0=0.1645, ωb=49.9559−41914.34M0/Tb10.660.417
Tc95ω0=0.1651, ωc=21.4714−41917.97M0/Tc16.60.010
Td95ω0=0.1594, ωd=37.8875−41899.59M0/Td130.163.978E-08
Te95ω0=0.1652, ωe=59.5984−41914.39M0/Te10.560.454
Tf95ω0=0.1666, ωf=87.3449−41911.84M0/Tf15.660.017
Tg95ω0=0.1656, ωg=51.0797−41914.47M0/Tg10.40.527
Th95ω0=0.1650, ωh=34.3683−41914.62M0/Th10.10.752
Ti95ω0=0.1653, ωi=49.6517−41914.51M0/Ti10.30.584
Tj95ω0=0.1650, ωj=37.3306−41914.52M0/Tj10.30.584
Tk95ω0=0.1650, ωk=21.5084−41914.66M0/Tk10.020.888
Tl95ω0=0.1655, ωl=24.4651−41914.55M0/Tl10.240.624

Selection analysis by branch models was performed using CODEML implemented in PAML. np, number of free parameters; lnL, loglikelihood; LRT, likelihood ratio test; df, degrees of freedom; 2ΔlnL, twice the log-likelihood difference of the models compared. p-values with bold font indicate statistical significance.

Calculation parameters were set as model=2 and Nsite=2 in PAML package version 4.4. The H1 vs. H0 models of the two branches were differed significantly. Eight amino acid sites were found in branch df: 276G, 191K, 327D, 319S, 66G, 67C, 179G and 257Q (Table IV).
Table IV

Parameter estimates and likelihood scores of PLA2 for branch-site models in PAML.

ModelnpEstimates of parameterslnLLRT pairsdf2ΔlnLp-valuePositively selected sites BEB (%)
BSa197p0=0.78310, p1=0.05618, p2a=0.14996, p2b=0.01076, ω0=0.15077, ω1=1.00000, b:ω2a=0.15077, ω2b=1.00000, f:ω2a=999.00000, ω2b=999.00000−41811.53BSa1/BSa0-fix117.622.70E-05276G (61.7)191K (58.8)327D (58.7)319S (58.2)
BSa0-fix96p0=0.64515, p1=0.04724, p2a=0.28662, p2b=0.02099ω0=0.15015, ω1=1.00000, b:ω2a=0.15015, ω2b=0.15015, f:ω2a=1.00000, ω2b=1.00000−41820.34
BSb197p0=0.89293, p1=0.07409, p2a=0.03045, p2b=0.00253, w0=0.15484, ω1=1.00000, b:ω2a=0.15484, ω2b=1.00000, f:ω2a=111.98807, ω2b=111.98807−41828.90BSb1/BSb0-fix13.346.76E-0266G (88.2)67C (89.7)179H (75.9)257Q (58.0)
BSb0-fix96p0=0.89547, p1=0.07423, p2a=0.02798, p2b=0.00232, w0=0.15479, ω1=1.00000, b:ω2a=0.15479, ω2b=1.00000, f:ω2a=1.00000, ω2b=1.00000−41830.57

Selection analysis by branch-site models was performed using codeml implemented in PAML. BS, branch-site; np, number of free parameters; lnL, loglikelihood; LRT, likelihood ratio test; df, degrees of freedom; 2ΔlnL, twice the log-likelihood difference of the models compared. BEB, Bayes empirical Bayes approach.

Using I-TASSER (32–34) (http://zhanglab.ccmb.med.umich.edu/I-TASSER/), four positive selection sites, 327D, 257Q, 276G and 34s, were located in α-helix; three positive selection sites, 66G, 67C and 319S, were located in β-sheet; and nine positive selection sites, 28N, 50S, 54T, 58R, 75T, 88Q, 92R, 179H and 191K, were located in random coil. All details of the positive selection sites are presented in Table V. A planar structure of all positive selection sites is presented in Fig. 3. Positive selection sites, which were detected by site models are three-dimensionally presented in Fig. 4A and B. Positive selection sites, which were detected by branch and branch-site models, are three-dimensionally presented in Fig. 4C–E.
Table V

Positive selection sites by site model, branch model and site-branch model.

LocationAmino acidSecondary struscturePosterior probability (%)Model
28AsnRandom coil65.8Site model
34Lysα-helix53.7Site model
50SerRandom coil60.4Site model
54ThrRandom coil64.6Site model
58ArgRandom coil64.9Site model
66Glyβ-pleated sheet88.2Branch model and branch-site model
67Cysβ-pleated sheet89.7Branch model and branch-site model
75ThrRandom coil70.0Site model
88GlnRandom coil77.2Site model
92ArgRandom coil50.1Site model
179HisRandom coil75.9Branch model and branch-site model
191LysRandom coil58.8Branch model and branch-site model
257Glnα-helix58.0Branch model and branch-site model
276Glyα-helix61.7Branch model and branch-site model
319Serβ-pleated sheet58.2Branch model and branch-site model
327Asnα-helix58.7Branch model and branch-site model

All positive-selection sites were marked on AA sequence of PLA2G7_Homo.

Distribution of positive selection sites

The functional areas on the AA sequence of PLA2G7_Homo were predicted by PredictProtein. The positive selection site 276G was located in the serine active site, 75T was located in the protein kinase C phosphorylation site and 191K was located near the casein kinase II phosphorylation site.

Discussion

Available natural and complete sequences of the phospholipase gene family in humans and the PLA2 gene family of vertebrates from the NCBI database were included in the present study. The phospholipase and PLA2 gene families showed different phylogenetic backgrounds and relationships according to the method used for determination (the NJ, ME, MP methods and the Bayesian information criterion). This difference may be attributed to the weakness of these methods. The NJ method focuses on one final topology with branch length estimates, and the observed differences between sequences are inaccurate reflections of the evolutionary distances (35). The construction of an ME tree is time consuming, and examining all topologies is difficult (36). The MP method lacks statistical consistency and does not guarantee the production of a true tree with high probability, given sufficient data (37). Bayesian analysis, which is widely accepted as the most valuable method in phylogenetic analysis and the estimation of positive selection sites, was also employed (26). The PLA2 family is the most complex gene family of phospholipases (38,39). The majority of PLA2 genes encode secreted enzymes with physiological features involved in catalyzing platelet activity (40), controlling lipid metabolism (2) and mediating inflammations (5). The dysfunction of these genes may lead to stroke (14). The PLA2G7 coding enzyme, Lp-PLA2, has attracted considerable attention due to its crucial function in platelet gathering in cardiovascular and cerebrovascular diseases (11). Lp-PLA2 is a new biological marker for detecting vasculitis (41). Unlike multiple clinical trials and diagnostic estimations of Lp-PLA2 mass and activity in the circulation (18), data on the phylogenetic background of the PLA2 gene family and the positive selection of amino acid residues on PLA2G7 genes are limited (42–44). According to the PLA2 phylogenetic tree built using the Bayesian information criterion, PLA2G7 is one of the most evolutionarily distant members of PLA2 proteins, an indication of a fast-evolving lineage with numerous structural changes. Moreover, lineage-specific expansion and divergence events were not observed from low-order to high-order vertebrates. The first duplication of the PLA2G7 group led to the emergence of lineages in the Norway rat and the house mouse, and the residual mammals shared duplication with birds, fish and amphibians. Thus, at least two duplications are present in mammals. Moreover, the PLA2G4 family presented the closest lineage to the PLA2G7 family, indicating that PLA2G4 may be another gene that mediates platelet gathering. In the present study, we identified specific amino acid residues of PLA2G7, which are targets of positive selection. According to the site model result, eight positive selection sites, 28N, 34K, 50S, 54T, 58R, 75T, 88Q and 92R, were found, and eight amino acid sites, 276G, 191K, 327D, 319S, 66G, 67C, 179G and 257Q, were found by the branch and branch-site models. No identical positive selection sites were found among the site, branch and branch-site models. Functional structure, the protein kinase C phosphorylation site (45), the casein kinase II phosphorylation site (46) and the serine active site (47) were widely scattered along the PLA2G7 peptide chain. The serine active site is a conserved region centered on a serine residue and has the function of catalyzing fatty acid transfer between phosphatidylcholine and cholesterol. According to the Bayesian analysis, a positive selection site, 276G, was located on serine active region, indicating its similar function. It has been previoulsy demonstrated that Lp-PLA2 mediates atherosclerosis by promoting platelet gathering and adherence to vessels (48) and a previous study (49) suggests that, apart from promoting platelet gathering, Lp-PLA2 may also alter cholesterol metabolism in atherosclerosis. Protein kinase C can modify the function of a protein by increasing or decreasing the protein's activity, stabilizing it or marking it for destruction. The positive selection site, 75T, located on the protein kinase C phosphorylation site, indicated its function on altering Lp-PLA2 activity. Casein kinase II is a protein kinase that phosphorylates many different proteins and is relevant to changes in macrophage gene expression during atherosclerosis (50). We found that 191K was located near the casein kinase II phosphorylation site, indicating that Lp-PLA2 may also increase macrophage gene expression in atherosclerosis. However, further validation of such sites is required in order to obtain richer experimental data. In conclusion, the PLA2 gene family is the most complex gene family among the phospholipases. A number of studies, including clinical trials have focused on the diagnostic estimation of the mass and activity of PLA2 coding enzymes in the circulation (18,51,52); however, phylogenetic analyses of the PLA2 gene family and positive selection amino acid residues on PLA2G7 genes are limited. The present study focused on the phospholipase and PLA2 gene families employing phylogenetic analysis using the NJ, ME and MP methods, as well as the Bayesian information criterion. Positive selection sites were detected for the PLA2 family using site, branch and branch-site models. A total of 49 sequences from 10 different species were selected for the analysis. Phylogenetic analysis of the PLA2 gene family in vertebrates suggests that PLA2G5 is the origin of this gene family, and that PLA2G7 is one of the most evolutionarily distant PLA2 proteins. Eight positive selection sites were detected using the site model, whereas eight positive selection sites were detected using the branch and branch-site models.
  52 in total

1.  MRBAYES: Bayesian inference of phylogenetic trees.

Authors:  J P Huelsenbeck; F Ronquist
Journal:  Bioinformatics       Date:  2001-08       Impact factor: 6.937

2.  Lipoprotein-associated phospholipase A2 and risk of dementia in the Cardiovascular Health Study.

Authors:  Annette L Fitzpatrick; Michael C Irizarry; Mary Cushman; Nancy S Jenny; Gloria C Chi; Carol Koro
Journal:  Atherosclerosis       Date:  2014-05-22       Impact factor: 5.162

3.  The method of minimum evolution.

Authors:  E A Thompson
Journal:  Ann Hum Genet       Date:  1973-01       Impact factor: 1.670

4.  A comparison of the effects of phospholipase C and tetrodotoxin on spike generation in muscle.

Authors:  P Johansson; S Thesleff
Journal:  Eur J Pharmacol       Date:  1968-10       Impact factor: 4.432

5.  The phylogeny of human globin genes investigated by the maximum parsimony method.

Authors:  M Goodman; G W Moore; J Barnabas; G Matsuda
Journal:  J Mol Evol       Date:  1974-02-28       Impact factor: 2.395

6.  Lipoprotein phospholipase A2 mass and activity are not associated with the diagnosis of acute brain ischemia.

Authors:  Waimei Tai; Madelleine Garcia; Michael Mlynash; Stephanie Kemp; Gregory W Albers; Jean-Marc Olivot
Journal:  Cerebrovasc Dis       Date:  2014-11-21       Impact factor: 2.762

7.  Lack of evidence for associations between plasma platelet-activating factor acetylhydrolase deficiency and schizophrenia.

Authors:  Tsuyuka Ohtsuki; Hideki Watanabe; Michio Toru; Tadao Arinami
Journal:  Psychiatry Res       Date:  2002-01-31       Impact factor: 3.222

8.  [Lipoprotein-associated phospholipase A2 serum levels in patients from different categories of cardiovascular risk].

Authors:  D N Nozadze; I V Sergienko; T V Balakhonova; A E Semenova; T N Vlasik; V V Kukharchuk
Journal:  Kardiologiia       Date:  2014       Impact factor: 0.395

9.  I-TASSER server for protein 3D structure prediction.

Authors:  Yang Zhang
Journal:  BMC Bioinformatics       Date:  2008-01-23       Impact factor: 3.169

10.  Evaluation of lipoprotein-associated phospholipase A2 in healthy Chinese Han adult serum.

Authors:  Li-Min Feng; Guo-Fang Feng; Yu Chen
Journal:  Lipids Health Dis       Date:  2014-01-07       Impact factor: 3.876

View more
  6 in total

1.  The evolution of S100A7: an unusual gene expansion in Myotis bats.

Authors:  Ana Águeda-Pinto; L Filipe C Castro; Pedro J Esteves
Journal:  BMC Evol Biol       Date:  2019-05-14       Impact factor: 3.260

2.  Mechanism of tanshinones and phenolic acids from Danshen in the treatment of coronary heart disease based on co-expression network.

Authors:  Dongxue Wu; Mengqi Huo; Xi Chen; Yanling Zhang; Yanjiang Qiao
Journal:  BMC Complement Med Ther       Date:  2020-02-03

3.  Transversal gene expression panel to evaluate intestinal health in broiler chickens in different challenging conditions.

Authors:  L Criado-Mesas; N Abdelli; A Noce; M Farré; J F Pérez; D Solà-Oriol; R Martin-Venegas; A Forouzandeh; F González-Solé; J M Folch
Journal:  Sci Rep       Date:  2021-03-18       Impact factor: 4.379

4.  CD64 and Group II Secretory Phospholipase A2 (sPLA2-IIA) as Biomarkers for Distinguishing Adult Sepsis and Bacterial Infections in the Emergency Department.

Authors:  Toh Leong Tan; Nurul Saadah Ahmad; Dian Nasriana Nasuruddin; Azlin Ithnin; Khaizurin Tajul Arifin; Ida Zarina Zaini; Wan Zurinah Wan Ngah
Journal:  PLoS One       Date:  2016-03-22       Impact factor: 3.240

Review 5.  The role of group IIA secretory phospholipase A2 (sPLA2-IIA) as a biomarker for the diagnosis of sepsis and bacterial infection in adults-A systematic review.

Authors:  Toh Leong Tan; Yew Yip Goh
Journal:  PLoS One       Date:  2017-07-03       Impact factor: 3.240

6.  Biochemical characterization and cytotoxic effect of the skin secretion from the red-spotted Argentina frog Argenteohyla siemersi (Anura: Hylidae).

Authors:  Luciano S Fusco; Rodrigo Cajade; Jose M Piñeiro; Ana M Torres; Igor R F da Silva; Stephen Hyslop; Laura C Leiva; Daniel C Pimenta; Soledad Bustillo
Journal:  J Venom Anim Toxins Incl Trop Dis       Date:  2020-03-30
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.