Literature DB >> 29900035

Mammalian Glutamyl Aminopeptidase Genes (ENPEP) and Proteins: Comparative Studies of a Major Contributor to Arterial Hypertension.

Roger S Holmes1,2, Kimberly D Spradling-Reeves1, Laura A Cox1.   

Abstract

Glutamyl aminopeptidase (ENPEP) is a member of the M1 family of endopeptidases which are mammalian type II integral membrane zinc-containing endopeptidases. ENPEP is involved in the catabolic pathway of the renin-angiotensin system forming angiotensin III, which participates in blood pressure regulation and blood vessel formation. Comparative ENPEP amino acid sequences and structures and ENPEP gene locations were examined using data from several mammalian genome projects. Mammalian ENPEP sequences shared 71-98% identities. Five N-glycosylation sites were conserved for all mammalian ENPEP proteins examined although 9-18 sites were observed, in each case. Sequence alignments, key amino acid residues and predicted secondary and tertiary structures were also studied, including transmembrane and cytoplasmic sequences and active site residues. Highest levels of human ENPEP expression were observed in the terminal ileum of the small intestine and in the kidney cortex. Mammalian ENPEP genes contained 20 coding exons. The human ENPEP gene promoter and first coding exon contained a CpG island (CpG27) and at least 6 transcription factor binding sites, whereas the 3'-UTR region contained 7 miRNA target sites, which may contribute to the regulation of ENPEP gene expression in tissues of the body. Phylogenetic analyses examined the relationships of mammalian ENPEP genes and proteins, including primate, other eutherian, marsupial and monotreme sources, using chicken ENPEP as a primordial sequence for comparative purposes.

Entities:  

Keywords:  Amino acid sequence; Aminopeptidase A; Arterial hypertensionAbbreviations: ENPEP: Glutamyl Aminopeptidase; BLAST: Basic Local Alignment Search Tool; BLAT: Blast-Like Alignment Tool; CpG island: Multiple C (cytosine)-G (guanine) Dinucleotide Region; ENPEP; Evolution; Glutamyl aminopeptidase; Mammals; NCBI: National Center for Biotechnology Information; Peptidase M1 family; QTL: Quantitative Trait Locus; RAS: Renin-Angiotensin System; SWISS-MODEL: Automated Protein Structure Homology-modeling Server; Zinc metallopeptidase; kbps: Kilobase Pairs; miRNA: microRNA Binding Region

Year:  2017        PMID: 29900035      PMCID: PMC5995572          DOI: 10.4172/2153-0602.1000211

Source DB:  PubMed          Journal:  J Data Mining Genomics Proteomics


Introduction

Glutamyl aminopeptidase (ENPEP; EC 3.4.11.7; aminopeptidase A [AMPE or APA]; differentiation antigen gp160; or CD249 antigen) is one of at least 12 members of the M1 family of endopeptidases which are zinc-containing single-pass type II transmembrane enzymes [1-6]. ENPEP is involved in the catabolic pathway of the Renin-angiotensin System (RAS) forming angiotensin III, which participates in blood pressure regulation and blood vessel formation, and may contribute to risk of atrial fibrillation, angiogenesis, hypertension and tumorigenesis [7-14]. The gene encoding ENPEP (ENPEP in humans and most mammals; Enpep in rodents) is expressed at high levels in the epithelial cells of the kidney glomerulus and proximal tubule cells. ENPEP participates in the renin-angiotensin system, by way of the conversion of the biologically active Ang II (angiotensin II) to angiotensin III (Ang III), as a result of the hydrolysis of the N-terminal aspartate (or glutamate) thereby removing biological activity of the Ang peptides [15,16]. In studies of blood pressure control in hypertensive rats, ENPEP is expressed in brain nuclei where ENPEP activity generates angiotensin III, one of the major effector peptides of the brain renin angiotensin system, causing a stimulatory effect on systemic blood pressure [7,17]. Genome wide association studies have examined blood pressure variation and atrial fibrillation risk in human populations and identified an association with ENPEP variants [9,12,13,18]. In addition, studies of Enpep̄/Enpep̄ knockout mice have shown that ischemia-induced angiogenesis is impaired in these mice, as a result of decreased growth factor secretion and capillary vessel formation [8]. Other studies involved in treating hypertension in animal models using inhibitors to block ENPEP activity have also supported a direct link between ENPEP and arterial hypertension in the body [19]. Biochemical and predictive structural studies of mammalian ENPEP proteins have shown that it comprises three major domains (human ENPEP numbers quoted): An N-terminus cytoplasmic sequence (residues 1-18); a transmembrane helical sequence (residues 19-39), the signal anchor for the type II membrane protein; and an extracellular domain (residues 40-957) [1,3]. A three-dimensional protein structure has been reported for the extracellular zinc-containing endopeptidase ENPEP domain and its complexes with different ligands, which identified a calcium-binding site in the S1 pocket of ENPEP [11]. In addition, inhibitor docking studies have identified specific amino acid residues (Asp213, Asp218 and Glu215) involved in enzyme catalysis and Thr348, in performing a key role in determining substrate and inhibitor specificity for this enzyme [20]. This paper reports the predicted gene structures and amino acid sequences for several mammalian ENPEP genes and proteins, the predicted structures for mammalian ENPEP proteins, a number of potential sites for regulating human ENPEP gene expression and the structural, phylogenetic and evolutionary relationships of these mammalian ENPEP genes and proteins.

Methods

Mammalian ENPEP gene and protein identification

BLAST studies were undertaken using web tools from NCBI (http://www.ncbi.nlm.nih.gov/) [21,22]. Protein BLAST analyses used mammalian ENPEP amino acid sequences previously described (Table 1) [1,3,6]. Non-redundant protein and nucleotide sequence databases for several mammalian genomes were examined, including human (Homo sapiens), chimpanzee (Pan troglodytes), gorilla (Gorilla gorilla), orang-utan (Pongo abelii), colobus (Colobus angolensis), mangabey (Cercocebus atys), rhesus (Macaca mulatta), baboon (Papio anubis), snub-nosed monkey (Rhinopithecus roxellana), squirrel monkey (Saimiri boliviensis), marmoset (Callithrix jacchus), mouse lemur (Microbus murinus), cow (Bos taurus), sheep (Ovis aries), water buffalo (Bubalus bubalis), bison (Bison bison), goat (Capra hircus), chiru (Pantholops hodgsonii), camel (Camelus ferus), alpaca (Vicugna pacos), mouse (Mus musculus), rat (Rattus norvegicus), guinea pig (Cavia porcellus), horse (Equus caballus), pig (Sus scrofa), rabbit (Oryctolagus cuniculus), dog (Canis familiaris), cat (Felis catus), dolphin (Tursiops truncatus), killer whale (Orcinus orca) and opossum (Monodelphis domestica). This procedure produced multiple BLAST ‘hits’ for each of the protein and nucleotide databases which were individually examined and retained in FASTA format.
Table 1

Mammalian and chicken ENPEP genes and proteins.

ENPEP GeneSpeciesChromosome locationExons#(strand)Gene Size bpsGenBank ID*UNIPROT IDAmino acidsSubunit M (pI)
HumanHomo sapiens4:110,476,415-110,561,55520 (+ve)85141NM_0019977Q07075957109,244 (5.3)
ChimpanzeePan troglodytes4:113,095,101-113,180,14720 (+ve)85047*XP_5117397H2QQ15957109,115 (5.3)
GorillaGorilla gorilla4:121,992,414-122,077,57120 (+ve)85158*XP_018880573G3SK36957109,262 (5.3)
Orang-utanPongo abelii4:115,175,027-115,261,67020 (+ve)86644NM_001132893H2PE46957109,098 (5.2)
RhesusMacaca mulatta5:109,436,911-109,519,17320 (+ve)82263NM_001266656F7GTW9957109,188 (5.2)
BaboonPapio anubis5:101,625,577-101,709,02020 (+ve)83444*XP_003899143A0A096MTU4957109,192 (5.3)
Squirrel monkeySaimiri boliviensis*JH378138:4,950,114-5,038,89 020 (−ve)88777*XP_003929505Na957109,059 (5.2)
MarmosetCallithrix jacchus3:83,132,279-83,220,29320 (−ve)88015*XP_002806699na957109,299 (5.4)
Mouse lemurMicrobus murinus*KQ053609v1:1,352,189-1,436, 78320 (−ve)84595*XP_012621645na962109,104 (5.6)
MouseMus musculus3:129,270,282-129,332,48120 (−ve)62200NM_007934P16406945107,956 (5.3)
RatRattus norvegicus2:252,992,139-253,065,72120 (−ve)73583*CH473952P50123945107,995 (5.2)
CowBos taurus6:16,067,640-16,146,01320 (−ve)78374NM_001038027F1MEM5956109,801 (5.1)
HorseEquus caballus2:115,349,261-115,422,83920 (−ve)73579*XP_001502921F6XRR6948108,220 (4.8)
PigSus scrofa8:119,969,527-120,060,88420 (−ve)91358NM_214017Q95334942108,284 (5.1)
RabbitOryctolagus cuniculus15:38,927,056-39,017,17620 (−ve)90121*XP_002717229G1TBB2956109,013 (5.0)
DogCanis familiaris32:30,553,200-30,638,48320 (+ve)85284*XP_535696F6XRM5954109,202 (5.4)
CatCatus felisB1:113,256,430-113,341,77620 (−ve)85347*XP_003985130M3VU18952109,480 (5.7)
OpossumMonodelphis domestica5:63,362,365-63,488,02820 (+ve)125664*XP_001363921F6TL25957110,151 (5.4)
PlatypusOrnithorhynchus anatinus*DS181320v1:1,408,807-1,485, 70420 (+ve)76898*XP_001506613F7E6Z3938107,447 (5.6)
ChickenGallus gallus4:57,435,632-57,469,04320 (−ve)33412*XP_426327A0A1D5PAZ7943107,918 (5.0)

RefSeq: The reference amino acid sequence;

Predicted NCBI-derived amino acid sequence; na: Not Available; GenBank IDs are derived from NCBI http://www.ncbi.nlm.nih.gov/genbank/; UNIPROT refers to UniprotKB/Swiss-Prot IDs for individual ENPEP proteins (http://kr.expasy.org); *JH and *KQ refer to a scaffold; bps refers to base pairs of nucleotide sequences; pI refers to theoretical isoelectric.

BLAT analyses were subsequently undertaken for each of the predicted ENPEP amino acid sequences using the UC Santa Cruz (UCSC) Genome Browser with the default settings to obtain the predicted locations for each of the mammalian M1 peptidase genes, including predicted exon boundary locations and gene sizes (Table 1) [23]. Structures for human isoforms (splicing variants) were obtained using the AceView website to examine predicted gene and protein structures [24]. points; the number of coding exons are listed.

Predicted structures and properties of mammalian ENPEP M1 endopeptidases

Predicted secondary and tertiary structures for mammalian ENPEP M1 endopeptidase proteins were obtained using the SWISS-MODEL web-server (http://swissmodel.expasy.org/) [25] using the reported tertiary structure for human ENPEP [11] (PDB:4kx7A) with a modelling residue range of 76-954. Molecular weights, N-glycosylation sites, and predicted transmembrane, cytosolic and lumenal sequences for mammalian ENPEP M1 endopeptidase proteins were obtained using Expasy web tools [26,27] (http://au.expasy.org/tools/pi_tool.html). of conserved domains for ENPEP was conducted using NCBI web tools [28].

Comparative human tissue (ENPEP) gene expression

RNA-seq gene expression across 53 selected tissues (or tissue segments) that were examined from the public database for human ENPEP, based on expression levels for 175 individuals [16] (Data Source: GTEx Analysis Release V6p (dbGaP Accession phs000424.v6.p1) (http://www.gtex.org).

Phylogeny studies and sequence alignments

Alignments of mammalian ENPEP peptidase sequences were undertaken using Clustal Omega, a multiple sequence alignment program (Table 1) [29], Percentage identities were derived from the results of these alignments (Table 2). Phylogenetic analyses used several bioinformatic programs, coordinated using the http://www.phylogeny.fr/bioinformatic portal, to enable alignment (MUSCLE), curation (Gblocks), phylogeny (PhyML) and tree rendering (TreeDyn), to reconstruct phylogenetic relationships [30]. Sequences were identified as mammalian ENPF.P M1 endopeptidase proteins (Table 1).
Table 2

Predicted locations of N-glycosylation sites for mammalian ENPEP proteins. The predicted N-glycosylation sites were numbered following alignments using Clustal Omega [29] from the N-terminal end; conserved N-glycosylation sites for all mammalian ENPEP sequences examined are highlighted in yellow; individual amino acid residues were identified using standard single letter nomenclature: N-asparagine; S-serine; T-threonine etc.

SiteNoHumanChimpGorillaOrangutanRhesusBaboonSquirrelMonkeyMarmosetMouseLemurMouseRatCowHorsePigRabbitCatDogOpossum
143NHS44NTS
2110NIS
3124NLS124NLS124NLS124NLS124NLS124NLS124NLS124NLS122NLS116NLS116NLS126NVS115NVS114NVT123NVS119NVS121NVS123NVT
4197NGS197NGS197NGS197NGS197NGS197NGS197NGS197NGS202NGS189NGS189NGS199NGS188NGS187NGS196NGS192HGS194NGS197NGS
5236NIS241NIS
6272NRT272NRT272NRT267NRT269NRT272NRT
7324NIT324NIT324NIT324NIT324NIT324NIT324NIT324NIT329NIT316NIT316NIT326NIT315NIT314NIT323NIT321NIT324NIT
8340NYS340NYS340NYS340NYS340NYS340NYS340NYS340NYS345NYS331NYS
9383NES367NES367NES
10545NLS
11554NIT554NIT554NIT554NIT554NIT554NIT554NIT554NIT546NIT547NVT556NIT549NIT551NIT
12558NSS557NSS562NSS564NLS
13567NPS567NPS567NPS567NPS567NPS567NPS567NPS567NPS567NPS
14589NIT589NIT589NIT589NIT589NIT589NIT589NIT589NIT589NIT584NIT580NVS579NES588NES584NVS586NVS592NIT
15597NRS597NRS597NRS597NRS597NRS597NRS597NRS597NRS602NRS599NRS588NRS587NRS596NRS592NRS594NRS597NRT
16607NSS607NSS607NSS607NSS607NSS607NSS607NSS612NSS601NLS601NLS601NPS597NSS606NPS602NSS604NSS607NST
17610NPS610NPS610NPS610NPS610NPS610NPS610NPS610NPS615NPS
18643NLS633NLS632NLS641NLS637NLS646NFS
19637NHT647NHT646NHT
20649NFS649NFS647NFS640NFS
21678NLT678NLT678NLT678NLT678NLT678NLT678NLT678NLT683NLT669NLT669NLT679NLT669NLT668NLT677NLT672NLT675NLT678NLT
22734NDT734NDT
23763NAS763NAS763NAS763NAS763NAS763NAS763NAS763NAS768NAS754NAS754NAS764NAS754NAS753NAS762NAS758NAT760NAT763NAS
24766NES
25773NGT773NGT773NGT773NGT773NGT
26796NET796NET801NET797NET787NET786NET795NET791NET793NET
27801NYT801NYT801NYT801NYT801NYT801NYT801NYT801NYT806NYT792NYT792NYT802NYT792NYT791NYT800NYT796NYT798NYT800NYT
28828NVT828NVT828NVT828NVT828NVT828NVT828NVT828NVT829NVT819NVT827NVT823NVT825NVT827NVT
Total15151414161518181691212151412151613

Results

Alignments of mammalian ENPEP amino acid sequences

The deduced amino acid sequences for baboon (Papio anubis), mouse (Mus musculus), opossum (Monodelphis domestica) and chicken (Gallus gallus) ENPEP are shown in Figure 1 together with a previously reported sequence for human ENPEP [1,19] (Table 1).
Figure 1

Amino acid sequence alignments for vertebrate ENPEP sequences. Table 1 for sources of ENPEP sequences; *Shows identical residues for ENPEP subunits; : Similar alternate residues;. Dissimilar alternate residues; N-glycosylated and potential N-glycosylated Asn sites are in red and numbered according to; human ENPEP active site residues are shown: Zinc binding sites, 393His, 397His, 416Glu; proton acceptor, 394Glu; and transition state stabilizer 497Tyr; other active site residues are shown as ^; α-helices for vertebrate ENPEP [11] are in shaded yellow and numbered in sequence from the N-terminus end; predicted β-sheets are in grey and similarly numbered in sequence from the N-terminus; turns in the 3D structure are shown; bold underlined font shows residues corresponding to known or predicted exon start sites; exon numbers refer to human ENPEP gene exons; four major domains were identified as cytoplasmic (N-terminal tail) (1-19); signal membrane anchor transmembrane (for linking ENPEP to the plasma membrane) (20-39; N-terminal domain (M1 aminopeptidase N) (100-545); and C-terminal domain (ERAP1-like domain) (617-931).

Alignments of human and other mammalian ENPEP sequences examined were between 71-98% identical, suggesting that these are members of the same family of genes. The amino acid sequences for mammalian ENPEP proteins contained between 942 (pig) and 962 (Mouse lemur) amino acids, with human and most other primate ENPEP sequences containing 957 amino acids (Figures 1 and 2; Table 1).
Figure 2

N-terminal amino acid sequence alignments (A) and 5′-nucleotide gene sequence alignments (B) for mammalian ENPEP proteins and genes. A: N-terminal mammalian ENPEP amino acid sequence alignments; *Shows identical residues for ENPEP subunits; : Similar alternate residues;. Dissimilar alternate residues; predicted cytosolic and transmembrane helical residues are shown; Table 1 for details of mammalian ENPEP proteins and genes; other mammalian ENPEP sequences were derived from NCBI as described in Methods; sn monkey: short nosed monkey; sq monkey: squirrel monkey; cap monkey: capucine monkey. B: N-Terminal mammalian ENPEP amino acid sequence alignments and 5′ mammalian ENPEP nucleotide sequence alignments; predicted cytosolic and transmembrane helical residues are shown; *Shows identical residues for ENPEP subunits and nucleotide residues; : Similar alternate residues;. Dissimilar alternate residues; ENPEP gene regions showing areas of deletions are shown.

Previous studies have reported several key regions and residues for human and mouse ENPEP proteins (human ENPEP amino acid residues were identified in each case). These included an N-terminus cytoplasmic tail (1-18) followed by a hydrophobic transmembrane 21-residue segment (19-39). A comparison of 13 primate and 19 other mammalian ENPEP sequences for these N-terminal regions revealed a high degree of conservation, particularly for residues (human ENPEP numbers used) Cys13-Ile14, His18-Val19-Ala20, Cys23, Val26, Gly30-Leu31, Val33-Gly34-Leu35 and Gly38-Leu39-Thr40-Arg41, which were invariant among all mammalian ENPEP sequences examined (Figures 1 and 2). The biochemical roles for these conserved regions include forming an N-terminal cytoplasmic tail sequence (1-19) and establishing a hydrophobic transmembrane 21-residue segment (19-39) which may anchor the enzyme to the plasma membrane [1,3,19]. Residues 41-957 of the human ENPEP sequence were identified using bioinformatics as containing two domains, including the N-terminal GluZincin Peptidase M1 (aminopeptidase N) domain (residues 100-545); and the ERAP1-like C-terminal domain (residues 617-931) [28]. The former domain includes the substrate binding site (223Glu); the Zinc binding site (1 Zinc ion per subunit) (393His, 397His, 416Glu); the proton acceptor (394Glu); and the transition state stabilizer (497Tyr) (Figure 1). The C-terminal region is predicted to be localized in the extracellular region. Five N-glycosylation sites were consistently found for all of mammalian ENPEP sequences examined, namely Asn124-Leu125-Ser126 (site 3 for mammalian sequences); Asn197-Gly198-Ser199 (site 4), Asn678-Leu679-Thr680 (site 21), Asn763-Ala764-Ser765 (site 23) and Asn801-Tyr802-Thr803 (site 27) (Figure 1 and Table 2). Other N-glycosylation sites were frequently observed for other mammalian ENPEP sequences, including Asn324-Ile325-Thr326 (site 7), Asn340-Tyr341-Ser342 (site 8), Asn554-Ile555-Thr556 (site 11), Asn567-Pro568-Ser569 (site 13), Asn589-Ile590-Thr591 (site 14), Asn597-Arg598-Ser599 (site 15), Asn607-Ser608-Ser609 (site16), Asn610-Pro611-Ser612 (site 17) and Asn828-Val829-Thr830 (site 28). One site was found among some primate ENPEP sequences, namely Asn773-Gly774-Thr775 (site 25), whereas a neighboring site (Asn796-Glu797-Thr798: site 26) was restricted to some lower primate and other mammalian ENPEP sequences (Table 2). The total number of mammalian ENPEP N-glycosylation sites differed with the species examined, from a low of 9 sites for mouse ENPEP to 18 sites for squirrel monkey and marmoset ENPEP sequences. The specific roles for ENPEP N-glycosylation sites and specific oligosaccharide residues attached to the Asparagine residues have not been determined, however given the level of conservation among different mammalian sequences examined, these are likely to play key roles in determining the physiological roles and microlocations for this enzyme in different tissues of the body.

Predicted secondary and tertiary structures for mammalian ENPEP

Predicted secondary structures for mammalian ENPEP sequences were examined, particularly for the extracellular sequences (Figure 1) using the known structure reported for human ENPEP [11] (PDB: 4kx7A), with 35 α-helices and 28 β-sheet structures being observed. Of particular interest were α-helices 8, 9 and 14 which contained the active site residues for human ENPEP. A diagram showing the tertiary structure for human ENPEP is shown in Figure 3 which demonstrates the distinct secondary structures for the N- and C-termini regions for the protein, with β-sheet structures predominating in the N-terminus region and with α-helices being the predominant structures for the C-terminus. These two major domains for human ENPEP, previously mentioned, were readily apparent, that enclose a large cavity previously shown to contain the enzyme’s active site [11]. The N-terminal domain (residues 100-545) contains the active site residues and has been recognized as a member of the peptidase M1 aminopeptidase N family, whereas the C-terminal domain (residues 617-931, recognized as an ERAP1-like domain) [31] is composed of 16 alpha helices, organized as 8 HEAT-like repeats (2 alpha helices joined by a short loop) [32], which forms a concave face facing towards the peptidase active site. This C-terminal ENPEP domain has also been shown to function as an intramolecular chaperone contributing to the correct folding, cell surface expression and activity of this enzyme [33].
Figure 3

Tertiary structure for human ENPEP. The structure for human ENPEP is based on the reported structure [11] and obtained using the SWISS MODEL web site based on PDB 4KX7A (http://swissmodel.expasy.org/workspace/). The rainbow color code describes the 3-D structure from the N- (blue) to C-termini (red color); α-helices and β-sheets are shown; note the separation of 2 major domains: N-terminal M1 aminopeptidase N domain (in blue, with predominantly β-sheets); and C-terminal ERAP1-like domain (multicolored, with predominantly α-helical structures.

Comparative human ENPEP tissue expression

Figure 4 shows RNA-seq gene expression profiles across 53 selected tissues (or tissue segments) were examined from the public database for human ENPEP, based on expression levels for 175 individuals [16] (Data Source: GTEx Analysis Release V6p (dbGaP Accession phs000424.v6.p1) (http://www.gtex.org). These data supported highest levels of gene expression for human ENPEP in the small intestine-terminal ileum and the kidney cortex, which is consistent with the enzyme’s role in digestive tract and renal sodium (Na+) reabsorption and the renin-angiotensin system [18,34]. Lower levels were also observed in the uterus, spleen, breast, visceral adipose tissue and coronary artery, whereas brain ENPEP levels were very low according to this method, even though ENPEP has been shown to contribute to the renin angiotensin system in brain nuclei [7].
Figure 4

Tissue expression for human ENPEP. RNA-seq gene expression profiles across 53 selected tissues (or tissue segments) were examined from the public database for human ENPEP, based on expression levels for 175 individuals (Data Source: GTEx Analysis Release V6p (dbGaP Accession phs000424.v6.p1) (http://www.gtex.org). Tissues: 1. Adipose-Subcutaneous; 2. Adipose-Visceral (Omentum); 3. Adrenal gland; 4. Artery-Aorta; 5. Artery-Coronary; 6. Artery-Tibial; 7. Bladder; 8. Brain-Amygdala; 9. Brain-Anterior cingulate Cortex (BA24); 10. Brain-Caudate (basal ganglia); 11. Brain-Cerebellar Hemisphere; 12. Brain-Cerebellum; 13. Brain-Cortex; 14. Brain-Frontal Cortex; 15. Brain-Hippocampus; 16. Brain-Hypothalamus; 17. Brain-Nucleus accumbens (basal ganglia); 18. Brain-Putamen (basal ganglia); 19. Brain-Spinal Cord (cervical c-1); 20. Brain-Substantia nigra; 21. Breast-Mammary Tissue; 22. Cells-EBV-transformed lymphocytes; 23. Cells-Transformed fibroblasts; 24. Cervix-Ectocervix; 25. Cervix-Endocervix; 26. Colon-Sigmoid; 27. Colon-Transverse; 28. Esophagus-Gastroesophageal Junction; 29. Esophagus-Mucosa; 30. Esophagus-Muscularis; 31. Fallopian Tube; 32. Heart-Atrial Appendage; 33. Heart-Left Ventricle; 34. Kidney-Cortex; 35. Liver; 36. Lung; 37. Minor Salivary Gland; 38. Muscle-Skeletal; 39. Nerve-Tibial; 40. Ovary; 41. Pancreas; 42. Pituitary; 43. Prostate; 44. Skin-Not Sun Exposed (Suprapubic); 45. Skin-Sun Exposed (Lower leg); 46. Small Intestine-Terminal Ileum; 47. Spleen; 48. Stomach; 49. Testis; 50. Thyroid; 51. Uterus; 52. Vagina; 53. Whole Blood.

Gene locations, exonic structures and regulatory sequences for mammalian ENPEP genes

Table 1 summarizes the predicted locations and exonic structures for mammalian ENPEP genes based upon BLAT interrogations of several mammalian and chicken genomes using the reported sequences for human and mouse ENPEP [1,8,35] and the predicted sequences for other ENPEP enzymes and the UCSC genome browser [23]. The predicted mammalian ENPEP genes were transcribed on both the negative strand (lower primates and most non-primate genomes) and the positive strand (higher primates, dog and opossum genomes). Figure 1 summarizes the predicted exonic start sites for human, baboon, mouse, opossum and chicken ENPEP genes with each having 20 coding exons, in identical or similar positions to those predicted for the human ENPEP gene. Exon 1 encodes the largest segment for each of these genes, including the cytoplasmic N-terminus and signal anchor sequences and the first 10 β-sheet structures and four of the N-glycosylation sites for mammalian ENPEP. Figure 5 shows the predicted structure for the major human ENPEP transcript together with CpG27 and several Transcription Factor Binding Sites (TFBS), which are located at the 5′ end of the gene, consistent with potential roles in regulating the transcription of this gene and forming part of the ENPEP gene promoter. The human ENPEP transcript was 4,991 bps in length with an extended 3′-untranslated region (UTR) containing 7 microRNA target sites. The human ENPEP genome sequence also contained several predicted TFBS and a large CpG island (CpG27) located in the 5′-untranslated promoter region of human ENPEP on chromosome 4. CpG27 contained 412 bps with a C plus G count of 264 bps, a C or G content of 64% and showed a ratio of observed to expect CpG of 0.64. It is likely therefore that the CpG27 Island plays a key role in regulating this gene and may contribute to the very high level of gene expression observed in the small intestine-terminal ileum and the kidney cortex [36]. At least 6 TFBS sites were colocated with CpG27 in the human ENPEP promoter region which may contribute to the high expression of this gene in human kidney and intestine.
Figure 5

Gene structure and major gene transcript for the human ENPEP gene. Derived from the Ace View (http://www.ncbi.nlm.nih.gov/IEB/Research/Acembly/) [24]; shown with capped 5′- and 3′-ends for the predicted mRNA sequences; NM refers to the NCBI reference sequence; coding exons are in pink; the direction for transcription is shown as 5′ ? 3′; a large CpG27 island is located at the gene promoter and the first exon; predicted transcription factor binding sites (TFBS) for human ENPEP are shown; 7 predicted miRNA target sites were identified within the extended 3′-UTR region of human ENPEP.

Of special interest among these identified ENPEP TFBS were the following: The chicken ovalbumin upstream promoter transcription factor II (COUP), which has been implicated in renin gene expression, a key member of the renin-angiotensin system [37] which is highly expressed in kidney cells [38,39] the ecotropic viral integration site (EVI1) is also highly expressed in the developing kidney distal tubule and duct in Xenopus and plays a key role in its formation [40,41] and nuclear protein c-Myc, which plays an important role in intestinal epithelial cell proliferation [11]. It appears that the ENPEP gene promoter contains gene regulatory sequences and a large CpG island (CpG27) which may contribute to the high levels of expression observed in intestine and kidney cells. Among the microRNA binding sites observed, miR-125b has been shown to act as a tumor suppressor in breast tumorigenesis by directly targeting the ENPEP gene [10].

Phylogeny and divergence of mammalian ENPEP M1 peptidase sequences

A phylogenetic tree (Figure 6) was calculated by the progressive alignment of 19 ENPEP mammalian M1 peptidase amino acid sequences with the chicken (Gallus gallus) ENPEP sequence, which was used to ‘root’ the tree (Table 1). The phylogram showed clustering of the ENPEP sequences into groups which were consistent with their evolutionary relatedness and showing distinct groups for primate, other eutherian (mouse/rat, cow/pig and dog/cat), marsupial (opossum) and monotreme (platypus) ENPEP sequences, which were distinct from, and progressively related to each other. It is apparent that the ENPEP gene existed as a distinct mammalian gene family which has evolved from a more primitive vertebrate ENPEP gene and has been retained throughout monotreme, marsupial and eutherian mammalian evolution.
Figure 6

Phylogenetic tree of mammalian ENPEP amino acid sequences with the chicken ENPEP amino acid sequence. The tree is labeled with the ENPEP name and the name of the animal and is ‘rooted’ with the chicken (Gallusi gallus) ENPEP sequence, which was used to ‘root’ the tree (Table 1). Note the single cluster corresponding to the ENPEP gene family. A genetic distance scale is shown. The number of times a clade (sequences common to a node or branch) occurred in the bootstrap replicates are shown. Replicate values of 0.9 or more, which are highly significant, are shown with 100 bootstrap replicates performed in each case. A proposed sequence of gene evolution events is shown arising from an ancestral bird ENPEP gene.

Discussion

ENPEP is expressed at high levels in the epithelial cells of the kidney glomerulus and proximal tubule cells where the enzyme participates in the renin-angiotensin system: Renin cleaves substrate angiotensinogen forming the decapeptide angiotensin I (Ang I) [42]. Ang I is cleaved by Angiotensin-Converting Enzyme (ACE) to produce the biologically active angiotensin II (Ang II) [43]. Ang II activates its receptor (AT1) that mediates key physiological functions in the kidney (systemic regulation) and brain (central regulation), including vasoconstriction, renal sodium (Na+) reabsorption and aldosterone secretion, increasing blood pressure and contributing to hypertension [44,45]. Ang II is converted to angiotensin III (Ang III) by ENPEP facilitating the hydrolysis of the N-terminal aspartate (or glutamate) thereby removing biological activity of the Ang peptides [15,16]. The results of the present study indicated that mammalian ENPEP genes and encoded proteins represent a distinct gene and protein family of M1 peptidase proteins which share key conserved sequences that have been reported for other M1 peptidases previously studied [6,46,47]. Human ENPEP contains the following sites: a cytoplasmic N-terminus region (1-18); a hydrophobic transmembrane 21-residue segment (19-39), a helical signal anchor for type II membrane protein; and an extracellular protein region (residues 100-545) containing the Zinc binding endopeptidase active site (the substrate binding site (223Glu); the Zinc binding site (1 Zinc ion per subunit) (393His, 397His, 416Glu); the proton acceptor (394Glu); and the transition state stabilizer (497Tyr); and the ERAP1-like C-terminal domain (residues 617-931) (Figure 1) [28], which contain a large number of N-glycosylation sites, several of which are conserved throughout mammalian evolution. ENPEP plays a role in the catabolic pathway of the renin-angiotensin system and is a major contributor to the development of clinical arterial hypertension in the body [13,15,18,19,42,45].

Conclusion

ENPEP is encoded by a single gene among the mammalian genomes studied and is highly expressed in human small intestine-terminal ileum and kidney cortex cells, and usually contained 20 coding exons on the negative (lower primate and other mammalian) or positive (higher primate) strands, depending on the mammalian genome. The human ENPEP gene contained a large CpG island within the promoter region, as well as several transcription factor binding sites, which may contribute to the high level of gene expression in intestinal and kidney tissues. Alignments of mammalian ENPEP sequences demonstrated the high degree of conservation observed, particularly for those regions directing the catalytic functions and structural integrity for this enzyme, especially the extracellular sequences, containing two domains, including the N-terminal GluZincin Peptidase M1 (aminopeptidase N) domain (residues 100-545); and the ERAP1-like C-terminal domain (residues 617-931). Phylogenetic studies using 19 ENPEP mammalian M1 endopeptidase sequences indicated that the ENPEP gene existed as a distinct family which has apparently evolved from a more primitive vertebrate ENPEP gene which has been retained throughout monotreme, marsupial and eutherian mammalian evolution [48-53].
  52 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  The PSIPRED protein structure prediction server.

Authors:  L J McGuffin; K Bryson; D T Jones
Journal:  Bioinformatics       Date:  2000-04       Impact factor: 6.937

Review 3.  Genetic associations and functional characterization of M1 aminopeptidases and immune-mediated diseases.

Authors:  N Agrawal; M A Brown
Journal:  Genes Immun       Date:  2014-08-21       Impact factor: 2.676

4.  Evi1 is specifically expressed in the distal tubule and duct of the Xenopus pronephros and plays a role in its formation.

Authors:  Claude Van Campenhout; Massimo Nichane; Aline Antoniou; Hélène Pendeville; Odile J Bronchain; Jean-Christophe Marine; Andre Mazabraud; Marianne L Voz; Eric J Bellefroid
Journal:  Dev Biol       Date:  2006-03-30       Impact factor: 3.582

Review 5.  Role of the Collecting Duct Renin Angiotensin System in Regulation of Blood Pressure and Renal Function.

Authors:  Nirupama Ramkumar; Donald E Kohan
Journal:  Curr Hypertens Rep       Date:  2016-04       Impact factor: 5.369

6.  Molecular cloning of the human kidney differentiation antigen gp160: human aminopeptidase A.

Authors:  D M Nanus; D Engelstein; G A Gastl; L Gluck; M J Vidal; M Morrison; C L Finstad; N H Bander; A P Albino
Journal:  Proc Natl Acad Sci U S A       Date:  1993-08-01       Impact factor: 11.205

7.  A naturally variable residue in the S1 subsite of M1 family aminopeptidases modulates catalytic properties and promotes functional specialization.

Authors:  Seema Dalal; Daniel R T Ragheb; Florian D Schubot; Michael Klemba
Journal:  J Biol Chem       Date:  2013-07-29       Impact factor: 5.157

8.  Structural basis for antigenic peptide precursor processing by the endoplasmic reticulum aminopeptidase ERAP1.

Authors:  Tina T Nguyen; Shih-Chung Chang; Irini Evnouchidou; Ian A York; Christos Zikos; Kenneth L Rock; Alfred L Goldberg; Efstratios Stratikos; Lawrence J Stern
Journal:  Nat Struct Mol Biol       Date:  2011-04-10       Impact factor: 15.369

9.  Aminopeptidase A initiates tumorigenesis and enhances tumor cell stemness via TWIST1 upregulation in colorectal cancer.

Authors:  Hui-Yu Chuang; Jeng-Kae Jiang; Muh-Hwa Yang; Hsei-Wei Wang; Ming-Chun Li; Chan-Yen Tsai; Yau-Yun Jhang; Jason C Huang
Journal:  Oncotarget       Date:  2017-03-28

10.  miR-125b acts as a tumor suppressor in breast tumorigenesis via its novel direct targets ENPEP, CK2-α, CCNJ, and MEGF9.

Authors:  Andrea Feliciano; Josep Castellvi; Ana Artero-Castro; Jose A Leal; Cleofé Romagosa; Javier Hernández-Losa; Vicente Peg; Angels Fabra; Francisco Vidal; Hiroshi Kondoh; Santiago Ramón Y Cajal; Matilde E Lleonart
Journal:  PLoS One       Date:  2013-10-03       Impact factor: 3.240

View more
  11 in total

Review 1.  SARS-CoV-2 cell entry beyond the ACE2 receptor.

Authors:  Shamila D Alipoor; Mehdi Mirsaeidi
Journal:  Mol Biol Rep       Date:  2022-06-26       Impact factor: 2.742

2.  Unraveling the role of salt-sensitivity genes in obesity with integrated network biology and co-expression analysis.

Authors:  Jamal Sabir M Sabir; Abdelfatteh El Omri; Babajan Banaganapalli; Nada Aljuaid; Abdulkader M Shaikh Omar; Abdulmalik Altaf; Nahid H Hajrah; Houda Zrelli; Leila Arfaoui; Ramu Elango; Mona G Alharbi; Alawiah M Alhebshi; Robert K Jansen; Noor A Shaik; Muhummadh Khan
Journal:  PLoS One       Date:  2020-02-06       Impact factor: 3.240

Review 3.  Gene Expression Profiles Induced by a Novel Selective Peroxisome Proliferator-Activated Receptor α Modulator (SPPARMα) Pemafibrate.

Authors:  Yusuke Sasaki; Sana Raza-Iqbal; Toshiya Tanaka; Kentaro Murakami; Motonobu Anai; Tsuyoshi Osawa; Yoshihiro Matsumura; Juro Sakai; Tatsuhiko Kodama
Journal:  Int J Mol Sci       Date:  2019-11-13       Impact factor: 5.923

Review 4.  Classical and alternative receptors for SARS-CoV-2 therapeutic strategy.

Authors:  Siti Fathiah Masre; Nurul Farhana Jufri; Farah Wahida Ibrahim; Sayyidi Hamzi Abdul Raub
Journal:  Rev Med Virol       Date:  2020-12-26       Impact factor: 11.043

5.  Renal Carcinoma Is Associated With Increased Risk of Coronavirus Infections.

Authors:  Satyendra C Tripathi; Vishwajit Deshmukh; Chad J Creighton; Ashlesh Patil
Journal:  Front Mol Biosci       Date:  2020-11-20

6.  Comparative Transcriptome Analysis of Gayal (Bos frontalis), Yak (Bos grunniens), and Cattle (Bos taurus) Reveal the High-Altitude Adaptation.

Authors:  Jun Ma; Tianliu Zhang; Wenxiang Wang; Yan Chen; Wentao Cai; Bo Zhu; Lingyang Xu; Huijiang Gao; Lupei Zhang; Junya Li; Xue Gao
Journal:  Front Genet       Date:  2022-01-11       Impact factor: 4.599

7.  Molecular characterization of a novel aspartyl aminopeptidase that contributes to the increase in glutamic acid content in chicken meat during cooking.

Authors:  Hitomi Yuhara; Akira Ohtani; Mami Matano; Yutaka Kashiwagi; Kenji Maehashi
Journal:  Food Chem (Oxf)       Date:  2021-02-17

Review 8.  A compendium answering 150 questions on COVID-19 and SARS-CoV-2.

Authors:  Carmen Riggioni; Pasquale Comberiati; Mattia Giovannini; Ioana Agache; Mübeccel Akdis; Magna Alves-Correia; Josep M Antó; Alessandra Arcolaci; Ahmet Kursat Azkur; Dilek Azkur; Burcin Beken; Cristina Boccabella; Jean Bousquet; Heimo Breiteneder; Daniela Carvalho; Leticia De Las Vecillas; Zuzana Diamant; Ibon Eguiluz-Gracia; Thomas Eiwegger; Stefanie Eyerich; Wytske Fokkens; Ya-Dong Gao; Farah Hannachi; Sebastian L Johnston; Marek Jutel; Aspasia Karavelia; Ludger Klimek; Beatriz Moya; Kari C Nadeau; Robyn O'Hehir; Liam O'Mahony; Oliver Pfaar; Marek Sanak; Jürgen Schwarze; Milena Sokolowska; María J Torres; Willem van de Veen; Menno C van Zelm; De Yun Wang; Luo Zhang; Rodrigo Jiménez-Saiz; Cezmi A Akdis
Journal:  Allergy       Date:  2020-07-20       Impact factor: 14.710

Review 9.  Immunology of COVID-19: Mechanisms, clinical outcome, diagnostics, and perspectives-A report of the European Academy of Allergy and Clinical Immunology (EAACI).

Authors:  Milena Sokolowska; Zuzanna M Lukasik; Ioana Agache; Cezmi A Akdis; Deniz Akdis; Mübeccel Akdis; Weronika Barcik; Helen A Brough; Thomas Eiwegger; Andrzej Eljaszewicz; Stefanie Eyerich; Wojciech Feleszko; Cristina Gomez-Casado; Karin Hoffmann-Sommergruber; Jozef Janda; Rodrigo Jiménez-Saiz; Marek Jutel; Edward F Knol; Inge Kortekaas Krohn; Akash Kothari; Joanna Makowska; Marcin Moniuszko; Hideaki Morita; Liam O'Mahony; Kari Nadeau; Cevdet Ozdemir; Isabella Pali-Schöll; Oscar Palomares; Francesco Papaleo; Mary Prunicki; Carsten B Schmidt-Weber; Anna Sediva; Jürgen Schwarze; Mohamed H Shamji; Gerdien A Tramper-Stranders; Willem van de Veen; Eva Untersmayr
Journal:  Allergy       Date:  2020-10       Impact factor: 14.710

10.  Single cell RNA sequencing of 13 human tissues identify cell types and receptors of human coronaviruses.

Authors:  Furong Qi; Shen Qian; Shuye Zhang; Zheng Zhang
Journal:  Biochem Biophys Res Commun       Date:  2020-03-19       Impact factor: 3.575

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.