| Literature DB >> 30782214 |
Georgia Charkoftaki1, Yewei Wang1, Monica McAndrews2, Elspeth A Bruford3, David C Thompson4, Vasilis Vasiliou5, Daniel W Nebert6.
Abstract
Lipocalins (LCNs) are members of a family of evolutionarily conserved genes present in all kingdoms of life. There are 19 LCN-like genes in the human genome, and 45 Lcn-like genes in the mouse genome, which include 22 major urinary protein (Mup) genes. The Mup genes, plus 29 of 30 Mup-ps pseudogenes, are all located together on chromosome (Chr) 4; evidence points to an "evolutionary bloom" that resulted in this Mup cluster in mouse, syntenic to the human Chr 9q32 locus at which a single MUPP pseudogene is located. LCNs play important roles in physiological processes by binding and transporting small hydrophobic molecules -such as steroid hormones, odorants, retinoids, and lipids-in plasma and other body fluids. LCNs are extensively used in clinical practice as biochemical markers. LCN-like proteins (18-40 kDa) have the characteristic eight β-strands creating a barrel structure that houses the binding-site; LCNs are synthesized in the liver as well as various secretory tissues. In rodents, MUPs are involved in communication of information in urine-derived scent marks, serving as signatures of individual identity, or as kairomones (to elicit fear behavior). MUPs also participate in regulation of glucose and lipid metabolism via a mechanism not well understood. Although much has been learned about LCNs and MUPs in recent years, more research is necessary to allow better understanding of their physiological functions, as well as their involvement in clinical disorders.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30782214 PMCID: PMC6381713 DOI: 10.1186/s40246-019-0191-9
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
List of all human LCN and mouse Lcn genes—with official gene symbols, full protein name, aliases, chromosomal locations, isoforms, National Center for Biotechnology Information (NCBI) RefSeq mRNA accession numbers, NCBI RefSeq protein accession numbers, and total number of amino acids (# of AAs) [information retrieved and confirmed from https://www.genenames.org/needs a close bracket]
| Gene symbol | Full protein name | Aliases | Chromosome | Isoforms | Ref seq mRNA number | Ref seq protein number | No. of AAs |
|---|---|---|---|---|---|---|---|
|
| Lipocalin-1 isoform 1 precursor | TP; TLC; PMFA; VEGP | 9q34 | NM_001252617.1 | NP_001239546 | 176 | |
| Lipocalin-1 isoform 2 precursor | 9q34 | NM_001252618.1 | NP_001239547.1 | 233 | |||
| Lipocalin-1 isoform 3 precursor | 9q34 | NM_001252619.1 | NP_001239548.1 | 230 | |||
|
| Lipocalin-2 | 24p3; MSFI; NGAL | 9q34 | NM_005564.3 | NP_005555.2 | 198 | |
|
| Lipocalin-2 | NRL; 24p3; Sip24; AW212229 | 2 | NM_008491.1 | NP_032517.1 | 200 | |
|
| Lipocalin-3 | Vnsp1 | 2 | NM_010694.1 | NP_034824.1187 | 187 | |
|
| Lipocalin-4 | Vnsp2; A630045M08Rik | 2 | NM_010695.1 | NP_034825.1 | 185 | |
|
| Lipocalin-5 | Erabp; MEP10; ERABP | 2 | Epididymal-specific lipocalin-5 isoform X1 | XM_006497665.1 | XP_006497728.1 | 208 |
| Epididymal-specific lipocalin-5 isoform X2 | XM_006497666.3 | XP_006497729.1 | 202 | ||||
| Epididymal-specific lipocalin-5 isoform X3 | XM_006497667.2 | XP_006497730.1 | 190 | ||||
| Epididymal-specific lipocalin-5 isoform X4 | XM_006497668.2 | XP_006497731.1 | 190 | ||||
| Epididymal-specific lipocalin-5 isoform X5 | XM_006497669.1 | XP_006497732.1 | 173 | ||||
|
| Lipocalin-6 | LCN5; hLcn5; UNQ643 | 9q34.3 | NM_198946.2 | NP_945184.1 | 163 | |
|
| Lipocalin-6 | 9230101D24Rik | 2 | Epididymal-specific lipocalin-6 isoform 1 precursor; | NM_001276448.1 | NP_001263377.1 | 245 |
| Epididymal-specific lipocalin-6 isoform 2 precursor | NM_177840.4 | NP_808508.2 | 181 | ||||
|
| Lipocalin-8 | EP17; LCN5 | 9q34.3 | NM_178469.3 | NP_848564.2 | 152 | |
|
| Lipocalin-8 | EP17; Lcn5; mEP17; 9230106L18Rik | 2 | NM_033145.1 | NP_149157.1 | 175 | |
|
| Lipocalin-9 | HEL129; 9230102I19Rik | 9q34.3 | NM_001001676.1 | NP_001001676.1 | 176 | |
|
| Lipocalin-9 | 9230102I19Rik | 2 | NM_029959.2 | NP_084235.1 | 178 | |
|
| Lipocalin-10 | 9q34.3 | NM_001001712.2 | NP_001001712.2 | 200 | ||
|
| Lipocalin-10 | 9230112J07Rik | 2 | NM_178036.4 | NP_828875.1 | 182 | |
|
| Lipocalin-11 | Gm109 | 2 | NM_001100455.2 | NP_001093925.1 | 178 | |
|
| Lipocalin-12 | 9q34.3 | NM_178536.3 | NP_848631.2 | 192 | ||
|
| Lipocalin-12 | 9230102M18Rik | 2 | NM_029958.1 | NP_084234.1 | 193 | |
|
| Lipocalin-15 | PRO6093; UNQ2541 | 9q34.3 | NM_203347.1 | NP_976222.1 | 184 | |
|
| Lipocalin-15 | Gm33749 | 2 A3; 2 | XM_006498514.1 | XP_006498577.1 | 202 | |
|
| Lipocalin-16 | Gm39773 | 2 | XM_011239226.1 | XP_011237528.1 | 181 | |
|
| Lipocalin-17 | Gm39774 | 2 | XM_011239227.1 | XP_011237529.1 | 193 | |
|
| Odorant binding protein 2A | OBP; LCN13; OBP2C; OBPIIa; hOBPIIa | 9q34 | NM_001293189.1 | NP_001280118.1 | 228 | |
|
| Odorant binding protein 2A | Lcn13; BC027556 | 2 | NM_153558.1 | NP_705786.1 | 176 | |
|
| Odorant-binding protein-2B | LCN14; OBPIIb | 9q34 | NM_001288987.1 | NP_001275916.1 | 170 | |
|
| Odorant-binding protein-2B | Lcn14 | 2 | NM_001099301.1 | NP_001092771.1 | 176 | |
|
| Alpha-1-microglobulin/bikunin precursor | A1M; HCP; ITI; UTI; EDC1; HI30; ITIL; IATIL; ITILC | 9q32-q33 | NM_001633.3 | NP_001624.1 | 352 | |
|
| Alpha 1 microglobulin/bikunin precursor | AI194774, ASPI, HI-30, Intin4, Itil, UTI | 4 B3; 4 33.96 cM | NM_007443.4 | NP_031469.1 | 349 | |
|
| Apolipoprotein D | 3q29 | NM_001647.3 | NP_001638.1 | 189 | ||
|
| Apolipoprotein D | 16 B2; 16 21.41 cM | NM_001301353.1 | NP_001288282.1 | 189 | ||
|
| Apolipoprotein M | G3a; NG20; apo-M; HSPC336 | 6p21 | NM_019101.2 | NP_061974.2 | 188 | |
|
| Apolipoprotein M | G3a; NG20; 1190010O19Rik | 17; 17 B1 | NM_018816.1 | NP_061286.1 | 190 | |
|
| Complement component 8, gamma polypeptide | C8C | 9q34.3 | NM_000606.2 | NP_000597.2 | 202 | |
|
| Complement component 8, gamma polypeptide | 2 A3; 2 17.31 cM | NM_001271777.1 | NP_001258706.1 | 168 | ||
|
| Orosomucoid-1 | ORM; AGP1; AGP-A; HEL-S-153w | 9q32 | NM_000607.2 | NP_000598.2 | 201 | |
|
| Orosomucoid-1 | Agp-1; Agp-2; Orm-1 | 4 B3; 4 33.96 cM | NM_008768.2 | NP_032794.1 | 207 | |
|
| Orosomucoid-2 | AGP2; AGP-B; AGP-B | 9q32 | NM_000608.2 | NP_000599.1 | 201 | |
|
| Orosomucoid-2 | Agp1; Orm-2 | 4 B3; 4 33.96 cM | NM_011016.2 | NP_035146.1 | 207 | |
|
| Progestagen-associated endometrial protein | GD; GdA; GdF; GdS; PEP; PAEG; PP14 | 9q34 | NM_001018049.1 | NP_001018059.1 | 180 | |
|
| Prostaglandin D2 synthase; 21 kDa (brain) | PDS; PGD2; PGDS; LPGDS; PGDS2; L-PGDS | 9q34.2-q34.3 | NM_000954.5 | NP_000945 | 190 | |
|
| Prostaglandin D2 synthase; 21 kDa (brain) | PGD2; PGDS; 21 kDa; PGDS2; Ptgs3; L-PGDS | 2 A3; 2 17.28 cM | NM_008963.3 | NP_032989.2 | 189 | |
|
| Retinol-binding protein-4, plasma | RDCCAS; MCOPCB10 | 10q23.33 | NM_006744.3 | NP_006735.2 | 201 | |
|
| Retinol-binding protein-4, plasma | Rbp-4 | 19 C2; 19 32.75 cM | NM_001159487.1 | NP_001152959.1 | 245 |
Fig. 1Dendrogram of lipocalins (LCNs) in the human genome. Although the names listed are the official human gene symbols [https://www.genenames.org/], this dendrogram is based on the alignment of proteins (listed in Table 1), using multiple sequence alignment by CLUSTALW (http://www.genome.jp/tools/clustalw/)
Fig. 2Dendrogram of mouse Lcn and Mup proteins. Although the names listed are the official mouse gene symbols [http://www.informatics.jax.org/], this dendrogram is based on the alignment of proteins (listed in Tables 1 and 2), using multiple sequence alignment by CLUSTALW (http://www.genome.jp/tools/clustalw/)
Mouse tissues known to express Mup mRNA [93]
| Mup mRNA | Tissue |
|---|---|
|
| Liver |
|
| Mammary gland, liver |
|
| Liver |
|
| Parotid gland, lacrimal gland, nasal expression |
|
| Submandibular gland, sublingual gland, lacrimal gland |
|
| Parotid gland |
List of all mouse Mup genes [http://www.informatics.jax.org/], with official gene symbols, aliases, chromosomal locations, isoforms, National Center for Biotechnology Information (NCBI) RefSeq mRNA accession numbers, NCBI RefSeq protein accession numbers, and total number of amino acids (# of AAs) [information retrieved and confirmed from https://www.ncbi.nlm.nih.gov/genome.]
| Gene symbol | Aliases | Chromosome | Isoforms | Ref seq mRNA number | Ref seq protein number | Full protein name | No. of AAs |
|---|---|---|---|---|---|---|---|
|
| Major urinary protein-1 | Mup7; Up-1; Ltn-1; Mup-1; Mup-a; Mup10; Lvtn-1 | 4 | MUP 1 isoform b precursor | NM_001163010.1 | NP_001156482.1 | 121 |
|
| Major urinary protein-2 | Mup4; Mup-2; AA589603 | 4 | MUP 2 isoform 1 precursor | NM_001045550.2 | NP_001039015.1 | 180 |
| 4 | MUP 2 isoform 2 | NM_001286096.1 | NP_001273025.1 | 119 | |||
| 4 | MUP 2 isoform 1 precursor | NM_008647.4 | NP_032673.3 | 180 | |||
|
| Major urinary protein-3 | MUP15; Mup-3; Mup25; MUPIII | 4 | MUP 3 precursor | NM_001039544.1 | NP_001034633.1 | 184 |
|
| Major urinary protein-4 | Mup1; Mup-4 | 4 | MUP 4 precursor | NM_008648.1 | NP_032674.1 | 178 |
|
| Major urinary protein-5 | Mup18 | 4 | MUP 5 precursor | NM_008649.2 | NP_032675.2 | 180 |
|
| Major urinary protein-6 | Mup2; Gm12544; OTTMUSG00000007423 | 4 | MUP (Mup)-like precursor | NM_001081285.1 | NP_001074754.1 | 179 |
|
| Major urinary protein-7 | Mup3; Gm12546; OTTMUSG00000007428 | 4 | MUP 7 precursor | NM_001134675.1 | NP_001128147.1 | 235 |
|
| Major urinary protein-8 | Mup5; Gm12809; OTTMUSG00000008509 | 4 | MUP 8 precursor | NM_001134676.1 | NP_001128148.1 | 235 |
|
| Major urinary protein-9 | Mup2; Mup6; Gm14076; OTTMUSG00000015595 | 4 | MUP 2-like precursor | NM_001281979.1 | NP_001268908.1 | 180 |
|
| Major urinary protein-10 | Mup8; 2610016E04Rik | 4 | MUP 10 precursor | NM_001122647.1 | NP_001116119.1 | 180 |
|
| Major urinary protein-11 | Gm12549; OTTMUSG00000007431 | 4 | MUP 11 precursor | NM_001164526.1 | NP_001157998.1 | 181 |
|
| Major urinary protein-12 | Gm2024 | 4 | MUP 12 precursor | NM_001199995.1 | NP_001186924.1 | 180 |
|
| Major urinary protein-13 | Mup11; Gm13513; OTTMUSG00000012492 | 4 | MUP 13 precursor | NM_001134674.1 | NP_001128146.1 | 235 |
|
| Major urinary protein-14 | Mup12; Gm13514; OTTMUSG00000012493 | 4 | MUP 14 precursor | NM_001199999.1 | NP_001186928.1 | 180 |
|
| Major urinary protein-15 | Mup13; Gm2068 | 4 | MUP 15 precursor | NM_001200004.1 | NP_001186933.1 | 180 |
|
| Major urinary protein-16 | 4 | MUP 16 precursor | NM_001199936.1 | NP_001186865.1 | 180 | |
|
| Major urinary protein-17 | MUP 17; Gm12557; OTTMUSG00000007480 | 4 | MUP 17 precursor | NM_001200006.1 | NP_001186935.1 | 180 |
|
| Major urinary protein-18 | Mup6 | 4 | MUP 6 precursor | NM_001199333.1 | NP_001186262.1 | 181 |
|
| Major urinary protein-19 | Mup8; Mup11; Mup14; Mup17; Gm12552; 100039247; OTTMUSG00000007472 | 4 | MUP 11 and 8 precursor | NM_001135127.2 | NP_001128599.1 | 180 |
|
| Major urinary protein-20 | Mup24; darcin; Gm12560; OTTMUSG00000007485 | 4 | MUP 20 precursor | NM_001012323.1 | NP_001012323.1 | 181 |
|
| Major urinary protein-21 | Mup; Mup26; Gm11208; bM64F17.1; bM64F17.4; OTTMUSG00000000231 | 4 | MUP 26 precursor | NM_001009550.2 | NP_001009550.1 | 181 |
|
| Major urinary protein-22 | Gm21320 | 4 | MUP 2-like isoform X1 | XM_003688783.3 | XP_003688831.1 | 181 |
Fig. 3Structure of prototypical mouse urinary protein. The crystal structure consists of eight β-strands, forming a calyx-shaped barrel (red); this encloses an internal ligand-binding site. There are also an α-helix (green) and four 310-helices (blue); the hydrophobic pocket is located inside the barrel. AB, BC, CD, DE, EF, FG, GH, HI, AND βI denote the amino-acid segments between the β-strands (This diagram taken from Ref. [95])
Fig. 4Chromosomal location of mouse Mup genes and pseudogenes. a The Mup cluster region, located at 60,498,012 Mb to 60,501,960 Mb (red vertical rectangle). Taken from the Ensembl genome browser. b The Chr 4 region (in greater detail)—showing ten of the 22 Mup genes (Gm21320 is Mup22) in the Mup cluster and 12 of the 29 Mup-ps pseudogenes
Fig. 5Dendrogram of human LCNs and mouse MUPs, combined. Although the names listed are the official human gene symbols and mouse Mup gene symbols, this dendrogram was based on the alignment of proteins (listed in Tables 1 and 3) using multiple sequence alignment by CLUSTALW (http://www.genome.jp/tools/clustalw/). Note that the human LCN9 gene is evolutionarily closest to the mouse Mup cluster in this dendrogram
List of the 19 LCN-like genes in the human genome and their similarity to their mouse orthologs, expressed as percent identity (%)
| Gene | Similarity to mouse (percent identity (%)) |
|---|---|
|
| NA |
|
| 62 |
|
| 74 |
|
| 70 |
|
| 54 |
|
| 64 |
|
| 56 |
|
| 39 |
|
| 39 |
|
| 50 |
|
| 77 |
|
| 74 |
|
| 81 |
|
| 76 |
|
| 49 |
|
| 44 |
|
| NA |
|
| 72 |
|
| 86 |