| Literature DB >> 15588499 |
Daniel W Nebert1, Nickolas A Sophos, Vasilis Vasiliou, David R Nelson.
Abstract
Why is agreement on one particular name for each gene important? As one genome after another becomes sequenced, it is imperative to consider the complexity of genes, genetic architecture, gene expression, gene-gene and gene-product interactions and evolutionary relatedness across species. To agree on a particular gene name not only makes one's own research easier, it aids automated text-mining algorithms and search engines, which are increasingly employed to find relationships in the millions of abstracts in the medical research literature and sequence databases. A common nomenclature system will also be helpful to the present generation, as well as future generations, of graduate students and postdoctoral fellows who are about to enter genomics research. In this paper, the authors present some problems that arose when two separate research communities decided to choose the same root, CYP, for naming their gene families. They then offer a logical solution, by renaming the cyclophilin genes with a common root, such as cyn- in Caenorhabditis and CYN- in mammals (Cyn in mouse), and using evolutionary divergence to cluster genes of the highest level of relatedness.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15588499 PMCID: PMC3525097 DOI: 10.1186/1479-7364-1-5-381
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
List of cyclophilin and P450 genes in C. elegans.
| Cyclophilin genes | Alternative name | WormPep accession # | |
|---|---|---|---|
| CGC approved | CE22213 | ||
| CGC approved | CE16730 | ||
| CGC approved | CE20374 | ||
| mog-6 | CGC approved | CE01596 | |
| CGC approved | CE17730 | ||
| CGC approved | CE01301 | ||
| CGC approved | CE20371 | ||
| D1009.2b | CGC approved | CE04286 | |
| CGC approved | CE03745 | ||
| B0252.4b | CGC approved | CE02420 | |
| CGC approved | CE03588 | ||
| CGC approved | CE17506 | ||
| CGC approved | CE24152 | ||
| F39H2.2b | CGC approved | CE32410 | |
| CGC approved | CE24686 | ||
| CGC approved | CE19042 | ||
| CGC approved | CE28157 | ||
| CGC approved | CE01655 | ||
| CGC approved | CE09262 | ||
| C01F6.3 | CGC approved | unavailable | |
| cyp-44 | CGC approved | CE25682 |
Data taken from Jonathan Hodgkin, CGC Genetic Map and Nomenclature Curator (Caenorhabditis Genetics Center), Genetics Unit, Department of Biochemistry, University of Oxford, South Parks Road, Oxford OX1 3QU, UK.
List of putatively functional human cyclophilin genes.
| Approved gene symbol | Approved gene name | Chromosomal location |
|---|---|---|
| Peptidylprolyl isomerase A (cyclophilin A) | 7p13-p11.2 | |
| Peptidylprolyl isomerase A (cyclophilin A)-like-3 | 21 | |
| Peptidylprolyl isomerase B (cyclophilin B) | 15 | |
| Peptidylprolyl isomerase C (cyclophilin C) | [reserved] | |
| Peptidylprolyl isomerase D (cyclophilin D) | 4 | |
| Peptidylprolyl isomerase E (cyclophilin E) | 1p32 | |
| Peptidylprolyl isomerase F (cyclophilin F) | 10q22-q23 | |
| Peptidylprolyl isomerase G (cyclophilin G) | 2q31.1 | |
| Peptidylprolyl isomerase H (cyclophilin H) | 1p34.1 | |
| Peptidylprolyl isomerase (cyclophilin)-like 1 | 6p21.1 | |
| Peptidylprolyl isomerase (cyclophilin)-like 2 | 22 | |
| Peptidylprolyl isomerase (cyclophilin)-like 3 | 2 | |
| Peptidylprolyl isomerase (cyclophilin)-like 4 | 6q24-25 | |
| Peptidylprolyl isomerase (cyclophilin)-like 5 | 14q21.3 | |
| Peptidylprolyl isomerase (cyclophilin)-like 6 | 6q21 |
Not included here are PPIAL, PPIAL2, PPIAP, PPI AP2, PPIAP3, PPIAP4, PPIAP5, PPIAP6, PPIHP1, PPIHP2, PPIL1P1, PPIP1, PPIP2, PPIP3, PPIP4, PPIP5, PPIP6, PPIP7, PPIP8, PPIP9, PPIP10 and PPIP11, which represent the 22 cyclophilin pseudogenes in the Human Genome Project (HGP) database.
Figure 1Sequence alignment of the conserved regions of 17 .
Figure 2UPGMA tree and possible family and subfamily divisions of the . The root of cyn-, is acceptable because it has not yet been used by any other gene database, except 'cyclone' in mouse. Families are designated by Arabic numerals and represent amino acid identity of 40 per cent or greater. Subfamilies are designated by letters and represent amino acid identity of 54 per cent or greater. Individual genes within subfamilies are then given Arabic numbers. Identifiers are the WormPep accession numbers.
Figure 3Rectangular cladogram of the 15 human cyclophilin proteins aligned. If one were to drop vertical lines, as in Figure 2, one might name: PPIL1 through to PPIL6 as CYNIA1 through to CYNIA4, and CYNIB1 and CYNIB2, respectively; PPIB and PPIC as CYN2A1 and CYN2A2, respectively; PPIE, PPIF, PPIA and PPIAL3 as CYN2C1, CYN2C2, CYN2C3 and CYN2C4, respectively; PPIG as CYN3; and so on.