| Literature DB >> 18629345 |
Naoya Yuhki1, James C Mullikin, Thomas Beck, Robert Stephens, Stephen J O'Brien.
Abstract
Two sequences of major histocompatibility complex (MHC) regions in the domestic cat, 2.976 and 0.362 Mbps, which were separated by an ancient chromosome break (55-80 MYA) and followed by a chromosomal inversion were annotated in detail. Gene annotation of this MHC was completed and identified 183 possible coding regions, 147 human homologues, possible functional genes and 36 pseudo/unidentified genes) by GENSCAN and BLASTN, BLASTP RepeatMasker programs. The first region spans 2.976 Mbp sequence, which encodes six classical class II antigens (three DRA and three DRB antigens) lacking the functional DP, DQ regions, nine antigen processing molecules (DOA/DOB, DMA/DMB, TAPASIN, and LMP2/LMP7,TAP1/TAP2), 52 class III genes, nineteen class I genes/gene fragments (FLAI-A to FLAI-S). Three class I genes (FLAI-H, I-K, I-E) may encode functional classical class I antigens based on deduced amino acid sequence and promoter structure. The second region spans 0.362 Mbp sequence encoding no class I genes and 18 cross-species conserved genes, excluding class I, II and their functionally related/associated genes, namely framework genes, including three olfactory receptor genes. One previously identified feline endogenous retrovirus, a baboon retrovirus derived sequence (ECE1) and two new endogenous retrovirus sequences, similar to brown bat endogenous retrovirus (FERVmlu1, FERVmlu2) were found within a 140 Kbp interval in the middle of class I region. MHC SNPs were examined based on comparisons of this BAC sequence and MHC homozygous 1.9x WGS sequences and found that 11,654 SNPs in 2.84 Mbp (0.00411 SNP per bp), which is 2.4 times higher rate than average heterozygous region in the WGS (0.0017 SNP per bp genome), and slightly higher than the SNP rate observed in human MHC (0.00337 SNP per bp).Entities:
Mesh:
Substances:
Year: 2008 PMID: 18629345 PMCID: PMC2453318 DOI: 10.1371/journal.pone.0002674
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Predicted Genes, their Functions and Coordinates in FLA.
| Gene | Functional/physiological properties/other name/structure | Orientation | Start | End | Length |
|
| kinesin family member C1 | − | 1243 | 13226 | 11983 |
|
| ribomal protein S28 | + | 25869 | 27731 | 1862 |
|
| unknown: cfa chr12.6 - 029.a N-SCAN gene prediction | + | 66551 | 66793 | 242 |
|
| death-associated protein 6 | + | 74219 | 77417 | 3198 |
|
| zinc finger and BTB domain containing 22 | + | 79170 | 80865 | 1695 |
|
| TAP binding protein (tapasin) | + | 82211 | 92070 | 9859 |
|
| ral guanine nucleotide dissociation stimulator-like 2 | + | 96755 | 101963 | 5208 |
|
| WD domain | + | 105084 | 112722 | 7638 |
|
| UDP-Gal:betaGlcNAc beta 1,3-galactosyltransferase, polypeptide 4 | − | 113431 | 114583 | 1152 |
|
| ribomal protein S28 | − | 115583 | 120594 | 5011 |
|
| Yeast sac2 homolog, suppressor of actin mutant 2, Sacm2l, coiled coil structure | + | 121032 | 136177 | 15145 |
|
| ring finger protein 1 | − | 141255 | 167593 | 26338 |
|
| Steroid dehydrogenase-like protein (estradiol 17 beta-dehydrogenase 8) | − | 170695 | 172885 | 2190 |
|
| Transmembrane protein with histidine-rich charge clusters | − | 173342 | 175211 | 1869 |
|
| retinoid X receptor, beta | + | 177083 | 181628 | 4545 |
|
| collagen, type XI, alpha 2 | + | 184143 | 212170 | 28027 |
|
| unknown: cfa chr12: 5,766,607–5,773,167 | − | 216881 | 223065 | 6184 |
|
| Class II antigen beta chain, pseudogene | − | 231611 | 242652 | 11041 |
|
| Class II antigen alpha chain, pseudogene | + | 245150 | 268064 | 22914 |
|
| DOA, heterodimerize with DOB in pre-B cells, peptide loading for class II antigen at low PH | + | 278617 | 289676 | 11059 |
|
| bromodomain containing 2 | − | 303996 | 327744 | 23748 |
|
| Nonclassical class II antigen, alpha chain, peptide loading for class II antigen | + | 329474 | 332504 | 3030 |
|
| Nonclassical class II antigen, beta chain, heterodimer with DMA,peptide loading for class II antigen | + | 342380 | 351767 | 9387 |
|
| Proteosome subunit to cleave peptides for class I antigen | − | 416025 | 419082 | 3057 |
|
| transporter 1, ATP-binding cassette, sub-family B (MDR/TAP) | + | 421610 | 428032 | 6422 |
|
| Proteosome subunit to cleave peptides for class I antigen | + | 431491 | 433586 | 2095 |
|
| transporter 2, ATP-binding cassette, sub-family B (MDR/TAP) | + | 435278 | 446029 | 10751 |
|
| Nonclassical class II antigen, beta chain, heterodimer with DNA, H2-IAB2 in mouse | + | 456252 | 465656 | 9404 |
|
| Class II antigen, beta chain | − | 509012 | 514820 | 5808 |
|
| Glycerol aldehyde phosphodehydrase, pseudogene | − | 563173 | 575620 | 12447 |
|
| Class II antigen, beta chain | + | 596698 | 607755 | 11057 |
|
| Class II antigen, alpha chain | − | 621375 | 624059 | 2684 |
|
| ribomal protein S28 gene fragment | + | 631565 | 633427 | 1862 |
|
| Class II antigen, beta chain pseudogene with intron 1 and exon 2 | + | 652000 | 659950 | 7950 |
|
| Class II antigen, beta chain | + | 664799 | 679467 | 14668 |
|
| Class II antigen, alpha chain | − | 688714 | 691459 | 2745 |
|
| Class II antigen, alpha chain | − | 722112 | 725240 | 3128 |
|
| butyrophilin-like 2 (MHC class II associated) | + | 735556 | 754731 | 19175 |
|
| butyrophilin-like 2 (MHC class II associated) | − | 765965 | 777458 | 11493 |
|
| ribomal protein S28 | + | 782864 | 783037 | 173 |
|
| heterogeneous nuclear ribonucleoprotein A2/B1 | − | 829773 | 830860 | 1087 |
|
| ribomal protein S28 | + | 984596 | 984769 | 173 |
|
| Notch homolog 4 (Drosophila) | + | 1206316 | 1229484 | 23168 |
|
| G-protein signalling modulator 3 (AGS3-like, C. elegans) | − | 1230443 | 1231562 | 1119 |
|
| pre-B-cell leukemia transcription factor 2 | + | 1234538 | 1236556 | 2018 |
|
| advanced glycosylation end product-specific receptor | + | 1238751 | 1241285 | 2534 |
|
| 1-acylglycerol-3-phosphate O-acyltransferase 1 (lysophosphatidic acid acyltransferase, alpha) | + | 1250819 | 1252976 | 2157 |
|
| EGF-like-domain, multiple 8 | − | 1254290 | 1255788 | 1498 |
|
| palmitoyl-protein thioesterase 2 | − | 1258652 | 1263918 | 5266 |
|
| proline-rich transmembrane protein 1 | + | 1267796 | 1269170 | 1374 |
|
| FK506 binding protein like | + | 1284306 | 1285355 | 1049 |
|
| cAMP responsive element binding protein-like 1 | + | 1285915 | 1300603 | 14688 |
|
| tenascin XB | + | 1310433 | 1357909 | 47476 |
|
| cytochrome P450, family 21, subfamily A, polypeptide 2 | − | 1358128 | 1360631 | 2503 |
|
| complement component 4A (Rodgers blood group) | − | 1363648 | 1378177 | 14529 |
|
| serine/threonine kinase 19 | − | 1379597 | 1384827 | 5230 |
|
| dom-3 homolog Z (C. elegans) | + | 1385598 | 1387305 | 1707 |
|
| superkiller viralicidic activity 2-like (S. cerevisiae) | − | 1387465 | 1394012 | 6547 |
|
| complement factor B | − | 1394591 | 1402014 | 7423 |
|
| complement component 2 | − | 1402258 | 1415710 | 13452 |
|
| zinc finger and BTB domain containing 12 | + | 1448172 | 1449551 | 1379 |
|
| euchromatic histone-lysine N-methyltransferase 2 | + | 1454322 | 1466627 | 12305 |
|
| solute carrier family 44, member 4 | + | 1467409 | 1479530 | 12121 |
|
| Neuraminidase 4 | + | 1480105 | 1483273 | 3168 |
|
| ribomal protein S28 | + | 1488926 | 1489106 | 180 |
|
| heat shock 70 kDa protein 1A | − | 1513315 | 1515240 | 1925 |
|
| heat shock 70 kDa protein 1A | − | 1524090 | 1526015 | 1925 |
|
| heat shock 70 kDa protein 1A | + | 1528138 | 1529946 | 1808 |
|
| LSM2 homolog, U6 small nuclear RNA associated (S. cerevisiae) | + | 1532633 | 1537716 | 5083 |
|
| valyl-tRNA synthetase | + | 1539067 | 1553500 | 14433 |
|
| chromosome 6 open reading frame 27 | + | 1555291 | 1563506 | 8215 |
|
| chromosome 6 open reading frame 26 | − | 1564291 | 1565750 | 1459 |
|
| mutS homolog 6 (E. coli) | − | 1566395 | 1590460 | 24065 |
|
| chloride intracellular channel 1 | + | 1594157 | 1598924 | 4767 |
|
| dimethylarginine dimethylaminohydrolase 2 | + | 1600599 | 1602528 | 1929 |
|
| chromosome 6 open reading frame 25 | − | 1604591 | 1606252 | 1661 |
|
| lymphocyte antigen 6 complex, locus G6C | + | 1607799 | 1610137 | 2338 |
|
| lymphocyte antigen 6 complex, locus G6D | − | 1611498 | 1613549 | 2051 |
|
| lymphocyte antigen 6 complex, locus G6E | + | 1615087 | 1616230 | 1143 |
|
| chromosome 6 open reading frame 21 | − | 1618100 | 1620941 | 2841 |
|
| HLA-B associated transcript 5 | + | 1623128 | 1636811 | 13683 |
|
| lymphocyte antigen 6 complex, locus G5C | + | 1642789 | 1645897 | 3108 |
|
| lymphocyte antigen 6 complex, locus G5B | − | 1649307 | 1650335 | 1028 |
|
| casein kinase 2, beta polypeptide | − | 1651325 | 1654509 | 3184 |
|
| HLA-B associated transcript 4 | + | 1656917 | 1658511 | 1594 |
|
| chromosome 6 open reading frame 47 | + | 1660571 | 1661426 | 855 |
|
| lymphotoxin beta (TNF superfamily, member 3) | + | 1663590 | 1665305 | 1715 |
|
| tumor necrosis factor (TNF superfamily, member 2) | − | 1668300 | 1670066 | 1766 |
|
| lymphotoxin alpha (TNF superfamily, member 1) | − | 1672461 | 1673412 | 951 |
|
| nuclear factor of kappa light polypeptide gene enhancer in B-cells inhibitor-like 1 | − | 1685899 | 1694249 | 8350 |
|
| ATPase, H+ transporting, lysosomal 13 kDa, V1 subunit G2 | + | 1695942 | 1697113 | 1171 |
|
| HLA-B associated transcript 1 | + | 1699863 | 1710709 | 10846 |
|
| mitochondrial coiled-coil domain 1 | − | 1711244 | 1714362 | 3118 |
|
| unknown: cfa chr12: 4,026,686–4,028,708 | − | 1716465 | 1722099 | 5634 |
|
| similar to peptidase (prosome, macropain) 26S subunit, ATPase 1 | − | 1722382 | 1723468 | 1086 |
|
| non classical class I molecule | + | 1727339 | 1733904 | 6565 |
|
| ribomal protein S28 | + | 1741091 | 1741264 | 173 |
|
| Classical class I antigen gene fragment | + | 1754279 | 1755539 | 1260 |
|
| non classical class I molecule | − | 1778815 | 1781240 | 2425 |
|
| ribomal protein S28 | + | 1791524 | 1791697 | 173 |
|
| MHC class I releated gene 1 | − | 1802050 | 1803673 | 1623 |
|
| Classical class I antigen gene fragment | − | 1812028 | 1826561 | 14533 |
|
| unknown | − | 1816059 | 1819441 | 3382 |
|
| unknown: cfa chr12: 3,304,294–3,451,219 | + | 1822307 | 1852375 | 30068 |
|
| BAT1 fragement | − | 1854809 | 1855450 | 641 |
|
| Classical class I antigen ( with long cytoplasmic tail) | + | 1859534 | 1862925 | 3391 |
|
| unknown | − | 1865368 | 1878940 | 13572 |
|
| MHC class I releated gene 2 fragment | + | 1880418 | 1880645 | 227 |
|
| ribomal protein S28 | + | 1892390 | 1892515 | 125 |
|
| non classical class I molecule | + | 1918438 | 1921808 | 3370 |
|
| Classical class I antigen gene fragment | − | 1938101 | 1940148 | 2047 |
|
| MHC class I releated gene 3 | − | 1962637 | 1964257 | 1620 |
|
| Classical class I antigen | − | 1973577 | 1976991 | 3414 |
|
| BAT1 fragement | + | 1979546 | 1980192 | 646 |
|
| Classical class I antigen gene fragment | + | 2003006 | 2003225 | 219 |
|
| non classical class I molecule | − | 2011852 | 2015260 | 3408 |
|
| BAT1 fragement | + | 2017763 | 2018402 | 639 |
|
| BAT1 fragement | − | 2059127 | 2059750 | 623 |
|
| Classical class I antigen | − | 2083736 | 2087122 | 3386 |
|
| BAT1 fragement | + | 2089903 | 2090242 | 339 |
|
| endogenous retrovirus similar to brown bat (Motis Lucifugus) endogenous retrovirus 2 | − | 2103256 | 2105890 | 2634 |
|
| MHC class I releated gene 4 fragment | − | 2117309 | 2117536 | 227 |
|
| unknown: cfa chr10: 6,669,201–6,794,639 | − | 2117453 | 2120737 | 3284 |
|
| non classical class I molecule | − | 2145008 | 2148428 | 3420 |
|
| BAT1 fragement | + | 2150915 | 2151554 | 639 |
|
| non classical class I molecule | − | 2183520 | 2187094 | 3574 |
|
| BAT1 fragement | − | 2205884 | 2206254 | 370 |
|
| baboon retrovius related endogenous retrovirus | + | 2212532 | 2215463 | 2931 |
|
| endogenous retrovirus similar to brown bat (Motis Lucifugus) endogenous retrovirus 1 | + | 2219742 | 2244701 | 24959 |
|
| Classical class I antigen gene fragment | + | 2221196 | 2221264 | 68 |
|
| BAT1 fragement | − | 2258148 | 2258782 | 634 |
|
| non classical class I molecule | + | 2260882 | 2264315 | 3433 |
|
| BAT1 fragement | + | 2293369 | 2294534 | 1165 |
|
| Classical class I antigen gene fragment | − | 2301870 | 2302131 | 261 |
|
| non classical class I molecule | + | 2329070 | 2332537 | 3467 |
|
| POU domain, class 5, transcription factor 1, OCT3 | + | 2354289 | 2362844 | 8555 |
|
| TCF19, transcription factor 19 | − | 2364737 | 2367161 | 2424 |
|
| coiled-coil alpha-helical rod protein 1 | + | 2370568 | 2382864 | 12296 |
|
| corneodesmosin | + | 2401577 | 2410158 | 8581 |
|
| unknown: chr12: 3,698,941–3,701,819 | + | 2446230 | 2449015 | 2785 |
|
| valyl-tRNA synthetase like | − | 2449807 | 2461727 | 11920 |
|
| general transcription factor IIH, polypeptide 4, 52 kDa | − | 2461975 | 2465896 | 3921 |
|
| discoidin domain receptor family, member 1 | + | 2466245 | 2488040 | 21795 |
|
| TAX response lement-binding protein | + | 2527942 | 2547435 | 19493 |
|
| immediate early response 3 | + | 2607659 | 2607996 | 337 |
|
| flotillin 1 | + | 2610071 | 2622235 | 12164 |
|
| tubulin, beta | − | 2625114 | 2628405 | 3291 |
|
| mediator of DNA damage checkpoint 1 | + | 2630309 | 2644393 | 14084 |
|
| nurim (nuclear envelope membrane protein) | + | 2648569 | 2653735 | 5166 |
|
| KIAA1949 | + | 2653850 | 2660634 | 6784 |
|
| DEAH (Asp-Glu-Ala-His) box polypeptide 16 | + | 2663783 | 2678921 | 15138 |
|
| chromosome 6 open reading frame 136 | − | 2679243 | 2683432 | 4189 |
|
| Drosophila melanogaster protein Cg3967-pc homolog | − | 2683765 | 2697339 | 13574 |
|
| mitochondrial ribosomal protein S18B | − | 2698088 | 2703714 | 5626 |
|
| protein phosphatase 1, regulatory subunit 10 | + | 2711308 | 2722165 | 10857 |
|
| ATP-binding cassette, sub-family F (GCN20), member 1 | − | 2724683 | 2735486 | 10803 |
|
| proline rich 3 | − | 2743092 | 2747113 | 4021 |
|
| guanine nucleotide binding protein-like 1 | + | 2748233 | 2755021 | 6788 |
|
| eukaryotic translation elongation factor 1 alpha 1 | − | 2779084 | 2786223 | 7139 |
|
| ribomal protein S28 | + | 2800184 | 2807333 | 7149 |
|
| poly(A) binding protein, cytoplasmic 4 (inducible form) | + | 2818343 | 2818789 | 446 |
|
| poly(A) binding protein, cytoplasmic 4 (inducible form) | + | 2820533 | 2821355 | 822 |
|
| unknown: PLEC1, PLECTIN1 | − | 2825427 | 2840447 | 15020 |
|
| unknown: SLC12AL Intron, sodium potassium chloride cotransporter2 | + | 2842351 | 2843559 | 1208 |
|
| ribomal protein S28 | + | 2850617 | 2852479 | 1862 |
|
| Ribonuclease | + | 2867225 | 2867494 | 269 |
|
| tripartite motif-containing 39 | − | 2870168 | 2879926 | 9758 |
|
| Classical class I antigen gene fragment | + | 2905161 | 2905383 | 222 |
|
| non classical class I molecule | − | 2910524 | 2913237 | 2713 |
|
| Patatin-like phospholipase domain containing 6 | − | 2956238 | 2961356 | 5118 |
| PeriCentromic Region and chromosomal break and inversion | |||||
| Subtelomeric Region | |||||
| X11 | unknown: cfa chr6: 27,705,326–27,717,672 | − | 7700 | 29900 | 22200 |
| TRIM26 | tripartite motif-containing 26 protein | + | 102400 | 111040 | 8640 |
| TRIM15 | tripartite motif-containing 26 protein | − | 125740 | 132100 | 6360 |
| TRIM10 | tripartite motif-containing 26 protein | + | 134820 | 140560 | 5740 |
| X12 | unknown: cfa chr35: 29,351,535–29,362,511 | − | 142140 | 158060 | 15920 |
| PPP1R11 | protein phosphatase 1, regulatory (inhibitor) subunit 11 | − | 189360 | 193120 | 3760 |
| MOG | myelin oligodendrocyte glycoprotein | − | 219300 | 228580 | 9280 |
| GABBR1 | gamma-aminobutyric acid (GABA) B receptor 1 | + | 248040 | 268920 | 20880 |
| OLFR1 | Olfactory receptor | − | 282860 | 289000 | 6140 |
| UBD | ubiquitin | + | 299600 | 301440 | 1840 |
| X13 | unknown | − | 303120 | 303740 | 620 |
| OLFR2 | Olfactory receptor | − | 316640 | 317400 | 760 |
| OLFR3 | Olfactory receptor | + | 330520 | 352180 | 21660 |
Figure 1Gene Organization, SNP level, GC contents in FLA.
(A) Gene organization of FLA. Genes with forward orientation, which towards to telomere in human HLA, but towards to centromere in FLA, and away from telomere in distal class I region in FLA were placed above the solid line. A position of the ancient chromosome break and an inversion was indicated by double slashed lines and genes with opposite orientation were placed below the solid line. (B) Coding (CDS) SNPs. CDS SNPs were counted based on exon structure of each gene. Pseudogenes CDS SNPs were omitted. No. of SNPs per 10 kbp were plotted. (C) Single nucleotide polymorphism (SNP). SNP was counted in 10 Kbp window and shift 1 Kb. (D) GC content. GC content was counted in 10 Kbp window and shift 1 Kb and number was plotted.
Figure 2Percent Identity Plots between FLA and HLA.
(A) Percent identity Plot of FLA extend class II, classical class II, class III, proximal and central class I regions. Genes and exons were highlighted with yellow and green colors, respectively. Gene, exon, UTR, simple repeats, MIR, other SINE, LINE1, LINE2, LTR, other repeats, CpG/GpC ratios were indicated. FLA sequence was compared with human HLA 6COX sequence. (B) Percent Identity Plot of FLA distal class I region. Same methods and criterions were used as in Figure 2(A). Only Figure 2A-1 was included in this text. The rest of Figure 2A and 2B can be found in Supporting Information File, Figure 2AB_All.tar.bzip2.
Interspersed Elements in FLA subregions.
| extended class II | classical class II | class III | proximal class I | central class I | distal class I |
| cat genome |
| |
| SINES: | 14.86 | 10.19 | 11.16 | 8.04 | 10.93 | 8.82 | 8.53 | 11.2 | 17.59 |
| MIRs | 2.29 | 1.69 | 2.46 | 0.44 | 2.07 | 1.56 | 1.05 | 3.10 | 16.06 |
| LINES: | 12.57 | 34.82 | 10.84 | 32.07 | 23.41 | 21.02 | 21.31 | 14.26 | 16.59 |
| LINE 1 | 8.61 | 32.88 | 6.86 | 29.13 | 19.76 | 19.39 | 18.63 | 10.79 | 13.35 |
| LINE 2 | 3.87 | 1.79 | 3.58 | 2.88 | 3.18 | 1.44 | 2.54 | 2.82 | 3.09 |
| L3/CR1 | 0.07 | 0.14 | 0.40 | 0.05 | 0.36 | 0.13 | 0.14 | 0.36 | 0.16 |
| LTR elements: | 2.84 | 4.49 | 1.39 | 5.17 | 4.51 | 4.93 | 2.69 | 4.44 | 10.55 |
| MaLRs | 1.90 | 0.79 | 0.82 | 1.51 | 1.57 | 0.91 | 1.04 | 2.14 | 2.61 |
| ERVL | 0.34 | 0.88 | 0.27 | 1.79 | 0.57 | 1.59 | 0.81 | 1.21 | 2.11 |
| ERV classI | 0.60 | 2.74 | 0.29 | 1.87 | 2.32 | 2.43 | 0.81 | 1.05 | 4.25 |
| DNA elements: | 5.23 | 1.56 | 1.63 | 2.56 | 1.72 | 1.43 | 1.62 | 2.19 | 2.64 |
| MER1_type | 2.30 | 1.16 | 0.97 | 1.73 | 1.07 | 0.81 | 1.31 | 1.26 | 1.52 |
| MER2_type | 1.46 | 0.17 | 0.26 | 0.83 | 0.33 | 0.23 | 0.14 | 0.39 | 0.88 |
| Total of Interspersed | 35.48 | 51.07 | 25.07 | 47.84 | 40.57 | 36.19 | 34.14 | 32.1 | 48.14 |
Percentage of sequence (%) was shown in each subregion.
Simple Tandem Repeats (STRs) in FLA.
|
| ||||||||||||||
| Extended class II | Classical class II | Class III | Proximal class I | Central class I | Distal class I | Total | ||||||||
| DI | 25 | 9.26 | 118 | 8.26 | 30 | 17.36 | 47 | 13.34 | 52 | 11.95 | 26 | 13.91 | 298 | 11.20 |
| TRI | 11 | 21.05 | 19 | 51.30 | 13 | 40.07 | 10 | 62.70 | 13 | 47.79 | 10 | 36.15 | 76 | 43.91 |
| TETRA | 9 | 25.73 | 37 | 26.34 | 13 | 40.07 | 39 | 16.08 | 23 | 27.01 | 17 | 21.27 | 138 | 24.18 |
| PENTA | 2 | 115.79 | 7 | 139.24 | 8 | 65.11 | 7 | 89.58 | 4 | 155.31 | 1 | 361.55 | 29 | 115.07 |
| TOTAL | 47 | 4.93 | 181 | 5.39 | 64 | 8.14 | 103 | 6.09 | 92 | 6.75 | 54 | 6.70 | 541 | 6.17 |
| size (bp) | 231580 | 974699 | 520919 | 627039 | 621255 | 361545 | 3337061 | |||||||
No. of more than 12 perfect repeats or 5 repeats were counted for dinucleotide (DI) and other STRs (TRI, TETRA, PENTA), respectively.
Average interval (kbp) of occurrence of STR was shown.
Figure 3Dotplot analyses.
Two FLA sequences were connected based on HLA organization and oriented as follows; Telomeric side of FcaB2q→B2qcen→B2pter→B2p. The centromeric side of two DLA sequences, one on cfa12qcen and the other on cfa35qter were also connected based on HLA organization as follows; telomeric side of cfa12q→cfa12qcen→cfa35qter→cfa35q. (A) Dotplot analysis between DLA (KIFC1 to the third olfactory receptor genes from MOG) and HLA 6COX sequences (X axis vs. Y axis). (B) Dotplot analysis between FLA (KIFC1 to the third olfactory receptor genes from MOG) and HLA 6 COX sequences (X axis vs. Y axis). (C) Dot plot analysis between FLA (X axis) and DLA (Y axis).
Figure 4Neighbor-Joining Tree with 1,000 bootstrap for domestic cat endogeneous retrovirus sequences.
ECE1 represent RD114 endogeneous retrovirus transmitted from baboon, enFeLV represent a full length FeIV endogeneous retrovirus. enRVMlu represent brown bat retrovirus sequence and FERVmlu1, FERVmlu2 represent new endogeneous genes found in the proximal class I region of FLA in this study.
Figure 5Single Nucleotide Polymorphism (SNP) plot on cat chromosome B2 coordinates.
Number of SNPs were counted based on whole genome shotgun sequences and the number of SNPs per 10 Kbp were plotted. A solid line represents average SNP rate (per 10 Kbp) in heterozygous regions of a female Abyssinian cat genome. Areas of FLA were indicated as brackets.
Characterization of 19 Class I Genes in FLA.
| Methods Applied | |||||||
| Gene | Dotplot | Coding Prediction | Sequence | 31 conserved | Assignment |
| |
| I | II | ||||||
|
| + | − | − | nonclassical | − | ||
|
| − | gene fragment | − | ||||
|
| + | − | − | nonclassical | − | ||
|
| − | gene fragment | − | ||||
|
| + | − | ++ | ++ (All) | classical | + | |
|
| + | − | ++ | − (−5) | nonclassical | − | |
|
| − | gene fragment | − | ||||
|
| + | + | + | + | + (−1) | classical | + |
|
| − | gene fragment | − | ||||
|
| + | + | + | − (−3) | nonclassical | + | |
|
| + | + | ++ | + | ++ (All) | classical | + |
|
| + | − | + | − (−5) | nonclassical | + | |
|
| + | − | + | − (−6) | nonclassical | + | |
|
| − | gene fragment | − | ||||
|
| + | + | − | − (−5) | nonclassical | + | |
|
| − | gene fragment | + | ||||
|
| + | − | − | nonclassical | − | ||
|
| − | gene fragment | − | ||||
|
| + | − | − | nonclassical | − | ||
PIPmaker dotplot ( ) was used. + and − represent full-length and partial length, respectively compared with full length FLAIA24 cDNA.
GENSCAN was used to predict coding region for only full-length class I genes. + and − symbols represent right and wrong prediction of exon and intron boundaries in each gene.
Spidey was used to examine sequence alignment of genomic cDNA class I sequences and splicing donor/acceptor sites. ++, +, and − symbols represent typical class I exon/intron structures reported in human class I genes with all correct splicing donor/acceptor sites, with one or two missing splicing donor/acceptor sites, and atypical exon/intron structures, respectively.
Class I cDNA sequences from MHC homozygous feline fibroblast cells were compared with all class I genomic sequences by Megablast Search ( ). + symbol represents >99% sequence identity.
Thirty-one highly conserved amino acid residues found in α1 and α2 domains of human and cat class I antigens were examined. ++, +, − numbers represent all 31 conserved residues, one substitution and more than one substitutions, respectively.
Assignment of classical/nonclassical/gene fragment class I genes based on this study.
Symbols + and − represent presence and absence of BAT1gene fragment in vicinity of class I gene.
Figure 6Haplotype analysis of the domestic cat MHC class II DR region.
(A) Gene organization of the domestic cat MHC class II DR region based on the nucleotide sequence of a composite haplotype as previously reported in Yuhki et al. [14]. The location of eight DR genes is shown with the transcriptional orientation indicated by arrows. (B) Analysis of the haplotype structure of Fca273 (used to make the BAC library) based on mapping of gene content of individual BAC clones by hybridization and sequence-based typing of exon 2 of BAC clones. DRB alleles were identified based on comparison to 71 domestic cat DRB exon 2 sequences spanning 233 bp (after removal of primer sequences) present in the NCBI nucleotide database. (C) Deduced amino acid sequences from two haplotypes were aligned in each loci and different residues in each loci were depicted in red. Antigen recognition sites were shown as asterisks below the sequence alignment.
Figure 7Dotplot analysis of two DR haplotype sequences from a single BAC library.
Figure 8Transcriptional factor binding site prediction.
A total of 6 kb sequence spanning 1 kb downstream and 5 kb upstream from translation start site (ATG) were analyzed for (A) DRA genes, (B) DRB genes, and (C) Classical class I genes. The S-X/X2-Y module and enh.A-ISRE-W/S-X1-X2-Y.enh.B were depicted as a red box. HLA-DRB1, FLA- DRB4 modules were located at 52 Kb, 7 Kb upstream from ATG site, respectively. Forward and reverse orientation of TF binding sites were depicted above and below lines respectively: (D) enh.A-ISRE-W/S-X1-X2-Y/enh.B module sequences found in FLA-E, -H, -K and HLA-A, -B, -C genes and S-X1-X2-T module sequences found in FLA-DRA1, -DRA2, -DRA3, -DRB1, -DRB3 and HLA-DRA, -DRB1, were aligned and each promoter/enhancer cis-motifs were boxed. Coordinates of FLA were based on 6 Kb sequence described above.
Single Nucleotide Polymorphism (SNP)s.
|
|
| |
| Size (Mbp) compared | 2.84 Mbp | 4.75 Mbp |
| No. of SNPs | 11,654 | 16,013 |
| SNP rate (per bp) | 0.00411 | 0.00337 |
| No. of CDS SNPs | 732 | 341 |
| Class I & II genes S/N | 48/145 | 48/68 |
Nonsynonymous and Synonymous Coding SNPs in FLA class I and II genes.
| Class |
| No.of Synonymous and Nonsynonymous SNPs (S/N) |
| II |
| 8/25 |
| II |
| 1/2 |
| II |
| 5/13 |
| II |
| 2/0 |
| II |
| 0/0 |
| II |
| 0/0 |
| I |
| 1/0 |
| I |
| 2/1 |
| I |
| 0/0 |
| I |
| 4/40 |
| I |
| 15/50 |
| I |
| 6/7 |
| I |
| 0/1 |
| I |
| 1/1 |
| I |
| 0/1 |
| I |
| 3/4 |
Figure 9Comparisons of MHC genomic structures in cat, human, mouse, dog, and opossum.
Framework genes in class II, III, I regions were shown as green, blue, red boxes, respectively. Forward and reverse orientations of each gene were shown above and below line, respectively. Classical class II antigen coding genes/gene fragments were shown in orange and classical and non-classical class I genes were shown in black.