| Literature DB >> 16618371 |
Wei-Hua Chen1, Xue-Xia Wang, Wei Lin, Xiao-Wei He, Zhen-Qiang Wu, Ying Lin, Song-Nian Hu, Xiao-Ning Wang.
Abstract
BACKGROUND: The cynomolgus monkey (Macaca fascicularis) is one of the most widely used surrogate animal models for an increasing number of human diseases and vaccines, especially immune-system-related ones. Towards a better understanding of the gene expression background upon its immunogenetics, we constructed a cDNA library from Epstein-Barr virus (EBV)-transformed B lymphocytes of a cynomolgus monkey and sequenced 10,000 randomly picked clones.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16618371 PMCID: PMC1522023 DOI: 10.1186/1471-2164-7-82
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Statistics of ESTs obtained from the cynomolgus monkey cDNA library. The length distributions of initial ESTs (A) and putative ORFs of unigenes after assembling (B).
Figure 2Comparison of results of gene-oriented clustering (blast human mRNA) and non-gene-oriented clustering (direct phrap) clustering. The two clustering methods produced similar numbers of EST clusters, the distributions of the two sets of EST clusters were also similar.
The most abundantly expressed unigenes containing at least 15 ESTs and their annotations in the cDNA library of the cynomolgus monkey B lymphocytes.
| unigene ID | Blastx NCBI nr | Blastn NCBI nt | clustered ESTs | ||
| E-value | Annotation | E-value | Annotation | ||
| Contig998 | 1.00E-119 | HLA-DR-gamma [Pan troglodytes] | 0 | Pongo pygmaeus mRNA; cDNA DKFZp469K1522 | 73 |
| Contig1006 | 1.00E-103 | Unknown (protein for MGC:22645) [Homo sapiens] | 0 | Homo sapiens cDNA clone MGC:22645 IMAGE:4700961, complete cds | 61 |
| Contig1003 | 0 | glyceraldehyde-3-phosphate dehydrogenase [Homo sapiens] | 0 | Homo sapiens cDNA clone MGC:20338 IMAGE:4541305, complete cds | 52 |
| Contig997 | 9.00E-93 | Chain C, Antibody Gnc92h2 Bound To Ligand | 0 | Macaca fascicularis CDR1, CDR2, CDR3 mRNA | 36 |
| Contig988 | 0 | cytoskeletal beta actin [Sus scrofa] | 0 | Pan troglodytes actb mRNA for beta-actin, complete cds | 30 |
| Contig989 | 2.00E-25 | TMSB4L [Homo sapiens] | 0 | Homo sapiens thymosin-like 3 (TMSL3), mRNA | 30 |
| Contig984 | 0 | elongation factor 1 alpha [Canis familiaris] | 0 | Pan troglodytes chromosome 7 clone RP43-5L2, complete sequence | 24 |
| Contig978 | 5.00E-99 | peptidylprolyl isomerase A isoform 1 [Pan troglodytes] | 0 | Pan troglodytes similar to peptidylprolyl isomerase A isoform 1 | 21 |
| Contig975 | 1.00E-61 | macrophage migration inhibitory factor [Macaca mulatta] | 0 | Homo sapiens macrophage migration inhibitory factor, complete cds | 19 |
| Contig971 | 1.00E-180 | cytosolic thyroid hormone-binding protein (EC 2.7.1.40) | 0 | Homo sapiens pyruvate kinase, muscle, mRNA | 18 |
| Contig966 | 4.00E-48 | Very hypothetical protein | 0 | Homo sapiens cDNA clone IMAGE:4566256 | 16 |
| Contig961 | 7.00E-38 | Alu subfamily SB sequence contamination warning entry | 1.00E-174 | Human DNA sequence from clone RP11-535C21 on chromosome 9 | 15 |
| Contig964 | 1.00E-144 | hypothetical protein [Homo sapiens] | 0 | Homo sapiens enolase 1, (alpha), mRNA, partial cds | 15 |
The mapping results of the 75 unigenes to the human genome.
| GenBank Acc a | Length | Matched length | Query Start | Query End | Chr # | Strand | # of Blocks | Aligned Position b |
| 708 | 632 | 10 | 705 | 21 | + | 4 | INTERGENIC | |
| 646 | 574 | 23 | 646 | 16 | + | 6 | INTERGENIC | |
| 647 | 561 | 7 | 647 | 11 | - | 7 | INTERGENIC | |
| 590 | 552 | 1 | 590 | 14 | + | 4 | INTERGENIC | |
| 600 | 549 | 13 | 600 | 12 | + | 4 | INTERGENIC | |
| 589 | 549 | 6 | 589 | 13 | - | 3 | INTERGENIC | |
| 595 | 548 | 8 | 595 | 7 | - | 6 | INTERGENIC | |
| 592 | 506 | 4 | 588 | 7 | + | 4 | INTERGENIC | |
| 559 | 503 | 20 | 559 | 7 | + | 5 | INTERGENIC | |
| 558 | 479 | 33 | 554 | 11 | - | 7 | INTERGENIC | |
| 506 | 477 | 4 | 506 | 8 | + | 3 | INTERGENIC | |
| 554 | 471 | 39 | 553 | 17 | + | 9 | INTERGENIC | |
| 497 | 451 | 15 | 497 | 6 | - | 6 | INTERGENIC | |
| 535 | 447 | 18 | 535 | 15 | + | 6 | INTERGENIC | |
| 502 | 430 | 26 | 502 | 20 | + | 8 | INTERGENIC | |
| 496 | 425 | 47 | 495 | 18 | - | 3 | INTERGENIC | |
| 454 | 395 | 34 | 446 | 12 | + | 6 | INTERGENIC | |
| 373 | 328 | 3 | 373 | X | + | 7 | INTERGENIC | |
| 295 | 273 | 0 | 295 | 10 | + | 3 | INTERGENIC | |
| 546 | 229 | 111 | 361 | 17 | - | 3 | INTERGENIC | |
| 465 | 125 | 143 | 272 | 24 | - | 3 | INTERGENIC | |
| 686 | 619 | 6 | 686 | 10 | + | 5 | INTRON | |
| 667 | 603 | 5 | 667 | 16 | + | 5 | INTRON | |
| 647 | 574 | 38 | 647 | 9 | + | 1 | INTRON | |
| 609 | 571 | 4 | 609 | 15 | - | 2 | INTRON | |
| 609 | 565 | 7 | 609 | 18 | + | 11 | INTRON | |
| 638 | 565 | 15 | 638 | 21 | - | 6 | INTRON | |
| 583 | 563 | 2 | 583 | 23 | - | 4 | INTRON | |
| 652 | 538 | 24 | 635 | 7 | - | 7 | INTRON | |
| 599 | 536 | 25 | 599 | 8 | + | 4 | INTRON | |
| 607 | 529 | 7 | 594 | 11 | - | 7 | INTRON | |
| 575 | 513 | 0 | 561 | 12 | - | 6 | INTRON | |
| 620 | 511 | 7 | 620 | 7 | + | 7 | INTRON | |
| 561 | 505 | 11 | 561 | 11 | - | 10 | INTRON | |
| 601 | 505 | 45 | 601 | 15 | + | 12 | INTRON | |
| 617 | 504 | 11 | 577 | 10 | + | 10 | INTRON | |
| 555 | 498 | 19 | 555 | 14 | - | 5 | INTRON | |
| 582 | 495 | 57 | 582 | 16 | - | 3 | INTRON | |
| 527 | 478 | 3 | 527 | 9 | + | 5 | INTRON | |
| 505 | 474 | 1 | 505 | 7 | - | 4 | INTRON | |
| 504 | 466 | 17 | 504 | 15 | + | 4 | INTRON | |
| 529 | 455 | 45 | 529 | 19 | + | 6 | INTRON | |
| 562 | 448 | 61 | 562 | 12 | + | 6 | INTRON | |
| 501 | 438 | 28 | 501 | 22 | - | 8 | INTRON | |
| 508 | 434 | 19 | 508 | 11 | - | 8 | INTRON | |
| 445 | 406 | 4 | 445 | 16 | + | 5 | INTRON | |
| 457 | 403 | 21 | 457 | 11 | + | 4 | INTRON | |
| 435 | 400 | 0 | 427 | 16 | - | 8 | INTRON | |
| 416 | 370 | 4 | 416 | 13 | - | 9 | INTRON | |
| 405 | 361 | 4 | 405 | 7 | + | 6 | INTRON | |
| 537 | 325 | 14 | 537 | 19 | - | 9 | INTRON | |
| 314 | 288 | 4 | 314 | 13 | - | 4 | INTRON | |
| 317 | 287 | 7 | 317 | 10 | + | 2 | INTRON | |
| 314 | 261 | 40 | 314 | 8 | - | 2 | INTRON | |
| 558 | 255 | 3 | 293 | 19 | - | 3 | INTRON | |
| 363 | 233 | 108 | 355 | 17 | - | 4 | INTRON | |
| 213 | 198 | 4 | 213 | 14 | - | 3 | INTRON | |
| 220 | 195 | 0 | 213 | 7 | - | 5 | INTRON | |
| 208 | 132 | 23 | 169 | 17 | + | 2 | INTRON | |
| 633 | 122 | 361 | 566 | 17 | - | 6 | INTRON | |
| 645 | 565 | 38 | 637 | 19 | + | 3 | OVERLAPED | |
| 616 | 550 | 6 | 616 | 12 | + | 4 | OVERLAPED | |
| 552 | 449 | 43 | 552 | 11 | + | 9 | OVERLAPED | |
| 472 | 429 | 5 | 471 | 15 | - | 1 | OVERLAPED | |
| 504 | 173 | 25 | 216 | 22 | - | 8 | OVERLAPED | |
| 577 | 144 | 78 | 244 | 16 | + | 7 | OVERLAPED | |
| 605 | 112 | 3 | 605 | X | - | 7 | OVERLAPED | |
| 607 | 569 | 4 | 607 | 11 | + | 3 | 3'UTR | |
| 155 | 138 | 6 | 155 | 7 | - | 4 | 3'UTR | |
| 608 | 571 | 0 | 608 | 21 | - | 4 | 5'UTR | |
| 573 | 544 | 9 | 573 | 8 | + | 1 | 5'UTR | |
| 589 | 535 | 25 | 589 | 12 | - | 5 | 5'UTR | |
| 610 | 509 | 20 | 607 | 21 | + | 6 | 5'UTR | |
| 483 | 433 | 5 | 475 | 14 | + | 8 | 5'UTR | |
| 519 | 352 | 8 | 381 | 8 | - | 7 | 5'UTR |
a. The GenBank identifiers for the 75 unigenes that could be mapped to the human genome and show no homology to any sequences in the nr database and public available cynomolgus monkey and human ESTs.
b. The aligned regions were determined by comparing the BLAT results with the refGene database.
Figure 3The Gene Ontology (GO) categories of genes from the cynomolgus monkey lymphocyte cDNA library (CYLA) and the RAMOS cell line (RAMOS). The genes were functionally categorized according to the Gene Ontology Consortium and level two of the assignment results were plotted here. In this ontology, "biological process", "cellular component" and "molecular function" are categorized independently. 57% (2,124 of total 3,728 unigenes) unigenes from the cynomolgus monkey cDNA library and 63.9% unigenes from the RAMOS cell line were classified by GO.
Figure 4The level 3 GO classification of genes from the cynomolgus monkey lymphocyte cDNA library (CYLA) and the RAMOS cell line (RAMOS). The "biological process" (A) and "molecular function" (B) are shown here.
The most frequently occurring protein families and functional domains in the cynomogus monkey cDNA library.
| IPR003006 | Immunoglobulin/major histocompatibility complex | 41 |
| IPR007110 | Immunoglobulin-like | 38 |
| IPR000345 | Cytochrome c heme-binding site | 37 |
| IPR003597 | Immunoglobulin C-type | 33 |
| IPR000504 | RNA-binding region RNP-1 (RNA recognition motif) | 26 |
| IPR002218 | Glucose-inhibited division protein A | 26 |
| IPR006209 | EGF-like domain | 22 |
| IPR007087 | Zn-finger, C2H2 type | 16 |
| IPR000298 | Cytochrome c oxidase, subunit III | 15 |
| IPR002429 | Cytochrome c oxidase, subunit II | 15 |
Sequence similarity of MHC genes between cynomolgus monkeys and humans.
| MHC gene | RefSeq ID a | Nucleotide identity (%) b | Amino acid identity (%) c | Aligned exons d | |
| Class I | A | NM_002116 | 84.7 | 88.6 | 3–5 |
| B | NM_005514 | 86.1 | 86.5 | 1–7 | |
| E | NM_005516 | 80.1 | 84.3 | 2–4 | |
| F | NM_018950 | 73.9 | 75 | 4–6 | |
| Class II | DMA | NM_006120 | 94 | 96.6 | 2–3 |
| DOA | NM_002119 | 92 | 95 | 3–4 | |
| DPA | NM_033554 | 93 | 96 | 2–5 | |
| DQA | NM_002122 | 91.5 | 93 | 1–3 | |
| DQB | NM_002123 | 94.6 | 95 | 2–3 | |
| DRA | NM_019111 | 94 | 96 | 1–4 | |
| DRB | NM_002124 | 88 | 89 | 1–4 | |
a. GenBank identifiers for the human reference sequences
b. Nucleotide sequence identity between cynomolgus monkeys and humans
c. Amino acid sequence identity between cynomolgus monkeys and humans
d. Exons that MHC genes in our library aligned to the human reference sequences
Figure 5Comparisons of gene expression profiles of MHC class I, II and III molecules. For the comparison purpose, the expression data of MHC genes in the human blood and lymph node were downloaded from the NCBI UNIGENE database and compared with that of the RAMOS cell line and our cDNA library of cynomolgus monkey B lymphocytes (CYLA). The gene expression abundance was indicated as "transcripts per million" for the comparison convenience.
Figure 6The gene expression abundance of Clusters of Differentiation of lymphocytes in our cDNA library (CYLA).