| Literature DB >> 20698968 |
Jianfeng Gao1, Ka Liu, Haibo Liu, Hugh T Blair, Gang Li, Chuangfu Chen, Pingping Tan, Runlin Z Ma.
Abstract
BACKGROUND: The ovine Major Histocompatibility Complex (MHC) harbors clusters of genes involved in overall resistance/susceptibility of an animal to infectious pathogens. However, only a limited number of ovine MHC genes have been identified and no adequate sequence information is available, as compared to those of swine and bovine. We previously constructed a BAC clone-based physical map that covers entire class I, class II and class III region of ovine MHC. Here we describe the assembling of a complete DNA sequence map for the ovine MHC by shotgun sequencing of 26 overlapping BAC clones.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20698968 PMCID: PMC3091662 DOI: 10.1186/1471-2164-11-466
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Assembly of 26 BAC-clone based DNA sequences covering entire Ovine MHC region
| BAC clone ID | GenBank Accession Number | Insert length (bp) | High repeat sequence | No. of Gaps outside | |||||
|---|---|---|---|---|---|---|---|---|---|
| 271H22 | 159959 | 7.93491 | 1.118 × 10-4 | 2974 | No | 1 | 0 | 0 | |
| 304C7 | 134586 | 8.0733 | 2.127 × 10-4 | 2509 | Yes | 1 | 1 | 0 | |
| 142M19 | 134479 | 8.11391 | 4.257 × 10-4 | 2715 | Yes | 1 | 2 | 0 | |
| 373D13 | 172485 | 8.15691 | 5.308 × 10-4 | 3311 | No | 1 | 3 | 0 | |
| 283N15 | 155021 | 7.165 | 0.268 × 10-4 | 2472 | No | 1 | 0 | 0 | |
| 222G18 | 167309 | 7.78757 | 0.790 × 10-4 | 2783 | No | 1 | 0 | 0 | |
| 55L9 | 145292 | 8.22565 | 0.195 × 10-4 | 2941 | No | 1 | 0 | 0 | |
| 197N2 | 90102 | 6.50404 | 0.488 × 10-4 | 1438 | No | 1 | 2 | 0 | |
| 429P24 | 198404 | 9.02502 | 3.497 × 10-4 | 4009 | No | 1 | 2 | 0 | |
| 225J15 | 139059 | 7.9057 | 68.30 × 10-4 | 2335 | Yes | 1 | 1 | 0 | |
| 453O11 | 143310 | 8.05201 | 0.802 × 10-4 | 2473 | No | 1 | 2 | 0 | |
| 63M17 | 129209 | 8.52801 | 37.780 × 10-4 | 2394 | No | 1 | 1 | 0 | |
| 163P3 | 165447 | 7.55517 | 1.049 × 10-4 | 2833 | No | 1 | 0 | 0 | |
| 119K1 | 156603 | 7.75008 | 0.309 × 10-4 | 2665 | No | 1 | 2 | 0 | |
| 349I12 | 149708 | 8.4984 | 7.600 × 10-4 | 2994 | No | 1 | 2 | 0 | |
| 345B17 | 134643 | 7.69046 | 10.58 × 10-4 | 2736 | No | 2 | 0 | 1 | |
| 68G10 | 165531 | 8.14164 | 6.535 × 10-4 | 3681 | Yes | 1 | 1 | 0 | |
| 346G21 | 138311 | 8.67944 | 9.059 × 10-4 | 2807 | No | 1 | 1 | 0 | |
| 44I10 | 134434 | 8.32584 | 16.47 × 10-4 | 2705 | No | 1 | 2 | 0 | |
| 282P19 | 174317 | 7.27217 | 0.490 × 10-4 | 2989 | No | 2 | 1 | 1 | |
| 239C1 | 142287 | 7.9438 | 0.736 × 10-4 | 2940 | No | 1 | 2 | 0 | |
| 141C4 | 160633 | 7.65743 | 0.624 × 10-4 | 2992 | No | 1 | 1 | 0 | |
| 374N21 | 83460 | 9.112 | 3.860 × 10-4 | 3648 | No | 2 | 1 | 1 | |
| 21H3 | 119055 | 8.13723 | 0.380 × 10-4 | 2157 | No | 1 | 1 | 0 | |
| 304D17 | 140735 | 8.14013 | 4.291 × 10-4 | 2599 | No | 1 | 1 | 0 | |
| 58G13 | 135958 | 8.827674 | 2.187 × 10-4 | 3845 | No | 2 | 4 | 0 |
a Defined as a ratio between total number of base pairs sequenced and total number of base pairs of the inserts in a given BAC clone.
b Error probability of a particular base call, corresponding to a quality value as determined by the equation:
Q = -10log10(Pe), where Pe is the error probability.
c The total number of shotgun DNA sequencing reactions performed for a given BAC clone.
d In genomic mapping, a series of contigs that are in the right order but not necessarily connected in one continuous stretch of sequence.
e The number of regions where the exact nucleotide base (G, A, T, or C) could not determined, represented by a strips of "N" in a given BAC clone.
Figure 1A feature map of Ovine MHC sequence. The map spans 2,434,000 nucleotide bases in length, containing 177 protein-coding genes/ORFs and 18 miRNA coding genes. Each locus is represented by an arrow or arrow head, and annotated according to type, orientation, and location within the MHC. The tiling path of the sequenced BACs and the MHC structure are shown on the top. Micro RNA (18 shown): Identified gene that has high sequence homology with conserved gene encoding the functional mircoRNA in other species, noted following the given name of that species. Predicted (36 shown): Gene that either has high sequence similarity to that of the predicted gene in other species, or has a predicted ORF but no high sequence homologies with ESTs of Ovine or other species. Novel (10 shown): Ovine-specific gene identified with a defined open reading frame (ORF) that has not been found in any other species to date. The novel genes are annotated with OaN1 to OaN8 (Oa for Ovis aries; N for novel; another two with no cloning data, no annotation), from left to the right of the map. Known (131 in total): The functional genes previously annotated in Bovine, Ovine or other species.
List of non-protein-codning microRNA genes identified in MHC by Rfam analysis
| Gene Name | Rfam Accession No. | Start coordinate | End coordinate | Orientation | |
|---|---|---|---|---|---|
| let-7 | RF00027 | 35 | 51 | + | 34.2 |
| miR-160 | RF00247 | 173327 | 173343 | + | 34.2 |
| miR-156 | RF00073 | 180706 | 180722 | - | 34.2 |
| miR-17 | RF00051 | 243137 | 243153 | - | 34.2 |
| miR-166 | RF00075 | 465267 | 465295 | + | 34.2 |
| miR-166 | RF00075 | 1062114 | 1062131 | - | 36.2 |
| miR-2 | RF00047 | 1243062 | 1243078 | + | 34.2 |
| lin-4 | RF00052 | 1608564 | 1608580 | + | 34.2 |
| miR-10 | RF00104 | 1978042 | 1978059 | + | 36.2 |
| miR-101 | RF00253 | 2046800 | 2046816 | + | 34.2 |
| miR-395 | RF00451 | 2082358 | 2082373 | - | 32.2 |
| miR-156 | RF00073 | 2084006 | 2084023 | - | 36.2 |
| miR-219 | RF00251 | 2098884 | 2098955 | - | 127 |
| miR-219 | RF00251 | 2098930 | 2098950 | - | 42.1 |
| miR-156 | RF00073 | 2131290 | 2131307 | - | 36.2 |
| miR-156 | RF00073 | 2339014 | 2339030 | + | 34.2 |
| miR-166 | RF00075 | 2348202 | 2348217 | - | 32.2 |
| miR-399 | RF00445 | 2354071 | 2354086 | + | 32.2 |
a The scores are bits (logs-odds) scores which represent the log of the probability of the query given the model over the probility of random sequence given the model.
Figure 2Gene order comparisons for the selected class II loci from HLA, OLA, and BoLA. Genetic loci in class II region were compared by aligning HLA, OLA and BoLA at telemere→centromere orientation. The orthologous loci were linked by solid lines. Solid and open box represent the selected class II loci and non-MHC loci, respectively. Shaded carmine boxes indicate regions of conservation among species. Red ellipses indicate the potential breaking points.Tel: Telomere, Cen: Centromere.
Figure 3A comparative alignment of . Solid and open box represent the known coding gene and predicted gene, respectively, with an arrow head of box indicating the orientation of gene transcription. A numeric number in a solid box indicates the gene family number of Btnl families. MHC of swine was interrupted by the Centromere as shown