| Literature DB >> 17705864 |
Elizabeth A Hart1, Mario Caccamo, Jennifer L Harrow, Sean J Humphray, James G R Gilbert, Steve Trevanion, Tim Hubbard, Jane Rogers, Max F Rothschild.
Abstract
BACKGROUND: We describe here the sequencing, annotation and comparative analysis of an 8 Mb region of pig chromosome 17, which provides a useful test region to assess coverage and quality for the pig genome sequencing project. We report our findings comparing the annotation of draft sequence assembled at different depths of coverage.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17705864 PMCID: PMC2374978 DOI: 10.1186/gb-2007-8-8-r168
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
List of manually annotated pig loci
| Locus name | Locus description | Start coordinate | End coordinate |
| Tyrosine phosphatase 1B | 192295 | 261948 | |
| Orthologue of human | 263263 | 296984 | |
| Novel transcript | 291165 | 303697 | |
| Par-6 partitioning defective 6 homolog beta ( | 370305 | 389059 | |
| Breast carcinoma amplified sequence 4 | 414609 | 479471 | |
| Activity-dependent neuroprotector | 492879 | 525950 | |
| Dolichly-phosphate mannosyltransferase polypeptide 1, catalytic subunit | 529091 | 552289 | |
| Molybdenum cofactor synthesis 3 | 552563 | 554931 | |
| Potassium voltage-gated channel, subfamily G, member 1 | 600472 | 620055 | |
| Novel transcript | 862318 | 889344 | |
| Putative novel transcript | 891646 | 892957 | |
| Nuclear factor of activated T-cells | 918501 | 1065177 | |
| Putative novel transcript | 1034571 | 1035756 | |
| Atpase, class II, type 9A | 1111110 | 1247324 | |
| Sal-like 4 ( | 1257555 | 1278123 | |
| Putative novel transcript | 1323104 | 1324011 | |
| Pseudogene similar to part of human protein regulator of cytokinesis 1 ( | 1323229 | 1323658 | |
| Ribosomal protein L27a ( | 1496762 | 1497205 | |
| Zinc finger protein 64 homolog (mouse) | 1577847 | 1666097 | |
| Novel transcript | 1689884 | 1709546 | |
| Teashirt family zinc finger 2 | 2317086 | 2773025 | |
| Zinc finger protein 217 | 2813463 | 2839285 | |
| Thioltransferase ( | 2937465 | 2937783 | |
| Novel transcript | 3057211 | 3066632 | |
| Putative novel transcript | 3077982 | 3079132 | |
| Breast carcinoma amplified sequence 1 | 3128687 | 3247224 | |
| 25-Hydroxyvitamin D3-24-hydroxylase | 3318523 | 3339440 | |
| Prefoldin 4 | 3365055 | 3377364 | |
| Docking protein 5 | 3600039 | 3751783 | |
| Novel transcript | 3756050 | 3764628 | |
| Pseudogene similar to human | 3817744 | 3817975 | |
| Cerebellin precursor | 4884433 | 4892814 | |
| Ribosomal protein L27 ( | 4992534 | 4992878 | |
| Melanocortin 3 receptor | 5077795 | 5078875 | |
| Orthologue of human | 5151070 | 5162634 | |
| Serine/threonine kinase 6 | 5164100 | 5183118 | |
| Cleavage stimulation factor, 3' pre-RNA, subunit 1, 50 kda | 5181641 | 5193333 | |
| Orthologue of human | 5200443 | 5240295 | |
| Putative novel transcript | 5224576 | 5225992 | |
| Orthologue of human | 5250245 | 5294040 | |
| Orthologue of human | 5271092 | 5277853 | |
| Orthologue of human | 5296497 | 5298326 | |
| Transcription factor AP-2 gamma (activating enhancer binding protein 2 gamma) | 5374025 | 5384555 | |
| Novel transcript | 5409258 | 5410949 | |
| Ribosomal protein L27 ( | 5690286 | 5690695 | |
| Bone morphogenetic protein 7 (osteogenic protein 1) | 5794879 | 5886410 | |
| SPO11 meiotic protein covalently bound to DSB-like ( | 5940695 | 5955823 | |
| RAE1 RNA export 1 homolog ( | 5961819 | 5977901 | |
| RNA-binding region (RNP1, RRM) containing 1 | 5993104 | 6007945 | |
| CCCTC-binding factor (zinc finger protein-like) | 6070196 | 6102129 | |
| Novel transcript | 6114046 | 6116136 | |
| Phosphoenolpyruvate carboxykinase 1 (soluble) | 6140516 | 6146484 | |
| Z-DNA binding protein 1 | 6182270 | 6192447 | |
| Transmembrane, prostate androgen induced RNA | 6205610 | 6260081 | |
| Orthologue of human | 6580341 | 6590326 | |
| Orthologue of human | 6632471 | 6641977 | |
| Protein phosphatase 4, regulatory subunit 1-like | 6644116 | 6665957 | |
| RAB22A, member RAS oncogene family | 6721985 | 6779521 | |
| VAMP (vesicle-associated membrane protein)-associated protein B and C | 6800573 | 6850820 | |
| Syntaxin 16 | 6911191 | 6939474 | |
| Aminopeptidase-like 1 | 6946539 | 6962106 | |
| GNAS complex locus | 7056486 | 7123907 | |
| Th1-like ( | 7199960 | 7212384 | |
| Cathepsin Z | 7212382 | 7220265 | |
| Tubulin, beta family 1 | 7232312 | 7239278 | |
| ATP synthase, H+ transporting, mitochondrial F1 complex, epsilon subunit | 7241534 | 7245591 | |
| Orthologue of human | 7245938 | 7255973 | |
| Orthologue of human | 7384250 | 7452090 | |
| Endothelin 3 | 7496706 | 7519565 | |
| Phosphatase and actin regulator 3 | 7697213 | 7882258 | |
| Synaptonemal complex protein 2 | 7895310 | 7972154 |
The locus name, description and relative co-ordinates within the 8 Mb region are given. Locus names denoted in bold indicate that the locus is orthologous to a known human locus.
Figure 1Feature map of the 8 Mb region of pig chromosome 17. Each locus is depicted according to type, orientation and position. The tiling path of the sequenced BACs is shown along the top. Below this, the distribution of repeats and C + G content is shown. Box 1 illustrates the zinc-finger locus expansion that has occurred in mouse between EDN3 and PHACTR3. The three regions described in the comparative analyses, PTPN1-CYP24A1, PFDN4-VAPB and STX16-SYCP2, are defined using double-headed arrows.
Comparison of loci type and number in pig, human and mouse
| Locus type | Pig | Human | Mouse |
| Known coding | 53 | 54 | 52 |
| Novel CDS | - | - | 51 |
| Novel transcript | 7 | 15 | 22 |
| Putative | 5 | 24 | 12 |
| Processed pseudogene | 6 | 20 | 22 |
| Unprocessed pseudogene | - | 1 | 31 |
| Expressed pseudogene | - | 1 | 1 |
| Total |
Figure 2Comparison of GNAS transcripts in human, pig and mouse. A screenshot taken from VEGA Pig MultiContigView, comparing GNAS transcripts annotated in human (top panel), pig (middle panel) and mouse (bottom panel). The vertical blues lines joining loci in VEGA MultiContigView represent orthologous relationships between loci across species.
Figure 3Comparison of 5× and 7.5× coverage assemblies. Dot-plots of finished BAC sequence against either 5× or 7.5× assembled sequence for BACS (a) CH242-247L10 and (b) CH242-155M9. Individual contigs, represented on the x-axis, are separated by vertical green lines. In (a) the black rectangle depicted on the graphs represents the GNAS downstream region. In (b) the black rectangle depicted on the graphs defines the vicinity of the pig C20orf106 locus.