| Literature DB >> 26432030 |
Heather M Holl1, Shan Gao2, Zhangjun Fei3, Caroline Andrews4, Samantha A Brooks5.
Abstract
BACKGROUND: Laminitis, the structural failure of interdigitated tissue that suspends the distal skeleton within the hoof capsule, is a devastating disease that is the second leading cause of both lameness and euthanasia in the horse. Current transcriptomic research focuses on the expression of known genes. However, as this tissue is quite unique and equine gene annotation is largely derived from computational predictions, there are likely yet uncharacterized transcripts that may be involved in the etiology of laminitis. In order to create a novel annotation resource, we performed whole transcriptome sequencing of sagittal lamellar sections from one control and two laminitis affected horses.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26432030 PMCID: PMC4592545 DOI: 10.1186/s12864-015-1948-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
De novo assembly statistics
| Metrics | Raw Assembly |
|---|---|
| Total reads (100 bp) | 112,979,003 |
| Reads after filtering | 86,275,849 |
| Average read length after filtering | 88.3 bp |
| # Unigenes | 74,860 |
| N50 | 2,272 |
| Minimum Length | 201 |
| Average Length | 1,098 |
| Maximum Length | 17,667 |
Fig. 1Distribution of exon counts within the unfiltered assembly. Longer models range from 10 to 119 exons
Isoform statistics by locus. Unigenes are clustered together into unique loci based on an overlap of at least 1 bp
| Statistics | All Transcripts | Long (3+ Exon) Transcripts |
|---|---|---|
| Total Unigenes | 55,120 | 27,884 |
| Unique Loci | 23,779 | 12,905 |
| Min Unigenes per Locus | 1 | 1 |
| Max Unigenes per Locus | 125 | 89 |
| Average Unigenes per Locus | 2.32 | 2.16 |
Mapping statistics for RNA-seq onto the equCab2 reference genome assembly
| Sample | Phenotype | Total Reads | Mapped Reads | % Mapped | SNPs |
|---|---|---|---|---|---|
| CU1 | control | 36,277,643 | 31,561,549 | 87 % | 60,580 |
| CU18 | acute | 43,422,463 | 38,211,767 | 88 % | 72,281 |
| LSU-J | chronic | 33,278,897 | 29,618,218 | 89 % | 58,368 |
Unigenes matching records in selected databases. The repeat-filtered assembly was utilized for EquCab2 alignment-based annotation
| Database | Total Records | Unigenes |
|---|---|---|
| NCBI NR Protein | 37,818,139 | 36,195 / 74,860a (48 %) |
| Equine-Specific Repeats | 2,905,169 | 19,740 / 74,860a (26 %) |
| Non-Horse RefSeq | 255,606 | 24,501 / 55,120b (44 %) |
| Ensembl Predictions | 29,196 | 15,538 / 55,120b (28 %) |
| Horse RefSeq | 1,169 | 604 / 55,120b (1 %) |
| None | n/a | 31,091 / 74,860a (42 %) |
aUnfiltered transcriptome assembly
bRepeat-filtered transcriptome assembly
Fig. 2Example custom annotation on UCSC Genome Browser. Clicking on the identifier “UN09857” loads the screen in the lower panel. Custom identifiers provide corresponding gene and protein annotation for each unigene
Putative novel loci validated by RT-PCR
| Name | Chr | Start | End | ORF length | Exons | E | BLASTX |
|---|---|---|---|---|---|---|---|
| UN20159 | 1 | 160767946 | 160771835 | 85 | 4 | 1.2E-40 | AAA80518 T-cell receptor alpha chain (IgC TCRA) [Equus caballus] |
| UN14299 | 4 | 24422341 | 24427035 | 83 | 4 | none | none |
| UN30143 | 5 | 15325547 | 15363449 | 515 | 16 | 7.3E-22 | XP_003209297 PREDICTED: intraflagellar transport protein 80 homolog [Meleagris gallopavo] |
| UN27297 | 5 | 74952780 | 74965671 | 177 | 6 | 1.7E-56 | XP_001493637 PREDICTED: guanylate-binding protein 5 [Equus caballus] |
| UN27113 | 9 | 81709438 | 81711119 | 137 | 4 | 2.9E-60 | XP_001917082 PREDICTED: lymphocyte antigen 6H-like [Equus caballus] |
| UN28086 | 11 | 880333 | 894532 | 240 | 6 | 9.9E-31 | NP_663348 secreted and transmembrane protein 1A precursor [Mus musculus] |
| UN62514 | 11 | 37045433 | 37058857 | 162 | 5 | 1.5E-11 | ACI67873163 Perlwapin [Salmo salar] |
| UN21936 | 12 | 19828520 | 19834360 | 195 | 5 | 2.1E-14 | NP_001243909 placenta-specific protein 1 precursor [Equus caballus] |
| UN70945 | 22 | 34309639 | 34312469 | 141 | 4 | 1.7E-54 | AES10462 antileukoproteinase-like protein [Mustela putorius furo] |
| UN26965 | 24 | 44802220 | 44806274 | 142 | 5 | 3.5E-21 | XP_002696828 PREDICTED: uncharacterized protein LOC509029 [Bos taurus] |
| UN26584 | 28 | 21567866 | 21620053 | 248 | 7 | 5.8E-102 | XP_003952339 PREDICTED: uncharacterized protein LOC101059192 [Pan troglodytes] |
| UN50658 | X | 2231083 | 2240297 | 174 | 7 | 1.0E-38 | XP_003134963 PREDICTED: odorant-binding protein-like [Sus scrofa] |
Fig. 3BLAT alignment of sequenced cDNA from UN21936 and assembled transcripts to the reference genome. Screenshot was captured from the UCSC Genome Browser. Dark boxes represent exons while thin lines are introns. The empty RefSeq Genes, Ensembl Gene Predictions, Horse ESTs, and Non-Horse RefSeq tracks indicate that there has never been expression or computational predictions placed here. Although the amplicon shows three mismatches to the reference (shown as vertical red lines on the SangerSequence data), this sequence aligned perfectly to the sequence from the de novo assembled transcript (RawAssemblySequence)
Fig. 4Agarose gel demonstrating the expression of UN21936 and UN27113. Expression of UN21936 appears to be limited to laminitic samples. CU1, CU5 = control; CU17, CU18 = acute laminitis; LSUE, LSUJ = chronic laminitis; NTC = non-template (negative) control
Summary of samples used in this study. Laminitis phenotype was determined through medical history and histological examination. Demographic information was not available for most horses
| Sample ID | Phenotype | Age | Sex | Breed | History | Experiment |
|---|---|---|---|---|---|---|
| CU1 | control | unknown | unknown | unknown | healthy | RNA-seq, RT-PCR |
| CU5 | control | 3 years | gelding | Quarab | healthy | RT-PCR |
| CU17 | acute | 13 years | stallion | unknown | enterocolitis | RT-PCR |
| CU18 | acute | unknown | unknown | unknown | enterocolitis | RNA-seq, RT-PCR |
| LSU-E | chronic | unknown | unknown | unknown | Equine Metabolic Syndrome | RT-PCR |
| LSU-J | chronic | unknown | unknown | unknown | Equine Metabolic Syndrome | RNA-seq, RT-PCR |
Primers used to confirm expression of unannotated transcripts. All PCRs were performed with an annealing temperature of 62°C and an elongation time of 30s
| Name | Forward Seq | Reverse Seq | Size |
|---|---|---|---|
| UN20159 | TTCAAGAGCAATGGGATGCT | CGCAGTGTCATGAACAGGTTA | 227 bp |
| UN14299 | TTTTCCTCTGAAGCATTTCC | TAGAGCATCGCTTTCCTGGT | 284 bp |
| UN30143 | CCCACCCCCAACCTAGATAC | AGGTAAGACAGGCTGGGTCA | 499 bp |
| UN27297 | GTCCGAATTCAGCCAATCAT | GAAACGATTTATGGCCTCCA | 495 bp |
| UN27113 | TGAAAGGCATCCATCTGGTC | ACCCCGTTACAGAGGTCCTT | 329 bp |
| UN28086 | TCCTTGCTAGGATGCTCTGG | GAGCACCAGGATGAAGAGGA | 506 bp |
| UN62514 | GGCTCCTCCTCCTTGTGAG | AACAGCAGTTTGGCAGGAGT | 437 bp |
| UN21936 | CTATGTTCTGGGCTGTGGTG | TGTAGCCACGTTTGCACTCT | 485 bp |
| UN70945 | CCTCATGACCTTCGTGGTTC | ATCTTTTTGAGCTGGCAAGG | 409 bp |
| UN26965 | GCACCCTACTCCCACATACG | GCTCACATCCACGTCTGCTA | 422 bp |
| UN26584 | GTACATTCCTCCCCTGCAAA | TCGACACCATCCAGTTGAAA | 479 bp |
| UN50658 | CTGACCAGGACCCTCAGTCT | TCAGTGACCAGGCCTTCTTC | 343 bp |