| Literature DB >> 23153308 |
Kenneth I Porter1, Bruce R Southey, Jonathan V Sweedler, Sandra L Rodriguez-Zas.
Abstract
BACKGROUND: The pig is a biomedical model to study human and livestock traits. Many of these traits are controlled by neuropeptides that result from the cleavage of prohormones by prohormone convertases. Only 45 prohormones have been confirmed in the pig. Sequence homology can be ineffective to annotate prohormone genes in sequenced species like the pig due to the multifactorial nature of the prohormone processing. The goal of this study is to undertake the first complete survey of prohormone and prohormone convertases genes in the pig genome. These genes were functionally annotated based on 35 gene expression microarray experiments. The cleavage sites of prohormone sequences into potentially active neuropeptides were predicted.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23153308 PMCID: PMC3499383 DOI: 10.1186/1471-2164-13-582
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Prohormone and convertase genes identified across pig genome resources
| P | complete | Not Found | F1RXU1 | 100517471 | ||
| P | complete | Ssc.26627 | A5LHG2 | 100101476 | ||
| P | complete | Ssc.314 | P53366 | 397195 | ||
| P | complete | Ssc.16245 | P24259 | 397496 | ||
| P | complete | Ssc.629 | P07634 | 396844 | ||
| P | complete | Ssc.23867 | P18104 | 493772 | ||
| P | complete | CU928865 | Not Found | 100625006 | ||
| P | complete | Ssc.22487 | F1SU23 | 100512958 | ||
| P | complete | Ssc.14052 | A6P7L6 | 100125547 | ||
| P | complete | Ssc.56129 | A6P7L7 | 100124407 | ||
| P | complete | Ssc.15900 | Q307W6 | 397252 | ||
| P | complete | Ssc.717 | P01356 | 397468 | ||
| P | complete | Ssc.4653 | P04404 | 397540 | ||
| P | complete | Ssc.14556 | P01192 | 396863 | ||
| P | complete | Not Found | F1RIF7 | 100526112 | ||
| P | complete | Ssc.69887 | P06296 | 100127468 | ||
| P | complete | Ssc.3741 | Q862B1 | 396563 | ||
| P | complete | Ssc.18558 | Q766Y7 | 396574 | ||
| P | complete | Ssc.17879 | Q766Y6 | 396573 | ||
| P | complete | Not Found | A0A761 | Not Found | ||
| P | complete | Ssc.9364 | P09558 | 396915 | ||
| P | complete | Not Found | Not Found | Not Found | ||
| P | complete | Ssc.31972 | A5A752 | 100049663 | ||
| P | complete | Ssc.713 | P07480 | 397465 | ||
| P | complete | Ssc.4875 | Q9TT95 | 396772 | ||
| P | complete | Ssc.644 | P01351 | 445524 | ||
| P | complete | Ssc.440 | Q9GKY5 | 396728 | ||
| P | complete | Ssc.38713 | P01281 | 100621117 | ||
| P | complete | Ssc.17225 | P01274 | 397595 | ||
| P | complete | Ssc.16310 | P49921 | 397516 | ||
| P | Not Found | Not Found | F1S8B1 | 100523475 | ||
| P | complete | Ssc.13923 | P63153 | Not Found | ||
| P | complete | Ssc.376 | Q8MJ80 | 397207 | ||
| P | complete | Ssc.8324 | Q29119 | 100520838 | ||
| P | complete | Ssc.16231 | P16545 | 397491 | ||
| P | fragment | Ssc.9365 | P23695 | 396916 | ||
| P | complete | Ssc.583 | P01315 | 397415 | ||
| P | complete | Ssc.11990 | P51461 | 397024 | ||
| P | complete | Not Found | Not Found | 100620109 | ||
| P | complete | Ssc.46919 | F1SK47 | 100158105 | ||
| P | complete | Ssc.73565 | B5M447 | 100145896 | ||
| P | complete | Ssc.3287 | Q9TTS8 | 396962 | ||
| P | complete | Ssc.714 | P01307 | 397466 | ||
| P | complete | Ssc.15668 | P01177 | 100152272 | ||
| P | complete | Ssc.4210 | P01183 | 396995 | ||
| P | complete | Ssc.38680 | F1SPX3 | 100739079 | ||
| P | complete | Ssc.2083 | B0LUW4 | 100141313 | ||
| P | complete | Ssc.12508 | C3UZJ1 | 100294685 | ||
| P | complete | Ssc.12508 | P34964 | 100523263 | ||
| P | complete | Ssc.82498 | Not Found | Not Found | ||
| P | complete | Ssc.44958 | F1SFP1 | 100518250 | ||
| P | complete | Ssc.73596 | F1RSG4 | 100188981 | ||
| P | complete | Ssc.15796 | Q8MI35 | 396680 | ||
| P | complete | Ssc.15981 | P01304 | 397304 | ||
| P | complete | Ssc.15983 | O77668 | 397305 | ||
| P | complete | Ssc.5148 | A5JHN9 | 100049691 | ||
| P | complete | Not Found | F1S0X5 | 100524361 | ||
| P | complete | Ssc.27598 | P41535 | 414283 | ||
| P | complete | Ssc.456 | P01300 | 397272 | ||
| P | complete | Ssc.17429 | Not Found | 100621697 | ||
| P | complete | Ssc.6173 | F1RIZ0 | 100519764 | ||
| P | complete | Ssc.54182 | P20034 | 100126843 | ||
| P | complete | Ssc.49835 | F1SV50 | 100524161 | ||
| P | complete | Ssc.121 | P01214 | 445529 | ||
| P | complete | Ssc.11281 | Q7M3H2/Q7M2Z7 | 100152093 | ||
| P | complete | Ssc.15910 | P55791 | 397257 | ||
| P | fragment | EW633867 | Not Found | 100526076 | ||
| P | fragment | Not Found | Not Found | Not Found | ||
| P | complete | Ssc.9991 | Q866H2 | 396951 | ||
| P | complete | Ssc.668 | P01269 | 399502 | ||
| P | complete | Ssc.63650 | P68005 | 445018 | ||
| P | complete | Ssc.162 | P01348 | 396891 | ||
| P | complete | Ssc.42647 | Q8HY17 | 503836 | ||
| P | complete | Ssc.49266 | F1SR77 | 100154377 | ||
| P | complete | Ssc.75350 | C4P9W1 | 100302024 | ||
| P | complete | Ssc.15718 | Q9GLG4 | 397154 | ||
| P | complete | Ssc.13645 | Q5FZP5 | 497237 | ||
| P | complete | Ssc.6770 | F1RYP7 | 100154760 | ||
| P | complete | Ssc.710 | P63298 | 397464 | ||
| P | complete | Ssc.71374 | P01287 | 100499556 | ||
| P | complete | Ssc.19520 | P01168 | 494469 | ||
| P | complete | Ssc.57764 | F1SR03 | 100155886 | ||
| P | complete | Not Found | F1RHZ | 100515141 | ||
| P | complete | Ssc.18075 | F1SF85 | 100525179 | ||
| P | complete | Ssc.23153 | F1RTB7 | 100511101 | ||
| P | complete | Ssc.19565 | P67934 | 492314 | ||
| P | fragment | Ssc.67158 | B6VD08 | 100519815 | ||
| P | complete | Not Found | P62968 | 100513309 | ||
| P | Traces | Not Found | F8R6K7 | Not Found | ||
| P | complete | Not Found | F1SKM2 | 100521865 | ||
| P | complete | Not Found | F1RYW0 | 100737810 | ||
| P | complete | Ssc.437 | Q95J46 | 397268 | ||
| P | complete | Not Found | F1SFH3 | 100626084 | ||
| P | complete | Ssc.12790 | F1RT19 | 100525960 | ||
| P | complete | Ssc.29289 | F1SQU4 | 100155670 | ||
| P | fragment | Ssc.90772 | Not Found | 100624333 | ||
| P | complete | Ssc.47759 | E0Y441 | 100500718 | ||
| C | complete | Ssc.155 | P01165 | 397110 | ||
| C | complete | Ssc.94009 | F1RMJ1 | 100156882 | ||
| C | complete | Ssc.92884 | Q28959 | 397103 | ||
| C | complete | Ssc.109 | Q03333 | 445533 | ||
| C | complete | Ssc.47037 | Not Found | 100626523 | ||
| C | incomplete | Ssc.43614 | Not Found | 100519237 | ||
| C | incomplete | Ssc.73551 | F1RZ92 | 100152144 | ||
| C | complete | Ssc.5628 | F1SJT0 | 100523009 | ||
| C | complete | Ssc.84357 | Not Found | 100620501 |
a P: prohormone gene, C: prohormone convertase gene.
b Genome sequence found: complete or incomplete in the pig genome assembly, found in the Traces archive, or Not Found in any genome repository.
c,d,e Identifiers in the UniGene, UniProt and Gene databases.
Distribution of the prohormone gene predictions across UniProt and UniGene resources
| complete | Present | 38 | 7 | 17 | 14 | 3 |
| complete | Not Found | 0 | 1 | 1 | 7 | 2 |
| fragment | Present | 1 | 0 | 1 | 0 | 2 |
| fragment | Not Found | 0 | 0 | 0 | 0 | 1 |
| Not Found | Not Found | 0 | 0 | 0 | 2 | 0 |
1 UniProt Evidence: “type of evidence that supports the existence of the protein”; Protein : complete protein sequence; Partial: incomplete protein sequence such as presence of a peptide; Transcript: “existence of a protein has not been strictly proven but there is expression data (such as existence of cDNAs, RT-PCR or Northern blots) that indicate the existence of a transcript.”; Predicted: Complete or partial sequence of the protein has been predicted; Not Found: no match found in the UniProt database.
2 Genome: prediction of the protein sequence from the genome assembly: complete denotes full sequence, fragment denotes incomplete prediction and Not Found denotes no match.
3 UniGene Present or Not Found denote whether the gene had any EST evidence or not, respectively.
Differentially expressed prohormone and prohormone convertase genes (-value < 0.005) across 35 microarray experiments by tissue class
| | | | | | | | | | |
| Ssc.26627.1.A1_at | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.314.1.S1_at | 2 | 0 | 1 | 0 | 1 | 0 | 1 | 5 | |
| Ssc.16245.1.S1_at | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | |
| Ssc.629.1.S1_at | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | |
| Ssc.23867.1.A1_at | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 2 | |
| Ssc.22487.1.S1_at | 2 | 0 | 0 | 0 | 1 | 1 | 0 | 4 | |
| Ssc.15900.1.S1_at | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 2 | |
| Ssc.717.1.S1_at | 1 | 0 | 0 | 1 | 1 | 0 | 0 | 3 | |
| Ssc.4653.1.S1_at | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 2 | |
| Ssc.14556.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.3741.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.18558.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.17879.1.S1_at | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.9364.1.S1_at | 2 | 0 | 0 | 0 | 0 | 1 | 0 | 3 | |
| Ssc.713.1.S1_at | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 3 | |
| Ssc.4875.1.S1_at | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 3 | |
| Ssc.644.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.440.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.17225.1.S1_at | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 3 | |
| Ssc.16310.1.S1_at | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 2 | |
| Ssc.376.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.8324.1.A1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.16231.1.S1_a_at | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 2 | |
| | Ssc.16231.2.A1_a_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | Ssc.16231.3.S1_a_at | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| Ssc.9365.1.S1_at | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| | Ssc.9365.2.S1_a_at | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 3 |
| | Ssc.9365.3.S1_a_at | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| | Ssc.9365.3.S1_x_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | Ssc.9365.4.S1_a_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| | Ssc.9365.5.A1_at | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| | Ssc.9365.5.S1_at | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 2 |
| | Ssc.9365.5.S1_a_at | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 |
| | Ssc.9365.6.A1_a_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | Ssc.9365.6.A1_x_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | Ssc.9365.6.S1_x_at | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 2 |
| | Ssc.9365.7.A1_x_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Ssc.583.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.11990.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.3287.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.714.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.15668.1.A1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.4210.1.S1_at | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | |
| Ssc.2083.1.A1_at | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.12508.1.A1_at | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.15796.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.15981.1.A1_at | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 3 | |
| | Ssc.15981.1.S1_at | 0 | 2 | 1 | 0 | 0 | 0 | 0 | 3 |
| Ssc.15983.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.27598.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.456.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.17429.1.S1_at | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 2 | |
| Ssc.6173.3.S1_a_at | 1 | 0 | 1 | 0 | 0 | 0 | 1 | 3 | |
| Ssc.121.1.S1_at | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 2 | |
| Ssc.11281.1.A1_at | 0 | 1 | 0 | 1 | 0 | 1 | 1 | 4 | |
| | Ssc.11281.2.S1_at | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 2 |
| Ssc.15910.1.A1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| | Ssc.15910.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Ssc.9991.1.S1_at | 0 | 1 | 1 | 2 | 0 | 0 | 0 | 4 | |
| Ssc.668.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.162.1.S1_at | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 2 | |
| Ssc.15718.1.A1_at | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 3 | |
| Ssc.13645.1.A1_at | 1 | 0 | 0 | 0 | 0 | 1 | 1 | 3 | |
| Ssc.6770.1.A1_at | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 3 | |
| Ssc.710.1.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.19520.1.A1_at | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 3 | |
| Ssc.18075.1.A1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| | Ssc.18075.2.S1_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 |
| Ssc.23153.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.19565.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| | Ssc.19565.2.A1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Ssc.437.1.S1_a_at | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | |
| Ssc.12790.1.A1_at | 1 | 1 | 1 | 0 | 1 | 0 | 1 | 5 | |
| Ssc.29289.1.A1_at | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 2 | |
| Total | | 30 | 35 | 12 | 7 | 10 | 7 | 9 | 110 |
| | | | | | | | | | |
| Ssc.141.1.S1_at | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 3 | |
| Ssc.109.1.S1_at | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | |
| Ssc.5628.1.S1_at | 1 | 1 | 0 | 0 | 0 | 0 | 1 | 3 | |
| Total | 2 | 2 | 0 | 0 | 0 | 0 | 2 | 6 |
aAffymetrix microarray gene probe identifier.
b Experiment classes: Imm: primary immune-response tissues, Emb: embryo and placenta, CNS: brain and central nervous system, Repro: reproduction, Musc: muscle, fat, and gut.
Performance of various cleavage prediction models to predict cleavage in pig prohormones
| True Positives | 181 | 165 | 160 | 158 | 164 | 167 |
| True Negatives | 1520 | 1640 | 1724 | 1670 | 1735 | 1747 |
| False Positives | 329 | 209 | 125 | 179 | 114 | 102 |
| False Negatives | 54 | 70 | 75 | 77 | 71 | 68 |
| Correct Classification | 0.8162 | 0.8661 | 0.904 | 0.8772 | 0.9112 | 0.9184 |
| Sensitivity | 0.7702 | 0.7021 | 0.6809 | 0.6723 | 0.6979 | 0.7106 |
| Specificity | 0.8221 | 0.887 | 0.9324 | 0.9032 | 0.9383 | 0.9448 |
| Positive predictive power | 0.3549 | 0.4412 | 0.5614 | 0.4688 | 0.5899 | 0.6208 |
| Negative predictive power | 0.9657 | 0.9591 | 0.9583 | 0.9559 | 0.9607 | 0.9625 |
| Correlation | 0.4358 | 0.4856 | 0.5645 | 0.4944 | 0.5919 | 0.6184 |
| AUC | 0.8006 | 0.847 | 0.86 | 0.8186 | 0.8589 | 0.8802 |
a Performance criteria. True positives: number of correctly predicted cleaved sites; True negatives: number of correctly predicted non-cleaved sites; False positives: number of incorrectly predicted cleaved sites; False negatives: number of incorrectly predicted non-cleaved sites; Correct classification rate: number of correctly predicted sites divided by the total number of sites; Sensitivity (one minus false positive rate): number of true positives divided by the total number of sites cleaved; Specificity (one minus false negative rate): number of true negatives divided by the total number of sites not cleaved; Positive predictive power: number of true positives divided by the total number of sites predicted to be cleaved; Negative predictive power: number of true negatives divided by the total number of sites predicted to not be cleaved; Correlation coefficient: Mathew’s correlation coefficient between observed and predicted cleavage; and AUC: Area under the receiver operator characteristic or ROC curve relating sensitivity and 1-specificity.
b AA: models trained only on amino acids.
c AA prop: models trained with amino acids combined with the physicochemical properties of amino acids.
d ANN: artificial neural network approach.