| Literature DB >> 21729922 |
Qinghua Nie1, Meixia Fang, Xinzheng Jia, Wei Zhang, Xiaoning Zhou, Xiaomei He, Xiquan Zhang.
Abstract
Pig (Sus scrofa) is an important organism for both agricultural and medical purpose. This study aims to investigate the S. scrofa transcriptome by the use of Roche 454 pyrosequencing. We obtained a total of 558 743 and 528 260 reads for the back-leg muscle and ovary tissue each. The overall 1 087 003 reads give rise to 421 767 341 bp total residues averaging 388 bp per read. The de novo assemblies yielded 11 057 contigs and 60 270 singletons for the back-leg muscle, 12 204 contigs and 70 192 singletons for the ovary and 18 938 contigs and 102 361 singletons for combined tissues. The overall GC content of S. scrofa transcriptome is 42.3% for assembled contigs. Alternative splicing was found within 4394 contigs, giving rise to 1267 isogroups or genes. A total of 56 589 transcripts are involved in molecular function (40 916), biological process (38 563), cellular component (35 787) by further gene ontology analyses. Comparison analyses showed that 336 and 553 genes had significant higher expression in the back-leg muscle and ovary each. In addition, we obtained a total of 24 214 single-nucleotide polymorphisms and 11 928 simple sequence repeats. These results contribute to the understanding of the genetic makeup of S. scrofa transcriptome and provide useful information for functional genomic research in future.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21729922 PMCID: PMC3190955 DOI: 10.1093/dnares/dsr021
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Figure 1.Schematic of 454 EST analyses. The steps include 454 sequencing, assembly of reads into contigs and isogroups, GO annotation, KEGG analysis and discovery of SNPs and SSRs.
Draft sequence data by 454 sequencing
| Types | Muscle (RL4) | Ovary (RL10) | Combined tissues |
|---|---|---|---|
| Number of reads | 558 743 | 528 260 | 1 087 003 |
| Total residues (bp) | 219 021 745 | 202 745 596 | 421 767 341 |
| Smallest (bp) | 27 | 23 | 23 |
| Largest (bp) | 772 | 813 | 813 |
| Average length (bp) | 392 | 384 | 388 |
Summary on assemble analysis
| Types | Muscle (RL4) | Ovary (RL10) | Combined tissues |
|---|---|---|---|
| Num of contigs | 11 057 | 12 204 | 18 938 |
| Smallest (bp) | 42 | 44 | 42 |
| Largest (bp) | 3540 | 3462 | 4218 |
| Total length (bp) | 8 703 645 | 9 517 255 | 15 332 944 |
| Average length (bp) | 787 | 780 | 810 |
| Num of isogroups | 9496 | 10 440 | 15 825 |
| Num of isogroups (contigs ≥ 2) | 662 | 719 | 1267 |
| Singleton | 60 270 | 70 192 | 102 361 |
| Total | 71 327 | 82 396 | 121 299 |
Statistics of contigs by 454 sequencing
| Length | Muscle | Ovary | Combined tissues | |||
|---|---|---|---|---|---|---|
| Numbers | Per cent (%) | Numbers | Per cent (%) | Numbers | Per cent (%) | |
| 1–100 | 12 | 0.11 | 11 | 0.09 | 12 | 0.06 |
| 101–400 | 225 | 2.03 | 309 | 2.53 | 411 | 2.17 |
| 401–700 | 5678 | 51.35 | 6320 | 51.79 | 9397 | 49.62 |
| 701–1000 | 2813 | 25.44 | 3050 | 24.99 | 4760 | 25.13 |
| 1001–1500 | 1756 | 15.88 | 1928 | 15.80 | 3187 | 16.83 |
| 1501–2000 | 449 | 4.06 | 474 | 3.88 | 864 | 4.56 |
| >2000 | 124 | 1.12 | 112 | 0.92 | 307 | 1.62 |
| Total | 11 057 | 100 | 12 204 | 100 | 18 938 | 100 |
Variant transcripts by assemble analysis
| No.a | Numbers of isogroups (contigs) | ||
|---|---|---|---|
| Muscle | Ovary | Combined tissues | |
| 2 | 452 (904) | 498 (996) | 857 (1714) |
| 3 | 71 (142) | 76 (152) | 143 (286) |
| 4 | 64 (128) | 58 (116) | 115 (230) |
| 5 | 20 (40) | 12 (24) | 29 (58) |
| 6 | 18 (36) | 20 (40) | 29 (58) |
| 7 | 6 (12) | 8 (16) | 17 (34) |
| 8 | 5 (10) | 10 (20) | 15 (30) |
| 9 | 4 (8) | 8 (72) | 14 (28) |
| ≥10 | 22 (529) | 29 (644) | 48 (1106) |
| In total (≥2) | 662 (2228) | 719 (2488) | 1267 (4393) |
aNumbers of contigs per isogroup.
Figure 2.Functional classification of S. scrofa transcriptome. (A) GO: Biological process. (B) Cellular component. (C) GO: Molecular function. In some cases, one transcript or gene has multiple functions.
The top 20 pathways with the highest EST numbers
| No. | Pathways | Number of ESTs |
|---|---|---|
| 1 | Biosynthesis of secondary metabolites | 1352 |
| 2 | Oxidative phosphorylation | 1111 |
| 3 | Microbial metabolism in diverse environments | 964 |
| 4 | Purine metabolism | 690 |
| 5 | Biosynthesis of plant hormones | 625 |
| 6 | Biosynthesis of phenylpropanoids | 466 |
| 7 | Biosynthesis of alkaloids derived from histidine and purine | 417 |
| 8 | Pyrimidine metabolism | 408 |
| 9 | Biosynthesis of terpenoids and steroids | 401 |
| 10 | Biosynthesis of alkaloids derived from terpenoid and polyketide | 399 |
| 11 | Methane metabolism | 399 |
| 12 | Biosynthesis of alkaloids derived from shikimate pathway | 380 |
| 13 | Glycolysis/gluconeogenesis | 379 |
| 14 | Biosynthesis of alkaloids derived from ornithine, lysine and nicotinic acid | 368 |
| 15 | Glutathione metabolism | 318 |
| 16 | Pyruvate metabolism | 268 |
| 17 | Arginine and proline metabolism | 261 |
| 18 | Glycerophospholipid metabolism | 222 |
| 19 | Fatty acid metabolism | 221 |
| 20 | Valine, leucine and isoleucine degradation | 218 |
Distribution of SNPs in the S. scrofa genome
| Chromosomesa | Counts |
|---|---|
| 1 | 2358 |
| 2 | 1901 |
| 3 | 1467 |
| 4 | 1693 |
| 5 | 1133 |
| 6 | 1492 |
| 7 | 1922 |
| 8 | 1260 |
| 9 | 1415 |
| 10 | 968 |
| 11 | 533 |
| 12 | 1073 |
| 13 | 1418 |
| 14 | 2008 |
| 15 | 1086 |
| 16 | 580 |
| 17 | 680 |
| 18 | 494 |
| X | 608 |
| MT | 125 |
| Total | 24 214 |
aChromosomes 1–18 and X indicate 18 autosomes and sex chromosome each, whereas MT indicates mtDNA.
Summary on microsatellite loci in S. scrofa transcriptome
| Number of repeats | Di-nucleotide repeats | Tri-nucleotide repeats | Tetra-nucleotide repeats | Penta-nucleotide repeats |
|---|---|---|---|---|
| 4 | — | 2511 | 1134 | 303 |
| 5 | — | 1004 | 386 | 74 |
| 6 | 1858 | 434 | 118 | 18 |
| 7 | 986 | 189 | 31 | 6 |
| 8 | 592 | 80 | 20 | — |
| 9 | 384 | 23 | 26 | 1 |
| 10 | 257 | 22 | 17 | — |
| 11 | 202 | 16 | 9 | 1 |
| 12 | 187 | 11 | 6 | — |
| 13 | 184 | 6 | 13 | — |
| 14 | 131 | 2 | 11 | — |
| 15 | 115 | 1 | — | — |
| 16 | 100 | — | — | — |
| 17 | 93 | — | — | — |
| 18 | 49 | — | — | — |
| 19 | 63 | 2 | — | — |
| 20 | 54 | — | — | — |
| 21 | 48 | — | — | — |
| 22 | 41 | — | — | — |
| 23 | 29 | — | — | — |
| 24 | 18 | — | — | — |
| 25 | 18 | — | — | — |
| 26 | 9 | — | — | — |
| 27 | 10 | — | — | — |
| 28 | 14 | — | — | — |
| 29 | 11 | — | — | — |
| Total | 5453 | 4301 | 1771 | 403 |