| Literature DB >> 24929829 |
Aaron R Jex1, Peter Nejsum2, Erich M Schwarz3, Li Hu4, Neil D Young1, Ross S Hall1, Pasi K Korhonen1, Shengguang Liao4, Stig Thamsborg2, Jinquan Xia4, Pengwei Xu4, Shaowei Wang4, Jean-Pierre Y Scheerlinck1, Andreas Hofmann5, Paul W Sternberg6, Jun Wang7, Robin B Gasser1.
Abstract
Trichuris (whipworm) infects 1 billion people worldwide and causes a disease (trichuriasis) that results in major socioeconomic losses in both humans and pigs. Trichuriasis relates to an inflammation of the large intestine manifested in bloody diarrhea, and chronic disease can cause malnourishment and stunting in children. Paradoxically, Trichuris of pigs has shown substantial promise as a treatment for human autoimmune disorders, including inflammatory bowel disease (IBD) and multiple sclerosis. Here we report whole-genome sequencing at ∼140-fold coverage of adult male and female T. suis and ∼80-Mb draft assemblies. We explore stage-, sex- and tissue-specific transcription of mRNAs and small noncoding RNAs.Entities:
Mesh:
Year: 2014 PMID: 24929829 PMCID: PMC4105696 DOI: 10.1038/ng.3012
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Features of the scaffolded assembly of the adult male and female T. suis genomes
| Male genome | Female genome | |
|---|---|---|
| 83.6 | 87.2 | |
| Total read data; estimated coverage | 11.73 Gb; 140× | 12.36 Gb; 142× |
| Total scaffolded assembly size (Mb); total scaffolds | 81.3; 60,856 | 76.0; 42,663 |
| Total scaffolds of >200 bp: length (Mb); no. of scaffolds | 74.2; 4,293 | 71.0; 3,288 |
| Largest scaffold (Mb) | 1.59 | 1.44 |
| N50 in kb (scaffolds >200 bp)a | 500 | 440 |
| N90 in kb (scaffolds >200 bp); total number for >N90a | 81.0; 185 | 104; 168 |
| % GC content, whole genome; scaffolds >200 bp) | 43.9; 43.6 | 43.6; 43.5 |
| % repetitive sequence (scaffolds >200 bp) | 31.7 | 32.3 |
| Proportion of the genome that is coding, exonic; including introns | 26.2; 66.2 | 29.2; 73.4 |
| Number of protein-encoding genes | 14,781 | 14,470 |
| Mean gene length (kb) | 3.7 | 3.9 |
| Mean number of exons per gene | 5.4 | 5.7 |
| Mean exon length (bp) | 271 | 270 |
| Mean number of introns per gene | 4.4 | 4.7 |
| Mean intron length (bp) | 511 | 509 |
| Number of transfer RNAs | 991 | 1,021 |
aN50 and N90 denote that 50% and 90% of assembly, respectively, is represented by scaffolds of at least this size.
Figure 1Homologs shared between T. suis (class Enoplea, order Trichocephalida) and related nematode species.
Figure 2Stage- and tissue-specific small-RNA and mRNA transcriptome of T. suis.
(a) Association between gene function and alternative splice variation. Charts show the inferred function of protein domains encoded by genes showing a statistically significant (P ≤ 0.05; Pearson's chi-squared analysis) positive (+) or negative (−) bias toward skipped exon (SE), alternative first (FE) or last exon (LE) splice events. Only genes encoding ten or more transcripts are included in this analysis. (b) Proportional representation of major protein classes or groups encoded by the genome (Gen), and their proportional abundance in all transcriptomic data (All) and in larval (L1/2, L3 and L4), adult male (Am), female (Af) and tissue-specific libraries, including in the male (Mp) and female posterior body (Fp) and the stichosome (St). (c) Self-organizing heatmap (transcripts per million (TPM) values normalized by gene) clustering miRNAs by their transcription abundance (represented as log2-transformed reads per kilobase per million reads (RPKM) values) in each larval, adult and tissue-specific library. (d) Self-organizing heatmap (TPM values normalized by gene) clustering 22A-RNAs by their transcription abundance (represented as log2-transformed RPKM values) in each larval, adult and tissue-specific library.