| Literature DB >> 27519257 |
Olukemi O Ifeonu1, Marcus C Chibucos1, Joshua Orvis1, Qi Su1, Kristin Elwin2, Fengguang Guo3, Haili Zhang3, Lihua Xiao4, Mingfei Sun5, Rachel M Chalmers2, Claire M Fraser1, Guan Zhu3, Jessica C Kissinger6, Giovanni Widmer7, Joana C Silva8.
Abstract
Human cryptosporidiosis is caused primarily by Cryptosporidium hominis, C. parvum and C. meleagridis. To accelerate research on parasites in the genus Cryptosporidium, we generated annotated, draft genome sequences of human C. hominis isolates TU502_2012 and UKH1, C. meleagridis UKMEL1, also isolated from a human patient, and the avian parasite C. baileyi TAMU-09Q1. The annotation of the genome sequences relied in part on RNAseq data generated from the oocyst stage of both C. hominis and C. baileyi The genome assembly of C. hominis is significantly more complete and less fragmented than that available previously, which enabled the generation of a much-improved gene set for this species, with an increase in average gene length of 500 bp relative to the protein-encoding genes in the 2004 C. hominis annotation. Our results reveal that the genomes of C. hominis and C. parvum are very similar in both gene density and average gene length. These data should prove a valuable resource for the Cryptosporidium research community. © FEMS 2016.Entities:
Keywords: C. hominis TU502_2012; Cryptosporidium; Cryptosporidium baileyi; Cryptosporidium meleagridis; annotation; genome assembly
Mesh:
Year: 2016 PMID: 27519257 PMCID: PMC5407061 DOI: 10.1093/femspd/ftw080
Source DB: PubMed Journal: Pathog Dis ISSN: 2049-632X Impact factor: 3.166
Summary statistics of whole-genome sequence and transcriptome data, assemblies and annotation.
|
|
|
| |||
|---|---|---|---|---|---|
|
|
| ||||
| Isolate: DNA | TU502 | TU502_2012 | UKH1 | UKMEL1 | TAMU-09Q1 |
| gDNA Illumina library fragment size (bp) | N/A | 460 | 461 | 517 | 654 |
| No. MiSeq reads | N/A | 6,871,858 | 7,596,410 | 22,862,044 | 6,240,960 |
| No. base pairs | N/A | 1,724,836,358 | 1,906,698,910 | 6,881,475,244 | 1,566,480,960 |
| Assembly size (bp) | 8,743,570 | 9,107,739 | 9,156,091 | 8,973,200 | 8,493,640 |
| No. of contigs | 1422 | 119 | 156 | 57 | 145 |
| Contig N50 | 14,504 | 238,509 | 179,408 | 322,908 | 203,018 |
| Largest contig (bp) | 90,444 | 1,270,815 | 542,781 | 732,862 | 702,637 |
| G + C content (%) | 30.9 | 30.1 | 30.1 | 31.0 | 24.3 |
| No. protein-coding genes | 3,885 | 3,745 | 3,765 | 3,758 | 3,692 |
| Average gene length (bp) | 1,360 | 1,892 | 1,830 | 1,844 | 1,778 |
| Percent coding | 60.4% | 77.8% | 75.2% | 77.2% | 77.3% |
| Accession no. | AAEL00000000 | JIBM00000000 | JIBN00000000 | JIBK00000000 | JIBL00000000 |
| SNPs relative to TU502 | 1303 : 2,567 | 718 : 1336 | N/A | N/A | |
| SNPs relative to TU502_2012 synonymous : non-syn | N/A | 143 : 339 | N/A | N/A | |
| Isolate: RNA | TU502_2012 | UKH1 | UKMEL1 | TAMU-10GZ1 | |
| No. HiSeq read pairs | 16,568,115 | 92,878,236 | N/A | 55,829,305 | |
| No. expressed genes | 1,868 | 2,454 | N/A | 2,235 | |
| Accession no. | SRX481527 | SRX481475 | N/A | SRX481530 | |
2004 assembly (Xu et al.2004).
Minimum 5X CDS coverage.
Figure 1.Inter- and intraspecies genome-wide comparisons of genome composition. (A) Comparison of protein length between C parvum and the 2004 and 2012 versions of the C. hominis TU502. (B) Distribution of orthologous gene clusters in five Cryptosporidium species. (C) Distribution of SNPs and short indels among three C. hominis isolates, TU502, TU502_2012 and UKH1. DNA sequence reads from the C. hominis TU502_2012 and UKH1 were mapped against the reference genome assembly of C. hominis TU502, as well as against each other, using BWA (Li and Durbin 2009). SNPs and small indels were identified using GATK (McKenna et al.2010). Identified variants were further filtered for reliability, according to the following parameter values: (DP < 12) ∥ (QUAL < 50) ∥ (SB > –0.10) ∥ (MQ0 > = 2 && (MQ0/(1.0 * DP)) > 0.1). SNPs were categorized as coding and non-coding, given the assembly and the annotation, using VCFtools.
Figure 2.Gene expression in Cryptosporidium oocysts is correlated within and between species. (A) Correlation in oocyst gene expression is highly correlated between two isolates of C. hominis (r2 ∼ 96%). (B) Correlation in oocyst gene expression is correlated between C. hominis and C. baylei (r2 ∼ 51%), particularly among the most highly expressed genes.