| Literature DB >> 35166336 |
Vipin K Menon1, Pablo C Okhuysen2, Cynthia L Chappell3, Medhat Mahmoud1, Medhat Mahmoud1, Qingchang Meng1, Harsha Doddapaneni1, Vanesa Vee1, Yi Han1, Sejal Salvi1, Sravya Bhamidipati1, Kavya Kottapalli1, George Weissenberger1, Hua Shen1, Matthew C Ross4, Kristi L Hoffman4, Sara Javornik Cregeen4, Donna M Muzny1, Ginger A Metcalf1, Richard A Gibbs1, Joseph F Petrosino4, Fritz J Sedlazeck1.
Abstract
BACKGROUND: Cryptosporidium parvum is an apicomplexan parasite commonly found across many host species with a global infection prevalence in human populations of 7.6%. Understanding its diversity and genomic makeup can help in fighting established infections and prohibiting further transmission. The basis of every genomic study is a high-quality reference genome that has continuity and completeness, thus enabling comprehensive comparative studies.Entities:
Keywords: Cryptosporidium; assembly; assembly comparisons; nanopore
Mesh:
Year: 2022 PMID: 35166336 PMCID: PMC8848321 DOI: 10.1093/gigascience/giac010
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1: Workflow for the generation of Cryptosporidium parvum assembly.
Figure 2: Read length distribution and cumulative coverage over the Oxford Nanopore sequencing. We obtained a total of 53× coverage with long reads and even 10× coverage with reads larger than 30 kb (x axis). The longest read measured was 808 kb.
: Overall assembly statistics and comparison using Quast (QUAST, RRID:SCR_001228) between the present assembly and the previously established assembly
| Statistic | GCA_000165345.1 | GCA_019844115.1 (newly established) |
|---|---|---|
| Total sequence length | 9,102,324 | 9,197,619 |
| Total ungapped length | 9,087,655 | 9,197,619 |
| Unresolved sequences | 14,669 | 0 |
| N50 | 1,104,417 | 1,108,772 |
| N90 | 985,969 | 993,129 |
| L50 | 4 | 4 |
| Total No. of chromosomes | 8 | 8 |
Figure 3: Assembly comparisons. (A) The Canu assembly shows a high concordance with the previously published C. parvum assembly (GCA_015245375.1) [27] (dot plots) and agreements in length (bottom). Nevertheless, clear assembly differences are visual when comparing it to GCA_000165345.1 [13]. (B) The Flye assembly vs the C. parvum assembly (GCA_015245375.1) shows large disagreements. Contig 3 is merged between 2 different Cryptosporidium chromosomes, and 1 chromosome is missing. Also, the length comparison (bottom) shows discrepancies in the beginning, highlighting a very short contig in the end (green track). Interestingly GCA_000165345.1 shows structural differences over both assemblies, likely indicating errors in the previous reference.