| Literature DB >> 33239096 |
Jenny G Maloney1, Aleksey Molokin1, Monica Santin2.
Abstract
BACKGROUND: Blastocystis sp. is one of the most common enteric parasites of humans and animals worldwide. It is well recognized that this ubiquitous protist displays a remarkable degree of genetic diversity in the SSU rRNA gene, which is currently the main gene used for defining Blastocystis subtypes. Yet, full-length reference sequences of this gene are available for only 16 subtypes of Blastocystis in part because of the technical difficulties associated with obtaining these sequences from complex samples.Entities:
Keywords: Blastocystis; Long-read sequencing; MinION; Ribosomal RNA; Subtypes
Mesh:
Substances:
Year: 2020 PMID: 33239096 PMCID: PMC7687777 DOI: 10.1186/s13071-020-04484-6
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Information of Blastocystis specimens used in this study including host, geographic origin, and subtype
| Specimen ID | Host | Location | |
|---|---|---|---|
| 1 | Humana | USA | ST1 |
| 2 | Humanb | USA | ST4 |
| 3 | Humanc | Spain | ST4 |
| 4 | Elephant | USA | ST11 |
| 5 | Cattled | USA | ST10 |
| 6 | Cattle | USA | ST10/ST14 |
| 7 | Cattle | USA | ST14 |
aIsolate acquired from ATCC (Blastocystis ATCC 50177™)
bIsolate acquired from ATCC (Blastocystis ATCC 50608™)
cIsolate H-1 reported in Santin et al. [24]
dIsolate C-3073 reported in Santin et al. [24]
Bioinformatic analysis data for each step in processing of MinION sequences obtained from the specimens used in this study
| Specimen ID | 1 | 2 | 3 | 4 | 5 | 6 | 7 |
|---|---|---|---|---|---|---|---|
| Total MinION reads | 260,471 | 272,100 | 574,327 | 562,189 | 509,070 | 505,177 | 335,913 |
| Total Mbases | 458.1 | 510 | 887.2 | 547.9 | 657.2 | 474.4 | 259.5 |
| Reads > Q7 | 236,039 | 247,399 | 515,724 | 529,182 | 431,051 | 456,117 | 296,619 |
| Mbases > Q7 | 428.1 | 479 | 815.5 | 524.3 | 574.4 | 434.6 | 233 |
| Reads with length 1000–2100 nt | 93,828 | 89,141 | 131,127 | 105,440 | 109,609 | 87,019 | 55,970 |
| Reads after canu correction | 79,682 | 61,484 | 77,149 | 69,586 | 57,435 | 50,107 | 28,966 |
| Reads after canu trimming | 71,895 | 55,643 | 46,519 | 44,990 | 29,693 | 27,873 | 19,805 |
| Strand (+) reads with both forward and reverse primers | 15,730 | 9,573 | 7,310 | 8,734 | 4,620 | 3,430 | 1,376 |
| Strand (−) reads with both forward and reverse primers | 14,242 | 9,407 | 5,646 | 6,427 | 3,816 | 2,616 | 986 |
| Strand (+) and (−) reads combined | 29,972 | 18,980 | 12,956 | 15,161 | 8,436 | 6,046 | 2,362 |
| Reads that aligned to a | 26,314 | 17,960 | 4,626 | 2,581 | 4,309 | 1,077 | 287 |
| No. clusters after clustering reads at 98% identity | 34 | 4,556 | 265 | 256 | 682 | 152 | 47 |
| No. clusters with an abundance > 5 | 18 | 25 | 12 | 7 | 11 | 7 | 3 |
| Total abundance of all clusters that had at least 5 reads | 26,116 | 10,674 | 4,353 | 2,158 | 3,585 | 725 | 30 |
| No. clusters that aligned to expected | 1 | 1 | 1 | 2 | 1 | 2 | 1 |
| No. clusters after nanopolishing and re-clustering | 1 | 1 | 1 | 2 | 1 | 2 | 1 |
Comparison of full-length Blastocystis SSU rRNA gene sequences generated in this study by MinION sequencing to Illumina MiSeq sequences from the same sample and closest full-length match available on GenBank
| Specimen ID | Length of sequence generated by MinION in this study (GenBank accession number) | Similarity to Illumina MiSeq sequencea (%) | Similarity to closest match available in GenBank (sequence length, accession number) | |
|---|---|---|---|---|
| 1 | ST1 | 1766 bp (MT898451) | 100 | 99.8 (1770 bp, U51151) |
| 2 | ST4 | 1772 bp (MT898452) | 100 | 100 (1730 bp, AY590114) |
| 3 | ST4 | 1773 bp (MT898453) | 100 | 99.9 (1730 bp, AY590114) |
| 4 | ST11 | 1762 bp (MT898454) | 100 | 99.9 (989 bp, GU256903) |
| 1763 bp (MT898455) | 100 | 98.6 (989 bp, GU256929) | ||
| 5 | ST10 | 1770 bp (MT898456) | 99.8 | 99.5 (1728 bp, KC148207) |
| 6 | ST10 | 1770 bp (MT898457) | 99.8 | 99.5 (1728 bp, KC148207) |
| ST14 | 1771 bp (MT898458) | 99.8 | 99.6 (1772 bp, KC148205) | |
| 7 | ST14 | 1771 bp (MT898459) | 99.8 | 99.7 (1772 bp, KC148205) |
aIllumina MiSeq sequence corresponds to the approximately 480 bp region of the SSU rRNA gene amplified using the primers reported in [11] using methods reported in [12]