| Literature DB >> 28861401 |
Jin Luo1, Min-Xuan Liu1,2, Qiao-Yun Ren1, Ze Chen1, Zhan-Cheng Tian1, Jia-Wei Hao1, Feng Wu1, Xiao-Cui Liu1, Jian-Xun Luo1, Hong Yin1,3, Hui Wang1,4,5, Guang-Yuan Liu1.
Abstract
Ticks are important vectors in the transmission of a broad range of micropathogens to vertebrates, including humans. Because of the role of ticks in disease transmission, identifying and characterizing the micropathogen profiles of tick populations have become increasingly important. The objective of this study was to survey the micropathogens of Hyalomma rufipes ticks. Illumina HiSeq2000 technology was utilized to perform deep sequencing of small RNAs (sRNAs) extracted from field-collected H. rufipes ticks in Gansu Province, China. The resultant sRNA library data revealed that the surveyed tick populations produced reads that were homologous to St. Croix River Virus (SCRV) sequences. We also observed many reads that were homologous to microbial and/or pathogenic isolates, including bacteria, protozoa, and fungi. As part of this analysis, a phylogenetic tree was constructed to display the relationships among the homologous sequences that were identified. The study offered a unique opportunity to gain insight into the micropathogens of H. rufipes ticks. The effective control of arthropod vectors in the future will require knowledge of the micropathogen composition of vectors harboring infectious agents. Understanding the ecological factors that regulate vector propagation in association with the prevalence and persistence of micropathogen lineages is also imperative. These interactions may affect the evolution of micropathogen lineages, especially if the micropathogens rely on the vector or host for dispersal. The sRNA deep-sequencing approach used in this analysis provides an intuitive method to survey micropathogen prevalence in ticks and other vector species.Entities:
Keywords: Hyalomma rufipes; high-throughput sequencing; micropathogen community; small RNA; ticks
Mesh:
Substances:
Year: 2017 PMID: 28861401 PMCID: PMC5559533 DOI: 10.3389/fcimb.2017.00374
Source DB: PubMed Journal: Front Cell Infect Microbiol ISSN: 2235-2988 Impact factor: 5.293
The sequencing chromatogram is converted into sequence data during the base calling step.
| Total reads | Total sequenced reads, which must be > 5 M in general (except for serum samples) |
| High-quality | Number of high quality reads with no Ns, no more than 4 bases with quality scores lower than 10 and no more than 6 bases with quality scores lower than 13 |
| 3′ Adaptor null | Number of reads with no 3′ adaptor sequence |
| Insert null | Number of reads with no insertion |
| 5′ Adaptor contaminants | Number of 5′ contaminants |
| Smaller than 18 nt | Number of reads < 18 nt; generally, small RNA tags are between 18 and 30 nt long and therefore, tags that are too short should be removed from the data prior to further analysis |
| Poly A | Number of reads with Poly As |
| Clean reads | Number of clean reads after adaptors and contaminants are removed that are used in subsequent analyses; detailed information for the clean reads has been submitted to the manuscript by Supplementary Data Sheet |
In addition, per a standard, sequencing quality is assigned.
The data is processed by the following steps: (1) removal of low quality reads (the criteria for this step is listed in the explanation of each row in Table 1); (2) removal of reads with 5′ primer contaminants; (3) removal of reads without a 3′ primer sequence; (4) removal of reads without the insert tag; (5) removal of reads with poly As; (6) removal of reads shorter than 18 nt; and (7) summary of the length distribution of the clean reads.
| Total reads | 13756626 | |
| High quality | 13736581 | 100 |
| 3′ Adapter null | 11255 | 0.08 |
| Insert null | 3236 | 0.02 |
| 5′ Adapter contaminants | 29681 | 0.22 |
| Smaller_than_18 nt | 36234 | 0.26 |
| PolyA | 186 | 0.00 |
| Clean reads | 13655989 | 99.41 |
In addition, the numbers are shown in this table.
Figure 1Summary of the tag length distribution following the sequencing and annotation of small RNA from Hyalomma rufipes. The length of the small RNAs varied between 18 and 30 nt. Length distribution analysis is helpful when elucidating the composition of small RNA samples. For example, miRNA is normally 21 or 22 nt, siRNA is 24 nt, and piRNA is 30 nt. The x-axis represents the length of the small RNAs. The y-axis represents the percentage frequency of small RNAs of specific lengths among the sequencing reads. Red indicates the length distribution of total clean reads from ticks and the first peak generated by the Dicer enzyme; the second peak was generated by the Piwi enzyme. Black indicates the length distribution of SCRV. The peak was mediated by Dicer activity. Blue indicates the length distribution produced by Dicer and Piwi from the bacteria communities. Nt indicates nucleotides.
Figure 2Distribution of small RNAs among different categories. Unann, unannotated; rRNA, ribosomal RNA; snoRNA, small nucleolar RNA; tRNA, transfer RNA; miRNA, microRNA; and snRNA, small nuclear RNA.
Summary of the bioinformatics data assembly and micropathogen diversity analysis.
| Virus | 1E-07 | 60.5 | 1345 | 13.79 | |
| Rickettsiales | 8E-05 | 50.8 | 811 | 19.24 | |
| 2E-03 | 46.2 | 323 | 0.34 | ||
| Bacteria | 1E-05 | 53.3 | 360 | 1.38 | |
| 0.003 | 45.1 | 299 | 2.06 | ||
| 4E-04 | 53.1 | 63 | 0.69 | ||
| 0.002 | 45.4 | 50 | 0.34 | ||
| 1E-06 | 57 | 50 | 1.37 | ||
| 3E-08 | 62.4 | 169 | 0.34 | ||
| 7E-09 | 48.5 | 103 | 2.06 | ||
| 2E-05 | 64.3 | 95 | 0.69 | ||
| 2E-05 | 53.1 | 94 | 0.34 | ||
| 2E-04 | 49.7 | 261 | 1.03 | ||
| 6E-05 | 51.2 | 50 | 1.03 | ||
| 1E-04 | 50.1 | 109 | 0.34 | ||
| 6E-03 | 44.7 | 93 | 0.69 | ||
| 2E-04 | 49.3 | 174 | 0.34 | ||
| 2E-03 | 46.2 | 189 | 0.34 | ||
| 4E-04 | 48.5 | 80 | 0.34 | ||
| 2E-05 | 52.8 | 40 | 0.34 | ||
| 6E-05 | 51.2 | 395 | 0.34 | ||
| 0.001 | 47 | 122 | 0.34 | ||
| 0.001 | 46.6 | 86 | 0.34 | ||
| 0.004 | 45.1 | 195 | 1.03 | ||
| 1E-09 | 67 | 67 | 0.34 | ||
| Fungus | 0.003 | 45.1 | 82 | 0.69 | |
| 1E-06 | 56.6 | 383 | 0.34 | ||
| 2E-05 | 52.8 | 131 | 1.03 | ||
| 3E-03 | 45.8 | 129 | 0.34 | ||
| 1E-04 | 50.4 | 58 | 0.34 | ||
| 2E-04 | 49.7 | 118 | 0.34 | ||
| 1E-08 | 63.5 | 160 | 0.69 | ||
| Parasite | 0.008 | 43.9 | 920 | 2.41 | |
| 3E-07 | 58.9 | 1564 | 0.69 | ||
| 4 E-04 | 48.5 | 99 | 0.34 | ||
| 6E-06 | 54.7 | 1324 | 0.34 | ||
| 3E-05 | 52.4 | 141 | 0.34 | ||
| 0.008 | 43.9 | 21 | 0.34 |
Scores indicate the homology between sequences; higher scores indicate greater similarity. The e-values indicate the reliability of the evaluation score. The underlining shows that 12 human-infective micropathogens were identified in the ticks.
Figure 3Mapping of micropathogen sequencing reads to genomes. (A) Cleaned sequence reads were mapped to the SCRV virus genome (GenBank accession number PRJNA14941). The SCRV genome contains 10 open reading frames (Vp1, Vp2, Vp3, Vp4, Vp5, Vp7, Vp9, Vp10, NS1, and NS2), which are flanked by inverted repeats and direct repeats at both termini of the genome. Dashed lines represent the regions to which sequenced reads were mapped. Yellow indicates the coverage (occurrence frequency) of the sense strand reads, and blue indicates the coverage of the antisense strand reads. (B) Small RNA mapping to the reference genome sequences of the corresponding micropathogens. b1–b6: These figures indicate deep coverage of the virus genome by the small RNA map following use of the pileup format. Both 1x and 10x indicate (yes and no) % coverage, average depth, and the proportion of forward and reverse (+/-) strands. b1: Mapping of the Francisella 16S rRNA to the genome (ID: 13795) for the negative control. b2: A 100% match was observed for the rpoA gene of CMM from H. rufipes and the CMM IricVA chromosome (ID: NC_015722). b3, b4, and b6: Genes SN2, Vp2, and Vp9 matched (100%) the genome of SCRV. b5: The reads from SCRV from H. rufipes show one mismatch with the reference genome.
Target genes and list of primers used in this study.
| ScrvVp6f | 5′-ACGCTGGATCGGACATGAA-3′ | The primers were designed manually based on the high-throughput sequencing data | |
| ScrvVp6r | 5′-GGGTATGAGAAGAGATGCATG-3′ | ||
| ScrvSN2f | 5′-GTAATGCAAGAGATCAGCATG-3′ | ||
| ScrvSN2r | 5′-CACCGCCCTGATAAACATACC-3′ | ||
| Cmmf | 5′-ATGTATGGTCCAGCTATTGG-3′ | ||
| Cmmr | 5′-CCACGGGAACCAATGTACTTC-3′ | ||
| Cchfvf | 5′-ATGAACAGGTGGTTTGAAGAGTT-3′ | Spengler et al., | |
| Cchfvr | 5′-TGGCACTGGCCATCTGA-3′ | ||
| Amaf | 5′-TTATGGCAGACATTTCCATATACTGTGCAG-3′ | Palmer et al., | |
| Amar | 5′-GGAGCGCATCTCTCTTGCC-3′ | ||
| Rcof | 5′-GCTCGATTGRTTTACTTTGCTGTGAG-3′ | Millán et al., | |
| Rcor | 5′-CATGCTATAACCACCAAGCTAGCAATAC-3′ | ||
| Bocf | 5′-GACACAGGGAGGTAGTGACAAG-3′ | Decaro et al., | |
| Bocr | 5′-GATCCTTCYGCAGGTTCACC-3′ |
Viruses and bacteria detected by high-throughput sequencing. The short sequences were assembled to a maximum length of 901 bp, and the resulting sequences were used to design the primers.
Primers designed based on the assembled sequences.
Figure 4Phylogenetic analysis of the isolated bacteria/viruses. The phylogenetic tree was generated using MEGA 5.0 with maximum parsimony and 500 bootstrap replicates. Reference amino acid sequences were selected by BLAST searches of the NCBI nt database. (A) Subtrees of the experimental sequences from the CMM rpoA gene. (B) Subtrees of the experimental sequences from the SCRV SN2 gene. (C) Subtrees of the experimental sequences from the SCRV Vp6 gene.