| Literature DB >> 32318094 |
Fuqiang Ye1, Yifang Han1, Juanjuan Zhu2, Peng Li3, Qi Zhang1, Yanfeng Lin3, Taiwu Wang1, Heng Lv1, Changjun Wang3, Chunhui Wang1, Jinhai Zhang1.
Abstract
Human adenoviruses (HAdVs) have been demonstrated to cause a diversity of diseases among children and adults. The circulation of human adenovirus type 21 (HAdV21) has been mainly documented within closed environments in several countries. Nonetheless, respiratory infections or outbreaks due to HAdV21 have never been reported in China. MinION and Illumina platforms were employed to identify the potential pathogen from a throat swab. Discrepancies between MinION and Illumina sequencing were validated and corrected via polymerase chain reaction (PCR). Genomic characterization and recombinant event detection were then performed. Among the 35,466 high-quality MinION reads, a total of 5,999 reads (16.91%) could be aligned to HAdV21 reference genomes (genome sizes ≈35.3 kb), among which 20 had a length of >30 kb. A genome sequence assembled from MinION reads was further classified as HAdV subtype 21a. Random downsampling revealed as few as 500 nanopore reads could cover ≥96.49% of current genome. Illumina sequencing displayed good consistency (pairwise nucleotide identity = 99.91%) with MinION sequencing but with 31 discrepancies that were further validated and confirmed by PCR coupled with Sanger sequencing. Restriction enzymes such as BamHI and KpnI were able to distinguish the present genome from HAdV21 prototype and HAdV21b. Phylogenetic analysis employing whole-genome sequences placed our genome with members only from subtype 21a. Common features among HAdV21a strains were identified, including polymorphisms discovered in penton and 100 kDa hexon assembly-associated proteins and a recombinant event in the E4 gene. Using MinION and Illumina sequencers, we identified the first HAdV21a strain from China, which could provide key genomic data for disease control and epidemiological investigations.Entities:
Keywords: Illumina sequencing; MinION sequencing; adenovirus infection; human adenovirus subtype 21a; pathogen detection
Year: 2020 PMID: 32318094 PMCID: PMC7155751 DOI: 10.3389/fgene.2020.00285
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Investigation of the minimal read number to identify the current isolate. (A) The overall genome coverage distribution when 5–5,000 reads were randomly selected. The x axis denotes pseudosequencing depth, and the y axis, the corresponding genome coverages. (B) Genome coverage distribution decomposed by the maximal aligned read length. The dashed line corresponds to 69.10%. (C) Hit ratio distribution decomposed by the maximal aligned read length. The dashed line corresponds to 16.92%. “Y” represents read length being more than 25 kb, and “N” indicates less than 25 kb. Boxes represent the interquartile range (IQR) between the first and third quartiles (25th and 75th percentiles, respectively). Lines inside denote the median, and whiskers denote the most extreme values within 1.5 times IQR from the first and third quartiles. Outlier values are represented as points.
Genomic discrepancies between MinION and Illumina sequencing.
| 1 | C | 584 | T | 31 | T |
| 2 | T | 576 | C | 61 | C |
| 3 | – | – | T | 77 | T |
| 4 | C | 601 | T | 83 | T |
| 5 | – | – | T | 76 | T |
| 6 | A | 554 | G | 76 | G |
| 7 | T | 517 | C | 75 | C |
| 8 | – | – | A | 94 | A |
| 9 | – | – | G | 105 | G |
| 10 | C | 436 | – | – | – |
| 11 | – | – | C | 117 | C |
| 12 | T | 544 | – | – | – |
| 13 | – | – | G | 78 | G |
| 14 | G | 538 | – | – | – |
| 15 | – | – | T | 58 | T |
| 16 | – | – | T | 75 | T |
| 17 | G | 601 | – | – | – |
| 18 | – | – | A | 85 | A |
| 19 | – | – | C | 106 | C |
| 20 | – | – | G | 90 | G |
| 21 | T | 624 | – | – | – |
| 22 | C | 577 | – | – | – |
| 23 | – | – | G | 66 | G |
| 24 | G | 628 | – | – | – |
| 25 | – | – | G | 77 | G |
| 26 | – | – | C | 106 | C |
| 27 | G | 653 | A | 66 | A |
| 28 | A | 623 | – | – | – |
| 29 | – | – | A | 51 | A |
| 30 | – | – | G | 34 | G |
| 31 | – | – | T | 22 | T |
FIGURE 2The read depth across the current genome by two sequencing platforms.
FIGURE 3Genomic characterization of the current isolate. (A,B) Genome type analysis using restriction enzyme BamHI (A) and KpnI (B). The numbers on the far left denote the molecular weight markers (bp). (C) Genomic annotation of the current isolate. Genes were displayed in gray, and CDSs discussed below in plum. “100 KDa protein” indicates the 100 kDa hexon-assembly associated protein.
FIGURE 4Phylogenetic tree constructed via whole-genome sequences. All available HAdV21 and other human mastadenovirus B genomes were used. The sequence displayed in red represents the current isolate.
FIGURE 5Bootscan analysis based on multiple alignment of the current isolate, HAdV3, HAdV21p, and HAdV66 genomes. The current genome was selected as the query sequence, whereas the rest were references. The dashed line represents the approximate position 33,260 in the present genome.