| Literature DB >> 34835128 |
María Cebriá-Mendoza1, María A Bracho2,3, Cristina Arbona4, Luís Larrea4, Wladimiro Díaz1,5, Rafael Sanjuán1,6, José M Cuevas1,6.
Abstract
Metagenomics is greatly improving our ability to discover new viruses, as well as their possible associations with disease. However, metagenomics has also changed our understanding of viruses in general. The vast expansion of currently known viral diversity has revealed a large fraction of non-pathogenic viruses, and offers a new perspective in which viruses function as important components of many ecosystems. In this vein, studies of the human blood virome are often motivated by the search for new viral diseases, especially those associated with blood transfusions. However, these studies have revealed the common presence of apparently non-pathogenic viruses in blood, particularly human anelloviruses and, to a lower extent, human pegiviruses (HPgV). To shed light on the diversity of the human blood virome, we subjected pooled plasma samples from 587 healthy donors in Spain to a viral enrichment protocol, followed by massive parallel sequencing. This showed that anelloviruses were clearly the major component of the blood virome and showed remarkable diversity. In total, we assembled 332 complete or near-complete anellovirus genomes, 50 of which could be considered new species. HPgV was much less frequent, but we, nevertheless, recovered 17 different isolates that we subsequently used for characterizing the diversity of this virus. In-depth investigation of the human blood virome should help to elucidate the ecology of these viruses, and to unveil potentially associated diseases.Entities:
Keywords: anellovirus; blood virome; metagenomics; orphan virus; pegivirus; virus discovery
Mesh:
Year: 2021 PMID: 34835128 PMCID: PMC8621239 DOI: 10.3390/v13112322
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.048
Summary of virome composition for the 60 pools analyzed. Read numbers are given. For comparison, the number of reads for the eight blank controls processed are also shown and subsequently used for computational subtraction of potential contaminants.
| Pool | Anellovirus Reads | Pegivirus Reads | Other Viruses | Pool/Blank | Anellovirus Reads | Pegivirus Reads | Other Viruses |
|---|---|---|---|---|---|---|---|
| SP1 | 101,069 | 25,965 | 64 | SP35 | 419,986 | 0 | 317 |
| SP2 | 1,580,534 | 0 | 3013 | SP36 | 185,281 | 0 | 144 |
| SP3 | 131,969 | 3669 | 421 | SP37 | 666,063 | 0 | 1311 |
| SP4 | 9992 | 4250 | 61 | SP38 | 242,853 | 2261 | 479 |
| SP5 | 47,927 | 0 | 225 | SP39 | 15,756 | 0 | 200 |
| SP6 | 718,633 | 0 | 330 | SP40 | 342,193 | 0 | 3390 |
| SP7 | 63,139 | 0 | 80 | SP41 | 169,614 | 0 | 2815 |
| SP8 | 76,204 | 0 | 5089 | SP42 | 4519 | 0 | 118 |
| SP9 | 153,491 | 0 | 52 | SP43 | 206,185 | 0 | 99 |
| SP10 | 30,175 | 0 | 1649 | SP44 | 7975 | 10,713 | 19 |
| SP11 | 9787 | 5706 | 143 | SP45 | 124,171 | 0 | 210 |
| SP12 | 57,559 | 0 | 15,397 | SP46 | 29,728 | 0 | 431 |
| SP13 | 95,922 | 1173 | 4844 | SP47 | 150,531 | 0 | 3731 |
| SP14 | 271,731 | 0 | 1757 | SP48 | 45,430 | 0 | 676 |
| SP15 | 141,896 | 0 | 37 | SP49 | 94,919 | 5226 | 255 |
| SP16 | 149,985 | 9 | 2610 | SP50 | 0 | 0 | 340 |
| SP17 | 10 | 0 | 74 | SP51 | 299,530 | 0 | 17 |
| SP18 | 24,134 | 0 | 7168 | SP52 | 59,852 | 0 | 16 |
| SP19 | 74,391 | 339 | 9506 | SP53 | 14,323 | 5344 | 68 |
| SP20 | 73,067 | 373 | 21 | SP54 | 2404 | 0 | 7634 |
| SP21 | 124,389 | 0 | 4952 | SP55 | 663 | 0 | 121 |
| SP22 | 51,168 | 0 | 3428 | SP56 | 25,673 | 3523 | 131 |
| SP23 | 51,730 | 0 | 557 | SP57 | 52,296 | 0 | 1737 |
| SP24 | 71,389 | 0 | 7583 | SP58 | 1232 | 2158 | 241 |
| SP25 | 4269 | 0 | 262 | SP59 | 157,753 | 0 | 46 |
| SP26 | 27,676 | 0 | 84 | SP60 | 36,470 | 0 | 324 |
| SP27 | 7659 | 0 | 3030 | C01 | 0 | 0 | 593 |
| SP28 | 96,187 | 0 | 270 | C02 | 0 | 0 | 8022 |
| SP29 | 334,689 | 6606 | 18,366 | C03 | 0 | 0 | 76,410 |
| SP30 | 69,110 | 6924 | 156 | C04 | 0 | 0 | 3589 |
| SP31 | 332,437 | 0 | 816 | C05 | 0 | 0 | 93,531 |
| SP32 | 1011 | 0 | 223 | C06 | 0 | 0 | 4588 |
| SP33 | 72,784 | 2033 | 68 | C07 | 0 | 0 | 2964 |
| SP34 | 270,083 | 0 | 57 | C09 | 0 | 0 | 6731 |
Figure 1Description of the virome characterized in this study. The classification is shown at the family level. Frequencies were obtained excluding spiked viruses.
Summary of anellovirus analysis. 1 Number of reference species currently accepted by ICTV for each genus. 2 Results obtained after reevaluating data from our previous study [18] using the currently accepted species and the recently proposed species demarcation criterion by the ICTV. 3 Results obtained analyzing the newly described sequences. 4 Genus assignment for the described sequences. 5 Number of new species (percentage with respect to the total number of described sequences for each genus is given between brackets). 6 Number of species that cluster with at least one new sequence (percentage with respect to the total number of species is given between brackets). Novel species identified from our previous study were also used as reference species on subsequent phylogenetic and pairwise identity analyses.
| Cebriá et al. (2021) 2 | This Study 3 | ||||||
|---|---|---|---|---|---|---|---|
| Species 1 | Sequences 4 | Novel Species 5 | Coincident Clusters (%) 6 | Sequences 4 | Novel Species 5 | Coincident Clusters (%) 6 | |
|
| 26 | 68 | 6 (8.8) | 13 (50.0) | 160 | 6 (3.8) | 20 (62.5) |
|
| 38 | 29 | 11 (37.9) | 11 (28.9) | 111 | 27 (24.3) | 24 (49.0) |
|
| 15 | 17 | 9 (52.9) | 5 (33.3) | 61 | 17 (27.9) | 16 (66.6) |
|
| 79 | 114 | 26 (22.8) | 29 (36.7) | 332 | 50 (15.1) | 60 (57.1) |
Figure 2Phylogenetic tree of ORF1 sequences belonging to the TTV genus. Sequences described in this study are marked with a green circle. Those sequences that could be considered as new species are labeled in red. Sequences identified as new species after reevaluating data from our previous study [18] are marked with a blue circle. Non-hominid primate isolates are marked with a brown square. Nodes supported by bootstrap values ranging 0.7–0.85 and 0.85–1.0 are indicated with blue and red circles, respectively. The scale bar indicates the evolutionary distance in nucleotide substitutions per site.
Figure 3Phylogenetic tree of ORF1 sequences from the TTMV genus. Sequences described in this study are marked with a green circle. Sequences identified as new species after reevaluating data from our previous study [18] are marked with a blue circle. New species (including one or more new sequences) are indicated with background green or blue color in order to distinguish contiguous clusters. Clusters of representative species including new sequences are indicated with background light or dark grey colors in order to distinguish contiguous clusters. Non-hominid primate isolates are marked with a brown square. Nodes supported by bootstrap values ranging 0.7–0.85 and 0.85–1.0 are indicated with blue and red circles, respectively. The scale bar indicates the evolutionary distance in nucleotide substitutions per site.
Figure 4Phylogenetic tree of ORF1 sequences from the TTMDV genus. Sequences described in this study are marked with a green circle. Sequences identified as new species after reevaluating data from our previous study [18] are marked with a blue circle. New species (including one or more new sequences) are indicated with background green or blue color in order to distinguish contiguous clusters. Clusters of representative species including new sequences are indicated with background light or dark grey colors in order to distinguish contiguous clusters. The non-hominid primate isolate is marked with a brown square. Nodes supported by bootstrap values ranging 0.7–0.85 and 0.85–1.0 are indicated with blue and red circles, respectively. The scale bar indicates the evolutionary distance in nucleotide substitutions per site.
Summary of HPgV analysis. Estimates were obtained using accession U44402 as the reference sequence. * This pool consists of two different HPgV isolates, and individual estimates cannot be independently provided.
| Sample/Pool | # Reads | Average Depth Coverage | Genome Coverage | Polyprotein Coverage |
|---|---|---|---|---|
| SP1 | 25,965 | 1010.7 | 98.4 | 98.6 |
| SP3 | 3669 | 130.1 | 94.2 | 95.6 |
| SP4 | 4250 | 157.7 | 92.5 | 93.8 |
| SP11 | 5706 | 204.0 | 99.2 | 100.0 |
| SP13 | 1173 | 40.7 | 96.4 | 97.4 |
| SP16 | 9 | 2.4 | 5.5 | 6.0 |
| SP19 | 339 | 12.4 | 82.7 | 89.9 |
| SP20 | 373 | 13.5 | 70.2 | 76.4 |
| SP29 | 6606 | 228.7 | 98.2 | 99.2 |
| SP30 | 6924 | 230.1 | 89.7 | 91.1 |
| SP33 | 2033 | 71.1 | 92.6 | 94.8 |
| SP38 | 2261 | 82.7 | 99.6 | 100.0 |
| SP44 | 10,713 | 392.0 | 99.1 | 100.0 |
| SP49 | 5226 | 165.7 | 83.4 | 84.0 |
| SP53 * | 5344 | 181.2 | 97.5 | 99.8 |
| SP56 | 3523 | 121.1 | 91.7 | 93.3 |
| SP58 | 2158 | 73.2 | 93.6 | 94.7 |
Figure 5HPgV phylogeny of the polyprotein nucleotide sequence including all currently available isolates. Sequences described in this study are indicated with a green circle. Isolates previously reported as recombinants are indicated with a red star. For genotype 2, 2a and 2b subtypes are explicitly shown. Bootstrap values ranging 0.75–0.9 and 0.9–1.0 are indicated with blue and red circles, respectively. The scale bar indicates the evolutionary distance in nucleotide substitutions per site.