| Literature DB >> 22848734 |
Huiquan Liu1, Yanping Fu, Jiatao Xie, Jiasen Cheng, Said A Ghabrial, Guoqing Li, Xianhong Yi, Daohong Jiang.
Abstract
Genome sequence of viruses can contribute greatly to the study of viral evolution, diversity and the interaction between viruses and hosts. Traditional molecular cloning methods for obtaining RNA viral genomes are time-consuming and often difficult because many viruses occur in extremely low titers. DsRNA viruses in the families, Partitiviridae, Totiviridae, Endornaviridae, Chrysoviridae, and other related unclassified dsRNA viruses are generally associated with symptomless or persistent infections of their hosts. These characteristics indicate that samples or materials derived from eukaryotic organisms used to construct cDNA libraries and EST sequencing might carry these viruses, which were not easily detected by the researchers. Therefore, the EST databases may include numerous unknown viral sequences. In this study, we performed in silico cloning, a procedure for obtaining full or partial cDNA sequence of a gene by bioinformatics analysis, using known dsRNA viral sequences as queries to search against NCBI Expressed Sequence Tag (EST) database. From this analysis, we obtained 119 novel virus-like sequences related to members of the families, Endornaviridae, Chrysoviridae, Partitiviridae, and Totiviridae. Many of them were identified in cDNA libraries of eukaryotic lineages, which were not known to be hosts for these viruses. Furthermore, comprehensive phylogenetic analysis of these newly discovered virus-like sequences with known dsRNA viruses revealed that these dsRNA viruses may have co-evolved with respective host supergroups over a long evolutionary time while potential horizontal transmissions of viruses between different host supergroups also is possible. We also found that some of the plant partitiviruses may have originated from fungal viruses by horizontal transmissions. These findings extend our knowledge of the diversity and possible host range of dsRNA viruses and offer insight into the origin and evolution of relevant viruses with their hosts.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22848734 PMCID: PMC3407116 DOI: 10.1371/journal.pone.0042147
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Partitivirus-like sequences discovered from cDNA libraries of eukaryotic organisms.
| Organism group | Organism | No. of RdRp | No. of CP | Total | ||
| Contig | Singleton | Contig | Singleton | |||
| Plants | ||||||
| monocots |
| 1 | 1 | |||
|
| 1 | 1 | ||||
|
| 6 (1) | 4 | 3 (2) | 13 | ||
|
| 1 | 1 | ||||
|
| 1 | 2 | 3 | |||
|
| 1 (1) | 1 (1) | 2 | |||
|
| 1 | 1 | 2 | |||
|
| 2 | 1 (1) | 3 | |||
|
| 2 | 3 | ||||
|
| 1 | 1 | ||||
| eudicots |
| 1 | 1 | |||
|
| 2 | 2 | ||||
|
| 1 | 1 | ||||
|
| 1 (1) | 2 | 2 | |||
|
| 1 (1) | 2 (1) | 3 | |||
|
| 2 | 1 | 3 | |||
|
| 1 | 1 | 2 | |||
|
| 1 | 2 | 3 | |||
|
| 1 (1) | 1 | ||||
|
| 1 | 2 | 3 | |||
|
| 1 (1) | 2 | 1 | 4 | ||
|
| 1 (1) | 4 | 5 | |||
|
| 1 (1) | 3 | 4 | |||
|
| 1 | 1 | ||||
|
| 1 | 1 | ||||
|
| 1 | 1 | ||||
|
| 2 | 1 (1) | 3 | |||
|
| 1 | 1 | ||||
|
| 3 | 3 | ||||
| conifers |
| 1 (1) | 2 (1) | 3 | ||
|
| 1 | 1 | ||||
| Fungi |
| 1 | 1 | |||
|
| 1 | 1 | ||||
| Animals |
| 1 | 1 | |||
|
| 1 | 1 | ||||
|
| 1 | 1 | 2 | |||
|
| 1 | 1 | ||||
|
| 1 | 1 | ||||
|
| 1 | 1 | ||||
|
| 1 | 1 | ||||
| Plants/Fungi |
| 1 | 1 | |||
|
| 1 | 1 | 2 | |||
| Total | 43 | 26 | 21 | 27 | 17 | 91 |
Partitivirus-like ESTs were detected from the cDNA libraries of these organisms. Note the possibility that certain viral sequences are possible not from the annotated host organisms.
ESTs from the two species are high identical and are assembled into one contig.
The numbers in parentheses indicate the numbers of contigs which are corresponding to complete or near full-length sequence of RdRp or CP segments. Note the possibility that virus ESTs used to generate a contig are possible not from the same virus.
Several sequences in certain species were possibly corresponding to one viral segment but were not assembled into one contig due to sequencing gaps.
Figure 1Representative full-length sequences of partitiviral RdRp or CP segments assembled from EST libraries of different plants.
Multiple ESTs were used to construct contigs where overlapping regions of EST sequences show over 97% sequence identity. Bracket indicates that the RdRp and CP-like sequences were possibly from one virus. Rectangular boxes indicate ORFs.
Totivirus, chrysovirus, endornavirus and Southern tomato virus (STV)-like sequences discovered from cDNA libraries of eukaryotic organisms.
| Organism group | Organism | No. of totiviral RdRp | No. of totiviral CP | ||
| Contig | Singleton | Contig | Singleton | ||
| Diatoms |
| 1 | |||
| Fungi |
| 1 | |||
|
| 2 | ||||
| Animals |
| 1 | |||
|
| 1 | ||||
|
| 1 | ||||
| Plants |
| 1 | |||
Virus-like ESTs were detected from the cDNA libraries of these organisms. Note the possibility that certain viral sequences are possible not from the annotated host organisms.
Note the possibility that virus ESTs used to generate a contig are possible not from the same virus.
Figure 2Maximum likelihood tree of partitiviral RdRp-like sequences.
The phylogenetic tree were built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the Word Size parameter setting to 3. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The tree is midpoint rooted. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site.
Figure 3Maximum likelihood tree of partitiviral CP-like sequences.
The phylogenetic tree were built using PhyML-mixtures based on a multiple sequence alignments generated using COBALT with the Word Size parameter setting to 3 and Constraint E-value parameter setting to 0.1. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The tree is midpoint rooted. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site.
Figure 4Maximum likelihood tree of totiviral, chrysoviral, and STV RdRp-like sequences.
The phylogenetic tree was built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the Word Size parameter setting to 3. The base tree was drawn using Interactive Tree Of Life Version 2.1.1 (http://itol.embl.de/). The totivirus-like group represented by Southern tomato virus (STV) was chosen as outgroup for presentation purposes. The p-values of approximate likelihood ratios (SH-aLRT) plotted as circle marks on the branches (only p-values>0.5 are indicated) and circle size is proportional to the p-values. The p-values for the branches relevant to our conclusions are shown. Scale bars correspond to 0.2 amino acid substitutions per site. Asterisk (*) indicates the chrysoviral clade.
Figure 5Maximum likelihood tree of polyproteins of endornavirus-like sequences.
The phylogenetic tree was built using PhyML-mixtures based on a multiple sequence alignment generated using COBALT with the default parameter setting. The tree was rooted with closteroviruses Pineapple mealybug wilt-associated virus 1 and Plum bark necrosis and stem pitting-associated virus, which belong to the “alpha-like” supergroup of ss(+)RNA viruses. Only p values for the approximate likelihood ratios (SH test) of >0.5 (50%) are indicated. Scale bars correspond to 0.5 amino acid substitution per site.
Figure 6A tree of eukaryotes showing known host range of partitvirus-, totivirus-, chrysovirus- and endornavirus-like viruses.
This tree was drawn base on The Tree of Life Web Project (http://tolweb.org/).