| Literature DB >> 19036160 |
Alla G Gagarinova1, Mohan Babu, Martina V Strömvik, Aiming Wang.
Abstract
RNA recombination is one of the two major factors that create RNA genome variability. Assessing its incidence in plant RNA viruses helps understand the formation of new isolates and evaluate the effectiveness of crop protection strategies. To search for recombination in Soybean mosaic virus (SMV), the causal agent of a worldwide seed-borne, aphid-transmitted viral soybean disease, we obtained all full-length genome sequences of SMV as well as partial sequences encoding the N-terminal most (P1 protease) and the C-terminal most (capsid protein; CP) viral protein. The sequences were analyzed for possible recombination events using a variety of automatic and manual recombination detection and verification approaches. Automatic scanning identified 3, 10, and 17 recombination sites in the P1, CP, and full-length sequences, respectively. Manual analyses confirmed 10 recombination sites in three full-length SMV sequences. To our knowledge, this is the first report of recombination between distinct SMV pathotypes. These data imply that different SMV pathotypes can simultaneously infect a host cell and exchange genetic materials through recombination. The high incidence of SMV recombination suggests that recombination plays an important role in SMV evolution. Obtaining additional full-length sequences will help elucidate this role.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19036160 PMCID: PMC2627826 DOI: 10.1186/1743-422X-5-143
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Figure 1Recombination in full-length SMV sequences. A. Phylogenetic relationships of SMV isolates to each other and to PPV as an outgroup. Phylogenetic tree was constructed using full-length nucleotide sequences of isolates L [GenBank: EU871724], L-RB [GenBank: EU871725], G2 [GenBank: S42280.1], N [GenBank: D00507.2], Aa [GenBank: AB100442.1], Aa15-M2 [GenBank: AB100443.1], G5 [GenBank: AY294044.1], G7H [GenBank: AY294045.1], G7d [GenBank: AY216987.1], G7 referred as G7x [GenBank: AY216010.1], and G7 referred as G7f [GenBank: AF241739.1], CN18 [GenBank: AJ619757], HH5 [GenBank: AJ310200], HZ [GenBank: AJ312439], as well as PPV [GenBank: M92280.1], as the outgroup, and the Neighbour Joining function of ClustalX [34]. Topologies of the Bayesian [35] as well as the1000 times bootstrapped least squares [36] and maximum likelihood [37] phylogenetic trees were same (data not shown). Bootstrap values for the Neighbour Joining and the maximum likelihood phylogenetic trees, out of 1000 replicates, are given at the nodes before and after the slanted line, respectively. For presentation purposes, the line marked with a star was shortened from 0.35145 to 0.04145. Automated RDP3 recombination analysis identified recombination events in all SMV isolates [please also see Additional file 1]. Filled circles demarcate likely times, when in evolution of SMV manually verified recombination events took place, while empty circles demarcate significant (P-value < 0.05) recombination events where the likely recombinant isolates were determined to be too far diverged from all available SMV sequences for the χ2 analysis of recombination and thus recombination analyses results were considered inconclusive. B. Locations of unique recombination events identified by RDP3, in relation to the full-length sequence alignment [please also see the Additional file 1]. Each full-length genome is represented by a long black bar and the corresponding underlined isolate name, given to the left of the bar. The figure shows a total of 17 unique recombination events, demarcated by the bars below the genomes the recombinant fragments have been integrated into. When an ancestral unique recombination event can be found in more than one daughter sequence, the recombination event is displayed with all corresponding daughter sequences. Locations of the unique recombination events identified by RDP, corresponding to the manually verified recombination sites, are shown with grey bars [please also see Additional file 1].
Figure 2Phylogenetic trees for the alignment regions demarcated by G5 and G7H recombination sites. Non-recombinant, as determined by the manual recombination analysis (see manuscript text), as well as G5 and G7H sequences were included in the phylogenetic tree construction. The designations for the fragments are given at the top, to the left of each tree. Bayesian [35] as well as bootstrapped Neighbour Joining [34], least squares [36], and maximum likelihood [37] trees were constructed for each region. Topologies of the trees generated by the four methods for the same region were same, with exception of how C2 and N sequences related to each other and to L and L-RB isolates from recombination site 'y' to the end of the alignment. Shimodaira-Hasegawa (SH) test [38] was used to select the best of the competing but very similar topologies for each sequence region (date not shown). The tree topology that obtained the highest SH score of 1 is presented for each region. Bootstrap values out of 1000 replicates, produced by the Neighbour Joining and the maximum likelihood methods are given at the nodes, before and after the slanted line, respectively. A star given instead of the number indicates that the respective method did not agree with the topology of the optimal tree identified by the SH test at that particular node. Topologies of all trees were tested against each other and the topologies of the trees presented here were found optimal (SH score: 1). SH scores of 0 were obtained when topologies of the trees from between recombination sites were tested against sequence alignments for the regions on the basis of which the given tree was not generated. The same SH score of 0 was obtained when the tree topologies for the regions from the beginning of the sequence alignment to 'w' and from 'z' to end of the sequence alignment were tested against sequence alignments between the recombination sites. Collectively, these results indicated that the different topologies cannot substitute for each other in explaining the variability of SMV sequences between G5 and G7H recombination sites that we identified.