| Literature DB >> 29018311 |
Yeonhwa Jo1, Hoseong Choi1, Miah Bae1, Sang-Min Kim2, Sun-Lim Kim2, Bong Choon Lee2, Won Kyong Cho1, Kook-Hyung Kim1.
Abstract
Soybean is the most important legume crop in the world. Several diseases in soybean lead to serious yield losses in major soybean-producing countries. Moreover, soybean can be infected by diverse viruses. Recently, we carried out a large-scale screening to identify viruses infecting soybean using available soybean transcriptome data. Of the screened transcriptomes, a soybean transcriptome for soybean seed development analysis contains several virus-associated sequences. In this study, we identified five viruses, including soybean mosaic virus (SMV), infecting soybean by de novo transcriptome assembly followed by blast search. We assembled a nearly complete consensus genome sequence of SMV China using transcriptome data. Based on phylogenetic analysis, the consensus genome sequence of SMV China was closely related to SMV isolates from South Korea. We examined single nucleotide variations (SNVs) for SMVs in the soybean seed transcriptome revealing 780 SNVs, which were evenly distributed on the SMV genome. Four SNVs, C-U, U-C, A-G, and G-A, were frequently identified. This result demonstrated the quasispecies variation of the SMV genome. Taken together, this study carried out bioinformatics analyses to identify viruses using soybean transcriptome data. In addition, we demonstrated the application of soybean transcriptome data for virus genome assembly and SNV analysis.Entities:
Keywords: de novo genome assembly; single nucleotide variation; soybean mosaic virus
Year: 2017 PMID: 29018311 PMCID: PMC5624490 DOI: 10.5423/PPJ.OA.03.2017.0060
Source DB: PubMed Journal: Plant Pathol J ISSN: 1598-2254 Impact factor: 1.795
Summary of de novo soybean transcriptome assembly using Trinity
| Accession number | SRR1777405 |
|---|---|
| Total trinity transcripts | 116108 |
| Percent GC | 43.97 |
| Contig N50 | 710 bp |
| Median contig length | 428 bp |
| Average contig | 580.18 bp |
| Total assembled bases | 67363642 bp |
We assembled raw data from two different libraries using Trinity program.
The statistics of assembled contigs were calculated by TrinityStats.pl in the Trinity program.
Summary of blast results to identify virus-associated contigs
| Query id | Subject id | Name of virus | Identity (%) | Alignment length | Mismatches | Gap opens | Query start | Query end | Subject start | Subject end | E value | Bit score |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TR2274|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 93.13 | 233 | 16 | 0 | 2 | 234 | 8571 | 8803 | 3.00E-93 | 342 |
| TR3618|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 91.02 | 256 | 23 | 0 | 1 | 256 | 1342 | 1597 | 2.00E-94 | 346 |
| TR3618|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 90.58 | 276 | 26 | 0 | 1 | 276 | 1342 | 1617 | 2.00E-100 | 366 |
| TR3858|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 97.35 | 264 | 7 | 0 | 1 | 264 | 910 | 1173 | 2.00E-125 | 449 |
| TR3858|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 96.6 | 235 | 8 | 0 | 1 | 235 | 939 | 1173 | 1.00E-107 | 390 |
| TR4672|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 96.55 | 261 | 9 | 0 | 1 | 261 | 9036 | 9296 | 2.00E-120 | 433 |
| TR4672|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 97.7 | 261 | 6 | 0 | 1 | 261 | 9036 | 9296 | 2.00E-125 | 449 |
| TR5077|c1_g1_i1 | NC_002634.1 | Soybean mosaic virus | 94.19 | 258 | 15 | 0 | 3 | 260 | 4680 | 4937 | 9.00E-109 | 394 |
| TR5077|c1_g1_i2 | NC_002634.1 | Soybean mosaic virus | 91.47 | 258 | 22 | 0 | 3 | 260 | 4680 | 4937 | 4.00E-97 | 355 |
| TR5102|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 91.96 | 224 | 18 | 0 | 1 | 224 | 7552 | 7329 | 6.00E-85 | 315 |
| TR5869|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 91.98 | 212 | 17 | 0 | 5 | 216 | 7243 | 7032 | 6.00E-80 | 298 |
| TR5869|c0_g2_i1 | NC_002634.1 | Soybean mosaic virus | 92.45 | 212 | 16 | 0 | 5 | 216 | 7243 | 7032 | 1.00E-81 | 303 |
| TR5869|c0_g3_i1 | NC_002634.1 | Soybean mosaic virus | 92.92 | 212 | 15 | 0 | 5 | 216 | 7243 | 7032 | 3.00E-83 | 309 |
| TR5869|c0_g4_i1 | NC_002634.1 | Soybean mosaic virus | 92.92 | 212 | 15 | 0 | 5 | 216 | 7243 | 7032 | 3.00E-83 | 309 |
| TR7406|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 94.64 | 280 | 15 | 0 | 1 | 280 | 2677 | 2956 | 6.00E-121 | 435 |
| TR7406|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 92.12 | 241 | 19 | 0 | 1 | 241 | 2677 | 2917 | 1.00E-92 | 340 |
| TR7406|c0_g1_i3 | NC_002634.1 | Soybean mosaic virus | 94.16 | 274 | 16 | 0 | 1 | 274 | 2677 | 2950 | 5.00E-116 | 418 |
| TR7406|c0_g1_i4 | NC_002634.1 | Soybean mosaic virus | 93.36 | 241 | 16 | 0 | 1 | 241 | 2677 | 2917 | 1.00E-97 | 357 |
| TR8100|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 97.86 | 234 | 5 | 0 | 12 | 245 | 6060 | 6293 | 4.00E-112 | 405 |
| TR9520|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 95.06 | 385 | 19 | 0 | 1 | 385 | 8268 | 7884 | 2.00E-172 | 606 |
| TR9520|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 96.65 | 239 | 8 | 0 | 4 | 242 | 8122 | 7884 | 6.00E-110 | 398 |
| TR9520|c0_g1_i3 | NC_002634.1 | Soybean mosaic virus | 94.66 | 356 | 19 | 0 | 1 | 356 | 8268 | 7913 | 2.00E-156 | 553 |
| TR9520|c0_g1_i4 | NC_002634.1 | Soybean mosaic virus | 94.38 | 356 | 20 | 0 | 1 | 356 | 8268 | 7913 | 9.00E-155 | 547 |
| TR9520|c0_g1_i5 | NC_002634.1 | Soybean mosaic virus | 95.06 | 385 | 19 | 0 | 1 | 385 | 8268 | 7884 | 2.00E-172 | 606 |
| TR9520|c0_g1_i6 | NC_002634.1 | Soybean mosaic virus | 96.19 | 210 | 8 | 0 | 4 | 213 | 8122 | 7913 | 8.00E-94 | 344 |
| TR9520|c0_g1_i7 | NC_002634.1 | Soybean mosaic virus | 96.88 | 385 | 12 | 0 | 1 | 385 | 8268 | 7884 | 0 | 645 |
| TR13605|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 92.25 | 400 | 31 | 0 | 10 | 409 | 8665 | 9064 | 8.00E-161 | 568 |
| TR13605|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 94.75 | 400 | 21 | 0 | 10 | 409 | 8665 | 9064 | 2.00E-177 | 623 |
| TR15892|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 92.64 | 231 | 17 | 0 | 2 | 232 | 5845 | 5615 | 2.00E-90 | 333 |
| TR20496|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 96.88 | 224 | 7 | 0 | 1 | 224 | 2087 | 1864 | 3.00E-103 | 375 |
| TR22770|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 91.67 | 240 | 20 | 0 | 1 | 240 | 6413 | 6652 | 2.00E-90 | 333 |
| TR22770|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 92.53 | 281 | 21 | 0 | 2 | 282 | 6372 | 6652 | 2.00E-111 | 403 |
| TR25078|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 88.54 | 253 | 29 | 0 | 1 | 253 | 8730 | 8478 | 1.00E-82 | 307 |
| TR25078|c0_g2_i1 | NC_002634.1 | Soybean mosaic virus | 94.72 | 246 | 13 | 0 | 16 | 261 | 8627 | 8382 | 2.00E-105 | 383 |
| TR25078|c0_g2_i2 | NC_002634.1 | Soybean mosaic virus | 93.7 | 349 | 22 | 0 | 1 | 349 | 8730 | 8382 | 2.00E-147 | 523 |
| TR25078|c0_g2_i3 | NC_002634.1 | Soybean mosaic virus | 95.72 | 187 | 8 | 0 | 43 | 229 | 8568 | 8382 | 5.00E-81 | 302 |
| TR25078|c0_g2_i4 | NC_002634.1 | Soybean mosaic virus | 90.91 | 253 | 23 | 0 | 1 | 253 | 8730 | 8478 | 1.00E-92 | 340 |
| TR32819|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 91.7 | 265 | 22 | 0 | 2 | 266 | 2515 | 2251 | 6.00E-101 | 368 |
| TR32819|c0_g2_i1 | NC_002634.1 | Soybean mosaic virus | 92.08 | 265 | 21 | 0 | 2 | 266 | 2515 | 2251 | 1.00E-102 | 374 |
| TR34507|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 87.27 | 377 | 44 | 4 | 4 | 378 | 3523 | 3149 | 1.00E-118 | 427 |
| TR37651|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 87.61 | 218 | 24 | 3 | 2 | 218 | 410 | 625 | 2.00E-65 | 250 |
| TR37651|c0_g3_i1 | NC_002634.1 | Soybean mosaic virus | 87.27 | 487 | 57 | 4 | 2 | 487 | 410 | 892 | 1.00E-155 | 551 |
| TR37706|c0_g2_i1 | NC_002634.1 | Soybean mosaic virus | 90.51 | 274 | 24 | 2 | 1 | 273 | 1128 | 1400 | 9.00E-99 | 361 |
| TR41793|c1_g1_i1 | NC_002634.1 | Soybean mosaic virus | 92.89 | 394 | 28 | 0 | 1 | 394 | 7483 | 7876 | 2.00E-162 | 573 |
| TR41793|c1_g1_i2 | NC_002634.1 | Soybean mosaic virus | 93.15 | 438 | 29 | 1 | 1 | 437 | 7483 | 7920 | 0 | 641 |
| TR41793|c1_g1_i3 | NC_002634.1 | Soybean mosaic virus | 91.55 | 213 | 18 | 0 | 23 | 235 | 7486 | 7698 | 8.00E-79 | 294 |
| TR41793|c1_g1_i4 | NC_002634.1 | Soybean mosaic virus | 93.93 | 445 | 27 | 0 | 1 | 445 | 7483 | 7927 | 0 | 673 |
| TR41793|c1_g1_i5 | NC_002634.1 | Soybean mosaic virus | 91.59 | 226 | 19 | 0 | 1 | 226 | 7473 | 7698 | 2.00E-84 | 313 |
| TR41793|c1_g1_i6 | NC_002634.1 | Soybean mosaic virus | 93.03 | 445 | 31 | 0 | 1 | 445 | 7483 | 7927 | 0 | 651 |
| TR41793|c1_g1_i7 | NC_002634.1 | Soybean mosaic virus | 91.17 | 419 | 37 | 0 | 1 | 419 | 7473 | 7891 | 2.00E-161 | 569 |
| TR44246|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 87.9 | 157 | 18 | 1 | 87 | 242 | 477 | 633 | 2.00E-45 | 183 |
| TR44822|c4_g1_i1 | NC_002634.1 | Soybean mosaic virus | 97.83 | 460 | 10 | 0 | 2 | 461 | 843 | 384 | 0 | 795 |
| TR44822|c4_g1_i2 | NC_002634.1 | Soybean mosaic virus | 97.65 | 765 | 18 | 0 | 2 | 766 | 843 | 79 | 0 | 1314 |
| TR44822|c4_g1_i3 | NC_002634.1 | Soybean mosaic virus | 97.27 | 622 | 14 | 1 | 2 | 623 | 843 | 225 | 0 | 1051 |
| TR44822|c4_g2_i1 | NC_002634.1 | Soybean mosaic virus | 90.13 | 1256 | 122 | 2 | 1 | 1255 | 1991 | 737 | 0 | 1631 |
| TR44822|c4_g2_i2 | NC_002634.1 | Soybean mosaic virus | 91.46 | 820 | 70 | 0 | 1 | 820 | 1918 | 1099 | 0 | 1127 |
| TR44822|c4_g2_i3 | NC_002634.1 | Soybean mosaic virus | 92.75 | 483 | 35 | 0 | 1 | 483 | 1991 | 1509 | 0 | 699 |
| TR44822|c4_g2_i4 | NC_002634.1 | Soybean mosaic virus | 88.67 | 256 | 29 | 0 | 1 | 256 | 1617 | 1362 | 2.00E-84 | 313 |
| TR44822|c4_g2_i5 | NC_002634.1 | Soybean mosaic virus | 93.9 | 246 | 15 | 0 | 19 | 264 | 1853 | 1608 | 4.00E-102 | 372 |
| TR44822|c4_g2_i6 | NC_002634.1 | Soybean mosaic virus | 94.81 | 231 | 12 | 0 | 19 | 249 | 1853 | 1623 | 9.00E-99 | 361 |
| TR44822|c4_g2_i7 | NC_002634.1 | Soybean mosaic virus | 94.15 | 410 | 24 | 0 | 1 | 410 | 1918 | 1509 | 5.00E-178 | 625 |
| TR44822|c5_g1_i1 | NC_002634.1 | Soybean mosaic virus | 95.98 | 994 | 40 | 0 | 2 | 995 | 5991 | 6984 | 0 | 1615 |
| TR44822|c5_g1_i2 | NC_002634.1 | Soybean mosaic virus | 94.11 | 3599 | 207 | 4 | 2 | 3596 | 5991 | 9588 | 0 | 5467 |
| TR44822|c5_g2_i1 | NC_002634.1 | Soybean mosaic virus | 93.3 | 224 | 15 | 0 | 4 | 227 | 8124 | 8347 | 6.00E-90 | 331 |
| TR44822|c5_g1_i3 | NC_002634.1 | Soybean mosaic virus | 96.21 | 501 | 19 | 0 | 2 | 502 | 5991 | 6491 | 0 | 821 |
| TR44822|c5_g1_i4 | NC_002634.1 | Soybean mosaic virus | 92.81 | 292 | 21 | 0 | 2 | 293 | 5991 | 6282 | 1.00E-117 | 424 |
| TR44822|c6_g1_i1 | NC_002634.1 | Soybean mosaic virus | 95.07 | 1015 | 50 | 0 | 1 | 1015 | 6049 | 5035 | 0 | 1598 |
| TR44822|c6_g2_i1 | NC_002634.1 | Soybean mosaic virus | 97.64 | 212 | 5 | 0 | 10 | 221 | 4930 | 4719 | 6.00E-100 | 364 |
| TR44822|c6_g2_i2 | NC_002634.1 | Soybean mosaic virus | 95.83 | 240 | 10 | 0 | 1 | 240 | 5051 | 4812 | 4.00E-107 | 388 |
| TR44822|c6_g2_i3 | NC_002634.1 | Soybean mosaic virus | 97.52 | 1372 | 34 | 0 | 1 | 1372 | 5146 | 3775 | 0 | 2346 |
| TR44822|c6_g2_i4 | NC_002634.1 | Soybean mosaic virus | 96.59 | 293 | 10 | 0 | 1 | 293 | 5146 | 4854 | 2.00E-136 | 486 |
| TR44822|c6_g3_i1 | NC_002634.1 | Soybean mosaic virus | 95.8 | 691 | 27 | 2 | 5 | 694 | 2746 | 2057 | 0 | 1114 |
| TR44822|c6_g3_i2 | NC_002634.1 | Soybean mosaic virus | 96.69 | 877 | 29 | 0 | 2 | 878 | 2822 | 1946 | 0 | 1459 |
| TR44822|c6_g4_i1 | NC_002634.1 | Soybean mosaic virus | 95.39 | 1149 | 53 | 0 | 1 | 1149 | 3889 | 2741 | 0 | 1829 |
| TR44822|c6_g4_i2 | NC_002634.1 | Soybean mosaic virus | 94.89 | 333 | 17 | 0 | 1 | 333 | 3598 | 3266 | 5.00E-147 | 521 |
| TR45256|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 93.49 | 261 | 17 | 0 | 4 | 264 | 6897 | 6637 | 4.00E-107 | 388 |
| TR45256|c0_g1_i2 | NC_002634.1 | Soybean mosaic virus | 94.32 | 229 | 13 | 0 | 2 | 230 | 6865 | 6637 | 5.00E-96 | 351 |
| TR47685|c0_g1_i1 | NC_002634.1 | Soybean mosaic virus | 92.53 | 281 | 21 | 0 | 1 | 281 | 5082 | 5362 | 2.00E-111 | 403 |
| TR47685|c0_g2_i1 | NC_002634.1 | Soybean mosaic virus | 92.53 | 281 | 21 | 0 | 1 | 281 | 5082 | 5362 | 2.00E-111 | 403 |
| TR44246|c0_g1_i1 | NC_003397.1 | Bean common mosaic virus | 81.86 | 408 | 68 | 6 | 490 | 894 | 458 | 862 | 2.00E-91 | 339 |
| TR19277|c0_g2_i1 | NC_003617.1 | Lettuce infectious yellows virus RNA1 | 75.34 | 146 | 29 | 6 | 467 | 607 | 6837 | 6694 | 1.00E-08 | 63.9 |
| TR45572|c0_g2_i1 | NC_012910.1 | Lettuce chlorosis virus RNA2 | 87.96 | 191 | 22 | 1 | 15 | 205 | 8555 | 8366 | 1.00E-57 | 224 |
| TR29303|c0_g1_i1 | NC_002034.1 | Cucumber mosaic virus RNA1 | 91.28 | 298 | 26 | 0 | 4 | 301 | 1334 | 1631 | 1.00E-112 | 407 |
Fig. 1De novo assembly of SMV isolate in China using transcriptome data. (A) Size distribution of virus-associated contigs. Red-colored bar indicates SMV-associated contigs. Four viruses with respective contig length were indicated. (B) Alignment of 79 SMV-associated contigs on the assembled genome of SMV isolate in China using BWA program. Black bar indicates the reference SMV genome. Sequence alignment was visualized by Tablet program. (C) Genome organization of SMV isolate in China. The nucleotide positions of two proteins, GP1 and GP2, were indicated.
Fig. 2Phylogenetic relationship of the assembled SMV isolate China with known SMV isolates. Phylogenetic trees of SMV isolates using complete genomes (A), polyproteins (B), and PIPO sequences (C). The respective genome and protein sequences were blasted against NCBI database and highly matched sequences were used for construction of phylogenetic trees using MEGA6 program using neighbor-joining method with 1000 bootstrap replications. Kimura 2-parameter and Poisson substitution model were used for nucleotide and protein sequences, respectively.
Fig. 3SNVs of SMV in the soybean seed transcriptome. (A) Raw data were mapped on the genome sequence of SMV isolate China using BWA and visualized by Tablet program. (B) The positions of identified single nucleotide variations on the SMV were visualized by Tablet program. Detailed information for SNVs can be found in Supplementary Table 1. (C) The numbers of identified SNVs of SMV in the soybean seed transcriptome.