| Literature DB >> 30367096 |
Susmita Shrivastava1, Vinita Puri1, Kari A Dilley1,2, Erica Ngouajio1,3, Jessica Shifflett4, Lauren M Oldfield1, Nadia B Fedorova1, Lihui Hu1, Torrey Williams5, Alan Durbin5, Paolo Amedeo1, Sujatha Rashid4, Reed S Shabman1,6, Brett E Pickett7.
Abstract
The recent emergence of Zika virus (ZIKV) has been concentrated in the Caribbean, Southeastern United States, and South- and Central America; resulting in travel-based cases being reported around the globe. As multi-disciplinary collaborations are combatting the ZIKV outbreak, the need to validate the sequence of existing strains has become apparent. Here, we report high-quality sequence data for multiple ZIKV strains made publicly available through the National Institutes of Health- (NIH) funded biorepository, BEI Resources (www.beiresources.org). Next-generation sequencing, 3' rapid amplification of cDNA ends (RACE), and viral genome annotation pipelines generated GenBank sequence records for 16 BEI Resources strains. Minor variants, consensus mutations, and consensus insertions/deletions were identified within the viral stocks using next-generation sequencing (NGS) and consensus changes were confirmed with Sanger sequencing. Bioinformatics analyses of the sequencing results confirm that the virus stocks available to the scientific research community through BEI Resources adequately represent the viral population diversity of ZIKV.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30367096 PMCID: PMC6203802 DOI: 10.1038/s41598-018-34147-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Consensus substitutions in the genome of different UGA/MR-766/1947 sequences, with numbers representing the nucleotide position in the genome for each sequence.
| Alignment position | NC_012532.1_NR-50065 | KU720415.1 | KX830960.1 | KX377335.1 | LC002520.1 | JCVI_KU963573.2_NR-50065* | KX601169.1 |
|---|---|---|---|---|---|---|---|
| 118 | 1 base insertion of C at 118; after 117 | ||||||
| 139 | 1 base deletion of T after 138 | ||||||
| 721 | A at 720 | A at 713 | A at 720 | G at 720 | A at 720 | A at 712 | A at 684 |
| 1100 | C at 1099 | T at 1092 | T at 1099 | T at 1099 | T at 1099 | C at 1091 | T at 1063 |
| 1388 | T at 1387 | T at 1380 | T at 1387 | T at 1387 | T at 1387 | C at 1379 | T at 1351 |
| 1432 | T at 1431 | T at 1424 | T at 1431 | T at 1431 | C at 1431 | T at 1423 | T at 1395 |
| 1435.0.1446 | 12 base deletion after 1433 | 12 base insertion at 1427.0.1438 compared to NC_012532.1 | 12 base insertion at 1434.0.1445 compared to NC_012532.1 | 12 base insertion at 1434.0.1445 compared to NC_012532.1 | 12 base insertion at 1434.0.1445 compared to NC_012532.1 and T at 1443 in insertion instead of C | 12 base deletion after 1425 | 12 base deletion after 1397 in consensus; also has reads with no deletion |
| 1464 | G at 1451 | A at 1456 | A at 1463 | A at 1463 | A at 1463 | A at 1443 | A at 1415 |
| 1562 | C at 1549 | C at 1554 | C at 1561 | T at 1562 | C at 1561 | C at 1541 | C at 1513 |
| 1670 | C at 1657 | C at 1662 | C at 1669 | C at 1669 | T at 1669 | C at 1649 | C at 1621 |
| 1825 | G at 1812 | A at 1817 | A at 1824 | A at 1824 | A at 1824 | A at 1804 | A at 1776 |
| 3065 | G at 3052 | G at 3057 | G at 3064 | A at 3064 | G at 3064 | G at 3044 | G at 3016 |
| 4472 | C at 4459 | T at 4464 | T at 4471 | T at 4471 | T at 4471 | T at 4451 | T at 4423 |
| 4520 | C at 4507 | C at 4512 | C at 4519 | C at 4519 | T at 4519 | C at 4499 | C at 4471 |
| 5178 | C at 5165 | C at 5170 | C at 5177 | T at 5177 | C at 5177 | C at 5157 | C at 5129 |
| 5457 | A at 5444 | T at 5449 | T at 5456 | T at 5456 | T at 5456 | T at 5436 | T at 5408 |
| 5513 | C at 5500 | C at 5505 | C at 5512 | C at 5512 | T at 5512 | C at 5492 | C at 5464 |
| 6338 | T at 6325 | C at 6330 | C at 6337 | C at 6337 | C at 6337 | C at 6317 | C at 6289 |
| 6353 | T at 6340 | G at 6345 | G at 6352 | G at 6352 | G at 6352 | G at 6332 | G at 6304 |
| 7173 | 1 bp gap of T after 7159 | T at 7165 | T at 7172 | T at 7172 | T at 7172 | T at 7152 | T at 7124 |
| 7183 | 1 bp insertion of G after 7168 | 1 base gap after 7174 | 1 base gap after 7181 | 1 base gap after 7181 | 1 base gap after 7181 | 1 base gap after 7161 | 1 base gap after 7133 |
| 7852 | A at 7838 | C at 7843 | C at 7850 | C at 7850 | C at 7850 | C at 7830 | C at 7802 |
| 7865 | A at 7851 | T at 7856 | T at 7863 | T at 7863 | T at 7863 | T at 7843 | T at 7815 |
| 8303 | G at 8289 | C at 8294 | C at 8301 | C at 8301 | C at 8301 | C at 8281 | C at 8253 |
| 8394 | A at 8380 | A at 8385 | A at 8392 | G at 8392 | A at 8392 | A at 8372 | A at 8344 |
| 10140 | C at 10126 | C at 10131 | C at 10138 | C at 10138 | C at 10138 | T at 10118 | C at 10090 |
| 10345 | T at 10331 | T at 10336 | T at 10343 | T at 10343 | T at 10343 | C at 10323 | T at 10295 |
| 10696 | A at 10682 | C at 10687 | A at 10694 | A at 10694 | A at 10694 | A at 10674 | A at 10646 |
| Missing 34 bases at 3′ end compared to NC_012532.1 | Missing 110 bases at 3′ end compared to NC_012532.1 |
*Sequenced at JCVI as part of the current work.
Substitutions in the polyprotein of different UGA/MR-766/1947 sequences, with numbers representing the amino acid positions in the polyprotein for each sequence.
| Alignment position | NC_012532.1_NR-50065 | KU720415.1 | KX830960.1 | KX377335.1 | LC002520.1 | JCVI_KU963573.2_NR-50065* | KX601169.1 |
|---|---|---|---|---|---|---|---|
| 6 | E at 6 | K at 6 | K at 6 | K at 6 | K at 6 | K at 6 | K at 6 |
| 7 | E at 7 | K at 7 | K at 7 | K at 7 | K at 7 | K at 7 | K at 7 |
| 8 | I at 8 | S at 8 | S at 8 | S at 8 | S at 8 | S at 8 | S at 8 |
| 9 | R at 9 | G at 9 | G at 9 | G at 9 | G at 9 | G at 9 | G at 9 |
| 10 | R at 10 | G at 10 | G at 10 | G at 10 | G at 10 | G at 10 | G at 10 |
| 11 | I at 11 | F at 11 | F at 11 | F at 11 | F at 11 | F at 11 | F at 11 |
| 205 | H at 205 | H at 205 | H at 205 | R at 205 | H at 205 | H at 205 | H at 205 |
| 442…444 | insertion after 441 of IVND | insertion after 441 of IVND | insertion after 441 of IVND | insertion after 441 of TVND | |||
| 446 | I at 442 | T at 446 | T at 446 | T at 446 | I at 446 | I at 442 | I at 442 |
| 453 | D at 449 | N at 453 | N at 453 | N at 453 | N at 453 | N at 449 | N at 449 |
| 573 | R at 569 | K at 573 | K at 573 | K at 573 | K at 573 | K at 569 | K at 569 |
| 1784 | N at 1780 | Y at 1784 | Y at 1784 | Y at 1784 | Y at 1784 | Y at 1780 | Y at 1780 |
| 2356 | M at 2352 | Y at 2356 | Y at 2356 | Y at 2356 | Y at 2356 | Y at 2352 | Y at 2352 |
| 2357 | H at 2353 | A at 2357 | A at 2357 | A at 2357 | A at 2357 | A at 2353 | A at 2353 |
| 2358 | G at 2353 | W at 2358 | W at 2358 | W at 2358 | W at 2358 | W at 2353 | W at 2353 |
| 2582 | I at 2578 | L at 2582 | L at 2582 | L at 2582 | L at 2582 | L at 2578 | L at 2578 |
| 2586 | E at 2582 | V at 2586 | V at 2586 | V at 2586 | V at 2586 | V at 2582 | V at 2582 |
| 2732 | C at 2728 | S at 2732 | S at 2732 | S at 2732 | S at 2732 | S at 2728 | S at 2728 |
*Sequenced at JCVI as part of the current work.
Consensus substitutions in different PRI/PRVABC59/2015 sequences, with numbers representing the nucleotide position in the genome for each sequence.
| Alignment position | KX377337.1 | KU501215.1_NR-50240 | JCVI_KX087101.3_NR-50240* | KX601168.1 |
|---|---|---|---|---|
| 1965 | G at 1965 | G at 1964 | T at 1937 | T at 1947 |
| 2781 | G at 2781 | T at 2780 | T at 2753 | T at 2763 |
| 8283 | T at 8283 | C at 8282 | C at 8255 | C at 8265 |
| Missing 1 base at 5′ and 131 base at 3′ end compared to KX377337.1 | Missing 28 base at 5′ and 1 base at 3′ end compared to KX377337.1 | Missing 18 base at 5′ and 110 base at 3′ end compared to KX377337.1 |
*Sequenced at JCVI as part of the current work.
Substitutions in the polyprotein of different PRI/PRVABC59/2015 sequences, with numbers representing the amino acid positions in the polyprotein for each sequence.
| Alignment position | KX377337.1 | KU501215.1_NR-50240 | JCVI_KX087101.3_NR-50240* | KX601168.1 |
|---|---|---|---|---|
| 620 | V at 620 | V at 620 | L at 620 | L at 620 |
| 892 | G at 892 | W at 892 | W at 892 | W at 892 |
*Sequenced at JCVI as part of the current work.
Figure 1Minor variants detected in each coding ZIKV stock, by coding region. The number of statistically significant minor variants observed at a minimum of 20% of reads across the ZIKV genome. Values were normalized by the number of nucleotides in each gene.
Figure 2Positions of minor variants detected in Asian-lineage Zika viruses. The location of codons containing minor variants in strains belonging to the Asian lineage with the frequency threshold set at 20% (or more) of reads.
Figure 3Positions of minor variants detected in African-lineage Zika viruses. The location of codons containing minor variants in strains belonging to the African lineage with the frequency threshold set at 20% (or more) of reads.
Minor variants (>20%) observed in codons across Asian and African lineages and the effect on amino acid residue.
| Strain Name | GenBank Accession | Lineage | Codon Position* | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 81 | 168 | 194 | 451 | 533 | 620 | 691 | 1033 | 1263 | 1303 | 2051 | |||
| C | prM | prM | E | E | E | E | NS1 | NS2A | NS2A | NS3 | |||
| Zika virus ZIKV/Macaca mulatta/UGA/MR-766/1947 | KX601169 | African | |||||||||||
| Zika virus ZIKV/Macaca mulatta/UGA/MR-766_SM150-V8/1947 | KU963573 | African | I81M | ||||||||||
| Zika virus ZIKV/Aedes africanus/SEN/DakAr41524/1984 | KX601166 | African | |||||||||||
| Zika virus ZIKV/Aedes africanus/SEN/DAK-AR-41524_A1C1-V5/1984 | KY348860 | African | |||||||||||
| Zika virus ZIKV/Homo sapiens/NGA/IbH-30656_SM21V1-V3/1968 | KU963574 | African | D168N | N/C | |||||||||
| Zika virus ZIKV/Homo Sapiens/PRI/PRVABC59/2015 | KX087101 | Asian | L620V | S1033N | N/C | ||||||||
| Zika virus ZIKV/Aedes sp./MYS/P6-740/1966 | KX601167 | Asian | D168N | N/C | V620L | Y691H | A1303T | ||||||
| Zika virus ZIKV/Aedes aegypti/MYS/P6-740/1966 | KX694533 | Asian | I81M | N/C | H691Y | V1263A | A1303T | N/C | |||||
| Zika virus ZIKV/Aedes.sp/MEX/MEX_2-81/2016 | KX446950 | Asian | |||||||||||
| Zika virus ZIKV/Aedes.sp/MEX/MEX_I-7/2016 | KX446951 | Asian | I81M | N/C | |||||||||
| Zika virus ZIKV/Homo sapiens/THA/PLCal_ZV/2013 | KX694532 | Asian | I81M | A1263V | N/C | ||||||||
| Zika virus ZIKV/Homo Sapiens/COL/FLR/2015 | KX087102 | Asian | D451E | S1033N | |||||||||
| Zika virus ZIKV/Homo sapiens/PAN/BEI-259634_V4/2016 | KX198135 | Asian | S1033N | A1263V | N/C | ||||||||
| Zika virus ZIKV/Homo sapiens/PAN/CDC-259359_V1-V3/2015 | KX156774 | Asian | D451E | S1033N | N/C | ||||||||
| Zika virus ZIKV/Homo sapiens/PAN/CDC-259249_V1-V3/2015 | KX156775 | Asian | S1033N | A1263V | N/C | ||||||||
| Zika virus ZIKV/Homo sapiens/PAN/CDC-259364_V1-V2/2015 | KX156776 | Asian | N/C | N/C | |||||||||
| Zika virus ZIKV/Homo sapiens/HND/R103451/2015 | KX694534 | Asian | I81M | N/C | |||||||||
| Zika virus ZIKV/Aedes aegypti/MEX/MEX_I-44/2016 | KY648934 | Asian | V1263A | ||||||||||
*Positions are codon numbers in the polyprotein with genes indicated inside the parentheses.
N/C = No change in amino acid sequence.