| Literature DB >> 33821261 |
David P Maison, Vivek R Nerurkar.
Abstract
Here, we present a methodology to define the origin of SARS-CoV-2 variants as exemplified by defining the introduction of the B.1.429 variant in Hawaii. We used 187 B.1.429 variant sequences from Hawai'i deposited in the GenBank and GISAID as of March 20, 2021, as an example to develop the methodology. Briefly, i) acquire sequences, ii) perform multiple sequence alignment, iii) trim the alignment, iv) remove incomplete sequences, v) remove duplicates, and vi) generate a phylogenetic tree. The tree defined the most recent common ancestor as the origin. Further, the multiple sequence alignment used to generate the phylogenetic tree identified 20 single nucleotide polymorphisms in the B.1.429 variant genome. The Centers for Disease Control and Prevention defines B.1.429 as a variant initially found in California. This variant was introduced in Hawai'i multiple times in early 2021. Based on the phylogenetic tree, we conclude that the B.1.429 variant has entered Hawai'i at different timepoints from at least seven different states in the continental United States. This information provides a tool for policy makers and public health officials in applying precision public health genomics.Entities:
Year: 2021 PMID: 33821261 PMCID: PMC8020990 DOI: 10.21203/rs.3.rs-378702/v1
Source DB: PubMed Journal: Res Sq
Figure 1Phylogenetic Tree of B.1.429 Variants Sequenced Worldwide. This figure shows an approximately maximum-likelihood tree generated by FastTree in Geneious Prime 2021.0.3 (http://www.geneious.com). The tree was rooted to the SARS-CoV-2 Lineage A reference sequence (EPI_ISL_406801). Blue text indicates sequences from Hawai’i. The pink text indicates the most recent common ancestor sequence(s) of B.1.429 in Hawai’i. Clusters and sequences identified with the colored text (blue) were evaluated for the most recent common ancestor to define the origin of the variant sequence (pink). Text shown in black indicates global B.1.429 variant sequences not necessarily directly affiliated with variants found in Hawai’i. Appendage sequences designated by an equal sign (=) indicate sequence identifications in which the sequences were identical as generated by sRNAtoolbox. Created with BioRender.com.
Figure 2Introduction of the SAR-CoV-2 B.1.429 Variant in Hawai’i from the Continental United States. This figure shows the states within the United States from which the SARS-CoV-2 B.1.429 variant has been introduced into Hawai’i. Represented States are Tennessee, California, Utah, New Jersey, Louisiana, Colorado, and Washington (shown in pink). Figure made with a stock image by Clker-Free-Vector-Images from Pixabay. Stock image editing done in Adobe Photoshop 21.2.0. Created with Biorender.com
B.1.429 Consensus (>90%) Single Nucleotide Polymorphisms and Amino Acid Substitutions as Determined From 1,931 Unique and Unambiguous B.1.429 Lineage Sequences
| B.1.429 | ||||
|---|---|---|---|---|
| Gene or region | Nucleotide Loci | Nucleotide Change | Amino Acid Position | Amino Acid Change |
| 1,059 | C → T | 265 | Thr → Ile | |
| 2,395 | C → T | 710 | Val → Val | |
| 2,597 | T → C | 778 | Leu → Leu | |
| 3,037 | C → T | 924 | Phe → Phe | |
| 8,947 | C → T | 2,894 | Asn → Asn | |
| 12,100 | C → T | 3,945 | Ala → Ala | |
| 12,878 | A → G | 4,205 | Ile → Val | |
| 14,408 | C → T | 4,715 | Pro → Leu | |
| 17,014 | G → T | 5,584 | Asp → Tyr | |
| 21,600 | G → T | 13 | Ser → Ile | |
| 22,018 | G → T | 152 | Trp → Cys | |
| 22,917 | T → G | 452 | Leu → Arg | |
| 23,403 | A → G | 614 | Asp → Gly | |
| 24,349 | T → C | 929 | Ser → Ser | |
| 25,563 | G → T | 57 | Gln → His | |
| 26,681 | C → T | 53 | Phe → Phe | |
| 27,890 | G → T | - | - | |
| 28,272 | A → T | - | - | |
| 28,887 | C → T | 205 | Thr → Ile | |
| 29,362 | C → T | 363 | Phe → Phe | |