| Literature DB >> 31041344 |
Soo-Yon Rhee1, Brittany R Magalis2, Leo Hurley3, Michael J Silverberg3, Julia L Marcus4, Sally Slome5, Sergei L Kosakovsky Pond2, Robert W Shafer1.
Abstract
BACKGROUND: Recent advances in high-throughput molecular epidemiology are transforming the analysis of viral infections.Entities:
Keywords: HIV-1; network analysis; pol sequence; transmission
Year: 2019 PMID: 31041344 PMCID: PMC6483754 DOI: 10.1093/ofid/ofz135
Source DB: PubMed Journal: Open Forum Infect Dis ISSN: 2328-8957 Impact factor: 3.835
Figure 1.Distribution and composition of cluster sizes in the Northern California Cohort (NCC) plus Los Alamos National Laboratories (LANL) transmission network. Figure 1A shows the numbers of clusters according to their sizes. Subtype B sequences are indicated in dark gray and non-subtype B sequences are indicated in yellow. Figure 1B shows the counts of NCC and LANL sequences in clusters grouped by their sizes with NCC sequences indicated in gray and LANL sequences indicated in dark red.
Figure 2.Connection patterns between Northern California Cohort (NCC) and Los Alamos National Laboratories sequences, categorized by country/region of origin. Sequences connected to multiple countries/regions are represented once for each connection.
23 Clusters With Non-B Subtype, and Including Both NCC and LANL Sequences
| Subtype | Class | Size | NCCa | LANL Directb | LANL Indirectc | Median Year NCCd | Median Year LANLd | Countries (LANL) |
|---|---|---|---|---|---|---|---|---|
| 01_AE | International | 3123 | 13 | 58 | 3052 | 2010 | 2009 | China (2093), Vietnam (564), Thailand (179), Japan (95), Czechia (39), Australia (21), and 20 other countries, and 7 with no country data |
| 07_BC | International | 1640 | 3 | 727 | 910 | 2014 | 2011 | China (1619), Japan (10), Hong Kong (3), Poland (1), Australia (1), UK (1), and 2 other countries |
| A6 | International | 1404 | 4 | 250 | 1150 | 2009 | 2005 | Russian Federation (703), Uzbekistan (124), Ukraine (117), Latvia (99), Kazakhstan (83), Czechia (60), and 22 other countries |
| 55_01B | International | 262 | 1 | 1 | 260 | 2015 | 2011 | China (258), Thailand (1), Hong Kong (1), Japan (1) |
| C | International | 50 | 2 | 39 | 9 | 2006 | 2008 | India (34), China (8), Italy (1), Nepal (1), Thailand (1), Czechia (1), and 2 other countries |
| BF1 | International | 45 | 1 | 10 | 34 | 2014 | 2013 | Turkey (42), Cyprus (1), Sweden (1) |
| 24_BG | International | 33 | 2 | 7 | 24 | 2013 | 2003 | Cuba (29), Spain (2) |
| 51_01B | International | 26 | 1 | 1 | 24 | 2011 | 2013 | Mongolia (19), Singapore (4), Thailand (1), Canada (1) |
| 01_AE | International | 24 | 1 | 17 | 6 | 2010 | 2008 | Japan (6), Australia (5), Malaysia (3), Republic of Korea (2), Singapore (1), Belgium (1), and 3 other countries, and 2 with no country data |
| G | National | 10 | 1 | 2 | 7 | 2016 | 2011 | USA (9) |
| D | N/A | 7 | 4 | 3 | 0 | 2005 | 1997 | Uganda (3) |
| 01_AE | N/A | 5 | 1 | 4 | 0 | 2008 | 2010 | Hong Kong (1), Philippines (1), and 2 with no country data |
| 02_AG | National | 3 | 1 | 2 | 0 | 2012 | 2009 | USA (2) |
| 01_AE | N/A | 2 | 1 | 1 | 0 | 2008 | 2009 | USA (1) |
| F1 | N/A | 2 | 1 | 1 | 0 | 2014 | 2013 | Spain (1) |
| 01_AE | N/A | 2 | 1 | 1 | 0 | 2016 | 2013 | Philippines (1) |
| 01_AE | N/A | 2 | 1 | 1 | 0 | 2007 | 2008 | Thailand (1) |
| D | N/A | 2 | 1 | 1 | 0 | 2006 | 2002 | USA (1) |
| BG | N/A | 2 | 1 | 1 | 0 | 2011 | 2009 | USA (1) |
| C | N/A | 2 | 1 | 1 | 0 | 2004 | 2008 | USA (1) |
| 01_AE | N/A | 2 | 1 | 1 | 0 | 2011 | 2013 | Philippines (1) |
| 08_BC | N/A | 2 | 1 | 1 | 0 | 2012 | 2012 | China (1) |
| C | N/A | 2 | 1 | 1 | 0 | 2014 | 2008 | USA (1) |
Abbreviations: LANL, Los Alamos National Laboratories; NA, nonapplicable; NCC, Northern California Cohort.
aNumber of sequences from the Northern California Cohort (NCC).
bNumber of previously published sequences in the LANL HIV Sequence Database directly linked (TN93 genetic distance ≤0.015) to an NCC sequence in the cluster.
cNumber of LANL sequences in the cluster directly linked only to another LANL sequence in the cluster.
dMedian isolation year for the NCC and LANL sequences.
28 Subtype B Clusters Containing 10 or More Virus Sequences, and Including Both NCC and LANL Sequences
| Size | Class | NCCa | LANL Directb | LANL Indirectc | Median Year NCCd | Median Year LANLd | Countries (LANL) |
|---|---|---|---|---|---|---|---|
| 3502 | International | 512 | 1172 | 1818 | 2008 | 2003 | USA (1122), China (638), Canada (248), Japan (241), Switzerland (178), Italy (91), and 38 other countries, and 1 with no country data |
| 1492 | International | 2 | 239 | 1251 | 2007.5 | 2010 | Japan (1471), UK (13), USA (2), China (2), Germany (1), Hong Kong (1) |
| 68 | Local | 65 | 3 | 0 | 2010 | 2009 | USA (3) |
| 37 | Local | 29 | 7 | 1 | 2007 | 2008.5 | USA (7), Hong Kong (1) |
| 35 | National | 3 | 13 | 19 | 2006 | 2004.5 | USA (32) |
| 34 | National | 2 | 7 | 25 | 2008.5 | 2004 | USA (20), Canada (12) |
| 28 | Local | 19 | 8 | 1 | 2008 | 2005 | USA (6), Canada (2), Japan (1) |
| 24 | Local | 21 | 3 | 0 | 2013 | 2005.5 | USA (3) |
| 24 | International | 1 | 3 | 20 | 2013 | 2011 | Philippines (12), Canada (3), Thailand (2), Taiwan (2), Australia (1), Brazil (1), and 2 other countries |
| 22 | International | 1 | 6 | 15 | 2014 | 2004 | Denmark (13), Sweden (7), Norway (1) |
| 21 | Local | 18 | 3 | 0 | 2015 | 2010 | USA (3) |
| 21 | International | 1 | 1 | 19 | 2005 | 2009.5 | Japan (20) |
| 20 | Local | 19 | 1 | 0 | 2009 | 2009 | USA (1) |
| 19 | International | 1 | 4 | 14 | 2007 | 2005.5 | UK (11), Australia (4), Singapore (1), Canada (1), USA (1) |
| 18 | Local | 11 | 6 | 1 | 2006 | 1999 | USA (7) |
| 17 | International | 3 | 7 | 7 | 2010 | 2009 | Canada (13), USA (1) |
| 17 | International | 1 | 13 | 3 | 2011 | 2010.5 | Poland (7), UK (6), USA (2), Germany (1) |
| 16 | Local | 13 | 2 | 1 | 2012 | 2006 | USA (3) |
| 14 | National | 4 | 2 | 8 | 2011 | 2002.5 | USA (10) |
| 13 | National | 3 | 5 | 5 | 2008 | 2005 | USA (10) |
| 12 | International | 1 | 2 | 9 | 2013 | 2008 | UK (5), Germany (2), USA (1), Spain (1), Canada (1), Serbia (1) |
| 11 | Local | 10 | 1 | 0 | 2013 | 2007 | USA (1) |
| 11 | Local | 10 | 1 | 0 | 2007.5 | 2011 | Germany (1) |
| 10 | Local | 9 | 1 | 0 | 2008 | 2008 | USA (1) |
| 10 | National | 4 | 6 | 0 | 2005.5 | 2003 | USA (6) |
| 10 | N/A | 4 | 5 | 1 | 2015 | 2011 | USA (5), Australia (1) |
| 10 | National | 2 | 6 | 2 | 2004 | 2009 | USA (7), Italy (1) |
| 10 | National | 1 | 8 | 1 | 2007 | 2005 | USA (9) |
Abbreviations: LANL, Los Alamos National Laboratories; N/A, nonapplicable; NCC, Northern California Cohort.
aNumber of sequences from the NCC.
bNumber of previously published sequences in the Los Alamos National Laboratories HIV Sequence Database (LANL) directly linked (TN93 genetic distance ≤0.015) to an NCC sequence in the cluster. cNumber of LANL sequences in the cluster directly linked only to another LANL sequence in the cluster. dMedian Isolation year for the NCC and LANL sequences.
Figure 3.Bayesian maximum clade credibility tree for the largest CRF01_AE cluster. Branch lengths are scaled in time and are colored according to region (legend at right) determined using maximum parsimony ancestral state reconstruction. Clades containing Northern California Cohort (NCC) sequences are expanded for clarity, and the 2 highly supported (posterior probability [PP] ≥90%) clades comprised of NCC sequences only are highlighted in green. In addition, the location of NCC sequences is indicated by taxon labeling. Open circles located at interior nodes indicate PP ≥90%.