| Literature DB >> 21036781 |
Abstract
Influenza has been circulating in the human population and has caused three pandemics in the last century (1918 H1N1, 1957 H2N2 and 1968 H3N2). The 2009 A(H1N1) was classified by World Health Organization as the fourth pandemic. Influenza has a high evolution rate, which makes vaccine design challenging. We here consider an approach for early detection of new dominant strains. By clustering the 2009 A(H1N1) sequence data, we found two main clusters. We then define a metric to detect the emergence of dominant strains. We show on historical H3N2 data that this method is able to identify a cluster around an incipient dominant strain before it becomes dominant. For example, for H3N2 as of 30 March 2009, the method detects the cluster for the new A/British Columbia/RV1222/2009 strain. This strain detection tool would appear to be useful for annual influenza vaccine selection.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21036781 PMCID: PMC2978544 DOI: 10.1093/protein/gzq078
Source DB: PubMed Journal: Protein Eng Des Sel ISSN: 1741-0126 Impact factor: 1.650
Fig. 1The evolutionary path of 2009 A(H1N1) influenza. Strain #1: A/California/05/2009. Strain #2: A/California/04/2009. Strain #7: A/California/07/2009. Strain #12: A/Texas/05/2009. Strain #28: A/New York/19/2009. For complete strain names, see Supplementary data. Strains from the Northern and Southern hemisphere are shown as red dots and blue dots, respectively. One branch represents one substitution in the amino acid sequence.
Fig. 2(a) Kernel density estimation for the protein distance map of 2009 A(H1N1) influenza as of 5 December 2009. (b) The protein distance map of 2009 A(H1N1) influenza. The vertical and horizontal axes of both figures represent protein distance as defined in Equation (1). A 0.0018 unit of protein distance equals one substitution in the HA protein sequence of H1N1. The height and colors in (a) both represent the density of isolates.
Fig. 3(a) The protein distance map and (b) corresponding Kernel density estimation of influenza from 1968 to 2007. The vertical and horizontal axes of both figures represent protein distance as defined in Equation (1). A 0.0030 unit of protein distance equals one substitution in the HA1 protein sequence of H3N2. The colors in (a) represent the time of collection of the isolates. The colors and height in (b) represent the density of isolates. Each cluster is named after the first vaccine strain in the cluster. HK68: Hongkong/1/68, EN72: England/42/72, VT75: Victoria/3/75, TX77: Texas/1/77, BK79: Bangkok/1/79, PP82: Philippines/2/82, SC87: Sichuan/2/87, BJ89: Beijing/32/92, SD93: Shandong/9/93, JB94: Johannesburg/33/94, WH95: Wuhan359/95, SN97: Sydney/5/97, PM99: Panama/2007/99, FJ02: Fujian/411/2002.
Summary of results
| Flu season | Vaccine strain from WHO ( | Our prediction | Circulating H3N2 strain | Circulating subtype |
|---|---|---|---|---|
| 1996–1997 | Wuhan/359/95 | Wuhan/359/95 | Wuhan/359/95 | H3 |
| 1997–1998 | Wuhan/359/95 | Wuhan/359/95 | Sydney/5/97 | H3 |
| 1998–1999 | Sydney/5/97 | Sydney/5/97 | Sydney/5/97 | H3 |
| 1999–2000 | Sydney/5/97 | Sydney/5/97 | Sydney/5/97 | H3 |
| 2000–2001 | Panama/2007/1999 | Panama/2007/1999 | N/A | H1 |
| 2001–2002 | Panama/2007/1999 | Panama/2007/1999 | Panama/2007/1999 | H3 |
| 2002–2003 | Panama/2007/1999 | Fujian/411/2002 | N/A | H1 |
| 2003–2004 | Panama/2007/1999 | Fujian/411/2002 | Fujian/411/2002 | H3 |
| 2004–2005 | Fujian/411/2002 | Fujian/411/2002 | Fujian/411/2002 | H3 |
| 2005–2006 | California/7/2004 | California/7/2004 | California/7/2004 | H3 |
| 2006–2007 | Wisconsin/67/2005 | Wisconsin/67/2005 | Wisconsin/67/2005 | H3 |
| 2007–2008 | Wisconsin/67/2005 | Wisconsin/67/2005 | N/A | H1 |
| 2008–2009 | Brisbane/10/2007 | Brisbane/10/2007 | Brisbane/10/2007 | H3 |
| 2009–2010 | Brisbane/10/2007 | BritishColumbia/RV1222/09 | BritishColumbia/RV1222/09 | H1 |
| 2010–2011 | Perth/16/2009 | BritishColumbia/RV1222/09 | N/A | N/A |
This table includes the H3N2 vaccine strains, our prediction of dominant strains, the reported dominant circulating H3N2 strains (World Health Organization, 1995, 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005a, 2006, 2007, 2008, 2009a, 2010) and the circulating subtypes in the northern hemisphere (World Health Organization, 1995, 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003, 2004, 2005a, 2006, 2007, 2008, 2009a, 2010). Circulating H3N2 strains are absent if the dominant subtype is H1 or influenza B. The reported dominant H3N2 strains and circulating subtypes data are from WHO Weekly Epidemiological Record (http://www.who.int/wer/en/).
Fig. 4(a) Kernel density estimation and (c) protein distance map for H3N2 viruses between 1 October 2002 and 1 February 2003. (b) Kernel density estimation and (d) protein distance map for H3N2 viruses between 1 October 2001 and 9 September 2002. We plot a dotted line to separate the two clusters. The vertical and horizontal axes of all figures represent protein distance as defined in Equation (1). A 0.0030 unit of protein distance equals one substitution of the HA1 protein sequence of H3N2.
Fig. 5(a) Kernel density estimation and (c) protein distance map for H3N2 viruses from 1 October 2008 to 14 June 2009. (b) Kernel density estimation and (d) protein distance map for H3N2 viruses between 1 October 2008 and 30 March 2009. The vertical and horizontal axes of all figures represent protein distance as defined in Equation (1). A 0.0030 unit of protein distance equals one substitution of the HA1 protein sequence of H3N2.
Fig. 6(a) Kernel density estimation for the protein distance map for H3N2 viruses between 1 October 2003 and 30 September 2004. (b) Kernel density estimation for the protein distance map for H3N2 viruses between 1 October 2003 and 1 February 2004. (c) Protein distance map for H3N2 viruses between 1 October 2003 and 30 September 2004. We plot a dotted line to separate the two clusters. (d) Protein distance map for H3N2 viruses between 1 October 2003 and 1 February 2004. The vertical and horizontal axes of all figures represent protein distance. A 0.0030 unit of protein distance equals one mutation of the HA1 protein sequence of H3N2.