| Literature DB >> 34894077 |
Jiani Chen1, Xueting Qiu2,3, Vasanthi Avadhanula4, Samuel S Shepard5, Do-Kyun Kim6, James Hixson6, Pedro A Piedra4, Justin Bahl1,2,7.
Abstract
BACKGROUND: Human respiratory syncytial virus (RSV) is one of the leading causes of respiratory infections, especially in infants and young children. Previous RSV sequencing studies have primarily focused on partial sequencing of G gene (200-300 nucleotides) for genotype characterization or diagnostics. However, the genotype assignment with G gene has not recapitulated the phylogenetic signal of other genes, and there is no consensus on RSV genotype definition.Entities:
Keywords: genotype; genotypic classification; label software; phylogenetic analysis
Mesh:
Year: 2021 PMID: 34894077 PMCID: PMC8983899 DOI: 10.1111/irv.12936
Source DB: PubMed Journal: Influenza Other Respir Viruses ISSN: 1750-2640 Impact factor: 4.380
FIGURE 1Scheme of respiratory syncytial virus (RSV) genome and comparison of RSV phylogenies inferred from different gene datasets. (A) RSV genome organization with G gene indels. (B) Likelihood scores of phylogenies inferred from different gene sequences given to the whole‐genome sequence (WGS) dataset. W1, the phylogeny inferred from WGS with G gene indels implemented as a single evolutionary event; W2, the phylogeny inferred from WGS with G gene indels implemented as multiple substitution events. *P < 0.005 in Shimodaira–Hasegawa (SH) test compared with W1; # P < 0.005 in approximately unbiased (AU) test compared with W1. (C) Comparison of evolutionary rates that are estimated from different gene datasets. Error bars indicate the confidence intervals of the estimation
FIGURE 2Maximum likelihood phylogeny of respiratory syncytial virus (RSV) RSV‐A (A) and RSV‐B (B) inferred from whole‐genome sequence (WGS) with the genotyping assignment. The genotype assignments are indicated with vertical black bars and are labeled on the right. The subclade assignments are indicated with vertical gray bars and are labeled on the left. Tip point colors represent the previously defined genotype names based on complete or partial G gene sequences. The nodes that define the genotype and subclade are indicated with black and gray node points, respectively. Bootstrap of each ancestral genotype/subclade node is marked. Colored columns on the right side represent G gene duplication and indels. Scale bars indicate 0.01 nucleotide substitution per site
List of previously defined genotype name and detection time of new RSV genotype assignment.
| Subtype | Genotype | Subclade | Detection time | Previously defined genotype name |
|---|---|---|---|---|
| A | A.1 | 1978–1998 | ||
| A.2 | ||||
| A.2.1 | 1990–1994 | |||
| A.2.2 | 2001–2015 | GA5 | ||
| A.2.3 | 2001–2015 | GA5 | ||
| A.2.4 | 1998–2013 | GA5 | ||
| A.3 | 1984–1998 | GA7 | ||
| A.4 | 1998–2009 | GA2 | ||
| A.5 | ||||
| A.5.1 | 2007–2015 | GA2 | ||
| A.5.2 | 2006–2010 | GA2 | ||
| A.5.3 | 2008–2015 | GA2, NA1 | ||
| A.5.4 | 2008–2015 | GA2 | ||
| A.5.5 | 2011–2015 | NA1 | ||
| A.5.6 | 2011–2015 | GA2 | ||
| A.5.7 | 2012–2016 | ON1 | ||
| A.5.8 | 2012–2016 | |||
| A.5.9 | 2008–2016 | |||
| A.5.10 | 2012–2017 | ON1 | ||
| A.5.11 | 2011–2016 | ON1 | ||
| B | B.1 | 1979–1987 | GB1 | |
| B.2 | 1979–1991 | GB2 | ||
| B.3 | 1989–2002 | GB4 | ||
| B.4 | 2008–2012 | |||
| B.5 | ||||
| B.5.1 | 1992–1996 | |||
| B.5.2 | 1997–2013 | |||
| B.5.3 | 2008–2015 | BA | ||
| B.5.4 | 2004–2009 | BA | ||
| B.5.5 | 2006–2012 | |||
| B.5.6 | 2006–2015 | BA | ||
| B.5.7 | 2006–2013 | BA | ||
| B.5.8 | 2012–2016 | BA | ||
| B.5.9 | 2008–2013 | BA | ||
| B.5.10 | 2009–2016 | BA |
Abbreviation: RSV, respiratory syncytial virus.
Periods were detected up to 2017 and may underestimate the circulation time due to bias in GenBank deposition practices.
The previously defined genotype name was collected from GenBank.
FIGURE 3Spatial and temporal distribution of respiratory syncytial virus (RSV) RSV‐A (A) and RSV‐B (B) genotypes. The temporal and spatial distribution of RSV genotypes is based on the detection year and isolated WHO region of sequence for each assigned genotype. African Region (AFRO), Region of the Americas (PAHO), South‐East Asia Region (SEARO), European Region (EURO), Eastern Mediterranean Region (EMRO), and Western Pacific Region (WPRO)
Comparison of RSV molecular systematic proposals.
| Reference | This study | Peret et al. | Agoti et al. | Goya et al. | Ramaekers et al. |
|---|---|---|---|---|---|
| Dataset | RSV whole‐genome sequence with isolation year in GenBank up to April 2019 | G gene sequence in GenBank and sequences obtained in the laboratory | RSV G gene sequence isolated from 2006 through 2011 available in GenBank and sequences obtained in the laboratory | All RSV G ectodomain sequences available in GenBank up to February 2018 | All RSV whole‐genome sequences available in GenBank up to January 2019 |
| Genotyping region | CDS region of full‐length genome | Second hypervariable region of G ectodomain | G gene | G ectodomain | Full‐length genome |
| Genotyping criteria | Genotype: ≥70% bootstrap and maximal pair‐wise node distance within the clade (0.018 for RSV‐A and 0.024 for RSV‐B ); subclade: clade detection time ≥ 5 years within genotype | Genotype: ≥70% bootstrap; subtype: 96% nucleotide similarity within genotype | ≥60% bootstrap and average genetic distance cutoff 1.5% | ≥80% bootstrap and | ≥70% bootstrap and patristic distance > 0.018 subs/site for RSV‐A, patristic distance > 0.026 subs/site for RSV‐B |
| Genotype name | A.1–A.5 (subclade: A.2.1–A.2.4, A.5.1–A.5.11); B.1–B.5 (subclade: B.5.1–B.5.10) | GA1–GA5 (22 subtypes among 5 genotypes); GB1–GB4 (6 subtypes among 4 genotypes) | GA1–GA7, SAA1; GB1–GB4, SAB1–SAB3, BA | GA1–GA3,;GB1–GB5 | A1–A23; B1–B6 |
Abbreviation: RSV, respiratory syncytial virus.