| Literature DB >> 31296922 |
Charles N Agoti1,2, My V T Phan3,4, Patrick K Munywoki5,6, George Githinji5, Graham F Medley7, Patricia A Cane8, Paul Kellam3,9, Matthew Cotten3,10,11, D James Nokes5,12,13.
Abstract
Infants (under 1-year-old) are at most risk of life threatening respiratory syncytial virus (RSV) disease. RSV epidemiological data alone has been insufficient in defining who acquires infection from whom (WAIFW) within households. We investigated RSV genomic variation within and between infected individuals and assessed its potential utility in tracking transmission in households. Over an entire single RSV season in coastal Kenya, nasal swabs were collected from members of 20 households every 3-4 days regardless of symptom status and screened for RSV nucleic acid. Next generation sequencing was used to generate >90% RSV full-length genomes for 51.1% of positive samples (191/374). Single nucleotide polymorphisms (SNPs) observed during household infection outbreaks ranged from 0-21 (median: 3) while SNPs observed during single-host infection episodes ranged from 0-17 (median: 1). Using the viral genomic data alone there was insufficient resolution to fully reconstruct within-household transmission chains. For households with clear index cases, the most likely source of infant infection was via a toddler (aged 1 to <3 years-old) or school-aged (aged 6 to <12 years-old) co-occupant. However, for best resolution of WAIFW within households, we suggest an integrated analysis of RSV genomic and epidemiological data.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31296922 PMCID: PMC6624209 DOI: 10.1038/s41598-019-46509-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Demographic details of the 20 households, number of positive samples, number of sequenced samples and clade assignment.
| HH€ ID | HH05 | HH06 | HH12 | HH14 | HH17 | HH18 | HH19 | HH25 | HH26 | HH29 | HH31 | HH35 | HH38 | HH40 | HH41 | HH42 | HH45 | HH49 | HH51 | HH57 | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| HH size | 37 | 6 | 20 | 6 | 5 | 8 | 14 | 4 | 5 | 7 | 11 | 8 | 23 | 5 | 8 | 6 | 10 | 12 | 15 | 16 |
|
| Females | 24 | 6 | 10 | 2 | 3 | 4 | 8 | 1 | 4 | 3 | 8 | 5 | 10 | 2 | 5 | 5 | 7 | 5 | 11 | 7 |
|
| In school | 9 | 3 | 6 | 3 | 2 | 4 | 7 | 0 | 3 | 3 | 3 | 4 | 10 | 2 | 5 | 3 | 8 | 3 | 7 | 8 |
|
| NPS¥ collected | 1050 | 229 | 503 | 262 | 208 | 333 | 524 | 166 | 232 | 296 | 216 | 326 | 875 | 226 | 360 | 217 | 389 | 372 | 524 | 387 |
|
| RSV + ve | 71 | 18 | 18£ | 18 | 11 | 11 | 31£ | 10 | 14£ | 25£ | 11 | 14 | 49 | 12 | 13 | 11 | 23£ | 16 | 19 | 20 |
|
| RSVA + ve | 70 | 2 | 1 | 18 | 0 | 0 | 1 | 0 | 9 | 25 | 11 | 0 | 24 | 12 | 0 | 0 | 6 | 2 | 2 | 18 |
|
| RSVB + ve | 1 | 16 | 18 | 0 | 11 | 11 | 31 | 10 | 7 | 6 | 0 | 14 | 25 | 0 | 13 | 11 | 19 | 14 | 17 | 2 |
|
| Genomes sequenced | 24 | 9 | 5 | 12 | 3 | 4 | 16 | 8 | 11 | 12 | 5 | 4 | 38 | 10 | 5 | 4 | 4 | 2 | 6 | 9 |
|
| RSVA/I | 24 | 1 | — | 12 | — | — | — | — | 9 | 12 | 5 | — | 22 | 10 | — | — | — | — | — | 8 |
|
| RSVB/I | — | — | — | — | 3 | 16 | — | — | — | — | 4 | 16 | — | — | 1 | — | — | — | — |
| |
| RSVB/II | — | 8 | 5 | — | 4 | — | — | — | — | — | — | — | — | 1 | — | — | 2 | — | — |
| |
| RSVB/III | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 3 | — | — | 6 | — |
|
| RSVB/IV | — | — | — | — | — | — | — | 8 | 2 | — | — | — | — | — | 4 | — | 4 | — | — | — |
|
| RSVB/V | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | — | 1 |
|
HH = household; ID = Identity; ¥NPS = nasopharyngeal-flocked swab; £some member showed RSVA and B coinfection.
Figure 1Phylogenetic relatedness of the sequenced household RSVA and RSVB genomes. (Panel a) Shows the clustering of household viruses on the global phylogeny. Taxa shapes (all provided as filled circles) of household genomes are shown in green, other available Kenyan genomes in blue and global sequences in red. The clade assignment is shown to the right-hand side of the taxa shapes. (Panel b) Shows clustering patterns of the households RSVA and B genomes on their own. The tip filled circles represent the individual genomes and are given the same colour if they belong to the same household. The clade and sub-clade assignments are shown to the right-hand side of the taxa shapes. Bootstraps support values for each branch are shown as filled circle node shapes in black. The size of the shapes is scaled by the support value that ranged from 0 to 100%.
Figure 2Temporal divergence characteristics of the household genomes. (Panel a) Shows a time-resolved maximum likelihood phylogenetic tree for RSVA and RSVB showing the estimated node ages and the assigned clades and sub-clades. (Panel b) Shows the strength of the phylogenetic signal in relation to sampling date for RSVA and B genomes detected in study and estimated time to the date of their most recent common ancestor (MRCA).
Figure 3The distribution of number of pairwise SNPs of the household study viruses. (Panel a) Shows pairwise SNP count for all sequenced RSVA and B genomes. (Panel b to f) Shows the pairwise SNP count for viruses within the six clades we identified from genomic analysis of the viruses detected in the 20 households.
Figure 4The genetic-spatio-temporal patterns of the RSVA and RSVB viruses identified from households. (Panel a) Shows the POPART minimum spanning networks of viruses within household and across households for RSVA (n = 103) RSVA and RSVB (n = 88). Each coloured vertex represents a sampled viral haplotype, with different colours indicating the different households of origin. The size of the vertex is relative to the number of sampled isolates. Numbers along each edge indicate the number of mutations. Small black circles within the network indicate unobserved internal nodes; (Panel b) illustrates the timeline of virus identified in each household, coloured by virus clade or sub-clade; (Panel c) illustrates the relative spatial locations of household with RSVA circulating with household coloured by virus clades (the longitude and latitude coordinates are removed from both axes for confidentiality). The latitude of household 42 was jittered slightly to better visualize the difference of virus clades circulating in this household and the HHs close by (HH41 and HH25).
Figure 5Patterns of intra-host consensus level SNPs during individual infection episodes. (Panel a) Shows the distribution of the number of SNP positions where we sequenced ≥2 genomes from the same individuals. (Panel b) Shows the relationship between the number SNPs observed and the duration between the first and last sequenced samples for these individuals. Both result panels are stratified by RSV group.
Figure 6Patterns of consensus level single nucleotide polymorphism (SNP) within households during the outbreaks. (Panel a) Shows the distribution of the number of SNP position for the genome sequenced households stratified by RSV group. (Panel b) Shows the relationship between the number SNPs observed and the duration between the first and last sequenced samples for these households stratified by RSV group.
Figure 7Infection and SNP patterns in prolonged virus shedders and overall SNP abundance at various levels of observations. (Panel a) Shows the patterns of positive samples (coloured grey if no sequence was obtained, red if an RSVA genome recovered and blue if an RSV B genome recovered) across time for the nine individuals who shed virus for more than 3 weeks. Note that for subject 4102 the samples available for genome comparison were only three days apart (although the individual was RSV positive for >3 weeks). (Panel b) Shows the nucleotide differences between viruses detected from these individuals. The virus sequences were compared to the earliest virus sequence from the individual. Vertical coloured bars show the nucleotide differences. Red is a change to T, green is a change to A, black is a change to G and blue is a change to C. Grey is a change to a deletion or a non-sequenced portion of the genome. (Panel c) Overall SNP abundance at various levels of observations.
Figure 8Example infection and genomic patterns of an RSV household outbreak. Here we show RSVB infection patterns in household 38 (HH38) in which 10/23 members got infected. (Panel a) Shows the patterns of positive samples across time for the 10 infected individuals. The circles represent positive samples and are coloured blue if we obtained genomes and grey if genomes were unavailable. (Panel b) Shows the nucleotide differences between viruses detected from this household. The viruses were compared to the earliest virus genome sequenced from the household. Vertical coloured bars show the nucleotide differences. Red is a change to T, green is a change to A, black is a change to G and blue is a change to C. Grey is a change to a deletion or a non-sequenced portion of the genome. (Panel c) Shows a maximum likelihood (ML) tree of all 16 genomes from the HH. The different taxon colour symbols indicate the different household members. (Panel d) Shows the POPART minimum spanning network plot of the sequences from the households. Each vertex presents a sampled viral haplotype, with different colours indicating different individuals who provided the sample. The size of each vertex is relative to the number of sampled isolates. Hatch marks indicate the number of mutations along each edge.
Inferring who most likely infected the infant in the household.
| HH€ ID | Infant ID¥ | RSV Group | Infector Identifiable?£ | Infector ID£ | Comment |
|---|---|---|---|---|---|
| HH05 | 0502 | A | No | — | Infant was the 15th person to be positive in HH and carried virus identical to 10 other members. |
| HH05 | 0503 | A | Equivocalß | 0509 or 0511 | Infant carried virus identical to 0509 (a toddler, asymptomatic, non-school-going member) and 0511 (a toddler, symptomatic, non-school going member). |
| HH05 | 0504 | A | Yes | 0518 | Infant was one of the first 2 secondary cases, virus detected identical to index case 0518 (a school-aged and school-going member). |
| HH05 | 0505 | A | No | — | All samples from this infant failed sequencing. |
| HH06 | 0601 | B | No | — | Infant was co-index€ with 0603 (a school-aged, asymptomatic, non-school going member). |
| HH12 | 1201 | B | No | — | Infant was the index case. |
| HH14 | 1401 | A | Equivocal | 1404 or 1402 | Infant virus sequence had one nucleotide difference from 1404 (the index-case, a school-aged, symptomatic, school-going member) and 1402 (a toddler, symptomatic, non-school going member). |
| HH17 | 1701 | B | No | — | Sequencing failed for the positive sample from the potential index case, 1705 (adult) collected 7 days earlier. |
| HH19 | 1901 | B | No | — | Infant was co-index with 1903 (a pre-schooler, mostly asymptomatic, non-school going member). |
| HH25 | 2501 | B | No | — | Infant was co-index with 2504 (an adult, asymptomatic except on one visit that produced a positive sample and was a non-school going member). |
| HH26 | 2601 | A | No | — | Infant was the index case. |
| HH29 | 2901 | A | Yes | 2904 | Infant was one of the three initial secondary cases, the virus sequence was identical to that from the index case 2904 (a school-aged, school-going, symptomatic member). |
| HH31 | 3101 | A | Equivocal | 3103 or 3105 | Infant was one of the two secondary cases in the household. The virus was introduced by 3103 (a toddler, non-school going, symptomatic) member. The other potential infector (3105) was a school-aged, school-going member, asymptomatic in 2/3 positive visits. |
| HH35 | 3501 | B | No | — | Infant was co-index with 3503 (a pre-schooler member, asymptomatic in 2/3 positive visits, school going). |
| HH38 | 3801 | A | Yes | 3821 | Infant among the first 2 secondary cases. Infant virus sequence had one nucleotide difference with the index case 3821 (an adult, asymptomatic, non-school going member). |
| HH38 | 3801 | B | Yes | 3805 | Infant was 6th positive case in HH and carried virus highly similar to the 3805 virus (a school-aged, asymptomatic, non-school going member). |
| HH40 | 4001 | A | Equivocal | 4004 or 4001 | The virus sequence from infant was identical to the index case virus (4004, a school-aged, symptomatic and school-going member) and the first secondary case (4002, a toddler old, symptomatic and non-school going member). |
| HH41 | 4101 | B | No | — | Infant was the index case in this household. |
| HH42 | 4201 | B | No | — | Infant was the index case in this household. |
| HH45 | 4501 | B | No | — | Infant was the third secondary case in the household, two of which yielded no sequence and the other had two nucleotide changes. |
| HH49 | 4901 | B | No | — | Infant was the index case. |
| HH51 | 5101 | B | No | — | Infant was the 5th positive case in the household and the 5101 virus sequence was equidistant genetically from all other members. |
| HH57 | 5701 | A | Equivocal | 5707 or 5702 | This household showed high genetic variation in general. Infant was the third secondary case in the household and virus was close to index case 5707 (a school-aged member) and 5702 (a toddler member). |
¥Infant refers to the persons that were <1 year-old during our RSV surveillance period; Definition of the age defined intervals are: toddler (ages 1 to <3 years); pre-schooler (ages 3 to <6 years); school-aged child (ages 6 to <12 years); adolescent (ages 12 to <18years); adults (aged >18 years); £Infector refers to the person whom most likely infected the infant in the household; €Co-index cases refer to the two individual whom were found concurrently first to be RSV positive in a household. ßEquivocal refers to when two individuals had an equal probability of being the source of the infant infection by our analysis approach and that could not be resolved further. Note that by our diagnostics, the infant in HH18 did not get RSV infected during our surveillance period.