Literature DB >> 28932211

Strain Level Streptococcus Colonization Patterns during the First Year of Life.

Meredith S Wright1, Jamison McCorrison1, Andres M Gomez1, Erin Beck1, Derek Harkins1, Jyoti Shankar1, Stephanie Mounaud1, Edelwisa Segubre-Mercado2, Aileen May R Mojica2, Brian Bacay2, Susan A Nzenze3, Sheila Z M Kimaro3, Peter Adrian3, Keith P Klugman3, Marilla G Lucero2, Karen E Nelson1, Shabir Madhi3, Granger G Sutton1, William C Nierman1, Liliana Losada1.   

Abstract

Pneumococcal pneumonia has decreased significantly since the implementation of the pneumococcal conjugate vaccine (PCV), nevertheless, in many developing countries pneumonia mortality in infants remains high. We have undertaken a study of the nasopharyngeal (NP) microbiome during the first year of life in infants from The Philippines and South Africa. The study entailed the determination of the Streptococcus sp. carriage using a lytA qPCR assay, whole metagenomic sequencing, and in silico serotyping of Streptococcus pneumoniae, as well as 16S rRNA amplicon based community profiling. The lytA carriage in both populations increased with infant age and lytA+ samples ranged from 24 to 85% of the samples at each sampling time point. We next developed informatic tools for determining Streptococcus community composition and pneumococcal serotype from metagenomic sequences derived from a subset of longitudinal lytA-positive Streptococcus enrichment cultures from The Philippines (n = 26 infants, 50% vaccinated) and South African (n = 7 infants, 100% vaccinated). NP samples from infants were passaged in enrichment media, and metagenomic DNA was purified and sequenced. In silico capsular serotyping of these 51 metagenomic assemblies assigned known serotypes in 28 samples, and the co-occurrence of serotypes in 5 samples. Eighteen samples were not typeable using known serotypes but did encode for capsule biosynthetic cluster genes similar to non-encapsulated reference sequences. In addition, we performed metagenomic assembly and 16S rRNA amplicon profiling to understand co-colonization dynamics of Streptococcus sp. and other NP genera, revealing the presence of multiple Streptococcus species as well as potential respiratory pathogens in healthy infants. A range of virulence and drug resistant elements were identified as circulating in the NP microbiomes of these infants. This study revealed the frequent co-occurrence of multiple S. pneumoniae strains along with Streptococcus sp. and other potential pathogens such as S. aureus in the NP microbiome of these infants. In addition, the in silico serotype analysis proved powerful in determining the serotypes in S. pneumoniae carriage, and may lead to developing better targeted vaccines to prevent invasive pneumococcal disease (IPD) in these countries. These findings suggest that NP colonization by S. pneumoniae during the first years of life is a dynamic process involving multiple serotypes and species.

Entities:  

Keywords:  Serotypes; Streptococcus pneumoniae; nasopharyngeal microbiome; pneumococcal conjugate vaccine

Year:  2017        PMID: 28932211      PMCID: PMC5592222          DOI: 10.3389/fmicb.2017.01661

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

Invasive pneumococcal disease (IPD) caused by Streptococcus pneumoniae has decreased significantly after implementation of the pneumococcal conjugate vaccine (PCV) (Pilishvili et al., 2010; Tocheva et al., 2011). However, nasopharyngeal carriage of the pneumococcus in children <5 years old appears to continue at roughly 20–30% of the population in the US or Europe (Weatherholtz et al., 2010; Sharma et al., 2013; Fleming-Dutra et al., 2014; Lee et al., 2014). Carriage in low and middle income countries is higher with a pooled average of ~65% (Adegbola et al., 2014) and up to 75% in South Africa (Nzenze et al., 2014). Results from epidemiologic surveys show that the incidence of capsular serotypes targeted by the vaccine (VT) has decreased, while non-VT serotypes have increased (Huang et al., 2005; Pelton et al., 2007; Sharma et al., 2013). In particular, evidence is emerging that the serotypes targeted in the current vaccines include a lower fraction of the serotypes causing IPD in young children particularly in Asia and Africa compared to the protection afforded young children by the vaccines in developed countries (Hausdorff et al., 2000). Detection of S. pneumoniae in clinical samples has traditionally been performed using microbiological cultures (Reller et al., 2008) or more recently, by quantitative PCR targeting the autolysin (lytA) gene (Messmer et al., 2004; WHO and CDC, 2011). In addition to detection of the organism from clinical samples, it is important to characterize the capsular serotype, since it has been shown that VT isolates are more likely to cause invasive disease than non-VT isolates (Weatherholtz et al., 2010; Fleming-Dutra et al., 2014). Capsule type is determined by serology using standardized antisera (Reller et al., 2008) or by multiplex PCR approaches that are able to discriminate between 20 and 37 of the more than 90 known capsule types (Satzke et al., 2013). However, these methods are laborious and expensive, and they have the inherent shortcoming that they cannot easily detect several capsular types in a single sample (Satzke et al., 2013). Methods that use high-throughput DNA sequencing have been presented as alternatives for capsular typing (Leung et al., 2012; Ip et al., 2014). These methods have relied on using a PCR enrichment step where the capsule loci are preferentially amplified directly from clinical samples, and thus suffer from similar limitations as multiplex PCR strategies. A more recent typing scheme using reads from whole genome sequence (WGS) data was developed to assign an in silico serotype (Kapatai et al., 2016). Here, we expand on the WGS approach using whole-metagenome sequencing of Streptococcus-enriched cultures and simultaneous development of bioinformatics approaches that clearly identify the capsular type. Our study demonstrates that metagenomics methods for serotyping S. pneumoniae directly from infant samples provide the potential for determining capsule information, the presence of other NP colonizers, and for providing data relating to virulence and drug resistance carriage.

Materials and methods

Study design and subjects

This study was performed in healthy infants whose mothers delivered at the Research Institute of Tropical medicine associated clinic in Muntinlupa City, Philippines or Chris Hani Baragwanath Hospital in Johannesburg, South Africa between June 2012 and January 2013. All mothers attending the clinics during the recruitment periods at each location were invited to participate in the study and written consent was obtained from all who agreed to participate. The study was approved by the Ethics Committees at both clinical sites and at the J. Craig Venter Institute (JCVI). Children were recruited to participate for 12 months. All of the children in South Africa were vaccinated against pneumococcus using PCV-7 according to the national vaccination schedule (Madhi et al., 2012). The Philippines had not implemented a national vaccination program against pneumococcus so half the children were randomly assigned to receive the PCV-10 vaccine (Rodenburg et al., 2010) vaccine.

Nasopharyngeal sample collection and enrichment protocol

Sampling was performed according to each infant's scheduled visits: at birth (within 6 h), at the time of their first PCV vaccination (usually 6 weeks old), at the time of their second dose (usually at 14 weeks old), at the time of the last dose (40 weeks old), and at 12 months. Maternal samples were obtained at birth (only South Africa) and at 12 months (both sites). NP samples from infants and mothers were collected by pediatricians in the clinics using Copan Eswabs following manufacturer's instructions. After collection, samples were placed in 1 ml liquid Aimes buffer and stored on ice until delivery to the clinical laboratory. A 200 μl aliquot of NP sample was transferred to 6 ml Supplemented Todd-Hewitt Broth (THB) containing 0.5% yeast extract and 17% rabbit (Philippines) or fetal bovine (South Africa) serum and 10 mg/ml colistin and incubated at 37 °C at 5% CO2 without shaking for 6 h. Cells were then centrifuged at 9,000 rpm for 10 min and frozen at −20 C. Metagenomic DNA was extracted from this pellet using Qiagen DNeasy Blood and Tissue kit (Qiagen) following manufacturer's instructions. Purified DNA was transferred to QIAsafe DNA tubes (Qiagen), allowed to dry uncovered for 10–12 h in a laminar flow hood, and shipped to JCVI at ambient temperature.

Definition of carriage by lytA Pcr

The presence of S. pneumoniae was assessed using a lytA qPCR as described (WHO and CDC, 2011) using primers F373: 5′-ACGCAATCTAGCAGATGAAGCA-3′ and R424: 5′ TCGTGCGTTTTAATTCCAGCT-3′. DNA was amplified using the following program: 95°C for 10 min, followed by 95°C for 15 s, 60°C for 1 min using TaqMan Universal Master Mix on a Biorad CFX96 Real-Rime PCR machine (RITM) or Applied Biosystems 7500 Real-Time PCR system(RMPRU). Samples were considered lytA-positive if the Ct value was below 35 (WHO and CDC, 2011).

Metagenomic DNA sequencing

Only a subset of lytA-positive samples was selected for metagenomic sequencing, where infants were sampled at random with the goals to obtain lytA-positive samples for each representative age and following the pneumococcal population in a subset of infants for the duration of the study. Genomic DNA sequencing libraries were generated using standard library construction (Illumina), adding sample specific barcodes. Sequencing was performed by pooling 8–22 samples in a single 2 × 250 or 2 × 300 MiSeq run to obtain ~35 million reads per run.

Metagenomic assembly pipeline

A pipeline to assemble reads and evaluate assembly content was developed as follows: (1) reads were adaptor and quality trimmed using trimmomatic (Bolger et al., 2014); (2) reads that mapped to the human reference genome GRCh38 (GCA_000001405.15) using bowtie2 version 2.2.7 (Langmead and Salzberg, 2012) with “sensitive” settings were removed; (3) filtered reads were then assembled with metaSPAdes version 3.7.1 (arXiv:1604.03071); and (4) BLAST-based evaluation of taxonomic and serotype content (details below) was conducted across metaSPAdes assembled contigs.

Assembly-based and read-based taxonomic analysis

In order of execution, contigs larger than 200 bp from each metagenomic assembly were aligned against (1) a database of common Streptococcus genomes to identify intended host targets (alignments greater than 95% identity); and (2) the human reference GRCh38 to remove ancillary human contigs (alignments greater than 90% identity). Finally, the remaining set of contigs were aligned to the NCBI NT Bacterial Database (ref, link) BLASTN matches with >97% identity over 5% of the contig length were considered a match. The filtered BLASTN output from each sample were combined and then queried to identify the predominant taxa present in the enrichments by compiling all of the occurrences of a given reference genome across the samples. This genome list was then used to build a reference nucleotide database for read-mapping to more quantitatively assess the relative abundance of each taxa in the enrichment samples (Table S1). The database also included all finished S. pneumoniae genomes. Metagenomic reads were mapped using bowtie2 with very-sensitive settings such that reads could only map once to the reference taxonomic database. Counts of mapped reads to each genome were quantified and were used to assess the relative abundance in different samples.

16S rRNA community analysis of the non-enriched NP microbiome

To determine the pre-enrichment NP bacterial community composition, 16S rRNA amplicon profiling was performed on the initial sample before the enrichment step. Operational taxonomic units (OTUs) were generated de novo from raw Illumina sequence reads using an in-house analyses pipleline relying on the UPARSE (Edgar, 2013) and mothur (Schloss et al., 2009) open-source bioinformatics tools. Briefly, paired-end reads were trimmed of adapter sequences, barcodes, and primers prior to assembly, followed by discarding low quality reads and singletons. After a de-replication step and abundance determination, sequences were filtered for chimeras and clustered into OTUs. To assign taxonomy, we used the Wang classifier, and bootstrapped using 100 iterations. We set mothur to report full taxonomies only for sequences where 80 or more of the 100 iterations were the identical (cutoff = 80). Taxonomies were then assigned to the OTUs with mothur using version SSU Ref NR 99 version of the SILVA 16S ribosomal RNA database (Quast et al., 2013) as the reference. Tables with OTUs and the corresponding taxonomy assignments were generated and used in subsequent analyses. The resulting matrices were summarized by frequency across species-level resolution.

Assembly-based in silico capsular and multi-locus sequence typing

The first step for establishing in silico method for serotyping was to create a nucleotide database of serotype sequences. Serotypes were assumed to be predominantly driven by the capsule polysaccharide (cps) locus of the Streptococcus strains. Capsule sequence exemplars were retrieved for all known serotypes from Bentley et al. (2006) and Skov Sorensen et al. (2016). Assemblies were aligned to this reference serotype nucleotide database for in silico serotyping using BLASTn. Sequence alignments greater than 98% identity over 2,000 bp were kept, and top matches of the cumulative alignment length for each serotype were identified via manual curation because in some cases multiple top matches were identified when more than one serotype was present. This was evident by cases in which different contigs had top matches to different serotypes. If no match was identified, metagenomic assemblies were then queried with aliA (NP_357921.1) and dexB (NP_357904.1), the two conserved genes upstream and downstream of cps cluster. The sequence region between these two flanking genes was then extracted from each metagenome assembly and evaluated by BLAST against the nucleotide non-redundant nt/nr database at NCBI to identify the match with the top total score. Multi-locus sequence typing (MLST) was performed on each metagenomic assembly in silico using LOCUST (Brinkac et al., 2017) using the S. pneumoniae MLST scheme at https://pubmlst.org/spneumoniae (Jolley and Maiden, 2010).

Virulence and antibiotic resistance gene analysis

Contigs from metagenomic enrichment analysis were compared using BLAST alignments against a reference databases containing known antibiotic resistance determinants or virulence factors including S. pneumoniae-specific virulence genes (Zhou et al., 2007; Kadioglu et al., 2008; Liu and Pop, 2009; Mitchell and Mitchell, 2010; Chen et al., 2012; Blumental et al., 2015). BLAST results were filtered for hits that were greater than 90% identical over 80% of the reference length.

Results

lytA-positive burden in South Africa and Philippine infants

A total of 393 nasopharyngeal (NP) samples from 203 infants enrolled in our pediatric microbiome study were analyzed for lytA carriage as a proxy for S. pneumoniae colonization (Table 1). Most samples represented the first sample immediately after birth, the 6-, and 14-, 40-week, and 12 months since these corresponded to the pediatric visits when the PCV vaccine was administered or were the end-point of the microbiome project. After culture enrichment, the proportion of lytA-positive samples (CT < 35) increased consistently with infant age, ranging from as low as 23.7% at birth to consistently above 85% after 7 months, with very little difference in the lytA-positive rates between the Philippines and South Africa, irrespective of vaccination status. Mother carriage of lytA-positive samples in South Africa was ~45% while lytA carriage from mothers in the Philippines was nearly 100%.
Table 1

Summary of lytA tested samples.

Infant ageSiteNegativePositivePercent Positive
BirthPhilippines2133.3
South Africa441423.7
6 weeks*Philippines321328.9
South Africa221946.3
3 months*South Africa61470
4 months**Philippines679.3
6 monthsSouth Africa1480
7 monthsSouth Africa72295.6
8 monthsSouth Africa21286
9 monthsSouth Africa2100
10 months*Philippines105985.5
South Africa11091
11 monthsSouth Africa1889
12 monthsPhilippines13095.6
South Africa14100
13 monthsSouth Africa10100
15 monthsSouth Africa1150
Total13725661.5

PCV administration in Philippines and South Africa.

PCV administration in Philippines.

Positive samples were defined as those with qPCR Ct values < 35 cycles.

Summary of lytA tested samples. PCV administration in Philippines and South Africa. PCV administration in Philippines. Positive samples were defined as those with qPCR Ct values < 35 cycles. We obtained longitudinal time points for 93 subjects ranging from 2 to 7 samples per infant (average 3 samples). Thirty-three (35%) of those infants had lytA-positive samples every time they were sampled, including their earliest visit (Table 1). Of the remaining infants with longitudinal samples, 54 had negative lytA samples in their early visits and became lytA-positive over time, following the overall trend described above. The remaining six infants had negative lytA samples each time they were tested, though all but 2 of these samples corresponded to less than 2 months of age, again suggesting that the carriage and abundance of lytA-positive organisms is low at a very young age.

Metagenomic sequencing and analysis of streptococcal carriage

A total of 51 samples were selected for further characterization through metagenomic sequencing in order to identify the various strains colonizing the NP of infants in each country. Samples were selected to represent primarily infants who had the maximum number of longitudinal lytA-positive samples in order to determine the effect of vaccination on pneumococcal population dynamics. Twelve samples were obtained from seven South African infants and 39 samples from 25 Philippine infants. Roughly one-half of the samples belonged to longitudinal samplings (Table 2). The majority of samples encoded multiple lytA genes in the metagenomic assembly of at least 80% nucleotide identity to the S. pneumoniae reference lytA sequence (NP_359346.1) (range: 1–4 copies, Table 2).
Table 2

Metagenomic enrichment sample characteristics.

# of lytA (NP_359346.1) copiesIn silico serotyping (approximate coverage)
SampleInfantAge (months)Total Reads# of S. pneumoniae ContigsS. pneumoniae Assembled LengthMLST ST>95%>90%>80%Serotype 1Serotype 2# non-encapsulated sequences
RITM002I4RITM00242,196,65960623243131223f (34x)15a (19x)0
RITM003I4RITM00341,449,07714244108301323a0
RITM004I4RITM00441,160,99572440317203452
RITM008I4RITM00841,190,82973733053752nt2
RITM009I10RITM00910642,274653325503412141
RITM019I10RITM01910603,97724321694521118c0
RITM020I4RITM02042,267,7334683708044771116a1
RITM020I10RITM020101,591,24114444769031112nt2
RITM020I12RITM020121,217,95483530591232nt1
RITM021I10RITM021106,470,94783038694273831216f1
RITM022I4RITM02241,113,057718410151811215f1
RITM022I12RITM022121,358,394724324584411nt1
RITM023I4RITM02341,533,81552542279051126b0
RITM029I10RITM029102,479,31810004431671125b (151x)15b/c (50x)1
RITM034I10RITM034101,326,646614249107511nt1
RITM034I12RITM034121,508,90543241878071119f1
RITM038I10RITM03810739,843515383100611219f1
RITM042I10RITM042101,380,897805291472912nt1
RITM043I2RITM0436 weeks3,162,0691228346082611~6b0
RITM046I2RITM0466 weeks2,477,87942328366981nt1
RITM052I2RITM0526 weeks3,110,195641264225912nt2
RITM052I4RITM05241,854,279266225224942123a1
RITM052I12RITM05212452,42564126422591223a (40x)45 (40x)0
RITM053I10RITM05310803,3321498311552011nt1
RITM053I12RITM053121,152,46270444678271126a1
RITM059I10RITM05910642,400565337655647316b1
RITM059I12RITM05912566,500481314231147316b1
RITM060I10RITM06010859,2235142462829Novel1123f0
RITM060I12RITM06012474,9331242370725312450
RITM070I10RITM07010347,80487127428041nt1
RITM070I12RITM07012521,88373033083051119a1
RITM071I10RITM071101,400,39382945299391219f0
RITM071I12RITM071121,954,6426525186483111f (109x)19c (22x)0
RITM077I2RITM0776 weeks1,670,01355331595544745120 (103x)19c (11x)1
RITM077I10RITM077102,614,42334831784581116a0
RITM081I10RITM081101,293,683829452993911nt2
RITM081I12RITM081121,205,2775784613175111nt2
RITM084I2RITM0846 weeks1,523,20872928886461nt2
RITM089I4RITM0894373,10061727234921110a0
RMPRU004I4RMPRU00441,813,223167320442891nt0
RMPRU004I7RMPRU00410934,919812110604981111nt1
RMPRU008I4RMPRU00841,645,268120230590093nt0
RMPRU008I7RMPRU00810885,194768245658211nt1
RMPRU010I3RMPRU0106 weeks8,554,2121022088121105501115a0
RMPRU010I7RMPRU010101,733,3291447320690211116f0
RMPRU011I4RMPRU0116 weeks1,688,331125210073640881116f1
RMPRU011I7RMPRU011102,262,50352023292451nt0
RMPRU011I9RMPRU011101,273,109742442806711116f1
RMPRU022I12RMPRU02212467,6551180639982219c1
RMPRU023I2RMPRU0236 weeks677,04446026808988334123f0
RMPRU031I2RMPRU0316 weeks710,7892222147330564711130

RITM, the Philippines; RMPRU, South Africa. S. pneumoniae contigs and assembled length determined by BLAST match to S. pneumoniae reference genomes for each contig (see text for details). Multilocus sequence typing (MLST) sequence types (ST) were assigned using the PubMLST database. The number of lytA copies in each assembly is based on % identity thresholds relative to the S. pneumoniae R6 copy (NP_359346.1). Serotype assignment was generated from in silico typing of WGS metagenomic assemblies (see text for details). nt, non-typeable.

Metagenomic enrichment sample characteristics. RITM, the Philippines; RMPRU, South Africa. S. pneumoniae contigs and assembled length determined by BLAST match to S. pneumoniae reference genomes for each contig (see text for details). Multilocus sequence typing (MLST) sequence types (ST) were assigned using the PubMLST database. The number of lytA copies in each assembly is based on % identity thresholds relative to the S. pneumoniae R6 copy (NP_359346.1). Serotype assignment was generated from in silico typing of WGS metagenomic assemblies (see text for details). nt, non-typeable.

Population structure of nasopharyngeal Streptococcus community

Our metagenomic approach to studying Streptococcus spp. colonizing the nasopharynx allowed a very detailed view of the various organisms that reside in that space. The taxonomic composition of the enriched NP microbiome based on percentage of mapped reads to various reference genomes indicated the predominance of S. pneumoniae in most samples (Figure 1, Table S2, mean: 61.6%, range: 3.7–98.4%). Other common Streptococcus taxa include S. mitis (mean: 14.9%), S. pseudopneumoniae (mean: 14.4%), S. oralis (mean: 1.2%). Other Streptococcus sp. were detected at >5% in a limited number of infants: S. pyogenes (1 infant: 9.4%), S. parasanguinis (1 infant: 11.5%), S. anginosus (1 infant, 12.2%). One NP sample (RMPRU011I9) had the most diverse Streptococcus community comprised of four species with >10% mapped reads, though the previous two samples from that infant were comprised of primarily S. pneumoniae and S. mitis and S. pseudopneumoniae. Other taxa present in the enrichments include Staphylococcus aureus (4 samples >5% reads, range: 0–96%), Gemella haemolytica (5 samples >5%, range: 0–12.8%), and Neisseria lactamica (1 sample >5%, range: 0–7.2%). The diverse Streptococcus sample (RMPRU011I9) also had a substantial number of Gemella reads in stark contrast to the previous two samples from the infant.
Figure 1

Relative abundance of NP microbiome taxa from metagenomic analysis of enrichment cultures. Abundance is based on normalized read counts mapped to a reference database of Streptoccocus species and other taxa detected in the NP enrichment assemblies (Table S1). The sample names that indicate infant and sampling time point are provided under the x-axis. Blue lines connecting sample names highlight longitudinal samples originating from the same infant. In silico serotype classification was assigned using a BLAST-based strategy by aligning metagenomic assemblies against a reference database of capsule biosynthetic loci (see Methods for details). Red-colored serotype text indicates a vaccine-type serotype while a red circle depicts which vaccine-type serotype samples came from vaccinated infants. A count of the number of contigs aligning to the nonecapsulated NT_110_58-like capsule locus is given (see text for details).

Relative abundance of NP microbiome taxa from metagenomic analysis of enrichment cultures. Abundance is based on normalized read counts mapped to a reference database of Streptoccocus species and other taxa detected in the NP enrichment assemblies (Table S1). The sample names that indicate infant and sampling time point are provided under the x-axis. Blue lines connecting sample names highlight longitudinal samples originating from the same infant. In silico serotype classification was assigned using a BLAST-based strategy by aligning metagenomic assemblies against a reference database of capsule biosynthetic loci (see Methods for details). Red-colored serotype text indicates a vaccine-type serotype while a red circle depicts which vaccine-type serotype samples came from vaccinated infants. A count of the number of contigs aligning to the nonecapsulated NT_110_58-like capsule locus is given (see text for details). Streptococcus sp. 16S rRNA amplicon sequences comprised between <1 and 33% of reads (Figure 2, Table S3) from the initial NP microbiome sample. The community composition varied greatly in relative abundance of different taxa, but the primary taxa was largely consistent with Dolosigranulum, Haemophilus, Prevotella, and Moraxella being the most prevalent. Other taxa prevalent in a fewer number of samples include Porphyromonas, Finegoldia, and Johnsonella.
Figure 2

Infant nasopharyngeal microbiome taxonomic composition based on 16S rRNA amplicon sequencing of the pre-enrichment sample. Taxa with >5% relative abundance in at least one sample are depicted.

Infant nasopharyngeal microbiome taxonomic composition based on 16S rRNA amplicon sequencing of the pre-enrichment sample. Taxa with >5% relative abundance in at least one sample are depicted.

Capsular type detection and serotype prediction

We applied in silico BLAST-based methods to ascertain the capsular type(s) present in these metagenomic samples. Using a criteria of >98% nucleotide identity, 33 samples were assigned a serotype. The most common serotype was 16f (four samples) while the following were encountered three times: 6a, 6b, 16f, 19c, 19f, and 23a (Figure 1, Table 2). The presence of more than one serotype was detected in five infants. Ten samples from The Philippines cohort had capsule types belonging to PCV10 vaccine types, and five of those samples originate from vaccinated infants (i.e., RITM009, RITM059, and RITM071), most of which occurred in infants >10 months of age. However, one VT-serotype sample originated from a 6-week-old infant (RITM043I2). One vaccinated South African infant carried a vaccine type serotype (23f) at 6 weeks, the timepoint for the first PCV7 administration. For longitudinal samples originating from the same infant, only three infants had the same capsule type at more than one visit (RITM052:23A, RITM059:6b, and RMPRU011:16f). The samples without in silico serotype matches were further interrogated to determine whether nontypeable Streptococcus capsule biosynthetic genes were present by examining sequence content between aliA and dexB, the conserved genes flanking the capsule biosynthetic cluster. The majority of extracted capsule sequences matched at 94–96% identity to several variants of the capsule locus detailed in Park et al. (2012) including the complete S. pneumoniae NT-110-58 genome (CP007593) (Hilty et al., 2014) (Table 2), as well as the complete genomes of S. mitis B6 (FN568063.1) and S. pseudopneumoniae IS7493 (CP002925.1) (Shahinas et al., 2011). Similar sequences (>94% similarity) were also present in the serotypeable samples as well (Figure 1) indicating that they are prevalent and co-exist with S. pneumoniae serotypes.

In silico MLST analysis

MLST types were definitively assigned from the metagenomes of twelve samples (Table 2, Table S4), two of which were from the same infant (RITM059) with the same MLST type (ST473). One additional samples (RITM060I10) represented a novel sequence type comprised of previously classified alleles. South African sequence types matched other ST from South African in the PubMLST isolate database, while Philippine samples were comprised of sequence types from more diverse locations.

Virulence factors genes

We predicted metagenomes from S. pneumoniae samples to encode core virulence factors lytA, ply (pneumolysin), nanA (neuraminidase A), hyl (hyaluronidase), pspC (pneumococcal surface protein C), and pavA (pneumococcal adhesion and virulence A) (Hiller et al., 2007). All samples encoded at least one S. pneumoniae virulence factor when compared to reference databases (Table S5; Zhou et al., 2007; Chen et al., 2012). The five representative virulence factors examined here were present in >50% of the metagenomes, where ply was present in almost all samples (98%). Several samples encoded more than one sequence distinguishable copy of hyl, ply, and pavA. One sample that contained both S. aureus, and S. pyogenes encoded a total of 62 virulence factors, including both staphylococcal and streptococcal toxins and complement evasion factors (Tables S5, S6). The majority of Staphylococcus-containing samples had more than 10 virulence factors including haemolysins and toxins, indicating the presence of fully virulent S. aureus (Powers and Wardenburg, 2014).

Antibiotic resistance markers

Twelve samples contained antibiotic resistance genetic determinants (Table S7): nine samples from the Philippines and three from South Africa. Seven samples encoded only one antibiotic resistance marker and two samples encoded 6 or more. Metagenome assemblies from two samples encoded the bla(TEM-1) gene, which is the most common β-lactamase in Gram-negative bacteria (Muhammad et al., 2014). The gene was encoded in contigs with relatively low read coverage was highly similar to Neisseria plasmids (Muhammad et al., 2014). Both TEM-1 samples were obtained from Philippine infants, one from a 6-week visit (RITM077), and the other from the 12-month visit (RITM022). One sample from a 6-week-old infant (RMPRU023I2) encoded the methicillin-resistance gene, mecA. The mecA gene was surrounded by sequences homologous to the transponson involved in mecA mobilization (Katayama et al., 2001), suggesting it was encoded by a mobile element.

Discussion

In this study, we report the use of targeted culture enrichment and metagenomic sequencing to study the dynamics of Streptococcus carriage in the infant nasopharynx in the Philippines and South Africa. A total of 393 samples from 203 infants were analyzed, where the majority of early samples were lytA-negative which is consistent with other studies and with colonization occurring later in life (>4 months) (Coles et al., 2001; Ercibengoa et al., 2012; Turner et al., 2012). Broth enrichment culture has been demonstrated to be a powerful approach to increasing the sensitivity for detecting the carriage of S. pneumoniae in the upper respiratory tract. When methods are compared on the same samples, the carrier fraction of the samples and the serotype diversity are maximal for the broth enrichment culture (da Gloria Carvalho et al., 2010). Metagenomic sequencing of the entire enrichment culture allowed us to see the range of bacteria that were selected by the enrichment culture protocol. The assembly data suggested that streptococcal enrichment was successful, with Streptococcus sp. reads accounting for an average of 2% of the 16S rRNA reads from the pre-enriched NP community, to an average of 93% of post-enrichment mapped reads. All samples had more than one Streptococcus sp. present including S. pseudopneumoniae and S. mitis. The detection of multiple lytA sequences of varying nucleotide similarity supports the idea that the NP community is colonized by a complex assemblage of Streptococcus organisms. This observation highlights the potential for genetic exchange among closely related Streptococcus sp. as recombination is a well-characterized mechanism for generating genetic diversity within the species (Hanage et al., 2009; Chaguza et al., 2015, 2016). Among the other taxa identified genera by 16S rRNA gene analysis in the non-enriched primary sample, were common NP microbiome taxa including Dolosigranulum, Haemophilus, Moraxella, and Prevotella sequences (Bogaert et al., 2011; Perez-Losada et al., 2017). Some studies have suggested that Corynebacterium and Dolosigranulum presence are protective from S. pneumoniae colonization (de Steenhuijsen Piters and Bogaert, 2016), but the limited sample size and general low prevalence of S. pneumoniae in this 16S rRNA data precludes much inference about the relationship. Other taxa enriched in the metagenomic analysis include Staphylococcus, Gemella, and Neisseria indicating that the enrichment protocol shifted the community composition substantially. The use of lytA for detecting pneumococcus in community acquired pneumonia cases has been documented and is frequently employed as a rapid assay (Abdeldaim et al., 2010). In this study where the subjects were largely free of respiratory infections, the lytA assay detected the presence of S. pneumoniae as a member of the commensal microbiome but also detected other lytA containing streptococcal species in the commensal NP microbiome. Recent screening assays have in fact documented that lytA is not a specific diagnostic gene for S. pneumoniae (Simoes et al., 2016). Undoubtedly the use of a second pneumococcus selective gene would greatly improve the specificity of the assay for use as a rapid pneumococcus diagnostic tool for respiratory infections. Although the presence of Streptococcus spp. in the nasopharynx of these infant subjects was both common and frequent, it was relatively uncommon for a child to have consistent colonization by the same S. pneumoniae strain. There were only three instances of the same capsular type in samples obtained over 3 months apart. Studies of serotype switching have been focused on such switching events in the context of PCV vaccination (for example see Hanage et al., 2011) but not in such young children. Serotypes related to the vaccine (PCV10 in the Philippines and PCV7 in South Africa) were observed in 11 samples, seven of which came from vaccinated infants. However, two of these samples originated from infants on their first scheduled vaccine administration, while the other five samples came from infants >10 months of age. This highlights the need for further examination of vaccine success in these populations. Multiple samples also had more than one serotype present concurrently, and many encoded both typeable and non-typeable capsule loci. This is consistent with previous studies using different methods, and again highlights the potential for genetic exchange between Streptococcus strains (Kamng'ona et al., 2015). In silico MLST typing indicates that many samples were not typeable, but for those that were, only one infant had the same sequence type more than once (Table 2). The remaining samples could not be specifically assigned to a single MLST type either because the assembly did not resolve all the loci necessary for typing especially in those cases with co-occurring S. pneumoniae, or because loci had no matches compared to known MLST types. The 16S rRNA NP longitudinal sampling demonstrated consequential variation between successive samples for the NP community composition in our infants during their first year of life. It is likely that the serotype variation we are observing is a consequence of the inherent instability of the NP microbiome during this early stage of life (Jebaraj et al., 1999; Hohwy et al., 2001; Turner et al., 2011; Ercibengoa et al., 2012). Another striking observation on the NP microbiomes in these infants is the prevalence of potentially pathogenic species acting as commensal members of the young infant NP microbiome. We have noted the presence of pathogenic bacteria in the respiratory tract microbiome of lung transplant patients in the absence of an infection, and often when these patients did present with a pneumonia, the pathogen was earlier detectable as a prior member of the commensal population before the onset of disease (Shankar et al., 2015). In this context it is not surprising that we detected the presence of at least one S. pneumoniae virulence factor in all of the metagenomic enrichment culture samples, with the majority of Staphylococcus-containing samples exhibiting more than 10 virulence factors. Furthermore, our detection of antibiotic resistance genes and mobile elements that can be easily transferred between strains suggests that the infant NP serves as a reservoir for antibiotic resistant potential. These observations are consistent with a hypothesis that in these young infants, potentially pathogenic bacteria are common members of the commensal microbiome and that bacterial respiratory disease does not simply result from the presence of a bacterial respiratory pathogen but is the result of a more complex interaction between the host immune system status and the respiratory tract microbiome. However, the mechanisms behind the activation and phenotypic manifestation of virulence in the early NP microbiome remain unclear.

Conclusions

The in silico serotype approach here may contribute to serotype analysis of strains isolated from infants that could lead to better data on residual serotypes that constitute the reservoir for future pneumococcal infections post-targeted vaccines to prevent IPD in infants in these countries. In addition, the study revealed the frequent presence of bacterial pathogens in the NP microbiome of these infants with genomes encoding an abundance of virulence and antibiotic resistance elements. Evidence is emerging that the serotypes targeted in the current vaccines are not as protective for young children in developing countries. The serotype tool reported here may contribute to serotype analysis of strains isolated for infants with IPD that could lead to developing better targeted vaccines to prevent IPD in infants in these countries.

Ethics statement

The study was approved by the Ethics Committees at both clinical sites and at the J. Craig Venter Institute (JCVI). For the South African cohort, approval was issued by the University of Witwatersrand, Johannesburg Human Research Ethics committee on 2/24/12 and reviewed with approval on 8/6/2013. The J. Craig Venter Institute Institutional Review Board approval was issued on 2/4/2012. For the Philippine cohort, approval was issued on 2/28/2012 by the Research Institute for Tropical Medicine Institutional Review board, assigned number 2012-002. The J. Craig Venter Institute Institutional Review Board approval was issued on 4/4/2012.

Availability of data

The WGS data supporting the conclusions of this article are available in GenBank under accession number PRJNA31170 http://www.ncbi.nlm.nih.gov/bioproject/PRJNA311705/. Other concluding datasets can be found within article and its additional files.

Author contributions

LL and MW were the major contributors to study design, performed the analysis, and crafted the manuscript. JM, AG, EB, DH, and JS participated in software tool design and data analysis and performed the statistical analysis and interpretation of the data. StM managed the materials and data exchanges and interactions among the clinical site and JCVI and organized the metadata and participated in editing of the manuscript. ES, AM, BB, SN, SK, ML, and ShM Contributed to study design, and sample and data collection. SK and PA participated in laboratory testing. GS participated in software tool design, and data analysis as well as critically reading the manuscript. KK was instrumental in developing the collaborative interactions with the project's South Africa clinical site and contributed to the coordination of the project with the Philippine clinical site. He provided guidance to the serotyping study design and performed a critical review of the manuscript prior to submission. KN and WN participated in the study design, coordinated the project across the three collaborating sites, and participated in editing the manuscript.

Conflict of interest statement

KK declares that he is currently employed by the Bill and Melinda Gates Foundation employee. The other authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. The reviewer AG and handling Editor declared their shared affiliation.
  59 in total

1.  Comparative genomic analyses of seventeen Streptococcus pneumoniae strains: insights into the pneumococcal supragenome.

Authors:  N Luisa Hiller; Benjamin Janto; Justin S Hogg; Robert Boissy; Susan Yu; Evan Powell; Randy Keefe; Nathan E Ehrlich; Kai Shen; Jay Hayes; Karen Barbadora; William Klimke; Dmitry Dernovoy; Tatiana Tatusova; Julian Parkhill; Stephen D Bentley; J Christopher Post; Garth D Ehrlich; Fen Z Hu
Journal:  J Bacteriol       Date:  2007-08-03       Impact factor: 3.490

Review 2.  Standard method for detecting upper respiratory carriage of Streptococcus pneumoniae: updated recommendations from the World Health Organization Pneumococcal Carriage Working Group.

Authors:  Catherine Satzke; Paul Turner; Anni Virolainen-Julkunen; Peter V Adrian; Martin Antonio; Kim M Hare; Ana Maria Henao-Restrepo; Amanda J Leach; Keith P Klugman; Barbara D Porter; Raquel Sá-Leão; J Anthony Scott; Hanna Nohynek; Katherine L O'Brien
Journal:  Vaccine       Date:  2013-12-17       Impact factor: 3.641

Review 3.  Streptococcus pneumoniae: virulence factors and variation.

Authors:  A M Mitchell; T J Mitchell
Journal:  Clin Microbiol Infect       Date:  2010-02-02       Impact factor: 8.067

4.  lytA-based identification methods can misidentify Streptococcus pneumoniae.

Authors:  Alexandra S Simões; Débora A Tavares; Dora Rolo; Carmen Ardanuy; Herman Goossens; Birgitta Henriques-Normark; Josefina Linares; Hermínia de Lencastre; Raquel Sá-Leão
Journal:  Diagn Microbiol Infect Dis       Date:  2016-03-29       Impact factor: 2.803

5.  Post-PCV7 changes in colonizing pneumococcal serotypes in 16 Massachusetts communities, 2001 and 2004.

Authors:  Susan S Huang; Richard Platt; Sheryl L Rifas-Shiman; Stephen I Pelton; Donald Goldmann; Jonathan A Finkelstein
Journal:  Pediatrics       Date:  2005-09       Impact factor: 7.124

6.  Improved detection of nasopharyngeal cocolonization by multiple pneumococcal serotypes by use of latex agglutination or molecular serotyping by microarray.

Authors:  Paul Turner; Jason Hinds; Claudia Turner; Auscharee Jankhot; Katherine Gould; Stephen D Bentley; François Nosten; David Goldblatt
Journal:  J Clin Microbiol       Date:  2011-03-16       Impact factor: 5.948

Review 7.  Medical microbiology: laboratory diagnosis of invasive pneumococcal disease.

Authors:  Anja M Werno; David R Murdoch
Journal:  Clin Infect Dis       Date:  2008-03-15       Impact factor: 9.079

8.  Variability and diversity of nasopharyngeal microbiota in children: a metagenomic analysis.

Authors:  Debby Bogaert; Bart Keijser; Susan Huse; John Rossen; Reinier Veenhoven; Elske van Gils; Jacob Bruin; Roy Montijn; Marc Bonten; Elisabeth Sanders
Journal:  PLoS One       Date:  2011-02-28       Impact factor: 3.240

Review 9.  Igniting the fire: Staphylococcus aureus virulence factors in the pathogenesis of sepsis.

Authors:  Michael E Powers; Juliane Bubeck Wardenburg
Journal:  PLoS Pathog       Date:  2014-02-13       Impact factor: 6.823

10.  Recombination in Streptococcus pneumoniae Lineages Increase with Carriage Duration and Size of the Polysaccharide Capsule.

Authors:  Chrispin Chaguza; Cheryl P Andam; Simon R Harris; Jennifer E Cornick; Marie Yang; Laura Bricio-Moreno; Arox W Kamng'ona; Julian Parkhill; Neil French; Robert S Heyderman; Aras Kadioglu; Dean B Everett; Stephen D Bentley; William P Hanage
Journal:  MBio       Date:  2016-09-27       Impact factor: 7.867

View more
  6 in total

1.  Intranasal Immunization with the Commensal Streptococcus mitis Confers Protective Immunity against Pneumococcal Lung Infection.

Authors:  Sudhanshu Shekhar; Rabia Khan; Karl Schenck; Fernanda Cristina Petersen
Journal:  Appl Environ Microbiol       Date:  2019-03-06       Impact factor: 4.792

Review 2.  A Narrative Review of Pneumococcal Disease in Children in the Philippines.

Authors:  Amgad Gamil; Miriam Y Lalas; Maria Rosario Z Capeding; Anna Lisa T Ong-Lim; Mary Ann C Bunyi; Angelica M Claveria
Journal:  Infect Dis Ther       Date:  2021-04-24

Review 3.  Competence in Streptococcus pneumoniae and Close Commensal Relatives: Mechanisms and Implications.

Authors:  Gabriela Salvadori; Roger Junges; Donald A Morrison; Fernanda C Petersen
Journal:  Front Cell Infect Microbiol       Date:  2019-04-03       Impact factor: 5.293

4.  Longitudinal changes in the nasopharyngeal resistome of South African infants using shotgun metagenomic sequencing.

Authors:  Rendani I Manenzhe; Felix S Dube; Meredith Wright; Katie Lennard; Heather J Zar; Stephanie Mounaud; William C Nierman; Mark P Nicol; Clinton Moodley
Journal:  PLoS One       Date:  2020-04-22       Impact factor: 3.240

5.  Characterization of Pneumococcal Colonization Dynamics and Antimicrobial Resistance Using Shotgun Metagenomic Sequencing in Intensively Sampled South African Infants.

Authors:  Rendani I Manenzhe; Felix S Dube; Meredith Wright; Katie Lennard; Stephanie Mounaud; Stephanie W Lo; Heather J Zar; William C Nierman; Mark P Nicol; Clinton Moodley
Journal:  Front Public Health       Date:  2020-09-22

6.  Strain-level resolution and pneumococcal carriage dynamics by single-molecule real-time (SMRT) sequencing of the plyNCR marker: a longitudinal study in Swiss infants.

Authors:  Oluwaseun Rume-Abiola Oyewole; Philipp Latzin; Silvio D Brugger; Markus Hilty
Journal:  Microbiome       Date:  2022-09-22       Impact factor: 16.837

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.