Literature DB >> 31636302

The FluPRINT dataset, a multidimensional analysis of the influenza vaccine imprint on the immune system.

Adriana Tomic1,2, Ivan Tomic3, Cornelia L Dekker4, Holden T Maecker5, Mark M Davis6,7,8.   

Abstract

Machine learning has the potential to identify novel biological factors underlying successful antibody responses to influenza vaccines. The first attempts have revealed a high level of complexity in establishing influenza immunity, and many different cellular and molecular components are involved. Of note is that the previously identified correlates of protection fail to account for the majority of individual responses across different age groups and influenza seasons. Challenges remain from the small sample sizes in most studies and from often limited data sets, such as transcriptomic data. Here we report the creation of a unified database, FluPRINT, to enable large-scale studies exploring the cellular and molecular underpinnings of successful antibody responses to influenza vaccines. Over 3,000 parameters were considered, including serological responses to influenza strains, serum cytokines, cell phenotypes, and cytokine stimulations. FluPRINT, facilitates the application of machine learning algorithms for data mining. The data are publicly available and represent a resource to uncover new markers and mechanisms that are important for influenza vaccine immunogenicity.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31636302      PMCID: PMC6803714          DOI: 10.1038/s41597-019-0213-4

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   6.444


Background & Summary

Influenza virus has a devastating societal impact, causing up to 650,000 deaths every year worldwide[1]. Vaccination with the seasonal mixture of strains is only partially effective, even among otherwise-healthy people, leading to serious pandemics. Vaccine efficacy is defined as the ability of a new seasonal influenza vaccine to prevent influenza-like illness compared to the placebo group, according to the US Food and Drug Administration (FDA) in their guideline for vaccine licensure[2]. Young children and elderly, due to high susceptibility to influenza infection[3], are encouraged to be vaccinated annually making a placebo-controlled clinical efficacy study in this population an extremely costly and arduous undertaking. The alternative approach to correlate vaccine-mediated protection in these populations is based on immunogenicity endpoints, recommended by FDA. The appropriate immunogenicity endpoint is the influenza-specific antibody titer measured by a hemagglutination inhibition (HAI) assay to each viral strain included in the vaccine. Vaccine protection is then assessed based on seroconversion (4-fold increase in the HAI antibody titers after vaccination) and seroprotection (geometric mean HAI titer ≥40 after vaccination). The HAI titer ≥40 after vaccination is associated with a 50% reduction in risk of influenza infection or disease[4]. Lack of pre-existing influenza immunity, especially T cells, has been identified as one of the major predispositions for failure to generate antibody response to vaccination[5-7]. However, exact phenotypes of CD4+ and CD8+ T cells, which are important for protective influenza immunity in general and to vaccination with live attenuated influenza vaccine (LAIV) in specific, remain elusive. The application of computational biology and machine learning to clinical datasets holds great promise for identifying immune cell populations and genes that mediate HAI antibody responses to influenza vaccines as a correlate of vaccine protection[8-15]. Unfortunately the correlates of protection identified are not consistent between cohorts and study years[8,9,11,12]. Some of the identified challenges leading to such discrepancy are small sample sizes and analysis of only one aspect of the biology, such as molecular correlates of protection by using transcriptome data[16]. Additionally, comparison of the results of different predictive models is hampered by the lack of a consensus regarding what defines the outcome of vaccination, i.e. high vs. low responders. For these reasons, it is necessary to generate a unified dataset that includes multiple measurements across age, gender and racially diverse populations, including different vaccine types. Specifically, it is of the utmost importance to include single-cell analysis at the protein level, such as mass cytometry combined with multiple high-dimensional biological measurements, since these have power to reveal heterogeneity of the immune system[17-21]. To accomplish that goal, we created FluPRINT, a dataset consisting of 13 data types in standardized tables on blood and serum samples taken from 740 individuals undergoing influenza vaccination with inactivated (IIV) or live attenuated seasonal influenza vaccines (LAIV) (Fig. 1). The FluPRINT dataset contains information on more than 3,000 parameters measured using mass cytometry (CyTOF), flow cytometry, phosphorylation-specific cytometry (phospho-flow), multiplex cytokine assays (multiplex ELISA), clinical lab tests (hormones and complete blood count), serological profiling (HAI assay) and virological tests. In the dataset, vaccine protection is measured using HAI assay, and following FDA guidelines individuals are marked as high or low responders depending on the HAI titers after vaccination. FluPRINT includes fully integrated and normalized immunology measurements from eight clinical studies conducted between 2007 to 2015 and assayed at the Human Immune Monitoring Center (HIMC) of Stanford University. Among those, one contains data from 135 donors enrolled in the 8-year long ongoing longitudinal study following immune responses to seasonal inactivated influenza vaccines. This is particularly interesting set of data that can deepen our understanding how repeated vaccination effects vaccine immunogenicity. The MySQL database containing this immense dataset is publicly available online (www.fluprint.com). The dataset represents a unique source in terms of value and scale, which will broaden our understanding of immunogenicity of the current influenza vaccines.
Fig. 1

Overview of the FluPRINT dataset. The FluPRINT dataset consists of the 740 individuals from 8 clinical studies (SLVP015, SLVP017, SLVP018, SLVP021, SLVO024, SLVP028, SLVP029 and SLVP030) and 8 influenza seasons (from 2017 to 2015). (a) Pie chart shows distribution of donors across clinical studies. The dataset contains harmonized data from different assays, including mass and flow cytometry, phosphorylated cytometry (Phospho-flow), multiplex ELISA (Luminex assay), clinical lab tests, such as complete blood test, analysis of hormones and virological assays (CMV and EBV antibody titers) and serological profiling with hemagglutination inhibition assay, which was used to define high and low responders. (b) Distribution of assays across years available for each clinical study.

Overview of the FluPRINT dataset. The FluPRINT dataset consists of the 740 individuals from 8 clinical studies (SLVP015, SLVP017, SLVP018, SLVP021, SLVO024, SLVP028, SLVP029 and SLVP030) and 8 influenza seasons (from 2017 to 2015). (a) Pie chart shows distribution of donors across clinical studies. The dataset contains harmonized data from different assays, including mass and flow cytometry, phosphorylated cytometry (Phospho-flow), multiplex ELISA (Luminex assay), clinical lab tests, such as complete blood test, analysis of hormones and virological assays (CMV and EBV antibody titers) and serological profiling with hemagglutination inhibition assay, which was used to define high and low responders. (b) Distribution of assays across years available for each clinical study.

Methods

Clinical studies

All studies were approved by the Stanford Institutional Review Board and performed in accordance with guidelines on human cell research. All vaccines used were licensed in the US for use in the populations studied. Peripheral blood samples were obtained at the Clinical and Translational Research Unit at Stanford University after written informed consent/assent was obtained from participants. Samples were processed and cryopreserved by the Stanford HIMC BioBank according to the standard operating protocols available online at the HIMC website (https://iti.stanford.edu/himc/protocols.html).

Data collection

Data involving individuals enrolled in influenza vaccine studies at the Stanford-LPCH Vaccine Program was accessed from the Stanford Data Miner (SDM) which holds data processed by HIMC from 2007 up to date[22]. The FluPRINT cohort was assembled by filtering the SDM for assays available in studies involving influenza vaccination. This resulted in a dataset containing data from 740 healthy donors enrolled in influenza vaccine studies conducted by the Stanford-LPCH Vaccine Program from 2007 to 2015 in the following studies: SLVP015, SLVP017, SLVP018, SLVP021, SLVP024, SLVP028, SLVP029 and SLVP030. Online-only Table 1 provides a summary of all studies including information about clinical trial identification numbers on www.clinicaltrials.gov, clinical protocols, ImmPort accession numbers to access raw data and quality reports, and finally references to published works where data was used. ImmPort is a web portal that contains data from NIAID-funded immunology studies and clinical trials (https://immport.niaid.nih.gov/)[23]. All data contained in the FluPRINT dataset are made freely available through the Shared Data Portal on ImmPort repository. In all studies, except for study SLVP015, vaccine was administrated only once. The study SLVP015 was longitudinal study where 135 participants received vaccine in consecutive years from 2007–2015. In all studies, generally healthy participants were included, and in some studies (SLVP017 for the 2010, 2011 and 2013, SLVP021 and SLVP029) those that were vaccinated in the prior influenza season were excluded. A total of 121 CSV files containing processed data from various assays and studies were downloaded from SDM. The link to the 121 CSV files is provided on Zenodo[24]. Table 1 provides a summary of the demographic characteristics of the FluPRINT study population. The population spans a wide age range, from a 1-year-old to a 90-year-old, with a median age of 27 years. Among 740 individuals with available experimental data, 446 were females and 294 males. The majority (491) of the individuals were Caucasian (European American ancestry). The complete demographic information is available on the Zenodo[25]. Individuals were stratified into high and low responders, depending on their HAI antibody titers measured before and after vaccination, as described below. Figure 2 shows demographic information for the FluPRINT study population, including gender, ethnicity, cytomegalovirus (CMV) status, and age stratified by the outcome to vaccination. Out of 363 individuals with measured HAI responses, 111 were identified as high responders and 252 as low responders. Overall, no major differences in the gender, ethnicity distribution, or CMV status (Fig. 2a) or age (Fig. 2b) were observed between high and low responders.
Online-only Table 1

Characteristics of the clinical studies included in the FluPRINT dataset.

Stanford study IDClinicalTrials.gov IDNameDescriptionVaccinesData in FluPRINTImmPort ID(www.immport.org)Ref.
SLVP015NCT01827462Comparison of immune responses to influenza vaccine in adults of different ages (2007–2017)

Who: 18–100yo healthy participants

How: immunized annually with the seasonal inactivated influenza vaccines from 2007–2017

When: Blood samples acquired before immunization (Day 0), on days 6–8 and 28 after immunization

2007–2013

Seasonal trivalent, inactivated influenza vaccines (Fluzone)

2014–2015

High Dose trivalent Fluzone for participants ≥ 65yo and quadrivalent Fluzone for younger participants

135 donors

Assays:

51-plex Luminex

62-plex Luminex

MSD 4plex

MSD9plex

Other Luminex

HAI

CMV/EBV

Hormones

CyTOF phenotype

Lyoplate

Phospho Cytof pheno

Phospho cytof phospho

Phosphoflow

CBCD

SDY887 (2007)

SDY212 (2008)

SDY312 (2009)

SDY311 (2010)

SDY112 (2011)

SDY315 (2012)

SDY478 (2013)

SDY1464 (2014)

[810, 28, 3746]
SLVP017

NCT02133781

NCT03020498

NCT03020537

B-cell immunity to influenza

(2009–2011 and 2013)

Who:

1–2yo (2013), 8–100yo healthy participants who did not receive the seasonal influenza vaccine in previous years (2010, 2011 and 2013)

How: immunized with either seasonal inactivated or live, attenuated influenza vaccines in 2009, 2010, 2011 and 2013

When: Blood samples acquired before immunization (Day 0) and on day 28 after immunization

2009–2011

Seasonal trivalent, inactivated influenza vaccines (Fluzone) or seasonal live, attenuated influenza vaccine (FluMist)

2013

Seasonal trivalent inactivated influenza vaccine- (Fluzone) - pediatric formulation for 1–2yo children

153 donors

Assays:

51-plex Luminex

62-plex Luminex

HAI

CMV/EBV

CyTOF phenotype

CBCD

SDY1467(2009)

SDY1468(2010)

SDY1469(2011)

SDY1470(2012)

SDY1471(2013)

[27, 4750]
SLVP018

NCT01987349

NCT03022396

NCT03022422

NCT03022435

NCT03023176

T-cell and general immune response to seasonal influenza vaccine

(2009–2013)

Who: 1–8yo (2013), 8–100yo healthy participants

How: immunized with either seasonal inactivated or live, attenuated influenza vaccines from 2009–2013

When: Blood samples acquired before immunization (Day 0), days 7–10 and 28 after immunization

2009–2010

Seasonal trivalent inactivated influenza vaccine (Fluzone) or seasonal trivalent live attenuated influenza vaccine (FluMist)

2010

High Dose trivalent Fluzone for participants ≥ 65yo

2013

Seasonal trivalent, inactivated influenza Pediatric Dose (Fluzone, 0.25 ml) for 1–3yo children

249 donors

Assays:

51-plex Luminex

62-plex Luminex

MSD 4plex

MSD 9plex

HAI

CMV/EBV

Hormones

CyTOF phenotype

Lyoplate

Phospho Cytof pheno

Phospho cytof phospho

Phosphoflow

CBCD

SDY514(2009)

SDY515(2010)

SDY519(2011)

SDY1465(2012)

SDY1466(2013)

[27, 44, 5157]
SLVP021NCT02141581Plasmablast trafficking and antibody response in influenza vaccination (2011–2014)

Who: 8–34yo healthy participants who did not receive the seasonal influenza vaccine in previous years

How: immunized with either seasonal inactivated influenza vaccines, given intramuscularly or intradermally, or live, attenuated influenza vaccines from 2011–2014

When: Blood samples acquired before immunization (Day 0), days 6–8 and 24–32 after immunization

2011–2014

Seasonal trivalent inactivated influenza vaccine (Fluzone) given either intramuscularly or intradermally

2011–2012

Seasonal trivalent live attenuated influenza vaccine (FluMist)

84 donors

Assays:

51-plex Luminex

62-plex Luminex

HAI

CMV/EBV

Hormones

CyTOF phenotype

Phospho Cytof pheno

Phospho cytof phospho

Phosphoflow

CBCD

SDY113 (2011)

SDY305 (2012)

SDY472 (2013)

SDY1479 (2014)

[5860]
SLVP024NCT03023683Protective mechanisms against a pandemic respiratory virus (2012)

Who: 2–49yo healthy participants

How: immunized with the seasonal live, attenuated influenza vaccine

When: Blood samples only from 18–42yo adults acquired before immunization (Day 0), days 7 and 28 after immunization

Seasonal live, attenuated influenza vaccine (FluMist)

Donors: 8

Assays:

HAI

Phosphoflow

SDY1472
SLVP028NCT03088904

Genetic and environmental factors in the response to influenza vaccination

(2014–2018)

Who: 12–49yo healthy participants

How: immunized with either seasonal inactivated or live, attenuated influenza vaccines from 2014–2018

When: Blood samples acquired before immunization (Day 0), days 6–8 and 28 + 7 after immunization

Seasonal quadrivalent inactivated influenza vaccine (Fluzone) or seasonal quadrivalent live attenuated influenza vaccine (FluMist)

Donors: 52

Assays:

62-plex Luminex

HAI

CMV/EBV

Hormones

CyTOF phenotype

SDY1480 (2014)

SDY1481 (2015)

SLVP029NCT03028974

Innate and acquired immunity to influenza infection and immunization

(2014–2017)

Who: 6 mo-49yo healthy participants

(who did not receive LAIV in the prior season nor received influenza immunizations in two or more prior seasons)

How: immunized with either seasonal inactivated or live, attenuated influenza vaccines from 2014–2017

When: Blood samples acquired before immunization (Day 0), days 7 and 28 after immunization. Children <9 yrs received 2 immunizations with the second blood samples acquired 28 days after second immunization

Seasonal quadrivalent inactivated influenza vaccine (Fluzone) or seasonal quadrivalent live attenuated influenza vaccine (FluMist)

Donors: 47

Assays:

62-plex Luminex

HAI

CMV/EBV

Hormones

CyTOF phenotype

SDY1482 (2014)

SDY1483 (2015)

SLVP030NCT03453801

The role of CD4+ memory phenotype, memory, and effector t cells in vaccination and infection

(2014–2019)

Who: 6 mo-10yo healthy participants

How: immunized annually with either seasonal inactivated or live, attenuated influenza vaccines from 2014–2019

When: Blood samples acquired before immunization (Day 0), days 7 and 60 after immunization. Children with no prior influenza vaccine received 2 immunizations with the second blood sample acquired 60 days after second immunization

Seasonal quadrivalent inactivated influenza vaccine (Fluzone) or seasonal quadrivalent live attenuated influenza vaccine (FluMist)

Seasonal trivalent, inactivated influenza Pediatric Dose (Fluzone, 0.25 ml) for 6–35mo children

Donors: 12

Assays:

62-plex Luminex

HAI

CMV/EBV

Hormones

CyTOF phenotype

SDY1484 (2014)
Table 1

Demographic characteristics for the FluPRINT study population.

Age (y)
Mean ± SD38 ± 25
Median (min. to max. range)27 (1–90)
Gender
Male (%)294 (39.7%)
Female (%)446 (60.3%)
Ethnicity
Caucasian (European American) (%)491 (66.35%)
African American (%)13 (1.75%)
American Indian and Alaska Native (%)3 (0.4%)
Asian (%)86 (11.6%)
Hispanic or Latino (%)5 (0.7%)
Other (%)137 (18.5%)
Unknown (%)5 (0.7%)
Fig. 2

Demographic characteristics for the FluPRINT study population stratified by the vaccination outcome. Distribution of individuals in the categories of high (red, n = 111) and low (grey, n = 252) responders regarding the (a) gender, ethnicity and CMV status (b) age distribution between high and low responders. Age is indicated in years.

Demographic characteristics for the FluPRINT study population. Demographic characteristics for the FluPRINT study population stratified by the vaccination outcome. Distribution of individuals in the categories of high (red, n = 111) and low (grey, n = 252) responders regarding the (a) gender, ethnicity and CMV status (b) age distribution between high and low responders. Age is indicated in years.

Assays and data processing

All data used were analysed and processed at the HIMC[26]. The distribution of assays performed across clinical studies and years is illustrated in Fig. 1b. Overall, SLVP015 was the longest study, running from 2007 to 2014, spanning 135 unique individuals, while the majority of samples (249) came from the SLVP018 study (Fig. 1). Raw data, including report files, standards, controls, antibodies used are available at ImmPort (https://immport.niaid.nih.gov/) under identification numbers for each study provided in the Online-only Table 1. Online-only Table 2 provides information about all assays performed, protocols, validations used and references to the published manuscripts using the data. Protocols for all assays are available online at the HIMC website (https://iti.stanford.edu/himc/protocols.html).
Online-only Table 2

Assays performed.

AssaysProtocolsValidationsRef.
51/52-plex Luminex

HIMC website:

“Luminex-Polystyrene bead kits/Luminex”

Report file available at ImmPort (https://immport.niaid.nih.gov/) contains all information of standards, Curve fitting, Bead counts to examine quality of assay and CV of both samples and standards. [9, 10, 22, 27, 28]
62/63-plex Luminex

HIMC website

“Luminex – eBioscience/Affymetrix Magnetic bead Kits”

MSD 4-plexV-PLEX Human Proinflammatory Panel II (Mesoscale, Cat No K15053D)Manufacturer standards [22]
MSD 9-plexHuman ProInflammatory 9-Plex Ultra-Sensitive Kit (Mesoscale, Cat No K15007C)
Other Luminex

Human 42-Plex Polystyrene Kit

(EMD Millipore, H42; MPXHCYTO060KPMX42)

Report file available at ImmPort (https://immport.niaid.nih.gov/) contains all information of standards, Curve fitting, Bead counts to examine quality of assay and CV of both samples and standards. [28]
HAI

HIMC website

“Hemagglutinin inhibition (HAI) assay”

Sample, virus control, HIMC human control serum (CONS2) and control PBS available at ImmPort. [9, 10, 27]
CMV/EBV

CMV IgG ELISA (Calbiotech, Cat No CM027G)

EBV-VCA IgG ELISA (Calbiotech, Cat No EVO10G)

Manufacturer standards [10, 27, 30]
Hormones

Free Testosterone ELISA Kit (Calbiotech) and

Custom Steroid Hormone Panel (human) Assay Kit (Mesosclae, MSD 4-plex)

Manufacturer standards [8]
CyTOF phenotype

HIMC website

“CyTOF Immunophenotyping/

CyTOF phenotyping”

Data analysed using FlowJo software. Gates were adjusted on a donor-specific basis. The statistics for each gated population was exported to an Excel spreadsheet. The percentage of each cell type is determined and reported as a percent of the parent cell type.

[27, 30]

Protocols[31,32]

Lyoplate

HIMC website:

“Flow cytometry phenotyping”

[27, 30]

Phospho-CyTOF

(whole blood)

HIMC website

“Whole blood phospho-CyTOF/Phosphoflow whole blood CYTOF”

The percentage of each cell type was determined and reported as a percent of the parent cell type. Median values were reported to quantitate the level of phosphorylation of each protein in response to stimulation.Protocol[34]
Phospho-flow

HIMC website

“Phospho-flow-cytokine/Phosphoepitope Flow Cytometry (Cytokine stimulation, pSTAT readouts)”

[9, 10, 27, 28, 30]

Blood count

(CBCD)

Clinical haematology test performed on a Coulter counterPerformed at the Stanford Clinical Lab

Multiplex cytokine assay

Multiplex ELISA using Luminex was performed using either polystyrene bead (for 51/52-plex) or magnetic bead kits (62/63-plex) (eBioscience/Affymetrix). The processed Luminex data available in the FluPRINT is normalized at the plate level to mitigate batch and plate effects[26]. The two median fluorescence intensity (MFI) values for each sample for each analyte were averaged, and then log-base 2 transformed. Z-scores ((value–mean)/standard deviation) were computed, with means and standard deviations computed for each analyte for each plate. Thus, units of measurement were Zlog2 for serum Luminex. Some of the Luminex data was used in previous publications[9,10,22,27,28]. In 2009 and 2010, for SLVP015 and SLVP018 studies, serum analytes were analysed using MSD 4- and 9-plex kits (V-PLEX Human Proinflammatory Panel II, Mesoscale, Cat No. K15053D and Human ProInflammatory 9-Plex Ultra-Sensitive Kit, Mesoscale, Cat No. K15007C) as according to the manufacturer’s protocol. The assay named ‘Other Luminex’ was performed only for study SLVP015 in 2007 using the Human 42-Plex Polystyrene Kit (EMD Millipore, H42; MPXHCYTO060KPMX42) and data was processed in the same way as for the Luminex assays described above (measurement units reported were Zlog2)[28].

Hemagglutination inhibition assay

Serum antibody titers before vaccination and day 28 after vaccination were measured using the standard HAI assay[29] using strains of influenza contained in the vaccines[9,10,27]. Geometric mean titers (GMT) were calculated for all strains of the virus contained in the vaccine, while fold change is calculated as: GMT for all vaccine strains on day 28/GMT for all vaccine strains on day 0. High responders were determined as individuals that seroconverted (4-fold or greater rise in HAI titer) and were seroprotected (GMT HAI ≥ 40).

Virological assays

CMV and Epstein-Barr virus (EBV) analysis was performed using CMV IgG ELISA (Calbiotech, Cat No. CM027G) and EBV-VCA IgG ELISA (Calbiotech, Cat No. EVO10G), following manufacturer’s protocols[10,27,30].

Immunophenotyping

Immunophenotyping was performed either with flow cytometry (Lyoplate)[27,30] or mass cytometry (CyTOF)[30-32]. Data was analysed using FlowJo software using the standard templates. Gates were adjusted on a donor-specific basis, if necessary, to control for any differences in background or positive staining intensity. The statistics was exported for each gated population to a spreadsheet. The percentage of each cell type is determined and reported as a percent of the parent cell type.

Phosphorylation-specific cytometry

Phospho-flow assays were performed either using flow cytometry on PBMC (for studies SLVP015, SLVP018 and SLVP021 from 2007 to 2012)[9,10,27,28,30] or mass cytometry on whole blood (for studies SLVP015, SLVP018 and SLVP021 in 2013)[33,34]. The percentage of each cell type is determined and reported as a percent of the parent cell type. Median values are reported to quantitate the level of phosphorylation of each protein in response to stimulation. For phospho-flow data acquired on flow cytometer a fold change value was computed as the stimulated readout divided by the unstimulated readout (e.g. 90th percentile of MFI of CD4+ pSTAT5 IFNα stimulated/90th percentile of CD4+ pSTAT5 unstimulated cells), while for data acquired using mass cytometry a fold change was calculated by subtracting the arcsinh (intensity) between stimulated and unstimulated (arsinh stim – arcsing unstim).

Automated importer and data harmonization

After collecting the data, a custom PHP script was generated to parse each of the 121 CSV files and to import data into the MySQL database. The source code for the script is available online at https://github.com/LogIN-/fluprint. The script optimizes the data harmonization process essential for combining data from different studies. Control and nonsense data were not imported, such as “CXCR3-FMO CD8+ T cells”, “nonNK-nonB-nonT-nonmonocyte-nonbasophils”, “viable”, etc. To standardize data, the original CSV entries were cleaned into the MySQL database readable format (e.g. quotes and parenthesis replaced with underscores, “+” with text “positive”, etc.). Additionally, classifications for ethnicity (Table 2), vaccine names (Table 3) and vaccination history (Table 4) were resolved into standard forms, while assays were numerated (Table 5). For example, “Fluzone single-dose syringe” and “Fluzone single-dose syringe 2009–2010” were mapped to “Fluzone” and given number 4 (Table 3). In all studies, vaccines were given intramuscularly for IIV and intranasally for LAIV, except for one study where a distinct licensed formulation of IIV was given intradermally and this was labelled as Fluzone Intradermal and given number 2. During data merging, we replaced text strings with binary values. For example, for the variable of gender, female and male were replaced with zero and one. To be able to distinguish between visits in consecutive years, a unique visit identification was calculated. For the original internal visit data, each visit in one year was labelled as V1 for day zero and V2 for day seven. However, if the same individual came in the consecutive year, day zero visit would again be labelled V1, and day seven as visit V2, causing repetition of values. To avoid such repetitions in the database, we generated a unique visit ID. Therefore, for the above example, first visit in the first year would be labelled V1 for day zero and V2 for day seven, but for the next year visits would be labelled as V3 for day zero and V4 for day seven. To distinguish between Luminex assays, the prefix L50 was given to each analyte analysed with the 51/52-plex Luminex kit. Finally, we imputed new values and calculated the vaccine outcome parameter using HAI antibody titers. High responders were determined as individuals that have HAI antibody titer for all vaccine strains ≥40 after vaccination and GMT HAI fold change ≥4, following FDA guidelines for evaluation of vaccine efficacy[2]. Vaccine outcome was expressed as a binary value: high responders were given value of one and low responders the value zero.
Table 2

Remapping ethnicity.

OriginalRemapped
Caucasian or WhiteCaucasian
Caucasian or White, AsianOther
Caucasian or White, OtherOther
AsianAsian
Asian, OtherOther
OtherOther
Caucasian or White, Black African American, Asian, OtherOther
Caucasian or White, Black African AmericanOther
NULLOther
Not Hispanic or LatinoOther
Non-HispanicOther
Decline to answerUnknown
Black African AmericanBlack or African American
Black African American, AsianOther
Cauc or White, Black Af AmOther
Caucasian or White, Pacific IslanderOther
Caucasian or White, PacIslanOther
Cauc or White, Pacific IslanderOther
Pacific Islander, AsianOther
American Indian/Alaska native, Caucasian or WhOther
American Indian/Alaska native, Caucasian or WhiteOther
American Indian/Alaska native, Black African AmericanOther
Am In/Alaska native, Cauc or WOther
Am In/AlaskaNative, Black Af AmOther
American Indian/Alaska nativeAmerican Indian or Alaska Native
HispanicHispanic/Latino
Hispanic or LatinoHispanic/Latino
Table 3

Remapping vaccine type.

Vaccine receivedVaccine type IDVaccine type name
FluMist IIV4 0.2 mL intranasal spray1Flumist
FluMist Intranasal spray1Flumist
FluMist Intranasal Spray 2009–20101Flumist
FluMist Intranasal Spray1Flumist
Flumist1Flumist
Fluzone Intradermal-IIV32Fluzone Intradermal
Fluzone Intradermal2Fluzone Intradermal
GSK Fluarix IIV3 single-dose syringe3Fluarix
Fluzone 0.5 mL IIV4 SD syringe4Fluzone
Fluzone 0.25 mL IIV4 SD syringe5Paediatric Fluzone
Fluzone IIV3 multi-dose vial4Fluzone
Fluzone single-dose syringe4Fluzone
Fluzone multi-dose vial4Fluzone
Fluzone single-dose syringe 2009–20104Fluzone
Fluzone high-dose syringe6High Dose Fluzone
Fluzone 0.5 mL single-dose syringe4Fluzone
Fluzone 0.25 mL single-dose syringe5Paediatric Fluzone
Fluzone IIV3 High-Dose SDS6High Dose Fluzone
Fluzone IIV4 single-dose syringe4Fluzone
Fluzone High-Dose syringe6High Dose Fluzone
Table 4

Remapping vaccination history.

OriginalRemapped
No0
Yes1
IIV injection/im2
Doesn’t know/doesn’t remember/na/does not remember3
LAIV4 intranasal/laiv_std_intranasal/laiv_std_ intranasal/nasal/intranasal4
Table 5

Assays in the database.

OriginalRemapped
CMV EBV1
Other immunoassay2
Human Luminex 62–63 plex3
CyTOF phenotyping4
HAI5
Human Luminex 51 plex6
Phospho-flow cytokine stim (PBMC)7
pCyTOF (whole blood) pheno9
pCyTOF (whole blood) phospho10
CBCD11
Human MSD 4 plex12
Lyoplate 113
Human MSD 9 plex14
Human Luminex 50 plex15
Other Luminex16
Remapping ethnicity. Remapping vaccine type. Remapping vaccination history. Assays in the database.

Generating tables

To build FluPRINT database, we generated four tables, as shown in Fig. 3. Table 6 depicts characteristics of the FluPRINT database. In the table , each row represents an individual given a unique encrypted identification number (study donor ID). Other fields provide information about the clinical study in which an individual was enrolled (study ID and study internal ID), gender and race. The second table, named describes information about the donor’s age, CMV and EBV status, Body Mass Index (BMI) and vaccine received on each clinical visit. Each clinical visit was given a unique identification (visit ID) in addition to the internal visit ID (provided by the clinical study) to distinguish between visits in consecutive years. For each visit, we calculated vaccine response by measuring HAI antibody response. Information about vaccine outcome is available as geometric mean titers (geo_mean), difference in the geometric mean titers before and after vaccination (delta_geo_mean), and difference for each vaccine strain (delta_single). In the last field, each individual is classified as high and low responder (vaccine_resp). On each visit, samples were analysed and information about which assays were performed (assay field) and value of the measured analytes (units and data) are stored in the table. Finally, the table describes information connected with each clinical visit about usage of statins (statin_use) and whether an influenza vaccine was received in the past (influenza vaccine history), if yes, how many times (total_vaccines_received, self reported). Also, we provide information which type of influenza vaccine was received in the previous years (1 to 5 years prior enrolment in the clinical study). Lastly, information about influenza infection (report of MD diagnosis) history and influenza-related hospitalization (participant report) is provided.
Fig. 3

The FluPRINT database model. The diagram shows a schema of the FluPRINT database. Core tables, donors (red), donor_visits (yellow), experimental_data (blue) and medical_history (green) are interconnected. Tables experimental_data and medical_history are connected to the core table donor_visits. The data fields for each table are listed, including the name and the type of the data. CHAR and VARCHAR, string data as characters; INT, numeric data as integers; FLOAT, approximate numeric data values; DECIMAL, exact numeric data values; DATETIME, temporal data values; TINYINT, numeric data as integers (range 0–255); BOOLEAN, numeric data with Boolean values (zero/one). Maximal number of characters allowed in the data fields is denoted as number in parenthesis.

Table 6

The characteristics of the FluPRINT database.

Table nameRowsColumnsDescription
donors 7406Each row in this table is one donor. Donor is described with 5 additional parameters that include unique identification (donor_id and study_donor_id), identification for the study (study_id), full internal name of the study (study_internal_id), gender and race.
donor_visits 2,93718Each row represents a donor at the particular visit (visit_id). Additionally, information about internal visit identification (visit_internal_id), date of the visit (visit_year and visit_day), pre- or post-vaccination visit for HAI assay (visit_type_hai), age at the visit (age and age_round), CMV/EBV status, BMI index at the visit are provided. Additionally, type of vaccine received (vaccine) and other calculated measures for HAI assay (geo_mean, d_geo_mean, d_single, vaccine_resp) are provided.
experimental_data 371,2609Each row represents a donor at particular visit (donor_visits_id). At each visit, assay that was performed is listed (assay) along with the names and values for measured analytes (name, name_formatted, subset, units and data).
Medical_history 74018Each row is one donor at first visit described by 15 additional parameters. These include usage of statins (statin_use) and history of receiving influenza vaccines (flu_vaccination_history). If donor received vaccination before enrollment, the survey information is provided about how many vaccines were received (total_vaccines_received), and the type of vaccines for each prior season (fields for the one year before enrolment: vaccinated_1yr_prior and vaccine_type_1yr_prior). This information is provided for up to 5 years prior enrolment in the clinical study and is by report, not record verified.
The FluPRINT database model. The diagram shows a schema of the FluPRINT database. Core tables, donors (red), donor_visits (yellow), experimental_data (blue) and medical_history (green) are interconnected. Tables experimental_data and medical_history are connected to the core table donor_visits. The data fields for each table are listed, including the name and the type of the data. CHAR and VARCHAR, string data as characters; INT, numeric data as integers; FLOAT, approximate numeric data values; DECIMAL, exact numeric data values; DATETIME, temporal data values; TINYINT, numeric data as integers (range 0–255); BOOLEAN, numeric data with Boolean values (zero/one). Maximal number of characters allowed in the data fields is denoted as number in parenthesis. The characteristics of the FluPRINT database.

Data Records

The FluPRINT dataset described herein is available online for use by the research community and can be downloaded directly from a research data repository Zenodo[35]. Additionally, the dataset can be imported in the MySQL database for further manipulation and data extraction. The instructions how to import FluPRINT into the database are available at github (https://github.com/LogIN-/fluprint). The summary of the dataset, including the number of observations, fields and description for each table is provided in Table 6.

Technical Validation

The objective of the current study was to ensure that the FluPRINT dataset accurately reflects processed data available in SDM. Technical data validation was carried in previous published studies referred in the Online-only Table 2. Data was downloaded from the original source, and here we focused on ensuring that data records were accurately harmonized, merged and mapped in the unifying FluPRINT database. The FluPRINT dataset was validated on two levels: (1) upon insertion and (2) after the data was inserted into the database. To validate data on insertion, we created loggers to monitor import of the CSV files into the database. This ensured easier and more effective troubleshooting of potential problems and contributed to the monitoring of the import process. Two different sets were used: (1) informative and (2) error loggers. Informative loggers provided information about which processing step has started or finished and how many samples have been processed in that particular step. This allowed us to monitor that correct number of samples was processed. Error loggers provided exact identification and name of the data which could not be imported into the database, usually caused by missing or incorrect user input, such as “… assay is missing. Skipping … ‘$row’”. This facilitated the process to identify erroneous data, which were then manually reviewed, corrected, and updated. Once the database was built, a manual review of data was performed to ensure accuracy and integrity of the dataset. Several random individuals were chosen and the accuracy of data was evaluated by comparison with the raw data. Additionally, we evaluated total number of all donors, assays performed, clinical studies and years with the raw data available at the SDM.

Usage Notes

Recent advances in the computational biology and the development of novel machine learning algorithms, especially deep learning, make it possible to extract knowledge and identify patterns in an unbiased manner from large clinical datasets. Application of machine learning algorithms to clinical datasets can reveal biomarkers for different diseases, therapies[36], including vaccinations[8,9,12]. The data from the FluPRINT study can be used to identify cellular and molecular baseline biomarkers that govern successful antibody response to influenza vaccines (IIV and LAIV) across different influenza seasons and a broad age population. The HAI antibody response to influenza vaccines is considered as an alternative way to compare immunogenicity of the vaccines in susceptible groups where placebo-controlled clinical efficacy study cannot be performed. Since FluPRINT dataset is provided as a database, this facilitates further analysis. Queries can be easily performed to obtain a single CSV file. For example, researchers interested in understanding which immune cells and chemokines can differentiate between high and low responders that received inactivated influenza vaccine could search the FluPRINT database. In the database, they can find all donors for which flow cytometry or mass cytometry were performed together with Luminex assays, for which donors the HAI response was measured, and all the donors who received inactivated influenza vaccine. The resulting CSV file can then easily be used for downstream analysis. Major advantages of this dataset are the mapping of the vaccine outcome, classifying individuals as high or low responders, standardization of the data from different clinical studies, and from different assays. This data harmonization process allows for direct comparison of immune cell frequency, phenotype, and functionality and quantity of chemokines and cytokines shared between individuals before or after influenza vaccinations. By releasing the FluPRINT database and the source code, we provide users with the ability to continue building upon this resource and to update the database with their data and other databases.
Measurement(s)immune response trait
Technology Type(s)digital curation
Factor Type(s)gender • race • visit_age • bmi • flu_vaccination_history • statin_use • influenza_infection_history • influenza_hospitalization • cmv_status • ebv_status
Sample Characteristic - OrganismHomo sapiens
  56 in total

1.  Systems Analysis of Immunity to Influenza Vaccination across Multiple Years and in Diverse Populations Reveals Shared Molecular Signatures.

Authors:  Helder I Nakaya; Thomas Hagan; Sai S Duraisingham; Eva K Lee; Marcin Kwissa; Nadine Rouphael; Daniela Frasca; Merril Gersten; Aneesh K Mehta; Renaud Gaujoux; Gui-Mei Li; Shakti Gupta; Rafi Ahmed; Mark J Mulligan; Shai Shen-Orr; Bonnie B Blomberg; Shankar Subramaniam; Bali Pulendran
Journal:  Immunity       Date:  2015-12-15       Impact factor: 31.745

Review 2.  Single-cell technologies for monitoring immune systems.

Authors:  Pratip K Chattopadhyay; Todd M Gierahn; Mario Roederer; J Christopher Love
Journal:  Nat Immunol       Date:  2014-02       Impact factor: 25.606

3.  Effects of aging, cytomegalovirus infection, and EBV infection on human B cell repertoires.

Authors:  Chen Wang; Yi Liu; Lan T Xu; Katherine J L Jackson; Krishna M Roskin; Tho D Pham; Jonathan Laserson; Eleanor L Marshall; Katie Seo; Ji-Yeun Lee; David Furman; Daphne Koller; Cornelia L Dekker; Mark M Davis; Andrew Z Fire; Scott D Boyd
Journal:  J Immunol       Date:  2013-12-11       Impact factor: 5.422

4.  Human B-cell isotype switching origins of IgE.

Authors:  Timothy J Looney; Ji-Yeun Lee; Krishna M Roskin; Ramona A Hoh; Jasmine King; Jacob Glanville; Yi Liu; Tho D Pham; Cornelia L Dekker; Mark M Davis; Scott D Boyd
Journal:  J Allergy Clin Immunol       Date:  2015-08-22       Impact factor: 10.793

5.  Increased Proinflammatory Responses of Monocytes and Plasmacytoid Dendritic Cells to Influenza A Virus Infection During Pregnancy.

Authors:  Mathieu Le Gars; Alexander W Kay; Nicholas L Bayless; Natali Aziz; Cornelia L Dekker; Gary E Swan; Mark M Davis; Catherine A Blish
Journal:  J Infect Dis       Date:  2016-09-21       Impact factor: 5.226

6.  Estimates of global seasonal influenza-associated respiratory mortality: a modelling study.

Authors:  A Danielle Iuliano; Katherine M Roguski; Howard H Chang; David J Muscatello; Rakhee Palekar; Stefano Tempia; Cheryl Cohen; Jon Michael Gran; Dena Schanzer; Benjamin J Cowling; Peng Wu; Jan Kyncl; Li Wei Ang; Minah Park; Monika Redlberger-Fritz; Hongjie Yu; Laura Espenhain; Anand Krishnan; Gideon Emukule; Liselotte van Asten; Susana Pereira da Silva; Suchunya Aungkulanon; Udo Buchholz; Marc-Alain Widdowson; Joseph S Bresee
Journal:  Lancet       Date:  2017-12-14       Impact factor: 79.321

7.  Phenotyping of Live Human PBMC using CyTOF™ Mass Cytometry.

Authors:  Michael D Leipold; Holden T Maecker
Journal:  Bio Protoc       Date:  2015-01-20

8.  Systems biology of vaccination for seasonal influenza in humans.

Authors:  Helder I Nakaya; Jens Wrammert; Eva K Lee; Luigi Racioppi; Stephanie Marie-Kunze; W Nicholas Haining; Anthony R Means; Sudhir P Kasturi; Nooruddin Khan; Gui-Mei Li; Megan McCausland; Vibhu Kanchan; Kenneth E Kokko; Shuzhao Li; Rivka Elbein; Aneesh K Mehta; Alan Aderem; Kanta Subbarao; Rafi Ahmed; Bali Pulendran
Journal:  Nat Immunol       Date:  2011-07-10       Impact factor: 25.606

9.  Adjuvanted influenza-H1N1 vaccination reveals lymphoid signatures of age-dependent early responses and of clinical adverse events.

Authors:  Olga Sobolev; Elisa Binda; Sean O'Farrell; Anna Lorenc; Joel Pradines; Yongqing Huang; Jay Duffner; Reiner Schulz; John Cason; Maria Zambon; Michael H Malim; Mark Peakman; Andrew Cope; Ishan Capila; Ganesh V Kaundinya; Adrian C Hayday
Journal:  Nat Immunol       Date:  2016-01-04       Impact factor: 25.606

10.  Characterization of influenza vaccine immunogenicity using influenza antigen microarrays.

Authors:  Jordan V Price; Justin A Jarrell; David Furman; Nicole H Kattah; Evan Newell; Cornelia L Dekker; Mark M Davis; Paul J Utz
Journal:  PLoS One       Date:  2013-05-29       Impact factor: 3.240

View more
  7 in total

1.  Identification of Pathogenic Immune Cell Subsets Associated With Checkpoint Inhibitor-Induced Myocarditis.

Authors:  Han Zhu; Francisco X Galdos; Daniel Lee; Sarah Waliany; Yuhsin Vivian Huang; Julia Ryan; Katherine Dang; Joel W Neal; Heather A Wakelee; Sunil A Reddy; Sandy Srinivas; Lih-Ling Lin; Ronald M Witteles; Holden T Maecker; Mark M Davis; Patricia K Nguyen; Sean M Wu
Journal:  Circulation       Date:  2022-06-28       Impact factor: 39.918

2.  Acute Chelation Therapy-Associated Changes in Urine Gadolinium, Self-reported Flare Severity, and Serum Cytokines in Gadolinium Deposition Disease.

Authors:  Holden T Maecker; Janet C Siebert; Yael Rosenberg-Hasson; Lorrin M Koran; Miguel Ramalho; Richard C Semelka
Journal:  Invest Radiol       Date:  2021-06-01       Impact factor: 10.065

3.  treekoR: identifying cellular-to-phenotype associations by elucidating hierarchical relationships in high-dimensional cytometry data.

Authors:  Adam Chan; Wei Jiang; Emily Blyth; Jean Yang; Ellis Patrick
Journal:  Genome Biol       Date:  2021-11-29       Impact factor: 13.583

Review 4.  Systems Immunology: Revealing Influenza Immunological Imprint.

Authors:  Adriana Tomic; Andrew J Pollard; Mark M Davis
Journal:  Viruses       Date:  2021-05-20       Impact factor: 5.048

Review 5.  Mucosal vaccines - fortifying the frontiers.

Authors:  Ed C Lavelle; Ross W Ward
Journal:  Nat Rev Immunol       Date:  2021-07-26       Impact factor: 108.555

6.  Activated natural killer cells predict poor clinical prognosis in high-risk B- and T-cell acute lymphoblastic leukemia.

Authors:  Caroline Duault; Anil Kumar; Adeleh Taghi Khani; Sung June Lee; Lu Yang; Min Huang; Christian Hurtz; Bryan Manning; Lucy Ghoda; Tinisha McDonald; Norman J Lacayo; Kathleen M Sakamoto; Martin Carroll; Sarah K Tasian; Guido Marcucci; Jianhua Yu; Michael A Caligiuri; Holden T Maecker; Srividya Swaminathan
Journal:  Blood       Date:  2021-10-21       Impact factor: 25.476

7.  Divergent trajectories of antiviral memory after SARS-CoV-2 infection.

Authors:  Adriana Tomic; Donal T Skelly; Ane Ogbe; Daniel O'Connor; Paul Klenerman; Eleanor Barnes; Christina Dold; Susanna J Dunachie; Matthew Pace; Emily Adland; Frances Alexander; Mohammad Ali; Kirk Allott; M Azim Ansari; Sandra Belij-Rammerstorfer; Sagida Bibi; Luke Blackwell; Anthony Brown; Helen Brown; Breeze Cavell; Elizabeth A Clutterbuck; Thushan de Silva; David Eyre; Sheila Lumley; Amy Flaxman; James Grist; Carl-Philipp Hackstein; Rachel Halkerston; Adam C Harding; Jennifer Hill; Tim James; Cecilia Jay; Síle A Johnson; Barbara Kronsteiner; Yolanda Lie; Aline Linder; Stephanie Longet; Spyridoula Marinou; Philippa C Matthews; Jack Mellors; Christos Petropoulos; Patpong Rongkard; Cynthia Sedik; Laura Silva-Reyes; Holly Smith; Lisa Stockdale; Stephen Taylor; Stephen Thomas; Timothy Tipoe; Lance Turtle; Vinicius Adriano Vieira; Terri Wrin; Andrew J Pollard; Teresa Lambe; Chris P Conlon; Katie Jeffery; Simon Travis; Philip Goulder; John Frater; Alex J Mentzer; Lizzie Stafford; Miles W Carroll; William S James
Journal:  Nat Commun       Date:  2022-03-10       Impact factor: 14.919

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.