Literature DB >> 33377042

Comparing Circadian Rhythmicity in the Human Gut Microbiome.

Sandra Reitmeier1,2, Silke Kiessling1,2, Klaus Neuhaus1, Dirk Haller1,2.   

Abstract

Targeted sequencing of 16S rRNA genes enables the analysis of microbiomes. Here, we describe a protocol for the collection, storage, and preparation of fecal samples. We describe how we cluster similar sequences and assign bacterial taxonomies. Using diversity analysis and machine learning, we can extract disease-associated features. We also describe a circadian analysis to identify the presence or absence of rhythms in taxonomies. Differences in rhythmicity between cohorts can contribute to determining disease-associated bacterial signatures. For complete details on the use and execution of this protocol, please refer to Reitmeier et al. (2020).
© 2020 The Author(s).

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 33377042      PMCID: PMC7757335          DOI: 10.1016/j.xpro.2020.100148

Source DB:  PubMed          Journal:  STAR Protoc        ISSN: 2666-1667


Before You Begin

The Study Centre informs participants about the aims of the study and provides a material box which includes everything necessary guaranteeing a clean and sanitized sample collection (e.g., gloves, tearproof stool collector). The participants are asked to collect the sample at the appointment day and store it in the fridge (4°C) until then. The stool collector should be used to avoid any contamination. The questionnaire comprises questions regarding fecal stool collection (date, time, problems etc.), about personal information including health status (age, medication, disease etc.) and about dietary habits. Each participant gets a postal package including: One questionnaire An instruction manual 2 collection containers – one being empty and one containing DNA Stabilizer from Invitek (Stool DNA Stabilizer – Catalog No 1038111100). The collection tubes have unique QR codes each 1 pair of disposable gloves 2 stool collector (one being a replacement) Participants are asked to collect samples, if possible, on the day of the appointment at the Study Centre or earliest 1 day before.

Fecal Sample Collection

The study center prepares sample collection kits which are handed out to the study participants. The kit includes a sample collection instruction guiding the participants through the procedure. According to the instructions the participant is asked to use all provided disposals. Samples should be stored in the household’s fridge at 4°C as short as possible. For a storage >36 h, we recommend storing samples at −20°C. Transport of the two 8-mL tubes (including the sample). Deliver samples to the center during visit (which is the preferred transport). Send sample as soon as possible by postal mailing (a prepaid and addressed envelope might be provided). Samples collected in DNA Stabilizer are stable for at least 3 days at ambient temperature and at least 7 days at 4°C. It was shown that short- and long-term storage have an effect on microbial DNA stability (Carroll et al., 2012; Dominianni et al., 2014) with some bacteria tend to be more sensitive than others (Shaw et al., 2016). DNA stabilization liquid has advantages for preservation of the DNA and facilitates the process of sample collection and storage in studies (Ilett et al., 2019). In a small in-house study, we analyzed the influence of storage (at 20°C–22°C) and showed that samples including DNA stabilizer have increased stability over time (0 h, 24 h, and 48 h) compared to samples without (Figure 1).
Figure 1

.Influence of Storage Time and Number of Observed OTUs (Richness)

Boxplots shows either the changes without DNA stabilizer (left boxplot) or with DNA stabilizer (right boxplot) over time as indicated.

.Influence of Storage Time and Number of Observed OTUs (Richness) Boxplots shows either the changes without DNA stabilizer (left boxplot) or with DNA stabilizer (right boxplot) over time as indicated.

Arrival at the Study Center

The QR code of the 8-mL tubes with the stabilizer liquid is scanned and the tubes are stored at −20°C. For a long-time storage (more than 3 months) it is recommended to store samples at −80°C (Goodrich et al., 2014). The QR code of the 5-mL tubes without the stabilizer liquid are scanned and the tubes are stored at −80°C. CRITICAL: It is important to have unique labels for each sample. We recommend a barcode system which helps in proper sample and data handling. For human studies, an anonymization system with restricted access may be important as well. Information about storage, arrival time, and additional information should be noted in a database. Questionnaires need to be electronically recorded (e.g., scanned for future reference). CRITICAL: Variables, names and information (included in the database) should be formatted smartly in advance in order to avoid later re-formatting of, e.g., identifiers for subsequent analyses (e.g., statistics).

Key Resources Table

Step-By-Step Method Details

Sample processing is divided into four main steps: DNA isolation, library construction by PCR, amplicon cleaning and dilution, and sequencing (Figure 2).
Figure 2

.Workflow of 16S rRNA Gene Sequencing Preparation and Analysis

Steps are structured into three sections: the sample collection and storage, the sample preparation and sequencing, and the sample preprocessing and data analysis. The given time for each step can be seen as a point of reference.

.Workflow of 16S rRNA Gene Sequencing Preparation and Analysis Steps are structured into three sections: the sample collection and storage, the sample preparation and sequencing, and the sample preprocessing and data analysis. The given time for each step can be seen as a point of reference.

DNA Isolation

Timing: approx. 3 h for 24 samples DNA is isolated with a modification of the protocol by Godon etal. (1997). A blank sample, consisting of 600 μL DNA Stabilizer from Invitek, is processed in every second DNA isolation batch (i.e., one blank sample for each 47 samples). Thaw fecal samples (ca. 2 g in 8 mL DNA stabilizer) for approximately 2 h at 20°C–22°C. Vortex until the sample is fully homogenized and let stand for 3 min to sediment debris. For each sample, a volume of 600 μL fecal slurry is transferred into a 2-mL screw cap tube containing 0.1 mm silica beads. Use autoclaved hand-cut blue tips that allow pipetting even in the presence of remaining debris. This aliquot is processed immediately. The remaining sample is frozen at −80°C for long-term storage. Add 250 μL 4 M guanidinium thiocyanate to the sample. This step is necessary to denature proteins. Add 500 μL 5% N-lauroylsarcosine sodium salt, which is an ionic surfactant that separates all cellular components from each other. Incubate the samples for 60 min at 70°C while shaking at 700 rpm. Lyse remaining microbial cells by using a FastPrep-24 fitted with a CoolPrep adapter (filled with a handful of dry ice). The FastPrep instrument performs the lysis of biological samples by using an optimized motion to disrupt cells through beating of beads on the sample material. Program: 5 Cycles: 40 s; 6.5 m/s 3 rounds (add more dry ice between each round) Add 15 mg polyvinylpyrrolidone (PVPP), a polymer used for removing phenolics and other fecal contaminants. After vortexing, centrifuge for 3 min at 15,000 × g and 4°C. 500 μL of the supernatant is transferred to a new 2-mL tube. Add 5 μL RNase A and incubate for 20 min at 37°C while shaking at 700 rpm. The DNA is then purified using a silica membrane-based approach following the manufacturer's instructions of the kit used (NucleoSpin gDNA Clean-up Kit, REF 740230.250 Machery-Nagel). Add 1500 μL Binding Buffer and vortex for 5 s. Transfer each sample to one column: this is performed in three steps with each 650 μL. After each transfer, columns are centrifuged for 30 s (11,000 × g); discard the flow-through. Wash columns by adding 700 μL Washing Buffer. After 2 s vortex, columns are centrifuged for 30 s (11,000 × g); discard the flow-through. Washing is performed three times. Dry the silica membrane by centrifuging the columns for 1 min (11,000 × g) and discard the collection tube. Add 50 μL Buffer DE to elute the DNA. Incubate for 1 min and centrifuge for 1 min (as before). Repeat the elution step and pool the flow-through to obtain a final volume of 100 μL with the isolated DNA. After DNA purification, nucleic acid concentrations are measured by using a NanoDrop. Use a DNA solution of known concentration and measure serial dilutions thereof to check for the accuracy of the NanoDrop.

Library Construction by Polymerase Chain Reaction

Timing: approx. 3 h for 96 samples Dilute isolated DNA of each sample to a final concentration of 12 ng/μL in 20 μL water into a 96-well skirted plate. Prepare the Master Mix (Table 1) for the first (1st) PCR.
Table 1

Master Mix for 1st PCR

ReagentsVolume μL/Sample
Phusion® HF Buffer (without Dye)6
dNTPs (20 μmol)0.6
341F-ovh Primer (20 μM)0.1875
785r-ovh Primer (20 μM)0.1875
Phusion® High-Fidelity DNA Polymerase Hotstart0.15
DMSO (100%)2.25
Water (for molecular biology, DEPC-treated and filter-sterilized)17.625
Master Mix for 1st PCR Transfer 27 μL of the prepared Master Mix (per well) and add 3 μL of the sample (per well) to a new 96-well skirted plate. The well plate with 30 μL sample per well is covered with a foil seal and is centrifuged for 30 s at low speed to collect the liquid at the bottom. Put the plate into the cycler (Biometra TAdvanced) and run the first (1st) PCR program for 15 cycles following the time and temperature settings shown in (Table 2).
Table 2

Settings for 1st PCR. Rows in gray are performed for 15 cycles.

PCR Cycling Conditions
Stepsin °CTimeCycles
Initial Denaturation9830 s1
Denaturation985 s15
Annealing5510 s
Extension7210 s
Final Extension722 min1
Hold10
Settings for 1st PCR. Rows in gray are performed for 15 cycles. Prepare the Master Mix (Table 3) for the second (2nd) PCR including forward index primer. For each 96-well plate, 6 different forward primer and 16 different reverse primer are used. The reverse primer is not included in the Master Mix, they are divided in strips which are placed in the robot working area as well. For each of the six forward primer one separate Master Mix is to be prepared.
Table 3

Master Mix for 2nd PCR

ReagentsVolume in μL/Sample
Phusion® HF Buffer (without Dye)10
dNTPs (20 μmol, Bioline BIO-39043)1
Forward primer (e.g., 341-ovh-HTS- SC501 Primer (20 μM))0.313
Phusion® High-Fidelity DNA Polymerase Hotstart0.2
DMSO (100%)1.5
Water (for molecular biology, DEPC-treated and filter-sterilized)32.487
Master Mix for 2nd PCR After the first PCR the plate returns to the robot. Mix 2 μL of the DNA from 1st PCR, 45.5 μL of the Master Mix (Table 3), and 2.5 μL of one reverse index primer. Primer are combined in order to insert a double index in each sample following the method introduced by Kozich et al. (2013). It is possible to select from 38 forward and 60 reverse primer Table 4.
Table 4

Primer selection for 2nd PCR

Forward primer341-ovh-HTS-SB501-508
341-ovh-HTS-SA502-509
341-ovh-HTS-SD501, 502, 505, 508
341-ovh-HTS-SC502, 505, 507, 508
341-ovh-HTS-i5_1-16
Reverse primer785r-ovh-HTS-SA701-712
785r-ovh-HTS-SB701-711
785r-ovh-HTS-SC701, 703, 704, 706-7012
785r-ovh-HTS-SD703, 705-712
785r-ovh-HTS-i7_02-06, 08-12, 15-18, 20-24
Primer selection for 2nd PCR The plate is covered again with a PCR foil seal and is centrifuges for 30 s as before. The second PCR starts by putting the covered plate into the cycler (Biometra TAdvanced). Run the program for ten cycles following the time and temperature settings shown in (Table 5).
Table 5

Settings for 2nd PCR. Rows in gray are performed for ten cycles.

PCR Cycling Conditions
StepsTemperatureTimeCycles
Initial Denaturation9830 s1
Denaturation985 s10
Annealing5510 s
Extension7210 s
Final Extension722 min1
Hold10
Pause Point: After the second PCR, the plate can be stored at 4°C for 1 day. Settings for 2nd PCR. Rows in gray are performed for ten cycles. Pool the final PCR products of both plates after the second PCR, which results in a total volume of 100 μL per sample. Fifteen μL can be used for quality control issues (e.g., gel electrophoresis).

Library Cleaning

Timing: approx. 1 h 30 min for 96 samples PCR purification is performed with AGENCOURT AMPure XP Beads (Beckman Coulter) and again fully automatized using Beckman Coulter Biomek 4000 robot. Prior to the library cleaning Remove the AMPure XP beads from 4°C storage and let stand for at least 30 min to bring to 20°C–22°C. Vortex the AMPure XP beads until they are well dispersed. Add 1.8 μL AMPure XP beads per 1.0 μL PCR product. Using a P1000 multi-channel pipette, the robot gently pipettes the entire volume up and down 10-times to mix thoroughly. For stool samples, the standard settings are 85 μL PCR product and 153 μL AMPure XP beads resulting in a total volume of 238 μL. Incubate at 20°C–22°C for 5 min. Put the well plate in the magnetic rack and let stand at 20°C–22°C for 5 min or until the liquid becomes clear in appearance. The robot removes the all of the clear supernatant using a P1000 multi-channel pipette. The fragment is bound to the beads and 200 μL freshly prepared 70% EtOH is added to each well using a P250 without barrier. Leave at 20°C–22°C for 30 s and discard the supernatant. Take extra care not to disturb the beads. Steps 31 and 32 are repeated once more, for a total of two 70% EtOH washes. Let the 96-well plate at 20°C–22°C for 4 - 5 min for drying, and then remove from the magnetic rack. Re-suspend the bead pellet in each well in 80 μL BE Elution (recommended volume of AMPure standard protocol). The robot gently pipettes the entire volume up and down 10-times to mix thoroughly using a P250 multi-channel pipette. CRITICAL: The amount of added Elution Buffer depends on the DNA yield of the PCR product. Low amounts of PCR product, i.e., weak bands on the gel, should be re-suspend with amount at or below 20 μL BE Elution. Incubate the 96-well plate at 20°C–22°C for 2 min. Place the 96-well plate on the magnetic rack at 20°C–22°C for 2 min or until the liquid becomes clear in appearance. Seventy μL of the clear supernatant from each well are transferred to an XP plate. Eight μL are transferred to a second plate for DNA measurements by fluorimetry (Qubit measurement according to the manufacturer’s instructions). If not enough volume is available, the total amount is transferred manually. Samples are diluted to a concentration of 2 nM and finally diluted to a concentration of 0.5 nM. From each sample of the 96-well plate, 5 μL are transferred to a low binding tube (pool of all samples of one plate). Pause Point: After the library cleaning the plate can be stored at 4°C for 1 day.

Prepare Samples for 16S rRNA Gene Sequencing

Calculate molarity of each sample based on measured Qubit concentrations for a mean over four measurements: For V3V4, the average library size is 572 bp. Following steps are necessary to denature the DNA and set to a concentration of 20 pM. Create a fresh 0.2 nM NaOH solution and a 0.2 nM Tris HCl solution. Add 40 μL of the 0.5 nM DNA pool and 40 μL of the 0.2 N NaOH solution to a 1.5-mL tube. Vortex the sample and centrifuge for 1 min (280 × g). Leave it in a stand for 5 min at 20°C–22°C. Add 40 μL of the 0.2 nM Tris HCl solution. Vortex the mixture and centrifuge for 1 min (280 × g). Incubate for 5 min at 95°C and for 5 min at 4°C. Add 880 μL cooled HT1-Buffer to the denaturated DNA pool to generate a 20 pM library. Dilute the DNA to get the final pM concentration of 10 pM final library concentration that was spiked-in using 20% (v/v) PhiX. PhiX DNA in a ready to sequence library (Illumina PhiX Control v3, FC-110-3001) is added in order to increase complexity for the first few bases sequenced. Otherwise, the sequencer miscalculates the amount of the dominating base and the sequencing fails. Six-hundred μL of the final pool is transferred to the Illumina MiSeq cartridge v3 with 600 cycles.

Expected Outcomes

After sequencing, the demultiplexed FASTQ files (forward and reverse file for each sample, Illumina bcl2fastq software) are transformed into Operational Taxonomic Unit (OTU) tables using the IMNGS (Lagkouvardos et al., 2016) platform which is based on the UPARSE approach for sequence quality check, chimera filtering, and cluster formation. To avoid spurious OTUs, we recommend a filtering threshold of 0.25% to remove artificial species Reitmeier et al. (2020) For downstream analysis, the generated OTU table is normalized by using the fully modular R pipeline Rhea (Lagkouvardos et al., 2017). The pipeline also provides information about alpha-diversity (within-sample diversity), beta-diversity (between-sample diversity) and generates a taxonomic classification.

Quantification and Statistical Analysis

Quality Control

The quality of the sequencing run is evaluated by FASTQC, which provides a modular set of quality control analysis. Graphical illustration about the quality scores over reads (bp) is used to show any problems occurred during the sequencing run. For human stool samples, it is intended to have for each sample about 10,000 reads (or more) after trimming, filtering, and chimera checking. Samples with too low number of reads should be excluded. However, the exact minimum threshold of reads depends on the studied environment and sequencing technology. Samples with total processed reads below the determined threshold should be re-sequenced.

Statistical Analysis

Descriptive analysis and data handling Handling sparsity in microbial datasets. For the analysis of 16S rRNA gene sequencing data of the large population-based cohort studies, we excluded OTUs with a relative abundance <0.1% and a prevalence <10%. Adjust for confounding and determine effect modifier. Confounders and effect modifiers are determined via a permutational multivariate analysis of variance using a distance matrix. For the confounders, the function is applied on the Bray-Curtis distance matrix considered as independent variable. The dependent variables are known confounding factors for which the data should be stratified anyway and the outcome of interested (e.g., Type 2 Diabetes). Effect modifiers help to explain the variation of the underlying microbial ecosystem. They are not considered as confounders but as contributors to the total variation. Therefore, co-variables are individually tested and ranked according to their significant explained variation. Statistical analysis to determine differences between groups/samples is obtained via linear regression model using lm from the R package vegan adjusted for the previously determined confounding factors. Machine learning – tool for classification and prediction A random forest model is used to classify binary outcome variables based on a combination of BMI and microbial composition with a 5-fold cross validation by using randomForest from the R package randomForest v4.6-14. To receive a robust and generalizable classification model, the machine-learning algorithm is applied 100-times iteratively assigning randomly individuals to either the training (80%) or test set (20%). For the training set, a subset of equally distributed T2D and nonT2D cases is taken to train the model. The model is further validated on the 20% test set. Based on out-of-bag error rates and Gini index, the most important features are selected for each iteration using rfcv from R package randomForest v4.6-14. Features, which appear in at least 50% of all 100 random forest models, are considered as classification feature for the final model (Figure 3)
Figure 3

.Random Forest Model for T2D Classification

Curves of receiver operating characteristics (ROC) for a random forest model using a training set (train set) of 80% of the data (dashed lines in the left panel) as well as using a test set with the remaining 20% of the data (ROC curves in the right panel). The mean AUC over 100 random data splits is shown. The boxplots below the curve panels show the distribution of AUCs across all generated models for the corresponding training and test sets, respectively.

Reused figure from Reitmeier et al. (2020); permission obtained from the corresponding author.

CRITICAL: To avoid overfitting of the classifier the data input needs to be reduced in advance, for example, based on a predefined cutoff for minimum relative abundance and prevalence. Implementation of a Generalized Linear Model .Random Forest Model for T2D Classification Curves of receiver operating characteristics (ROC) for a random forest model using a training set (train set) of 80% of the data (dashed lines in the left panel) as well as using a test set with the remaining 20% of the data (ROC curves in the right panel). The mean AUC over 100 random data splits is shown. The boxplots below the curve panels show the distribution of AUCs across all generated models for the corresponding training and test sets, respectively. Reused figure from Reitmeier et al. (2020); permission obtained from the corresponding author. For the risk prediction of T2D, a generalized linear model (GLM) for binomial distribution and binary outcome (logit) is generated using the previously selected features based on arrhythmic OTUs including BMI as additional variable. Therefore, two approaches are followed. First, the model is tested in a nested 80% - training and 20% - test scenario as described in the previous section for the random forest model. To verify the importance of the selected features, a generalized linear model for control OTUs is implemented repetitively 100-times (Figure 4).
Figure 4

.Generalized Linear Model

ROC curves for classification of T2D. The distribution of AUCs are shown by boxplots and are significantly different between the types of models. Results showed that the classification of T2D in the 20% blind test set performed comparable as the 5-fold cross validated data.

Reused figure from Reitmeier et al. (2020); permission obtained from the corresponding author.

Circadian analysis of human stool samples Identify rhythmic OTUs’ “Pre-filtering” Collection daytime needs to be converted into a 24-h time scale ranging from 0 to 23:59 h (see “Time point” in Table 6).
Table 6

Example of a Raw OTU Table with Assigned Time Points and Intervals

Time
IntervalGroupSubject IDOTU 1OTU 2OTU 3OTU X
DaytimeTime Point
00:01023.5AXXX1A1A2-A4
00:05023.5BYYY1B1-B3B4
00:10023.5CXXX2A1A2A3A4
01:1011.5AXXX3A1A2A3A4
04:2043.5CYYY2-B2B3-
11:201111.5BYYY3B1B2B3B4
...........................
Example of a Raw OTU Table with Assigned Time Points and Intervals The Raw OTU table including “Time point” need to be transferred in GraphPad Prism using an XY table with single Y values for each time point (Figure 5).
Figure 5

.OTU Table in GraphPad

The Excel sheet is transferred in an XZ/Sheet in GraphPad Prism for further analysis.

.OTU Table in GraphPad The Excel sheet is transferred in an XZ/Sheet in GraphPad Prism for further analysis. A cosine-regression can be applied for each single OTU by using the =Analyze button. .Generalized Linear Model ROC curves for classification of T2D. The distribution of AUCs are shown by boxplots and are significantly different between the types of models. Results showed that the classification of T2D in the 20% blind test set performed comparable as the 5-fold cross validated data. Reused figure from Reitmeier et al. (2020); permission obtained from the corresponding author. A nonlinear regression needs to be applied with the following equation (Figure 6):
Figure 6

.OTU Nonlinear Regression Analysis in GraphPad

.OTU Nonlinear Regression Analysis in GraphPad or a double harmonic cosine wave equation:on alpha-diversity and relative abundance, with a fixed 24-h period. The goodness of fit needs to be corrected for multiple comparisons and the significance determined using an F-test. Each p value needs to be Bonferroni-adjusted for multiple testing. A statistically significant difference can be assumed when p ≤ 0.05. Most circadian rhythm detection algorithms were developed to assess the significance of rhythms in large data sets obtained from gene expression analysis (e.g., microarray, in situ hybridization) with relatively low sampling rates (~1 sample/h). Thus, microbiota data collected throughout the 24-h day need to be combined in hourly intervals to be analyzed with different methods. Alternatively to the cosine wave regression fit, which can handle high sampling rates, the rhythmicity detection algorithm named JTK_CYCLE (Hughes et al., 2010) can be used. JTK_CYCLE employs a non-parametric algorithm, detecting sinusoidal signals and, therefore, is more reliable when data are not normally distributed. Importantly, 4-h sampling intervals are a minimum and JTK_CYCLE is not working well with only one daily cycle. Nevertheless, JTK_CYCLE presents the highest false negative rates (Hughes et al., 2009). For example, the OTU table in Table 6 can be transposed as illustrated in Figure 7.
Figure 7

Transposed OTU Table

Transposed OTU Table Although microbiota sequencing data are predominantly sinus shaped, the analysis may certainly profit from adding harmonics in order to describe a more complex microbiota profile. Harmonics are integrated. e.g.. in CircWave or Harmonic cosine wave regression. CircWave is different from JTK_CYCLE in that it uses a parametric approach, e.g., an F-tested forward harmonic regression procedure similar to the Cosine- or Harmonic cosine wave regression, except that CircWave automatically detects how many harmonics can be added by F-test criterion (step forward regression style). Thus, it is likely more powerful to detect rhythmicity in normal distributed data compared to JTK_CYCLE. Unfortunately, in comparison to gene expression data, human sequencing data are particular in multiple ways: (i) the prevalence of OTUs can vary between groups and between individuals within one group, and (ii) the distribution of fecal samples in a human population study varies dramatically over the 24-h day. In particular, defecation occurs in 70% of the people between 5 and 11 am. Consequently, an algorithm assuming equally distributed samples over the course of the day, such as CircWave, would need optimization. A method which works independently of the sample size per time point and which can handle missing values, are the Cosine and Harmonic cosine wave regression. Both are parametric analyses similar to CircWave, which can integrate up to two harmonics. Other possibilities are represented by the online tool Nitecap (unpublished) or RAIN (Thaben and Westermark, 2014), which, similarly to JTK_CYCLE, represents a non-parametric method for the detection of rhythms in biological data sets and, thus, can detect arbitrary wave forms. Nevertheless, RAIN requires a fairly powerful computer, which, at least in our case, must be able to handle data from more than 2,000 subjects. In summary, we highly recommend identifying rhythms in microbiome data sets with multiple tools, including parametric and non-parametric, non-harmonic and harmonic logarithms, depending on the microbiome data set available. There are various analysis tools available, which combine multiple methods, such as MetaCycle (Wu et al., 2016), incorporating JTK_CYCLE, ARSER (Yang and Su, 2010), and Lomb-Scargle (Lomb, 1976). Nevertheless, ARSER does not considers replicates and cannot cope with missing data that are likely present with microbiome data. Time points are named from Row 1B onwards. Single OTU names are found in Column 1B downwards. When saved as txt file, JTK_CYCLE identifies significantly rhythmic OTUs with a p value corrected for multiple regression as illustrated in yellow in the output file (Figure 8).
Figure 8

.JTK Output Table

Columns are referring to adjusted q (BH.Q) and p value (ADJ.P), period (PER), phase (LAG) and amplitude (AMP) as well as the relative abundance values of the corresponding OTU (rows).

.JTK Output Table Columns are referring to adjusted q (BH.Q) and p value (ADJ.P), period (PER), phase (LAG) and amplitude (AMP) as well as the relative abundance values of the corresponding OTU (rows). Importantly, the circadian analysis needs to be performed separately for group A and group B. Thereby, the amount of rhythmic OTUs in group A can be compared to the amount of rhythmic OTUs in group B. However, to compare rhythmicity of a specific OTU directly between the two groups, further analysis, as described in 4a, is necessary. Detection of differential rhythmicity of specific OTUs, e.g., comparing rhythmicity of different genotypes, treatments, or phenotypes The relative abundance of each OTU was assessed for a 24-h rhythmicity in the pre-filtering step 4a using the cosine wave regression, JTK_CYCLE or any other circadian analysis software for each group examined (such as nonT2D or T2D) separately. With this pre-filtering method, the amount of OTUs from all OTUs analyzed will be identified as significantly rhythmic in group A and independently in group B. However, these rhythmic OTUs can differ between the groups. Therefore, all OTUs rhythmic in at least one group need to be further analyzed for differential 24-h time-of-day patterns comparing data from group A with group B using the Detection of Differential Rhythmicity (DODR) R packages (Thaben and Westermark, 2016). These results will determine whether an OTU, which appears rhythmic in group A, also (1) exhibit circadian oscillation, (2) shows a different rhythmicity (i.e., phase and amplitude), or (3) lacks rhythmicity in group B and vice versa. One OTU table per group needs to be generated in txt format. Importantly, the same OTUs need to be listed in both group A and in the file from group B as illustrated in Tables 7 and 8.
Table 7

OTU Table Group A

Time PointOTU 10OTU 15OTU 100OTU 219OTU 412OTU n
000.103057000.2461930.080156
00.0118650000.2194930.065255
000.03011600.0075290.0376450.097877
00.0118850000.1782740
10.0112770.0789350.0338290.1014880.2368060.045106
10.0589030000.0084150
100.0763200.0101760.0407040.055968
100.10221100.0161390.0591750.032277
200.12094800.024190.024190.036284
200.1662070.050840.0156430.1994490.021509
20.30281000.0726740.0968990.084787
3000.18114800.1624090.037479
300.14651600.0705450.0596920.179075
3000.0294830.0412760.0530690.076655
400.0299800.0599590.0839430.077947
400000.0599680.21322
40000.1392970.2940720.023216
500.0178390.0297320.0118930.0535170
500.0307650.00615300.0676840.049225
500.083903000.0239720
n..................
Table 8

OTU Table Group B

Time pointOTU 10OTU 15OTU 100OTU 219OTU 412OTU n
00.5401060.70098800.1034250.0229830.321765
02.300020.0067850.0135690.1696180.1221250.061062
00.7691080.3765420.040058000.176254
01.0712330.6802720.00781900.1798420.062554
03.631990.950090.08418500.4810580.264582
02.4067870.6291120.2943550.0173150.0865750.06926
16.2909060.0097680.01953700.0488420.029305
10.2560821.09901800.2987620.021340.096031
10.472030.93401600.3715980.1606910.230993
20.1978150.847777000.0470990.36737
20.0118630.97277400.2135360.0177950
31.4270670000.0233950.666745
36.6082800.00995200.0099520.159236
37.8885020.1114980.51567900.0975610
42.597180.5993490.3139450.3424850.0799130.022832
41.4538250.0220280.01652100.0495620.033041
52.9316241.1346860.4218160.113890.1771630.054836
50.5494290.75984900.0272770.0155870.066243
52.6440240.010748000.1504730.042992
n..................
The time points may differ between the groups. OTU Table Group A OTU Table Group B In the DODR output table, the results from all applied analysis (described in detail by Thaben and Westermark (2016) are indicated for every specific OTU (see Table 9) including the p value for the robustDODR analysis.
Table 9

DODR Output Table

OTUHANOVAHarmNoisePred1HarmNoisePred2HarmScaleTestrobustDODRrobustHarmScaleTestmeta.p.val
100.35433.33E-1611.33E-154.66E-060.5388332.00E-15
150.00421.11E-1318.38E-126.97E-050.0058146.65E-13
1000.21470100.0002880.1805280
2190.00295.44E-1511.85E-130.0005670.0303483.26E-14
4120.00061.09E-1317.21E-120.000730.0772926.54E-13
N0.00019.26E-1316.59E-110.0020620.0310115.56E-12
DODR Output Table Resulting DODR p-values need to be corrected for multiple comparisons and for significant OTUs that have a corrected p value ≤0.05, a significance level can be identified, e.g., which OTUs appear rhythmic in group A, but show a differential rhythmicity in group B. To address what kind of difference appears between the two groups, such as amplitude or phase, differences can be analyzed by an additionally R package called “HarmonicRegression” (Luck et al., 2014). Illustration of cosine wave-fitted grouped data using GraphPad Prism Grouping of subjects to predefined time intervals. To receive the highest possible resolution of the curve fit, time intervals need to be predefined with the goals to (i) include an equal number of subjects per interval and (ii) group for further circadian rhythm analysis. The higher the frequency of sample collection, the better the resolution. Next, subjects are to be grouped according to the assigned intervals (i.e, bins; see “Interval” in Table 6). For instance, for 2-h intervals, data from subjects collected within 23:00 p.m. and 0:59 a.m. are merged into one bin referred to as “23.5.” CRITICAL: When time intervals are assigned, group sizes should be equal between time points within one group and between groups. For example, when data are obtained from 360 subjects per group, each of the 12 2-h intervals should include 30 ± 5 subjects. Results from the different groups need to be averaged per interval within each group (as illustrated in Table 10) to be transferred to GraphPad Prism using, e.g., an XY table with mean (AVE) values ± standard deviation (SD) and sample size (n) calculated, e.g., in Microsoft Excel.
Table 10

Results from Different Groups Averaged per Interval within Each Group


Group A
Group B
IntervalAverageSDnAverageSDn
1.5AVE1SD130AVE1SD130
2.5AVE2SD230AVE2SD230
5.5AVE3SD330AVE3SD330
7.5AVE4SD430AVE4SD430
9.5AVE5SD530AVE5SD530
11.5AVE6SD630AVE6SD630
13.5AVE7SD730AVE7SD730
15.5AVE8SD830AVE8SD830
17.5AVE9SD930AVE9SD930
19.5AVE10SD1030AVE10SD1030
21.5AVE11SD1130AVE11SD1130
23.5AVE12SD1230AVE12SD1230
Results from Different Groups Averaged per Interval within Each Group As described in paragraph 4aiii, a cosine-wave regression will be applied and the significance of the goodness of fit is evaluated by an F-test. In case the cosine fit is reaching significance, the cosine wave can be illustrated in the graph, whereas a non-significant fit is shown by simply connecting straight lines between data points (see Figure 9).
Figure 9

.Illustration of Cosine-Wave Regression

Diurnal profiles of richness depending of subjects from different groups (red, Group B; black, Group A). Significant rhythms (cosine-wave regression, p ≤ 0.05) are illustrated with fitted cosine-wave curves; data points connected by straight lines indicate no significant cosine fit curves (p > 0.05) and thus no rhythmicity.

.Illustration of Cosine-Wave Regression Diurnal profiles of richness depending of subjects from different groups (red, Group B; black, Group A). Significant rhythms (cosine-wave regression, p ≤ 0.05) are illustrated with fitted cosine-wave curves; data points connected by straight lines indicate no significant cosine fit curves (p > 0.05) and thus no rhythmicity.

Limitations

The sample preparation strongly influences the outcome and quality of the sequencing, which limits the comparability between studies. Bioinformatical methods, clustering approaches, and filtering can influence the abundance of certain taxonomies. Taxonomic classification of 16S rRNA gene sequencing data is limited in its accuracy to assign species or even strains correctly. The taxonomic assignment is only based on a short amplicon, which increases the difficulty to determine correctly the bacterial species found. This designation also depends on the used database, which have differences when comparing them against each other. In human studies, it is almost impossible to cover all daytimes for the analysis of circadian rhythmicity. A minimum of approx. 300 samples distributed across the full day are required within a single group to achieve a resolution necessary to detect significant circadian rhythms.

Troubleshooting

Problem 1

Strikingly low 260/280 values obtained by NanoDrop could be due to mistakes during the DNA cleaning step (e.g., ethanol residuals in cleaning columns) (DNA Isolation, steps 1–16).

Potential Solution

If enough starting material (e.g., stool sample) is available, the sample preparation needs to be repeated. Including one additional washing step during the DNA isolation (DNA Isolation, steps 1–16).

Problem 2

Low biomass samples could result in insufficient PCR products (Library construction by Polymerase Chain Reaction, steps 17–26). Increase the number of the second PCR cycles and/or the dilution of the sample could help to overcome this problem.

Problem 3

Samples with a low number of reads could be caused by problems during the demultiplexing (e.g., misdisposition of indices) (Library construction by Polymerase Chain Reaction, steps 17–26). Double-check the assigned index primers with the sequences provided in the samples sheet. Adjust trimming length of the forward and reverse reads.

Problem 4

Precipitous FASTQ curves could be an indicator for primer dimers, which could be due to poor purification of the sample (e.g., when using magnetic beads) (Library cleaning, steps 27–40). Repeat the purification step (Library cleaning, steps 27–40) with the pooled PCR products using AMPure XP magnetic beads with a lower concentration of 0.6 μL.

Problem 5

Insufficient data to calculate rhythmicity (Statistical analysis - 4. Circadian analysis of human stool samples) Sample numbers within a group needs to be increased. A study with 80 people requires approximately four samples per person, which results in 320 samples in total distributed across the day, to find diurnal rhythms comparable to results obtained from a cohort with more than 1,900 subjects from whom a single sample per person was collected (Reitmeier et al., 2020). If an increase in sample size is not possible the distribution of collection times needs to be expanded. For example, the number of samples need to be spread across the daytime, such as 20–30 samples per daytime hours to receive a resolution to detect significant rhythms.

Resource Availability

Lead Contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the Lead Contact, Prof. Dr. Dirk Haller

Materials Availability

This study did not generate any unique materials or reagents.

Data and Code Availability

Sequence data, analyses, and resources related to the 16S rRNA gene sequencing of human cohort (N = 8), and data from human cohort are available upon request from the corresponding author. Software used to analyze the data are either freely or commercially available. Source code data are available from the corresponding author on request.
REAGENT or RESOURCESOURCEIDENTIFIER
Chemicals, Peptides, and Recombinant Proteins

polyvinylpyrrolidone (PVPP)SigmaCat# 77627 100 g
guanidinium thiocyanateSigmaCat# G9277 500 g
N-lauroylsarcosine sodiumSigmaCat# L5125 100 g
Phusion Hot Start II High fidelityThermo FisherCat# F-549L
HF Puffer PackThermo FisherCat# F-518L
dNTP Mix, 10 mM each, 2 × 0.5 mLBiozymCat# 331520
100 bp DNA LadderNEBCat# N3231S
GelRed Nucleic Acid Gel Stain, 10,000× in water; 0.5 mLVWRCat# 41003
dNTPsSigmaCat# D7295 20 × 0.2 mL
AgaroseSigmaCat# A9539 500 g
DMSOSigmaCat# D2650 5 × 10 mL
Lysing Matrix BMP BiomedicalsCat# 116911500
16S rRNA gene Illumina sequencing primers (V3V4)Kozich et al., 2013341F-ovh and 785r-ovh
AMPure XP beadsBeckmanCat# A63881
PhiX Control v3 LibraryIlluminaFC-110-3001
RNase AThermo FisherCat# EN0531

Critical Commercial Assays

Nucleo Spin gDNA clean-up (250)Machery-NagelCat# 740230250
Binding Buffer DBMachery-NagelCat# 740323.1
Qubit 1 × dsDNAhs Kit 500 assays REF Q32854 (Life Technologies)Fisher ScientificCat# 15860210
MiSeq® Reagent Kit v3 (600 cycle)Illumina IncCat# MS-102-3003
Mock communityZymoBIOMICSCat# D6300

Software and Algorithms

bcl2fastqbcl2fastqhttps://support.illumina.com/sequencing/sequencing_software/bcl2fastq-conversion-software.htmlRRID:SCR_015058
GraphPad Prism v8.0.2Graphpad Softwarehttps://www.graphpad.com/scientific-software/prism/RRID:SCR_002798
RStudioRStudiohttps://rstudio.com/products/rstudio
BLASTAltschul et al. (1990)https://blast.ncbi.nlm.nih.govRRID:SCR_007190
IMNGSLagkouvardos et al. (2016)https://www.imngs.org/
EvolViewHe et al. (2016)https://www.evolgenius.info/
FASTQChttp://www.bioinformatics.babraham.ac.uk/projects/fastqc/RRID:SCR_014583
EzBiocloudYoon et al. (2017)https://www.ezbiocloud.net/
KEGGKanehisa and Goto (2000)https://www.genome.jp/kegg/RRID:SCR_001120
HeatmapperBabicki et al. (2016)http://www.heatmapper.ca;
GraPhlAnSegata et al. (2013)https://github.com/biobakery/graphlan
RheaLagkouvardos et al. (2017)https://github.com/Lagkouvardos/Rhea
JTK_CYCLEHughes et al. (2010)https://www.r-project.org/
HUMAnN2Franzosa et al. (2018)https://github.com/biobakery/humann
Psych R packageRevelle (2020)https://cran.r-project.org/web/packages/psych/index.html
randomForest R packageLiaw and Wiener (2002)https://cran.r-project.org/web/packages/randomForest/randomForest.pdfRRID:SCR_015718
metaphlan2Segata et al. (2012)https://github.com/biobakery/metaphlanRRID:SCR_004915

Oligonucleotides

341F-ovh Primer: CCTACGGGNGGCWGCAGKlindworth et al. (2013)N/A
785R-ovh Primer: GACTACHVGGGTATCTAATCCKlindworth et al. (2013)N/A

Biological Samples

Healthy adults (N = 8), stool samples (n = 24) for the analysis of storage effectTechnical University Munich, Chair of Nutrition and ImmunologyAvailable upon request

Other

DNA-Stool-StabilizerINVITEKCat# 1038111100
Stool Collection Tubes with StabilizerINVITEKCat# 1038111300
Combitips advanced, 5 mLdiagonalCat# 30089812
Combitips advanced, 25 mLdiagonalCat# 30089839
Micro tube, 2.0 mL, SafeSealsarstedtCat# 72695400
Micro tube, 1.5 mL, SafeSealsarstedtCat# 72706400
Micro tube, 2.0 mL, PPsarstedtCat# 72693005
96-Well Skirted PCR Plate4ti-tudeCat# 4ti-0960
PCR Foil Seal4ti-tudeCat# 4ti-0550
Microplate Seals for Aqueous Sample Storage4ti-tudeCat# 4ti-0510
Adhesive Seals for PCR Plates4ti-tudeCat# 4ti-0500
1,000-μL tips with barrierbeckmanCat# B01124
50-μL tips with barrierbeckmanCat# A21586
250-μL tips with barrierbeckmanCat# 717253
250-μL tips without barrierbeckmanCat# 717252
AMPure XP beadsbeckmanCat# A63881
Deep-well Plate (AB-1127)Fisher ScientificCat# 10243223
PCR Tubes, 0.5 mL for Qubit (AXYGEN) PCR-05-CFisher ScientificCat# 11331974
Tips GP LTS, 20 μLMettler-ToledoCat# 30389274
Tips GP LTS, 200 μLMettler-ToledoCat# 30389276
Tips GP LTS, 1,000 μLMettler-ToledoCat# 30389272
10/20 μL RPT XL Graduated Filter Tip (Sterile)StarLabCat# S1180-3710-C
0.2 m: 8-Strip “Non-Flex” Natural PCR Tubes, Ind-Attached Flat Caps (Xtra-Clear)StarLabCat# I1402-3700
200 μL RPT Graduated Filter Tip (Sterile)StarLabCat# S1180-8710-C
10/20 μL RPT XL Graduated Filter Tip (Sterile)StarLabCat# S1180-3710-C
FastPrep-24MP BiomedicalsCat# 15260488
CoolPrep adapterMP BiomedicalsCat # 6002528
Biomek 4000 Automated Liquid HandlerBeckman coulterCat # C23350
Biometra TAdvancedAnalytik Jena AGCat # 846-x-070-211
Qubit-1 (ng/μL)Qubit-2 (ng/μL)Qubit-3 (ng/μL)Qubit-4 (ng/μL)Mean (ng/μL)nM
Final PoolPool_10.180.180.190.180.180.49
Pool_20.170.180.180.180.18
Pool_30.170.170.180.180.18
  26 in total

1.  KEGG: kyoto encyclopedia of genes and genomes.

Authors:  M Kanehisa; S Goto
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform.

Authors:  James J Kozich; Sarah L Westcott; Nielson T Baxter; Sarah K Highlander; Patrick D Schloss
Journal:  Appl Environ Microbiol       Date:  2013-06-21       Impact factor: 4.792

4.  Molecular microbial diversity of an anaerobic digestor as determined by small-subunit rDNA sequence analysis.

Authors:  J J Godon; E Zumstein; P Dabert; F Habouzit; R Moletta
Journal:  Appl Environ Microbiol       Date:  1997-07       Impact factor: 4.792

5.  Arrhythmic Gut Microbiome Signatures Predict Risk of Type 2 Diabetes.

Authors:  Sandra Reitmeier; Silke Kiessling; Thomas Clavel; Markus List; Eduardo L Almeida; Tarini S Ghosh; Klaus Neuhaus; Harald Grallert; Jakob Linseisen; Thomas Skurk; Beate Brandl; Taylor A Breuninger; Martina Troll; Wolfgang Rathmann; Birgit Linkohr; Hans Hauner; Matthias Laudes; Andre Franke; Caroline I Le Roy; Jordana T Bell; Tim Spector; Jan Baumbach; Paul W O'Toole; Annette Peters; Dirk Haller
Journal:  Cell Host Microbe       Date:  2020-07-02       Impact factor: 21.023

6.  Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies.

Authors:  Anna Klindworth; Elmar Pruesse; Timmy Schweer; Jörg Peplies; Christian Quast; Matthias Horn; Frank Oliver Glöckner
Journal:  Nucleic Acids Res       Date:  2012-08-28       Impact factor: 16.971

7.  Conducting a microbiome study.

Authors:  Julia K Goodrich; Sara C Di Rienzi; Angela C Poole; Omry Koren; William A Walters; J Gregory Caporaso; Rob Knight; Ruth E Ley
Journal:  Cell       Date:  2014-07-17       Impact factor: 41.582

8.  Evolview v2: an online visualization and management tool for customized and annotated phylogenetic trees.

Authors:  Zilong He; Huangkai Zhang; Shenghan Gao; Martin J Lercher; Wei-Hua Chen; Songnian Hu
Journal:  Nucleic Acids Res       Date:  2016-04-30       Impact factor: 16.971

9.  Heatmapper: web-enabled heat mapping for all.

Authors:  Sasha Babicki; David Arndt; Ana Marcu; Yongjie Liang; Jason R Grant; Adam Maciejewski; David S Wishart
Journal:  Nucleic Acids Res       Date:  2016-05-17       Impact factor: 16.971

10.  Species-level functional profiling of metagenomes and metatranscriptomes.

Authors:  Eric A Franzosa; Lauren J McIver; Gholamali Rahnavard; Luke R Thompson; Melanie Schirmer; George Weingart; Karen Schwarzberg Lipson; Rob Knight; J Gregory Caporaso; Nicola Segata; Curtis Huttenhower
Journal:  Nat Methods       Date:  2018-10-30       Impact factor: 28.547

View more
  8 in total

1.  Impact of sucroferric oxyhydroxide on the oral and intestinal microbiome in hemodialysis patients.

Authors:  Mohamed M H Abdelbary; Christoph Kuppe; Sareh Said-Yekta Michael; Thilo Krüger; Jürgen Floege; Georg Conrads
Journal:  Sci Rep       Date:  2022-06-10       Impact factor: 4.996

2.  Gut bacterial dysbiosis and instability is associated with the onset of complications and mortality in COVID-19.

Authors:  David Schult; Sandra Reitmeier; Plamena Koyumdzhieva; Tobias Lahmer; Moritz Middelhoff; Johanna Erber; Jochen Schneider; Juliane Kager; Marina Frolova; Julia Horstmann; Lisa Fricke; Katja Steiger; Moritz Jesinghaus; Klaus-Peter Janssen; Ulrike Protzer; Klaus Neuhaus; Roland M Schmid; Dirk Haller; Michael Quante
Journal:  Gut Microbes       Date:  2022 Jan-Dec

3.  Offering Fiber-Enriched Foods Increases Fiber Intake in Adults With or Without Cardiometabolic Risk: A Randomized Controlled Trial.

Authors:  Beate Brandl; Rachel Rennekamp; Sandra Reitmeier; Katarzyna Pietrynik; Sebastian Dirndorfer; Dirk Haller; Thomas Hofmann; Thomas Skurk; Hans Hauner
Journal:  Front Nutr       Date:  2022-02-16

4.  Preservation by lyophilization of a human intestinal microbiota: influence of the cultivation pH on the drying outcome and re-establishment ability.

Authors:  Regina Haindl; Lisa Totzauer; Ulrich Kulozik
Journal:  Microb Biotechnol       Date:  2022-02-06       Impact factor: 5.813

5.  Establishment of an In Vitro System of the Human Intestinal Microbiota: Effect of Cultivation Conditions and Influence of Three Donor Stool Samples.

Authors:  Regina Haindl; Julia Engel; Ulrich Kulozik
Journal:  Microorganisms       Date:  2021-05-13

6.  Full-Length SSU rRNA Gene Sequencing Allows Species-Level Detection of Bacteria, Archaea, and Yeasts Present in Milk.

Authors:  Isabel Abellan-Schneyder; Annemarie Siebert; Katharina Hofmann; Mareike Wenning; Klaus Neuhaus
Journal:  Microorganisms       Date:  2021-06-09

7.  Processing Matters in Nutrient-Matched Laboratory Diets for Mice-Microbiome.

Authors:  Jasmin Wenderlein; Linda F Böswald; Sebastian Ulrich; Ellen Kienzle; Klaus Neuhaus; Ilias Lagkouvardos; Christian Zenner; Reinhard K Straubinger
Journal:  Animals (Basel)       Date:  2021-03-18       Impact factor: 2.752

8.  A protocol for characterization of extremely preterm infant gut microbiota in double-blind clinical trials.

Authors:  Magalí Martí; Johanne E Spreckels; Maria C Jenmalm; Thomas Abrahamsson
Journal:  STAR Protoc       Date:  2021-07-09
  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.