Literature DB >> 31143306

Characteristics of Gut Microbiota in Patients With Diabetes Determined by Data Mining Analysis of Terminal Restriction Fragment Length Polymorphisms.

Yuta Nakamura1, Yoshio Nagai1, Toshio Kobayashi2, Kentaro Furukawa3, Yoichi Oikawa4,5, Akira Shimada5, Yasushi Tanaka1.   

Abstract

BACKGROUND: This study was performed to clarify whether gut microbiota obtained from fecal samples could identify the type of diabetes in patients of each gender by using a combination of terminal restriction fragment length polymorphism (T-RFLP) analysis and data mining.
METHODS: A cross-sectional study was performed at three centers. Fecal samples were collected from 12 Japanese patients with type 1 diabetes mellitus (T1D), 18 patients with type 2 diabetes mellitus (T2D), and 31 subjects without diabetes mellitus (non-DM). Amplification of fecal 16S rRNA was carried out. After digestion of the amplification products with restriction enzymes (AluI, BslI, HaeIII, and MspI), terminal restriction fragments (T-RFs) of DNA were detected. A data mining algorithm (classification and regression tree (CART) modeling system) provides a decision tree that classifies subjects into various groups according to pre-assigned characteristics.
RESULTS: Among men, the error rate was 2.4% with MspI, while error rates were 0.0% with other restriction enzymes. Among women, the error rate was 0.0% with all restriction enzymes. The operational taxonomic units (OTUs) incorporated into the decision tree differed between men and women.
CONCLUSIONS: We were able to classify the 16SrRNA gene amplification products obtained from fecal samples of T1D patients, T2D patients, and non-DM subjects with a high level of precision by combining T-RFLP analysis and data mining. Specific gut microbiota patterns were found for T1D and T2D patients, as well as a sex difference of the patterns.

Entities:  

Keywords:  Data mining; Diabetes; Gut microbiota; T-RFLP analysis

Year:  2019        PMID: 31143306      PMCID: PMC6522229          DOI: 10.14740/jocmr3791

Source DB:  PubMed          Journal:  J Clin Med Res        ISSN: 1918-3003


Introduction

Gut microbiota are increasingly recognized as having an important role in regulation of energy homeostasis and the immune system [1], and analysis of gut microbiota is becoming instrumental for assessment of the etiology and pathogenesis of some diseases, especially inflammatory bowel disease [2, 3]. The human gut contains approximately 100 trillion bacteria, including more than 1,000 different bacterial species [4, 5]. Conventional culture techniques can only detect 30% of the total gut microbiota [6], resulting in difficulty with performing meaningful analysis. Recently, a new method has been proposed for analyzing the gut microbiota. In the field of molecular microbiology, identification of bacteria by high throughput DNA amplification and subsequent 16S rRNA gene sequencing has been used as a replacement for conventional culture techniques [7], and next-generation sequencing has been suggested as a suitable tool for studying the human microbiota [8]. However, the high cost and complexity of such methods are major obstacles to widespread adoption. Terminal restriction fragment length polymorphisms (T-RFLP) analysis is another molecular biological method that could be used to obtain quantitative data on the gut microbiota [9]. This method is based on enzymatic digestion of PCR products and is suitable for handling many samples cost-effectively, although it not useful for identifying bacterial species. Thus, research using T-RFLP analysis needs to be based on a new concept. Data mining is a process of sorting and analyzing large data sets to identify patterns and establish relationships. This technique could allow us to recognize even unknown bacteria from certain patterns. Using a data mining approach, we were previously successful in identifying smokers by examining fecal samples from 92 Japanese men [10]. Diabetes mellitus is a metabolic disorder characterized by chronic hyperglycemia due to insufficient action of insulin and there are two major types. Type 1 diabetes (T1D) is caused by autoimmune destruction of pancreatic β cells, whereas type 2 diabetes (T2D) arises from the combination of relatively insufficient insulin secretion and obesity-associated insulin resistance. A previous study of gender differences in the gut microbiota showed that the microbiota may regulate sex hormone levels and influence individual susceptibility to autoimmunity [11]. Based on the above considerations, this study was performed to clarify whether gut microbiota obtained from fecal samples could identify the type of diabetes in patients of each gender by using a combination of T-RFLP analysis and data mining.

Methods

Study design

A cross-sectional study was performed at three centers in Japan: St. Marianna University School of Medicine Hospital (Kawasaki, Japan), Tokyo Saiseikai Central Hospital (Tokyo, Japan), and Furukawa Hospital (Yokohama, Japan).

Subjects

Between April 2015 and December 2016, 12 Japanese patients with T1D, 18 patients with T2D, and 31 subjects without diabetes mellitus (non-DM) were enrolled in this study. The subjects included 37 men (four patients with T1D, 14 patients T2D, and 19 non-DM subjects) and 24 women (eight patients with T1D, four patients with T2D, and 12 non-DM subjects). Diabetes was diagnosed and classified as type 1 or type 2 by diabetologists. The exclusion criteria for this study were as follows: 1) Antimicrobial therapy; 2) Prior gastrointestinal surgery; and 3) Severe liver or kidney disease. The non-DM group was recruited from among persons undergoing regular health screening tests. This study was conducted in accordance with the ethical principles of the Declaration of Helsinki, and was approved by the ethics committee of St. Marianna University School of Medicine. All patients gave written informed consent prior to participation.

Materials

Fecal samples were collected from the subjects and bacterial DNA was isolated from 100 mg of feces using the method described by Takahashi et al [12]. Amplification of fecal 16S rRNA was carried out as described by Nagashima et al [13]. After digestion of the amplification products with restriction enzymes (AluI, BslI, HaeIII, and MspI), terminal restriction fragments (T-RFs) of DNA were detected. T-RF lengths were determined with an ABI PRISM® 3130xl Genetic Analyzer (Applied Biosystems, Foster City, CA, USA). As the apparent size of identical T-RFs may vary by 1 - 2 bp among different gels and/or different lanes of the same gel, major T-RFs having a size within 1 - 2 bp were summarized as operational taxonomic units (OTUs). Each OTU was designated on the basis of the restriction enzyme and restriction fragment length (bp), for example the AluI 44 bp OTU was abbreviated as A44. OTUs were quantified by determining the percentage of each individual OTU relative to the total OTU area, and this was expressed as the peak percent area under the curve (% area under the curve (AUC)).

Statistical analysis

A data mining algorithm (classification and regression tree (CART) modeling system), which is the usual method of data mining, provides a decision tree that classifies subjects into various groups according to pre-assigned characteristics. CART divides subjects into two subsets by comparing the Gini coefficient according to the OTU data, such that the subjects within each subset are more homogeneous than in the previous subset. The CART system is flexible and allows unequal misclassification costs to be considered, unlike the other data mining systems. A major point of data mining and the decision tree thus constructed is that a single selected OTU is used for each step of the tree. The default setting of the CART system allows a decision tree to grow up to five steps. Balance nodes [14], which are boosting and duplicating a minor data, are used to correct imbalances in the dataset, which readily develop with higher nominal partitions, and we confirmed that more accurate results could be obtained with the specified test criteria. If necessary, balancing is carried out by boosting infrequent values at the time of decision tree construction.

Results

Baseline characteristics of the subjects are shown in Table 1. In the present study, all patients diagnosed with T1D showed an acute-onset pattern of hyperglycemia and were positive for glutamic acid decarboxylase (GAD) antibody by radioimmune assay. The median (interquartile range: IQR) age of the T1D group, T2D group, and non-DM group was 46 (38 - 60) years, 45 (40 - 63) years, and 47 (41 - 53) years, respectively (P = 0.833). Body mass index (BMI) was 21.7 (19.8 - 23.8), 28.9 (25.0 - 32.0), and 22.0 (19.8 - 24.8) kg/m2, respectively (P < 0.001), being higher in the T2D group than in the other groups. The estimated duration of diabetes was 4.0 (1.0 - 5.0) years in the T1D group and 3.0 (1.0 - 4.3) years in the T2D group (P = 0.232). Hemoglobin A1c (HbA1c) was 7.5% (6.7-8.6%) and 10.1% (9.2-12.0%), respectively (P < 0.001), being higher in the T2D group than in T1D group. Antidiabetic drugs are also shown in Table 1.
Table 1

Characteristics of the Subjects (Median (IQR))

T1D groupT2D groupNon-DM group
Number121831
Gender (women/men)8/44/1412/19
Age (years)46(38 - 60)45(40 - 63)47(41 - 53)P = 0.833
Weight (kg)52.5(51.0 - 66.8)73.8(68.0 - 92.1)a61(49.0 - 68.0)bP < 0.001
BMI (kg/m2)21.7(19.8 - 23.8)28.9(25.0 - 32.0) a2.0(19.8 - 24.8) bP < 0.001
Duration of DM (years)4.0(1.0 - 5.0)3.0(1.0 - 4.3)-P = 0.232
Neuropathy (+/-)0/121/17-
Retinopathy (+/-)0/122/16-
Nephropathy (stage 1/2/3/4/5)12/0/0/0/013/5/0/0/0-
HbA1c (%)7.5(6.7 - 8.6)10.1(9.2 - 12.0)-P < 0.001
Antidiabetic drugs
  Insulin1213
  Metformin113
  DPP4-I010
  αGI16
  GLP-1RA04
  Glinide02
  SGLT2-I01

IQR: interquartile range; T1D: type 1 diabetes mellitus; T2D: type 2 diabetes mellitus; non-DM: non-diabetes mellitus; BMI: body mass index; DM: diabetes mellitus; HbA1c: hemoglobin A1c; αGI: α-glucosidase inhibitor, DPP4-I: dipeptidyl peptidase-4 inhibitor; GLP-1RA: glucagon-like peptide-1 receptor agonist; SGLT2: sodium-glucose cotransporter 2 inhibitor. avs. T1D; bvs. T2D.

IQR: interquartile range; T1D: type 1 diabetes mellitus; T2D: type 2 diabetes mellitus; non-DM: non-diabetes mellitus; BMI: body mass index; DM: diabetes mellitus; HbA1c: hemoglobin A1c; αGI: α-glucosidase inhibitor, DPP4-I: dipeptidyl peptidase-4 inhibitor; GLP-1RA: glucagon-like peptide-1 receptor agonist; SGLT2: sodium-glucose cotransporter 2 inhibitor. avs. T1D; bvs. T2D. Data mining provided the decision trees displayed in Figure 1. Figure 1a, which was applied the balance node (i.e. using T1D twice), shows the decision tree obtained by using the restriction enzyme BslI in men. Node 0 (at the left side of the tree) was divided into nodes 1 and 2 by using B366 with a cut-off value of 1.334. In node 1 (B366 ≤ 1.334), all nine subjects were from the non-DM group, while node 2 (B366 > 1.334) included eight patients with T1D, 14 patients with T2D, and 10 subjects from the non-DM group. After performing five similar steps, all of the subjects were classified and the decision tree was constructed. Six of the eight T1D patients were classified at node 8 (B366 > 1.334, B749 > 3.032, and B968 > 0.980), while 12 of 14 T2D patients were classified at node 9 (B366 > 1.334, B749 ≤ 3.032, B338 ≤ 1.613, and B494 ≤ 8.932). In the T1D group, six patients were classified at node 8 and two were classified at node 12. In the T2D group, 12 patients were classified at node 9 and two were classified at node 14. In the non-DM group, nine subjects were classified at node 1, three were classified at node 6, one was classified at node 10, and six were classified at node 13.
Figure 1

Decision trees created by the classification and regression tree (CART) method based on BslI digestion of 16SrRNA gene amplification products from fecal samples. Each operation taxonomic unit (OTU) is named by the combination of a restriction enzyme and restriction fragment length (bp). For example, the BslI 366 bp OTU is designated as B366. The cut-off value for each OTU was calculated from the OTU data of all subjects by the CART method using the Gini coefficient. Similar steps were repeated to construct the decision tree. Node 0 (at the left side of the decision tree) is the root node, namely the starting point for construction of the tree. The decision tree pathways indicate the species, quantities, and OTUs used to divide the different groups of subjects. (a) Decision tree for men. (b) Decision tree for women. non-DM: non-diabetes mellitus group; T1D: type 1 diabetes mellitus group; T2D: type 2 diabetes mellitus group.

Decision trees created by the classification and regression tree (CART) method based on BslI digestion of 16SrRNA gene amplification products from fecal samples. Each operation taxonomic unit (OTU) is named by the combination of a restriction enzyme and restriction fragment length (bp). For example, the BslI 366 bp OTU is designated as B366. The cut-off value for each OTU was calculated from the OTU data of all subjects by the CART method using the Gini coefficient. Similar steps were repeated to construct the decision tree. Node 0 (at the left side of the decision tree) is the root node, namely the starting point for construction of the tree. The decision tree pathways indicate the species, quantities, and OTUs used to divide the different groups of subjects. (a) Decision tree for men. (b) Decision tree for women. non-DM: non-diabetes mellitus group; T1D: type 1 diabetes mellitus group; T2D: type 2 diabetes mellitus group. Figure 1b, which was applied the balance node (i.e. using T2D twice), shows the results obtained in women using the restriction enzyme BslI. Node 0 was divided into nodes 1 and 2 by using B366 with a cut-off value of 5.342. Node 1 (B366 ≤ 5.342) contained two patients with T1D, eight patients with T2D, and 12 non-DM subjects, while node 2 (B366 > 5.342) contained six T1D patients. The decision tree for women only required four similar steps to classify all of the female subjects, so it was simpler than the decision tree for men. In the T2D group, all eight patients were classified at node 7 (B366 ≤ 5.342, B110 > 1.751, B853 ≤ 1.567, and B317 ≤ 2.612). In the T1D group, six patients were classified at node 2 and two patients were classified at node 10. In the non-DM group, 10 subjects were classified at node 3, one was classified at node 8, and one was classified at node 9. The results of decision tree analysis using four restriction enzymes are shown in Table 2. Among men, the error rate was 2.4% with MspI, while error rates were 0.0% with other restriction enzymes. Among women, the error rate was 0.0% with all restriction enzymes. The OTUs incorporated into the decision tree differed between men and women. In both men and women, node 0 was divided into nodes 1 and 2 by using B366. But the cut-off values were different between them (1.334 in men, 5.342 in women). Even after the second step, men and women showed different OTUs and the cut-off values (Fig. 1a, b). As shown in Table 2, the OTUs incorporated with other restriction enzymes were also totally different between men and women.
Table 2

Comparison of the Results Obtained With Each Restriction Enzyme

Restriction enzymeSexBalance nodeErrorError rate (%)Dt first stepDt second stepDt steps
AluIWomenT2D × 20/280A216A233, A584
MenT1D × 20/410A830A819, A1794
BslIWomenT2D × 20/280B366B110, -4
MenT1D × 20/410B366-, B7495
HaeIIIWomenT2D × 20/280H58H93, -5
MenT1D × 20/410H45H160, -5
MspIWomenT2D × 20/280M494M494, M4763
MenT1D × 21/412.44M45M81, -5

T2D: type 2 diabetes mellitus; T1D: type 1 diabetes mellitus; Dt: decision tree.

T2D: type 2 diabetes mellitus; T1D: type 1 diabetes mellitus; Dt: decision tree.

Discussion

In the present study, the combination of T-RFLP analysis and data mining was able to precisely classify the 16S rRNA gene amplification products obtained from fecal samples of the non-DM, T1D, and T2D groups. To our knowledge, no widely available data mining algorithm has previously been able to classify the type of diabetes by analysis of fecal samples. Our results suggest that non-DM persons, T1D patients, and T2D patients have specific gut microbiota patterns. It is interesting to note that T1D and T2D patients showed different patterns, even though both groups had metabolic disorders associated with hyperglycemia. Pathophysiology of T1D might not be homogeneous; therefore, we included acute onset T1D with GAD antibody and relatively recent onset (disease duration < 5 years) in this study. The reason why we could see clear difference between T1D and controls may be due to focusing on this “pure” population in T1D. A previous study of T1D suggested that human gut microbiota have a role in the onset of this autoimmune disorder in young children with a high genetic risk profile [15]. A striking result of that study was the increase of Bacteroidetes and decline of Firmicutes in the gut microbiome over time as autoimmunity developed. In the present study, most subjects with T1D were assigned to the node with a higher OTU of B366 which contains Bacteroidetes, indicating that our findings were consistent with the previous report [15]. It is possible that our method could become a bacterial marker for early diagnosis of T1D or for predicting its onset among persons with a high risk of developing T1D. Many studies have investigated associations between the gut microbiota and T2D, but few have found a causal relationship between gut microbiota and the onset of T2D. The changes of gut microbiota have been different in every study, suggesting that short-chain fatty acid (SCFA) or micro-inflammation might explain the association between gut microbiota and T2D [16, 17]. On the other hand, a decrease in SCFA is associated with intestinal barrier failure and translocation of lipopolysaccharide and gut bacteria into the circulation, leading to development of insulin resistance [18]. Changes of lipopolysaccharide and SCFA were not confirmed in the present study. However, the gut microbiota pattern which is characteristic of T2D may cause such changes. Data mining by the CART method achieves reproducibility. After it has been used to create a decision tree, other individuals can be classified by applying the same process [19]. For example, a man would be classified at node 8 of Figure 1a if the OTUs of his gut microbiota digested by BslI showed B366 > 1.334, B749 > 3.032, and B968 > 0.980. In the present study, most men with T1D were classified at node 8 using BslI, suggesting that this gut microbiota pattern is characteristic of T1D. Accordingly, a male individual classified at node 8 may be likely to have T1D or an increased risk of developing it. Accordingly, it is possible that this method could be used to support the diagnosis of diabetes. The main method currently employed for analysis of the gut microbiota is next generation sequencing. However, it has the weaknesses of ignoring unknown microbiota and high cost. We combined T-RFLP analysis with data mining in the present study, allowing us to recognize the overall pattern of gut microbiota and to evaluate the influence of unknown microbiota. In addition, the cost of our method is relatively low. This study had several limitations. First, the number of the subjects was small. Second, the subjects were all Japanese, so we were unable to assess the influence of ethnic differences. Therefore, further large-scale studies will be needed to confirm our results. In conclusion, we were able to classify the 16SrRNA gene amplification products obtained from fecal samples of T1D patients, T2D patients, and non-DM subjects with a high level of precision by combining T-RFLP analysis and data mining. Specific gut microbiota patterns were found for T1D and T2D patients, as well as a sex difference of the patterns.
  19 in total

1.  Terminal restriction fragment length polymorphism analysis for human fecal microbiota and its application for analysis of complex bifidobacterial communities.

Authors:  Mitsuo Sakamoto; Hidenori Hayashi; Yoshimi Benno
Journal:  Microbiol Immunol       Date:  2003       Impact factor: 1.955

2.  Application of new primer-enzyme combinations to terminal restriction fragment length polymorphism profiling of bacterial populations in human feces.

Authors:  Koji Nagashima; Takayoshi Hisada; Maremi Sato; Jun Mochizuki
Journal:  Appl Environ Microbiol       Date:  2003-02       Impact factor: 4.792

3.  Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya.

Authors:  C R Woese; O Kandler; M L Wheelis
Journal:  Proc Natl Acad Sci U S A       Date:  1990-06       Impact factor: 11.205

Review 4.  A microbial world within us.

Authors:  Erwin G Zoetendal; Elaine E Vaughan; Willem M de Vos
Journal:  Mol Microbiol       Date:  2006-03       Impact factor: 3.501

Review 5.  Microbial influences in inflammatory bowel diseases.

Authors:  R Balfour Sartor
Journal:  Gastroenterology       Date:  2008-02       Impact factor: 22.682

Review 6.  Microbes in gastrointestinal health and disease.

Authors:  Andrew S Neish
Journal:  Gastroenterology       Date:  2008-11-19       Impact factor: 22.682

7.  Toward defining the autoimmune microbiome for type 1 diabetes.

Authors:  Adriana Giongo; Kelsey A Gano; David B Crabb; Nabanita Mukherjee; Luis L Novelo; George Casella; Jennifer C Drew; Jorma Ilonen; Mikael Knip; Heikki Hyöty; Riitta Veijola; Tuula Simell; Olli Simell; Josef Neu; Clive H Wasserfall; Desmond Schatz; Mark A Atkinson; Eric W Triplett
Journal:  ISME J       Date:  2010-07-08       Impact factor: 10.302

8.  Metagenomic analysis of the human distal gut microbiome.

Authors:  Steven R Gill; Mihai Pop; Robert T Deboy; Paul B Eckburg; Peter J Turnbaugh; Buck S Samuel; Jeffrey I Gordon; David A Relman; Claire M Fraser-Liggett; Karen E Nelson
Journal:  Science       Date:  2006-06-02       Impact factor: 47.728

9.  Sex differences in the gut microbiome drive hormone-dependent regulation of autoimmunity.

Authors:  Janet G M Markle; Daniel N Frank; Steven Mortin-Toth; Charles E Robertson; Leah M Feazel; Ulrike Rolle-Kampczyk; Martin von Bergen; Kathy D McCoy; Andrew J Macpherson; Jayne S Danska
Journal:  Science       Date:  2013-01-17       Impact factor: 47.728

Review 10.  Obesity, diabetes, and gut microbiota: the hygiene hypothesis expanded?

Authors:  Giovanni Musso; Roberto Gambino; Maurizio Cassader
Journal:  Diabetes Care       Date:  2010-10       Impact factor: 19.112

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.