Literature DB >> 28190657

Cross-sectional analysis of BioBank Japan clinical data: A large cohort of 200,000 patients with 47 common diseases.

Makoto Hirata1, Yoichiro Kamatani2, Akiko Nagai3, Yutaka Kiyohara4, Toshiharu Ninomiya5, Akiko Tamakoshi6, Zentaro Yamagata7, Michiaki Kubo8, Kaori Muto3, Taisei Mushiroda9, Yoshinori Murakami10, Koichiro Yuji11, Yoichi Furukawa12, Hitoshi Zembutsu13, Toshihiro Tanaka14, Yozo Ohnishi15, Yusuke Nakamura16, Koichi Matsuda17.   

Abstract

BACKGROUND: To implement personalized medicine, we established a large-scale patient cohort, BioBank Japan, in 2003. BioBank Japan contains DNA, serum, and clinical information derived from approximately 200,000 patients with 47 diseases. Serum and clinical information were collected annually until 2012.
METHODS: We analyzed clinical information of participants at enrollment, including age, sex, body mass index, hypertension, and smoking and drinking status, across 47 diseases, and compared the results with the Japanese database on Patient Survey and National Health and Nutrition Survey. We conducted multivariate logistic regression analysis, adjusting for sex and age, to assess the association between family history and disease development.
RESULTS: Distribution of age at enrollment reflected the typical age of disease onset. Analysis of the clinical information revealed strong associations between smoking and chronic obstructive pulmonary disease, drinking and esophageal cancer, high body mass index and metabolic disease, and hypertension and cardiovascular disease. Logistic regression analysis showed that individuals with a family history of keloid exhibited a higher odds ratio than those without a family history, highlighting the strong impact of host genetic factor(s) on disease onset.
CONCLUSIONS: Cross-sectional analysis of the clinical information of participants at enrollment revealed characteristics of the present cohort. Analysis of family history revealed the impact of host genetic factors on each disease. BioBank Japan, by publicly distributing DNA, serum, and clinical information, could be a fundamental infrastructure for the implementation of personalized medicine.
Copyright © 2017 The Authors. Production and hosting by Elsevier B.V. All rights reserved.

Entities:  

Keywords:  BioBank Japan Project; Biobank; Clinical information; Common disease; Family history

Mesh:

Year:  2017        PMID: 28190657      PMCID: PMC5363792          DOI: 10.1016/j.je.2016.12.003

Source DB:  PubMed          Journal:  J Epidemiol        ISSN: 0917-5040            Impact factor:   3.211


Introduction

BioBank Japan (BBJ) was established with the cooperation of 12 medical institutes, consisting of over 60 hospitals, as a leading project of the Ministry of Education, Culture, Sports, Science and Technology in 2003.1, 2 As a disease-oriented biobank, BBJ collected DNA and serum samples from approximately 200,000 patients with 47 diseases. BBJ annually updates clinical information, which is another essential element of biobanks. The clinical information associated with the biospecimens was utilized in previous studies to select or stratify the participant group. Samples and their clinical information were used for over 200 studies. However, so far, a comprehensive analysis of the clinical information of the BBJ cohorts has not been conducted. Here, we analyzed clinical information including age, sex, body mass index (BMI), hypertension, smoking, and drinking status across 47 diseases, and compared the results with the Japanese database. In addition, we assessed the association between target diseases and positive family history.

Materials and methods

Study design

In the present cohort, we focused on 47 common diseases (Table 1). Patients diagnosed with any one of the 47 diseases were recruited from 66 hospitals affiliated with 12 medical institutes between fiscal year of 2003 and 2007. The detailed protocol of the recruitment process has been described elsewhere. Written informed consent was obtained from all participants. The study protocol was reviewed and approved by the Ethics Committees of all participating institutions, including the Institute of Medical Science, the University of Tokyo, and the Center for Integrative Medical Sciences, RIKEN.
Table 1

Baseline characteristics of participants with 47 diseases in the present cohort.

47 DiseasesNumber of SubjectsMean (SD) age at registration (y)
% of male subjects% of male patients (Patient survey)
MaleFemale
Whole cohort199,98262.6614.6661.5516.0253.05N/A
Lung cancer377967.649.5466.079.8164.2550.51
Esophageal cancer129165.668.0665.5610.4486.2984.00
Gastric cancer632267.019.9065.1811.7773.3966.27
Colorectal cancer675967.109.9566.4210.8662.7655.54
Liver cancer192467.378.4769.978.1575.6868.18
Pancreatic cancer39266.029.8066.2111.0264.5450.85
Gallbladder/cholangiocarcinoma39267.719.2268.759.0562.5051.02
Prostate cancer506672.607.46N/A100.00100.00
Breast cancer633663.7411.2157.6711.980.731.33
Uterine cervical cancer1218N/A51.8313.330.000.00
Uterine corpus cancer1026N/A58.9310.650.000.00
Ovarian cancer888N/A56.3911.910.000.00
Hematological cancer130760.9915.0860.2616.6554.3253.97
Cerebral infarction16,53468.829.9071.6810.6062.2744.37
Cerebral aneurysm271060.5211.5162.8410.7835.24N/A
Epilepsy230346.5621.7543.3121.9857.2754.42
Bronchial asthma870051.8923.1153.5420.6849.3251.51
Pulmonary tuberculosis86362.1416.8262.4319.3471.3864.10
Chronic obstructive pulmonary disease277472.338.5772.719.8286.8168.28
Interstitial lung disease/pulmonary fibrosis80868.7411.4168.1111.9758.0455.32
Myocardial infarction13,27265.9210.3771.199.9080.9864.32
Unstable angina433066.769.7171.269.1573.7055.20
Stable angina14,80767.869.8171.059.7169.3955.20
Arrhythmia15,91267.0311.6769.2712.5264.3852.24
Heart failure761066.0112.6371.4612.7261.8138.18
Peripheral arterial diseases268370.849.0271.709.9778.1261.97
Chronic hepatitis B134654.5713.2155.6214.9762.6362.50
Chronic hepatitis C581963.3711.8464.6411.9253.7052.92
Liver cirrhosis251962.7411.5065.3814.1762.2949.52
Nephrotic syndrome105647.4522.8848.2321.7460.3258.06
Urolithiasis630753.0213.7256.9014.4275.6067.42
Osteoporosis674372.2812.8973.779.577.597.23
Diabetes mellitus39,69763.3111.3365.8012.0063.2352.74
Dyslipidemia43,81262.1511.9766.2610.7950.7633.55
Graves' disease232349.8614.2349.0415.7527.8522.22
Rheumatoid arthritis413964.0512.1262.3912.2920.2518.87
Hay fever565846.3917.6344.9415.8442.9346.74
Drug eruption58560.5316.1754.8217.4645.81N/A
Atopic dermatitis293829.9814.8529.7413.5453.1351.61
Keloid80948.5319.9743.3119.6038.94N/A
Uterine fibroid5904N/A44.699.490.000.00
Endometriosis1843N/A38.938.220.000.00
Febrile seizure3334.163.574.355.0960.96N/A
Glaucoma475566.8712.4370.0310.9546.7941.98
Cataract20,00270.4310.3172.919.5244.8136.83
Periodontitis389858.2015.9256.5916.0043.6941.03
Amyotrophic lateral sclerosis78260.8610.2161.0310.7664.32N/A
We included patients who had been diagnosed with the diseases by physicians at the cooperating hospitals (eTable 1). As this project registered not only patients with newly developed diseases but also patients who were diagnosed and treated before starting the project, some participants were enrolled several years after disease onset or diagnosis. We excluded patients who had received a bone marrow transplant and those who were not of East Asian descent.

Clinical information

Clinical information including common clinical variables, disease-specific variables, prescriptions, and drug side-effect information, was collected from each participant. The detailed methods of the collection of clinical information has been described elsewhere. The clinical database was updated every year until 2012. After a thorough review and data-cleansing of clinical variables, clinical information of 199,982 participants with 47 diseases at enrollment was established on March 31 2015 and used in the current study.

Japanese database

The Ministry of Health, Labour and Welfare in Japan conducts a Patient Survey every three years and a National Health and Nutrition Survey every year. We obtained the results of the Patient Survey of 2005 and those of the National Health and Nutrition Survey of 2006. Table 65 in the Patient Survey was used to estimate Japanese patient numbers, stratified by sex and age for each disease. Distributions of BMI categories, hypertension prevalence, smoking history, and alcohol intake history in the general Japanese population were calculated from Tables 23, 49-2, 97, and 91 of the National Health and Nutrition Survey, respectively.

Analysis of clinical information

The distributions of BMI, hypertension prevalence, smoking history, and alcohol intake history in the BBJ cohort were adjusted for sex and age group for each table in the national public survey when we compared the distributions among the 47 diseases and Japanese database. BMI category and hypertension were defined according to World Health Organization (WHO) criteria as follows: BMI < 18.5 was defined as underweight, 18.5 ≤ BMI < 25 as normal, 25 ≤ BMI < 30 as overweight, and 30 ≤ BMI as obese; hypertension was defined as systolic blood pressure ≥140 mmHg and diastolic blood pressure ≥90 mmHg or when participants were prescribed antihypertensive drugs. Multivariate logistic regression analyses were performed to assess the association between each target disease and positive family history associated with the target disease, adjusted for sex and age. SAS 9.4 software was used for the data analysis. A p-value of <0.05 was considered statistically significant.

Results

Basic characteristics (Age and sex)

We characterized the BioBank Japan cohort at enrollment by analyzing common clinical variables of age and sex across the target diseases. Mean age at enrollment, across the entire cohort or for each disease, was comparable between both sexes, but varied among the diseases (Table 1). The highest mean age was observed in men with prostate cancer and in women with osteoporosis (72.60 and 73.77 years, respectively), while the youngest mean age was observed in men and women with febrile seizures (4.16 and 4.35 years, respectively), reflecting the typical age of onset of each disease. A greater number of men were registered in the BBJ cohort compared to women (53.05% vs. 46.95%), while sex ratios varied according to the diseases (Table 1). To highlight sex and age characteristics of the BBJ cohort, we further compared the sex and age distribution for each disease with the Patient Survey. We included participants with 42 out of the 47 diseases for the comparison, as we obtained the relevant clinical data from the Patient Survey (eTable 2). Almost all diseases displayed equivalent age distributions, while lower proportions of participants <20 years of age were observed in three diseases (bronchial asthma, atopic dermatitis and hay fever), which are likely to occur in younger populations (eFigs. 1.1–1.5 and eTable 3). The proportion of male participants with dyslipidemia was considerably higher in the BBJ cohort (50.76%) than in the Patient Survey (33.55%), although both age distributions appeared equivalent. The low proportion of female patients with heart failure aged ≥80 years resulted in a lower proportion of female participants in the BBJ cohort. We also observed a low proportion of elderly female participants with cerebral infarction, chronic obstructive pulmonary disease (COPD), peripheral arterial diseases (PAD), unstable angina, stable angina, and myocardial infarction in the BBJ cohort. Varied distributions between the BBJ cohort and Patient Survey were observed in pulmonary tuberculosis and nephrotic syndrome.

Basic characteristics (Lifestyle and physical status)

We also evaluated life style including smoking and alcohol intake history, and physical status including BMI and blood pressure, at enrollment in the BBJ cohort. We included participants ≥20 years of age in this analysis because the frequency of smoking, alcohol intake, and hypertension among individuals under 20 years of age is quite low, and the criteria for underweight or obesity according to BMI in children and teenagers are different from those applied to adults. Furthermore, we compared the BBJ cohort and the National Health and Nutrition Survey 2006 for physical and life style, after adjusting for sex and age, because sex- and age-distribution varied among diseases. Smoking history at enrollment (including subjects both with and without information on current smoking status) was positive in 74.98% of male subjects and 21.24% of female subjects in the BBJ cohort, while current smokers accounted for 27.78% of male subjects and 10.45% of female subjects (Table 2). The highest frequency of positive smoking history in both sexes was observed in COPD, followed by PAD in male subjects, and esophageal cancer in female subjects (Table 2). The highest proportion of ex-smokers for both sexes was observed in participants with lung cancer, esophageal cancer and COPD (71.45%, 64.88% and 64.68% in male subjects, and 21.75%, 30.86% and 39.44% in female subjects, respectively), while the highest proportion of current smokers for both sexes was observed in participants with Graves' disease (49.84% in male subjects and 24.92% in female subjects) (Table 2). We then compared age-adjusted smoking history among the 47 diseases. The frequency of smokers was highest among participants with COPD, esophageal cancer, interstitial lung diseases/pulmonary fibrosis, pancreatic cancer, and cardiovascular diseases, in which smoking was shown to be a critical risk factor (Fig. 1 and eTable 4).
Table 2

Baseline smoking status of participants with 47 diseases in the present cohort.

47 DiseasesSmoking status
Male subjects
Female subjects
Never smokerEx-smokerCurrent smokerSmoker with unknown statusNever smokerEx-smokerCurrent smokerSmoker with unknown status
Whole cohort25.0243.7527.783.4578.769.3710.451.42
Lung cancer11.7371.4514.052.7873.3921.753.890.97
Esophageal cancer10.9264.8818.745.4656.5730.869.143.43
Gastric cancer18.5355.0122.723.7476.1615.337.361.15
Colorectal cancer22.8749.8123.783.5579.5412.146.941.38
Liver cancer20.3945.1326.847.6477.2910.489.173.06
Pancreatic cancer15.6055.6028.000.8066.9116.5514.392.16
Gallbladder/cholangiocarcinoma26.3448.1521.404.1280.6910.346.902.07
Prostate cancer31.8846.5417.194.39N/A
Breast cancer42.2242.2215.560.0078.2413.047.631.08
Uterine cervical cancerN/A63.6014.3619.092.96
Uterine corpus cancerN/A80.809.358.161.69
Ovarian cancerN/A80.099.558.851.51
Hematologic cancer27.9945.6020.206.2081.4110.156.022.41
Cerebral infarction24.8647.6924.363.1082.978.966.771.30
Cerebral aneurysm19.0648.9927.804.1569.6815.6912.592.04
Epilepsy36.1228.2131.454.2276.197.5214.791.50
Bronchial asthma26.4440.0830.962.5365.9013.4819.241.39
Pulmonary tuberculosis19.3150.1729.041.4978.8410.798.302.07
Chronic obstructive pulmonary disease7.3164.6825.812.2035.7739.4422.542.25
Interstitial lung disease/pulmonary fibrosis13.0762.9620.043.9272.5918.078.730.60
Myocardial infarction17.9656.9821.563.5070.3617.6210.561.45
Unstable angina21.6055.6120.262.5276.5913.858.580.98
Stable angina21.7054.3621.951.9979.5611.648.070.74
Arrhythmia25.6150.0121.422.9583.209.586.500.72
Heart failure23.6849.6323.053.6379.3212.067.531.09
Peripheral arterial disease10.3952.6631.745.2264.6720.9112.651.76
Chronic hepatitis B28.6931.1134.875.3377.308.7912.471.43
Chronic hepatitis C22.3238.4034.494.7974.759.2113.712.33
Liver cirrhosis19.7434.0540.655.5673.9410.8013.252.00
Nephrotic syndrome26.9739.8930.522.6269.2514.4012.743.60
Urolithiasis29.1728.6838.463.6976.985.4915.611.92
Osteoporosis32.6038.8325.153.4287.795.275.990.95
Diabetes mellitus23.0841.2432.453.2478.499.2611.111.15
Dyslipidemia24.4243.8327.524.2381.618.268.801.33
Graves' disease20.2026.0649.843.9160.6812.9024.921.50
Rheumatoid arthritis17.3041.8435.465.4078.098.5311.411.97
Hay fever42.4028.3024.434.8877.018.9112.181.89
Drug eruption25.0042.5829.692.7372.379.5416.451.64
Atopic dermatitis48.1213.7536.531.6070.647.0620.911.39
Keloid38.0835.4325.171.3272.697.9618.061.29
Uterine fibroidN/A73.669.3414.682.31
EndometriosisN/A70.968.8616.673.52
Febrile seizure100.000.000.000.00100.000.000.000.00
Glaucoma30.3142.4924.103.1087.765.984.871.39
Cataract28.6943.6124.223.4786.666.235.951.17
Periodontitis35.5527.7636.060.6378.786.1014.630.49
Amyotrophic lateral sclerosis37.1017.9741.943.0084.672.3013.030.00
Fig. 1

Age-adjusted ratios of participants with a smoking history for each disease. The distributions of male (A) and female (B) participants with a smoking history in the BBJ cohort and in the National Health and Nutrition Survey (Japan, 2006) were compared. Age-adjustment was performed according to the age distribution of the National Health and Nutrition Survey (Japan, 2006).

A positive alcohol history at enrollment (including those with and without current drinking status) was found in 69.68% of male subjects and 28.20% of female subjects (Table 3). The proportion of current drinkers in the whole cohort was much higher than that of ex-drinkers in both sexes: 52.24% and 13.35% of male subjects and 21.70% and 3.99% of female subjects were current and ex-drinkers, respectively. Among the 47 diseases, the proportion of ex-drinkers was relatively high among participants with liver cirrhosis (34.05% in male subjects and 10.80% in female subjects), liver cancer (34.83% and 10.94%), pulmonary tuberculosis (33.17% and 11.20%), esophageal cancer (25.59% and 16%), and pancreatic cancer (29.72% and 10.14%) (Table 3). Age-adjusted alcohol intake history showed that the frequency of drinkers in esophageal cancer was remarkably higher than that in other diseases for male and female subjects (Fig. 2 and eTable 5). To highlight the smoking and drinking status in the BBJ cohort, the frequency of smokers or drinkers, stratified by sex and age group, was compared between the BBJ and the National Health and Nutrition Survey. The BBJ cohort had a higher frequency of smokers among female subjects across all age groups and among elderly male subjects, particularly among those >60 years of age; the frequency of drinkers was almost equivalent between the BBJ and the National Health and Nutrition Survey for both sexes and across all age groups (eFig. 2A and B and eTables 6 and 7).
Table 3

Baseline alcohol intake status of participants with 47 diseases in the present cohort.

47 DiseasesAlcohol intake
Male subjects
Female subjects
Never drinkerEx-drinkerCurrent drinkerDrinker with unknown statusNever drinkerEx-drinkerCurrent drinkerDrinker with unknown status
Whole cohort30.3213.3552.244.0971.803.9921.702.52
Lung cancer26.7315.8554.213.2169.946.3721.811.87
Esophageal cancer8.2925.5961.384.7447.4316.0032.004.57
Gastric cancer25.1918.9651.494.3673.127.5717.621.69
Colorectal cancer23.5315.8856.154.4573.115.8318.992.08
Liver cancer23.9034.8333.018.2776.5910.949.413.06
Pancreatic cancer25.7029.7242.971.6165.2210.1423.910.72
Gallbladder/cholangiocarcinoma30.5827.2738.433.7284.832.0713.100.00
Prostate cancer29.3313.0551.476.16N/A
Breast cancer28.8911.1160.000.0063.675.2129.022.10
Uterine cervical cancerN/A58.905.5730.804.73
Uterine corpus cancerN/A71.023.5922.512.89
Ovarian cancerN/A68.913.8324.133.13
Hematologic cancer29.0512.2850.727.9572.765.6918.103.45
Cerebral infarction28.5118.3349.383.7879.644.9713.671.73
Cerebral aneurysm23.7215.8155.345.1366.866.6924.052.40
Epilepsy37.3516.1142.034.5066.544.6725.882.90
Bronchial asthma34.169.4453.532.8769.263.8325.291.62
Pulmonary tuberculosis30.6933.1734.321.8274.6911.2012.032.07
Chronic obstructive pulmonary disease38.4817.8642.011.6674.506.8017.001.70
Interstitial lung disease32.0217.3246.713.9576.744.5316.622.11
Myocardial infarction41.6813.7041.293.3279.485.4213.481.62
Unstable angina39.2713.5944.103.0478.804.5614.761.88
Stable angina34.0413.7449.552.6778.714.6815.271.35
Arrhythmia26.3714.1756.053.4275.694.3318.601.37
Heart failure32.3117.0846.344.2779.825.7813.301.09
PAD31.0119.9443.805.2477.468.1011.622.82
Chronic hepatitis B27.1818.0847.457.2868.435.7024.031.83
Chronic hepatitis C30.6126.6037.505.2972.358.9015.513.25
Liver cirrhosis19.2335.2038.926.6570.8611.3514.243.56
Nephrotic syndrome39.7014.4241.953.9367.596.3723.822.22
Urolithiasis32.495.4456.805.2772.962.7222.002.32
Osteoporosis39.6314.4342.893.0582.072.4713.701.76
Diabetes mellitus31.4715.1749.883.4979.105.2914.021.59
Dyslipidemia30.7810.7154.084.4476.033.4018.522.06
Graves' disease38.248.8248.374.5866.735.0225.932.32
Rheumatoid arthritis35.4713.7944.955.7975.464.1117.892.54
Hay fever31.184.3158.226.2959.312.8032.505.40
Drug eruption30.9815.2949.024.7171.572.6821.744.01
Atopic dermatitis45.443.6847.043.8457.372.4436.623.57
Keloid39.337.3350.672.6763.711.5133.691.08
Uterine fibroidN/A53.522.2338.575.67
EndometriosisN/A55.512.2634.567.66
Febrile seizure100.000.000.000.0066.670.0033.330.00
Glaucoma27.4212.4656.143.9877.612.9817.162.25
Cataract31.0514.4250.583.9581.002.9514.491.57
Periodontitis29.827.2061.531.4563.542.7532.141.57
Amyotrophic lateral sclerosis28.570.0057.1414.29100.000.000.000.00
Fig. 2

Age-adjusted ratio of participants with alcohol history in each disease. The distributions of male (A) and female (B) participants with a drinking history in the BBJ cohort and in the National Health and Nutrition Survey (Japan, 2006) were compared. Age-adjustment was performed according to the age distribution of the National Health and Nutrition Survey (Japan, 2006).

Mean BMI at enrollment in the BBJ cohort was 23.51 in male subjects and 22.94 in female subjects. Analysis of BMI in each disease revealed that underweight participants (BMI<18.5) had an increased association of various cancers, while overweight or obese participants (BMI ≥ 25) had an increased association of metabolic and cardiovascular diseases (Table 4, Fig. 3 and eTable 8). When comparing the National Health and Nutrition Survey and the BBJ, there was a greater proportion of participants with overweight or obesity in the BBJ, among male and female subjects and across all age-groups; conversely, similar distribution patterns were found when comparing the BBJ cohort and the Survey, by sex and age-group (eFig. 2C and eTables 6 and 7). In contrast, in the BBJ cohort, there were fewer underweight participants in their twenties (for both sexes) but more underweight participants >60 years (among male subjects) and >50 years (among female subjects) (eFig. 2D and eTables 6 and 7).
Table 4

Baseline BMI and hypertension of participants with 47 diseases in the present cohort.

47 DiseasesBMI
%Hypertension
Male subjects
Female subjects
Male subjectsFemale subjects
Mean(SD)Mean(SD)
Whole cohort23.513.4722.943.8951.5241.11
Lung cancer22.293.0522.053.3736.7433.83
Esophageal cancer20.532.9619.773.2527.2924.86
Gastric cancer21.253.0420.343.2630.9124.26
Colorectal cancer22.663.1722.003.5138.0030.31
Liver cancer22.683.2722.823.9644.6445.51
Pancreatic cancer20.443.1919.903.0330.8329.50
Gallbladder/cholangiocarcinoma21.463.2922.203.8933.4731.29
Prostate cancer23.282.86N/A38.00N/A
Breast cancer23.873.7522.743.6052.1722.82
Cervical cancerN/A21.933.29N/A19.23
Uterine cancerN/A23.744.37N/A25.83
Ovarian cancerN/A22.043.38N/A19.21
Hematopoietic tumor23.113.2321.873.3330.9426.71
Cerebral infarction23.533.1923.393.8667.1265.34
Cerebral aneurysm23.863.3623.113.6465.4559.52
Epilepsy23.473.8422.704.1936.3625.64
Bronchial asthma23.793.7123.784.5541.1935.14
Pulmonary tuberculosis20.823.2820.263.2632.3137.80
Chronic obstructive pulmonary disease21.303.3720.334.0846.0544.66
Interstitial lung disease/pulmonary fibrosis23.023.2122.623.7542.3740.65
Myocardial infarction24.043.2323.403.7473.1177.01
Unstable angina24.023.2123.743.7474.3374.80
Stable angina23.853.1623.633.6677.1777.45
Arrhythmia23.533.3022.903.8066.7565.33
Heart failure23.503.8922.644.3178.7078.10
Peripheral arterial diseases22.523.2522.443.8070.2169.17
Chronic hepatitis B23.323.1122.553.5140.6733.60
Chronic hepatitis C22.863.1522.543.6546.1940.71
Liver cirrhosis22.883.5123.034.0552.1849.41
Nephrotic syndrome23.003.3422.503.9462.3250.68
Urolithiasis24.433.3923.594.1637.2735.61
Osteoporosis21.963.5222.263.6249.2244.46
Diabetes mellitus24.033.7224.574.4160.3262.82
Dyslipidemia24.783.4524.113.8864.4759.48
Graves' disease23.553.6322.353.6141.0332.00
Rheumatoid arthritis22.493.2921.853.6940.5733.56
Hay fever23.663.1722.063.5124.7014.77
Drug eruption23.273.3922.624.0749.8130.87
Atopic dermatitis23.013.5121.483.7013.245.75
Keloid23.953.2922.714.1127.6018.82
Uterine fibroidN/A22.293.51N/A14.09
EndometriosisN/A21.433.25N/A7.93
Febrile seizure28.730.0021.264.80N/AN/A
Glaucoma23.053.1822.903.6643.8741.08
Cataract23.083.1523.053.8549.5345.68
Periodontitis23.273.1422.253.3727.3315.90
Amyotrophic lateral sclerosis21.141.6828.003.14N/AN/A
Fig. 3

Age-adjusted ratio of participants with overweight or underweight in each disease. The distributions of obese or underweight participants among male (A) and female (B) subjects in the BBJ cohort and in the National Health and Nutrition Survey (Japan, 2006) were compared. Age-adjustment was performed according to the age distribution of the National Health and Nutrition Survey (Japan, 2006). BMI ≥25 was defined as overweight and BMI less than 18.5 was defined as underweight.

Nearly half of the participants of the BBJ cohort had hypertension (51.52% of male subjects and 41.11% of female subjects, Table 4) at enrollment. The frequency of hypertension in cardiovascular diseases, particularly in coronary diseases, was higher than that in other diseases, while the frequency of hypertension among cancer participants tended to be low (Table 4, Fig. 4 and eTable 9). The frequency of hypertension increased with age, similarly to the increase observed in the Survey. However, the frequency of hypertension among subjects <50 years of age was higher and subjects >60 years of age was lower in the BBJ cohort than in the Survey (eFig. 2E and eTables 6 and 7).
Fig. 4

Age-adjusted ratio of participants with hypertension in each disease. The distributions of male (A) and female (B) participants with hypertension in the BBJ cohort and in the National Health and Nutrition Survey (Japan, 2006) were compared. Age-adjustment was performed according to the age distribution of the National Health and Nutrition Survey (Japan, 2006). Participants with a systolic blood pressure ≥135-mmHg, a diastolic blood pressure ≥90-mmHg, or participants prescribed antihypertensive medication, were diagnosed with hypertension.

Family history

Finally, we performed multivariate logistic-regression analysis using age and sex status as covariates to assess the association between positive family history and disease risk. We were able to obtain the questionnaire-based information regarding family history of 45 diseases out of the 47 diseases (eTable 10). For all the diseases, except for PAD, there was a significant association with a positive family history, with an odds ratio of >1.7 (Fig. 5 and eTable 11). Notably, the odds ratios for keloid, chronic hepatitis B, and Grave's disease were relatively high (149.417, 53.474, and 23.751, respectively) indicating the strong impact of genetic and familial factors on disease onset.
Fig. 5

Sex- and age-adjusted odds ratios in family history, related with the 47 diseases. Dots represent odds ratios and bars represent 95% CIs by logistic regression analysis. The list of family histories, associated with the 47 diseases, is set out in eTable 2.

Discussion

We analyzed common clinical variables at enrollment, across the whole BBJ cohort, as well as for each target disease, and we compared these results with those of the Japanese database to highlight the characteristics of the BBJ cohort. Statistical analyses were not conducted in this study, as the large-scale cohort sample in the BBJ would yield relatively low-p-values, even when absolute differences were very small. The distribution of age, life style, and physical status, showed that the characteristics of each disease group could generally be explained. It is an established fact that smoking and/or alcohol intake are risk factors for various diseases including cancer, cardiovascular disease, hepatic disease, and respiratory disease.9, 10 In fact, these diseases showed a higher frequency of participants with a positive smoking or drinking history at enrollment in the BBJ cohort (Fig. 1, Fig. 2 and eTables 4 and 5). Although we cannot estimate the odds ratios of smoking and drinking status due to the lack of control data in the present cohort, age-adjusted distributions of the smoking and drinking histories of participants suggest that these lifestyle factors have a significant impact on disease onset. Analysis of BMI at enrollment indicated that lower BMI was more prevalent among participants with malignant tumors, while higher BMI was common among participants with metabolic and cardiovascular disease (Fig. 3 and eTable 8). Obesity could be a risk factor for dyslipidemia, type 2 diabetes, coronary disease, while cancer can induce weight loss. Therefore, we need to be cautious in the interpretation of the association between diseases and lifestyle or physical factors. To highlight the characteristics of the BBJ cohort, we compared the age and sex distributions of the BBJ cohort with those of the Patient Survey for each disease, and the distributions of smoking and drinking history, BMI and hypertension in the BBJ cohort with those of the National Health and Nutrition Survey. It is difficult to discuss the discrepancy or consistency between the BBJ cohort and the Japanese database, because backgrounds of the subjects and methods to determine the numbers of patients or the distributions of life style and physical status were different. However, the comparisons between the BBJ cohort and the Japanese database gave us better insight about the characteristics of the BBJ cohort, contributing to utmost utilization of the biobank samples. As one of our main aims was to identify genetic factors causing susceptibility to diseases, we analyzed the association between positive family history and disease onset to evaluate the impact of host genetic factors. It has been reported that a positive family history is an important risk factors for many common chronic diseases,12, 13, 14, 15, 16, 17, 18, 19 and keloid, chronic hepatitis B, and Graves' disease showed the highest odds ratios for a positive family history (Fig. 5). While it is important to consider the possibility that perinatal transmission, a major route of hepatitis B virus transmission, resulted in the high odds ratio observed in chronic hepatitis B, several genome-wide association studies (GWAS), which identified some single nucleotide polymorphism loci significantly associated with these diseases in Japan,21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32 support the finding that genetic factors are associated with these diseases. However, the odds ratios, calculated in the previous genomic studies, were not as high as in the present analysis, suggesting the possibility that further genomic analysis could identify novel genomic loci. In addition, the fact that common clinical variables were consistently identified across the 47 diseases enabled us to evaluate and compare the risk significance of the positive family history on the diseases and to perform further genomic or other “omics” analyses based on these results. This study has some limitations. We could not eliminate the possibility of reporting bias, causing significantly higher odds ratio of positive family history in almost all target diseases, as the information on family history was mainly based on participants' interviews, although this was completed by certified medical coordinators. Another limitation of this analysis is that the reference population for each logistic analysis was not the disease-free general population but the participants with the other diseases in the cohort. Therefore, again, we need to take into account selection bias. In conclusion, we have established a large biobank cohort, consisting of approximately 200,000 patients with 47 diseases. Analysis of the clinical dataset and comparisons between the present cohort and the Japanese database largely revealed consistent trends in common clinical variables, particularly among participants aged ≥40 years, suggesting that the sampling is representative for the general patient population in Japan. Further analysis, combined with various high-throughput ‘omics’ technologies, using their DNA and serum samples, will aid us to identify novel genomic variants or biomarkers associated with disease progression or drug efficacy, contributing to the implementation of personalized medicine.

Conflicts of interest

None declared.
  25 in total

1.  Association of a C/T single-nucleotide polymorphism in the 5' untranslated region of the CD40 gene with Graves' disease in Japanese.

Authors:  Yoshiyuki Ban; Teruaki Tozaki; Matsuo Taniyama; Motowo Tomita; Yoshio Ban
Journal:  Thyroid       Date:  2006-05       Impact factor: 6.568

Review 2.  The BioBank Japan Project.

Authors:  Yusuke Nakamura
Journal:  Clin Adv Hematol Oncol       Date:  2007-09

3.  The replication of the association of the rs9355610 within 6p27 with Graves' disease.

Authors:  Yoshiyuki Ban; Teruaki Tozaki; Matsuo Taniyama
Journal:  Autoimmunity       Date:  2013-09       Impact factor: 2.815

4.  Family history is a coronary heart disease risk factor in the Second Northwick Park Heart Study.

Authors:  E Hawe; P J Talmud; G J Miller; S E Humphries
Journal:  Ann Hum Genet       Date:  2003-03       Impact factor: 1.670

5.  A C/T polymorphism in the 5' untranslated region of the CD40 gene is associated with later onset of Graves' disease in Japanese.

Authors:  Tokunori Mukai; Yuji Hiromatsu; Tomoka Fukutani; Michiko Ichimura; Hiroo Kaku; Ikuyo Miyake; Kentaro Yamada
Journal:  Endocr J       Date:  2005-08       Impact factor: 2.349

6.  Multiple SNPs in intron 7 of thyrotropin receptor are associated with Graves' disease.

Authors:  Hitomi Hiratani; Donald W Bowden; Satoshi Ikegami; Senji Shirasawa; Akira Shimizu; Yoshinori Iwatani; Takashi Akamizu
Journal:  J Clin Endocrinol Metab       Date:  2005-03-01       Impact factor: 5.958

7.  Biobanks: transnational, European and global networks.

Authors:  Martin Asslaber; Kurt Zatloukal
Journal:  Brief Funct Genomic Proteomic       Date:  2007-10-04

Review 8.  Family history questionnaires designed for clinical use: a systematic review.

Authors:  G T Reid; F M Walter; J M Brisbane; J D Emery
Journal:  Public Health Genomics       Date:  2008-10-02       Impact factor: 2.000

9.  Association of the T-cell regulatory gene CTLA4 with Graves' disease and autoimmune thyroid disease in the Japanese.

Authors:  Koichi Furugaki; Senji Shirasawa; Naofumi Ishikawa; Kunihiko Ito; Koichi Ito; Sumihisa Kubota; Kanji Kuma; Hajime Tamai; Takashi Akamizu; Hitomi Hiratani; Masao Tanaka; Takehiko Sasazuki
Journal:  J Hum Genet       Date:  2004-02-20       Impact factor: 3.172

Review 10.  Overview of the BioBank Japan Project: Study design and profile.

Authors:  Akiko Nagai; Makoto Hirata; Yoichiro Kamatani; Kaori Muto; Koichi Matsuda; Yutaka Kiyohara; Toshiharu Ninomiya; Akiko Tamakoshi; Zentaro Yamagata; Taisei Mushiroda; Yoshinori Murakami; Koichiro Yuji; Yoichi Furukawa; Hitoshi Zembutsu; Toshihiro Tanaka; Yozo Ohnishi; Yusuke Nakamura; Michiaki Kubo
Journal:  J Epidemiol       Date:  2017-02-08       Impact factor: 3.211

View more
  57 in total

1.  Integration of genetics and miRNA-target gene network identified disease biology implicated in tissue specificity.

Authors:  Saori Sakaue; Jun Hirata; Yuichi Maeda; Eiryo Kawakami; Takuro Nii; Toshihiro Kishikawa; Kazuyoshi Ishigaki; Chikashi Terao; Ken Suzuki; Masato Akiyama; Naomasa Suita; Tatsuo Masuda; Kotaro Ogawa; Kenichi Yamamoto; Yukihiko Saeki; Masato Matsushita; Maiko Yoshimura; Hidetoshi Matsuoka; Katsunori Ikari; Atsuo Taniguchi; Hisashi Yamanaka; Hideya Kawaji; Timo Lassmann; Masayoshi Itoh; Hiroyuki Yoshitomi; Hiromu Ito; Koichiro Ohmura; Alistair R R Forrest; Yoshihide Hayashizaki; Piero Carninci; Atsushi Kumanogoh; Yoichiro Kamatani; Michiel de Hoon; Kazuhiko Yamamoto; Yukinori Okada
Journal:  Nucleic Acids Res       Date:  2018-12-14       Impact factor: 16.971

2.  Genetics: An integrated genetic analysis of disease.

Authors:  David L Mattson
Journal:  Nat Rev Nephrol       Date:  2018-04-09       Impact factor: 28.314

3.  Novel Risk Loci Identified in a Genome-Wide Association Study of Urolithiasis in a Japanese Population.

Authors:  Chizu Tanikawa; Yoichiro Kamatani; Chikashi Terao; Masayuki Usami; Atsushi Takahashi; Yukihide Momozawa; Kichiya Suzuki; Soichi Ogishima; Atsushi Shimizu; Mamoru Satoh; Keitaro Matsuo; Haruo Mikami; Mariko Naito; Kenji Wakai; Taiki Yamaji; Norie Sawada; Motoki Iwasaki; Shoichiro Tsugane; Kenjiro Kohri; Alan S L Yu; Takahiro Yasui; Yoshinori Murakami; Michiaki Kubo; Koichi Matsuda
Journal:  J Am Soc Nephrol       Date:  2019-04-11       Impact factor: 10.121

4.  Trans-biobank analysis with 676,000 individuals elucidates the association of polygenic risk scores of complex traits with human lifespan.

Authors:  Saori Sakaue; Masahiro Kanai; Juha Karjalainen; Masato Akiyama; Mitja Kurki; Nana Matoba; Atsushi Takahashi; Makoto Hirata; Michiaki Kubo; Koichi Matsuda; Yoshinori Murakami; Mark J Daly; Yoichiro Kamatani; Yukinori Okada
Journal:  Nat Med       Date:  2020-03-23       Impact factor: 53.440

Review 5.  Electronic health records: the next wave of complex disease genetics.

Authors:  Brooke N Wolford; Cristen J Willer; Ida Surakka
Journal:  Hum Mol Genet       Date:  2018-05-01       Impact factor: 6.150

6.  Genome-wide association study identifies 112 new loci for body mass index in the Japanese population.

Authors:  Masato Akiyama; Yukinori Okada; Masahiro Kanai; Atsushi Takahashi; Yukihide Momozawa; Masashi Ikeda; Nakao Iwata; Shiro Ikegawa; Makoto Hirata; Koichi Matsuda; Motoki Iwasaki; Taiki Yamaji; Norie Sawada; Tsuyoshi Hachiya; Kozo Tanno; Atsushi Shimizu; Atsushi Hozawa; Naoko Minegishi; Shoichiro Tsugane; Masayuki Yamamoto; Michiaki Kubo; Yoichiro Kamatani
Journal:  Nat Genet       Date:  2017-09-11       Impact factor: 38.330

7.  Genome-wide association study identifies seven novel susceptibility loci for primary open-angle glaucoma.

Authors:  Yukihiro Shiga; Masato Akiyama; Koji M Nishiguchi; Kota Sato; Nobuhiro Shimozawa; Atsushi Takahashi; Yukihide Momozawa; Makoto Hirata; Koichi Matsuda; Taiki Yamaji; Motoki Iwasaki; Shoichiro Tsugane; Isao Oze; Haruo Mikami; Mariko Naito; Kenji Wakai; Munemitsu Yoshikawa; Masahiro Miyake; Kenji Yamashiro; Kenji Kashiwagi; Takeshi Iwata; Fumihiko Mabuchi; Mitsuko Takamoto; Mineo Ozaki; Kazuhide Kawase; Makoto Aihara; Makoto Araie; Tetsuya Yamamoto; Yoshiaki Kiuchi; Makoto Nakamura; Yasuhiro Ikeda; Koh-Hei Sonoda; Tatsuro Ishibashi; Koji Nitta; Aiko Iwase; Shiroaki Shirato; Yoshitaka Oka; Mamoru Satoh; Makoto Sasaki; Nobuo Fuse; Yoichi Suzuki; Ching-Yu Cheng; Chiea Chuen Khor; Mani Baskaran; Shamira Perera; Tin Aung; Eranga N Vithana; Jessica N Cooke Bailey; Jae H Kang; Louis R Pasquale; Jonathan L Haines; Janey L Wiggs; Kathryn P Burdon; Puya Gharahkhani; Alex W Hewitt; David A Mackey; Stuart MacGregor; Jamie E Craig; R Rand Allingham; Micheal Hauser; Adeyinka Ashaye; Donald L Budenz; Stephan Akafo; Susan E I Williams; Yoichiro Kamatani; Toru Nakazawa; Michiaki Kubo
Journal:  Hum Mol Genet       Date:  2018-04-15       Impact factor: 6.150

8.  Genetic analysis of quantitative traits in the Japanese population links cell types to complex human diseases.

Authors:  Masahiro Kanai; Masato Akiyama; Atsushi Takahashi; Nana Matoba; Yukihide Momozawa; Masashi Ikeda; Nakao Iwata; Shiro Ikegawa; Makoto Hirata; Koichi Matsuda; Michiaki Kubo; Yukinori Okada; Yoichiro Kamatani
Journal:  Nat Genet       Date:  2018-02-05       Impact factor: 38.330

9.  Association of HLA-A*31:01 Screening With the Incidence of Carbamazepine-Induced Cutaneous Adverse Reactions in a Japanese Population.

Authors:  Taisei Mushiroda; Yukitoshi Takahashi; Teiichi Onuma; Yoshiaki Yamamoto; Tetsumasa Kamei; Tohru Hoshida; Katsuya Takeuchi; Kotaro Otsuka; Mitsutoshi Okazaki; Masako Watanabe; Kosuke Kanemoto; Tomohiro Oshima; Atsushi Watanabe; Shiro Minami; Kayoko Saito; Hisashi Tanii; Yasushi Shimo; Minoru Hara; Shinji Saitoh; Toshihiko Kinoshita; Masaki Kato; Naoto Yamada; Naoki Akamatsu; Toshihiko Fukuchi; Shigenobu Ishida; Shingo Yasumoto; Atsushi Takahashi; Takeshi Ozeki; Takahisa Furuta; Yoshiro Saito; Nobuyuki Izumida; Yoko Kano; Tetsuo Shiohara; Michiaki Kubo
Journal:  JAMA Neurol       Date:  2018-07-01       Impact factor: 18.302

10.  Large-scale genome-wide association study in a Japanese population identifies novel susceptibility loci across different diseases.

Authors:  Soumya Raychaudhuri; Johji Inazawa; Toshimasa Yamauchi; Takashi Kadowaki; Michiaki Kubo; Yoichiro Kamatani; Kazuyoshi Ishigaki; Masato Akiyama; Masahiro Kanai; Atsushi Takahashi; Eiryo Kawakami; Hiroki Sugishita; Saori Sakaue; Nana Matoba; Siew-Kee Low; Yukinori Okada; Chikashi Terao; Tiffany Amariuta; Steven Gazal; Yuta Kochi; Momoko Horikoshi; Ken Suzuki; Kaoru Ito; Satoshi Koyama; Kouichi Ozaki; Shumpei Niida; Yasushi Sakata; Yasuhiko Sakata; Takashi Kohno; Kouya Shiraishi; Yukihide Momozawa; Makoto Hirata; Koichi Matsuda; Masashi Ikeda; Nakao Iwata; Shiro Ikegawa; Ikuyo Kou; Toshihiro Tanaka; Hidewaki Nakagawa; Akari Suzuki; Tomomitsu Hirota; Mayumi Tamari; Kazuaki Chayama; Daiki Miki; Masaki Mori; Satoshi Nagayama; Yataro Daigo; Yoshio Miki; Toyomasa Katagiri; Osamu Ogawa; Wataru Obara; Hidemi Ito; Teruhiko Yoshida; Issei Imoto; Takashi Takahashi; Chizu Tanikawa; Takao Suzuki; Nobuaki Sinozaki; Shiro Minami; Hiroki Yamaguchi; Satoshi Asai; Yasuo Takahashi; Ken Yamaji; Kazuhisa Takahashi; Tomoaki Fujioka; Ryo Takata; Hideki Yanai; Akihide Masumoto; Yukihiro Koretsune; Hiromu Kutsumi; Masahiko Higashiyama; Shigeo Murayama; Naoko Minegishi; Kichiya Suzuki; Kozo Tanno; Atsushi Shimizu; Taiki Yamaji; Motoki Iwasaki; Norie Sawada; Hirokazu Uemura; Keitaro Tanaka; Mariko Naito; Makoto Sasaki; Kenji Wakai; Shoichiro Tsugane; Masayuki Yamamoto; Kazuhiko Yamamoto; Yoshinori Murakami; Yusuke Nakamura
Journal:  Nat Genet       Date:  2020-06-08       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.