| Literature DB >> 31699749 |
Janet E Olson1, Euijung Ryu2, Matthew A Hathcock2, Ruchi Gupta2, Joshua T Bublitz2, Paul Y Takahashi3, Suzette J Bielinski4, Jennifer L St Sauver4, Karen Meagher5, Richard R Sharp5, Stephen N Thibodeau6, Mine Cicek6, James R Cerhan4.
Abstract
PURPOSE: The Mayo Clinic Biobank was established to provide a large group of patients from which comparison groups (ie, controls) could be selected for case-control studies, to create a prospective cohort with sufficient power for common outcomes and to support electronic health record (EHR) studies. PARTICIPANTS: A total of 56 862 participants enrolled (21% response rate) into the Mayo Clinic Biobank from Rochester, Minnesota (77%, n=43 836), Jacksonville, Florida (18%, n=10 368) and La Crosse, Wisconsin (5%, n=2658). Participants were all Mayo Clinic patients, 18 years of age or older and US residents. FINDINGS TO DATE: Overall, 43% of participants were 65 years of age or older and female participants were more frequent (59%) than males at all sites. Most participants resided in the Upper Midwest regions of the USA (Minnesota, Iowa, Illinois or Wisconsin), Florida or Georgia. Self-reported race among Biobank participants was 90% white. Here we provide examples of the types of studies that have successfully utilised the resource, including (1) investigations of the population itself, (2) provision of controls for case-control studies, (3) genotype-driven research, (4) EHR-based research and (5) prospective recruitment to other studies. Over 270 projects have been approved to date to access Biobank data and/or samples; over 200 000 sample aliquots have been approved for distribution. FUTURE PLANS: The data and samples in the Mayo Clinic Biobank can be used for various types of epidemiological and clinical studies, especially in the setting of case-control studies for which the Biobank samples serve as control samples. We are planning cohort studies with additional follow-up and acquisition of genetic information on a large scale. © Author(s) (or their employer(s)) 2019. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.Entities:
Keywords: cohort; electronic health records; mayo clinic biobank
Year: 2019 PMID: 31699749 PMCID: PMC6858142 DOI: 10.1136/bmjopen-2019-032707
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Figure 1Modified Consolidated Standards of Reporting Trials flow diagram. EHR, electronic health records.
Demographic characteristics of Mayo Clinic Biobank stratified by recruitment locations and overall
| Biobank, | Biobank, MC Health System, Wisconsin | Biobank, | Entire biobank (n=56 862) | |
| Age at enrolment, N (%) | ||||
| 18–44 | 1054 (10%) | 489 (18%) | 8106 (19%) | 9649 (17%) |
| 45–54 | 1374 (13%) | 460 (17%) | 7636 (17%) | 9470 (17%) |
| 55–64 | 2511 (24%) | 685 (26%) | 10 371 (24%) | 13 567 (24%) |
| 65 or older | 5429 (52%) | 1024 (39%) | 17 723 (40%) | 24 176 (43%) |
| Female gender, N (%) | 5782 (56%) | 1867 (70%) | 25 738 (59%) | 33 387 (59%) |
| Residence at recruitment, N (%) | ||||
| MC catchment areas in Upper Midwest* | 5 (0.0%) | 2106 (79%) | 23 550 (54%) | 25 661 (45%) |
| Other Upper Midwest | 41 (0.4%) | 544 (21%) | 14 125 (32%) | 14 710 (26%) |
| FL/GA | 9508 (92%) | 2 (0.1%) | 515 (1.2%) | 10 025 (18%) |
| Remainder of USA | 814 (8%) | 6 (0.2%) | 5646 (13%) | 6466 (11%) |
| Race (n=55 590), N (%) | ||||
| White | 8832 (87%) | 2450 (93%) | 40 166 (94%) | 51 448 (90%) |
| Black/African American | 353 (4%) | 8 (0.3%) | 275 (0.6%) | 636 (1%) |
| Asian | 87 (10%) | 19 (0.7%) | 465 (1.1%) | 571 (1%) |
| Native American/Alaskan Native | 23 (0.2%) | 4 (0.2%) | 69 (0.2%) | 96 (0.2%) |
| Others | 825 (8%) | 147 (6%) | 1869 (4%) | 2841 (5%) |
| Missing | 248 | 30 | 992 | 1270 |
| Hispanic ethnicity (n=55 322), N (%) | 334 (3%) | 22 (0.8%) | 486 (1.1%) | 842 (1%) |
| Missing | 282 | 55 | 1203 | 1540 |
| BMI (n=54 123), N (%) | ||||
| Underweight | 188 (2%) | 31 (1.2%) | 364 (0.9%) | 583 (1%) |
| Normal | 3193 (33%) | 642 (26%) | 11 231 (27%) | 15 066 (26%) |
| Overweight | 3550 (37%) | 847 (34%) | 15 006 (36%) | 19 403 (34%) |
| Obese | 2640 (28%) | 992 (40%) | 15 439 (37%) | 19 071 (34%) |
| Missing | 797 | 146 | 1796 | 2739 |
| Education (n=55 022), N (%) | ||||
| Less than High School | 127 (1.3%) | 54 (2%) | 833 (2%) | 1014 (2%) |
| High School Graduate | 1006 (10%) | 509 (19%) | 6600 (16%) | 8115 (14%) |
| Associate Degree/Technical School | 3147 (31%) | 1051 (40%) | 13 581 (32%) | 17 779 (31%) |
| College Graduate or Higher | 5778 (57%) | 1005 (38%) | 21 333 (50%) | 28 116 (49%) |
| Missing | 310 | 39 | 1489 | 1838 |
| Smoking status (n=54 891), N (%) | ||||
| Never | 5205 (52%) | 1476 (57%) | 25 209 (60%) | 31 890 (56%) |
| Former | 4418 (44%) | 906 (35%) | 14 787 (35%) | 20 111 (35%) |
| Current | 386 (4%) | 216 (8%) | 2288 (5%) | 2890 (5%) |
| Missing | 359 | 60 | 1552 | 1971 |
| Alcohol (n=55 647), N (%) | ||||
| Never | 2593 (26%) | 563 (21%) | 9192 (21%) | 12 348 (22%) |
| Once a month or less | 1733 (17%) | 584 (22%) | 8943 (21%) | 11 260 (20%) |
| 2–4 times a month | 1719 (17%) | 659 (25%) | 9807 (23%) | 12 185 (21%) |
| 2–5 times a week or more | 4076 (40%) | 829 (32%) | 14 949 (35%) | 19 854 (35%) |
| Missing | 247 | 23 | 945 | 1215 |
*27 counties in Minnesota and Wisconsin described by Rocca et al.4
BMI, body mass index; FL, Florida; GA, Georgia; MC, Mayo Clinic.
Figure 2Geographical distribution (state-level) of home residences among participants at the time of enrolment into the Mayo Clinic Biobank.
Figure 3Geographical distribution (county-level) of home residences among Mayo Clinic Biobank participants from Upper Midwest regions (A) and Florida/Georgia (B).
Figure 4Depth of electronic health records of Mayo Clinic Biobank participants since 1994.
Top 15 self-reported prevalent conditions at enrolment into Mayo Clinic Biobank among all 56 862 participants. Incident conditions are those reported by the 24 016 subjects with data available on both the enrolment (baseline) and at 4-year follow-up surveys through the end of 2017
| Self-reported disease | Prevalent cases at enrolment (%) | Incident cases at follow-up (%) |
| Hyperlipidaemia | 23 121 (41%) | 1885 (14%) |
| Hypertension | 22 373 (40%) | 1615 (11%) |
| Osteoarthritis | 19 393 (35%) | 3066 (19 %) |
| Cancer* | 18 497 (33%) | 2007 (12%) |
| Non-melanoma skin cancer | 8825 (16%) | 1583 (8%) |
| Prostate cancer (men only) | 2655 (12 %) | 318 (4%) |
| Breast cancer | 3003 (9%) | 293 (2%) |
| Melanoma | 2118 (4%) | 510 (2%) |
| Other cancer | 1262 (2%) | 1044 (4%) |
| Gastro-oesophageal reflux disorder | 16 846 (30%) | 2098 (12%) |
| Cataracts | 15 845 (28%) | 2966 (17%) |
| Depression | 13 648 (24%) | 1175 (6%) |
| Anxiety | 11 500 (21%) | 1412 (7%) |
| Migraine headaches | 10 787 (19%) | 765 (4%) |
| Sleep apnoea | 9486 (17%) | 1481 (7%) |
| Hyper/hypothyroidism | 8344 (15%) | 928 (5%) |
| Asthma | 7568 (14%) | 501 (2%) |
| Diabetes | 6516 (12%) | 626 (3%) |
| Irritable bowel disease | 6108 (11%) | 783 (4%) |
| Atrial fibrillation/arrhythmia | 23 121 (41%) | 1885 (14%) |
*Incident cases were defined as those who reported to have a condition at follow-up survey, but either reported ‘no’ at the baseline or did not answer the particular question. Among those who completed both questionnaires, 58.7% (n=14 091) were females. For prostate cancer, number of males was used as a denominator.
Prevalence (at enrolment) and cumulative incidence through the last follow-up period (up to 8.5 years after enrolment) of the 20 US DHHS chronic conditions among all consented subjects and the subset eligible for clinical follow-up via electronic health records
| US DHHS 20 chronic conditions | Eligible participants for follow-up (n=40 978) | All consented | ||
| Prevalent cases (%) | Incident case (%*) | Prevalent cases (%) | Incident case (%*) | |
| Hypertension | 17 976 (44%) | 2034 (12%) | 20 876 (37%) | 2835 (11%) |
| Congestive heart failure | 1394 (3%) | 1121 (4%) | 1592 (3%) | 1259 (4%) |
| Coronary artery disease | 7388 (18%) | 1373 (6%) | 8492 (15%) | 1742 (5%) |
| Cardiac arrhythmias | 13 508 (33%) | 2158 (10%) | 15 349 (27%) | 2762 (9%) |
| Hyperlipidaemia | 21 817 (53%) | 1773 (12%) | 25 038 (44%) | 2596 (11%) |
| Stroke | 2689 (7%) | 975 (4%) | 2969 (5%) | 1116 (3%) |
| Arthritis | 17 944 (44%) | 2861 (16%) | 21 173 (37%) | 3847 (14%) |
| Asthma | 3874 (9%) | 489 (2%) | 4331 (8%) | 637 (2%) |
| Autism | 3 (<1%) | 6 (<1%) | 3 (<1%) | 6 (<1%) |
| Cancer (selected†) | 13 648 (33%) | 2681 (13%) | 15 946 (28%) | 3379 (12%) |
| Chronic kidney disease | 3604 (9%) | 2094 (8%) | 4153 (7%) | 2413 (7%) |
| Chronic obstructive pulmonary disease | 3600 (9%) | 728 (3%) | 3932 (7%) | 897 (2%) |
| Dementia | 714 (2%) | 813 (3%) | 893 (2%) | 906 (3%) |
| Depression | 7368 (18%) | 1452 (6%) | 8040 (14%) | 1804 (5%) |
| Diabetes | 11 779 (29%) | 2473 (11%) | 13 351 (23%) | 3051 (9%) |
| Hepatitis | 574 (1%) | 126 (<1%) | 706 (1%) | 174 (<1%) |
| HIV | 27 (<1%) | 3 (<1%) | 35 (<1%) | 8 (<1%) |
| Osteoporosis | 4063 (10%) | 926 (3%) | 4405 (8%) | 1102 (3%) |
| Schizophrenia | 153 (<1%) | 116 (<1%) | 168 (<1%) | 127 (<1%) |
| Substance Abuse | 1305 (3%) | 662 (3%) | 1461 (3%) | 752 (2%) |
*Cumulative incidence was calculated based on the Kaplan-Meier (KM) method, which is a sum of the KM estimate of incidence of a given condition at a specified time point over time. The KM estimate at a specified time point is the number of participants having a given condition at that time, divided by the number of patients at risk (alive, disease-free and not lost to follow-up), multiplied by the probability of disease-free survival just prior to that time. Note that this estimate cannot be calculated as a simple proportion of the number of participants with a given condition, divided by the number of participants at risk at the biobank entry.
†Cancer category includes breast, colorectal, lung and prostate cancers.
US DHHS, United States Department of Health and Human Services.