| Literature DB >> 33181822 |
Mark J Berger1, Hannah E Williams1, Ryan Barrett1, Anjali D Zimmer1, Wendy McKennon1, Huy Hong1, Jeremy Ginsberg1, Alicia Y Zhou1, Cynthia L Neben1.
Abstract
Publicly available genetic databases promote data sharing and fuel scientific discoveries for the prevention, treatment and management of disease. In 2018, we built Color Data, a user-friendly, open access database containing genotypic and self-reported phenotypic information from 50 000 individuals who were sequenced for 30 genes associated with hereditary cancer. In a continued effort to promote access to these types of data, we launched Color Data v2, an updated version of the Color Data database. This new release includes additional clinical genetic testing results from more than 18 000 individuals who were sequenced for 30 genes associated with hereditary cardiovascular conditions as well as polygenic risk scores for breast cancer, coronary artery disease and atrial fibrillation. In addition, we used self-reported phenotypic information to implement the following four clinical risk models: Gail Model for 5-year risk of breast cancer, Claus Model for lifetime risk of breast cancer, simple office-based Framingham Coronary Heart Disease Risk Score for 10-year risk of coronary heart disease and CHARGE-AF simple score for 5-year risk of atrial fibrillation. These new features and capabilities are highlighted through two sample queries in the database. We hope that the broad dissemination of these data will help researchers continue to explore genotype-phenotype correlations and identify novel variants for functional analysis, enabling scientific discoveries in the field of population genomics. Database URL: https://data.color.com/.Entities:
Year: 2020 PMID: 33181822 PMCID: PMC7661094 DOI: 10.1093/database/baaa083
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
New filter categories and filter values on the hereditary cancer dashboard.
| Filter categories | Filter values |
|---|---|
| Gail Risk Score | Elevated risk (age 35–49 years and risk ≥1.67%; age 50–59 years and risk ≥2.0%; age 60–69 years and risk ≥3.0%; age 70–74 years and risk ≥4.0%), Ineligible |
| Claus Risk Score | Elevated (≥20%), Ineligible |
| BC Polygenic Risk Score | Calculated, Unknown |
Ineligible includes females who did not meet the model criteria and males.
Unknown includes females who did not provide enough information to calculate risk scores.
Males included. Only displays normalized risk scores between − 3 and 3.
Filter categories and filter values on hereditary cardiovascular conditions dashboard.
| Filter categories | Filter values |
|---|---|
| Sex | Female, Male |
| Age | 18–25, 26–30, 31–35, 36–40, 41–45, 46–50, 51–55, 56–60, 61–65, 66–70, 71–75, 76–80, 81–85, 86–89, ≥90 |
| Ethnicity | African, Ashkenazi Jewish, Asian, not specified; Caucasian, Chinese, Filipino, Hispanic, Indian, Japanese, Multiple ethnicities, Native American, Pacific Islander, Unknown |
| Personal Health History | Aneurysm, Angioplasty, Bundle branch or heart block, Bypass, Cardiac arrest, Cardiomegaly, Heart attack, Heart failure, No cardiac events, Stent, Stroke |
| Family Health History | Aneurysm, Angioplasty, Bundle branch or heart block, Bypass, Cardiac arrest, Cardiomegaly, Heart attack, Heart failure, Stent, Stroke |
| Classification | Benign, Likely Benign, Likely Pathogenic, Pathogenic, VUS |
| Gene |
|
| Variant | (Search by HGVS Nomenclature) |
| Zygosity | Heterozygous, Homozygous |
| CHARGE-AF Risk Score | Borderline (5%–7.5%), High (>10%), Ineligible |
| Framingham Risk Score | Borderline (5%–7.5%), High (>10%), Ineligible |
| Polygenic Risk Score | Calculated, Unknown |
Unknown includes information not reported.
Filter values for ‘Variant’ can only be selected by text typing with auto complete using Human Genome Variation Society (HGVS) nomenclature.
CHARGE-AF ineligible includes individuals <46 or ≥90 years. Framingham ineligibility includes individuals <30 or >74 years.
Unknown includes individuals who did provide enough information to calculate risk scores.
Only displays normalized risk scores between − 3 and 3.
Figure 1.Screenshots of query results for frequency of pathogenic and likely pathogenic variants in genes associated with hereditary cardiovascular conditions. (A–E) Filter by ‘Classification: Pathogenic or Likely pathogenic’. Query URL: https://data.color.com/v2/cardio.html#classification=Likely%20pathogenic&classification=Pathogenic (F, G) Remove ‘Classification: Pathogenic or Likely pathogenic’ and filter by ‘Gene: APOB’ and ‘Variant: c.10580G>A’. Query URL: https://data.color.com/v2/cardio.html#gene=APOB&variant=c.10580G%3EA.
Figure 2.Screenshots of query results for monogenic and polygenic breast cancer risk in women with a personal history of breast cancer. (A–E) Filter by ‘Sex: Female’, ‘Personal health history: Breast’ and ‘BC Polygenic Risk Score: Calculated’. Query URL: https://data.color.com/v2/cancer.html#sex=Female&personal_health_history=Breast&bc_polygenic_risk_score=Calculated.