| Literature DB >> 29308445 |
Paul J Newey1, Jonathan N Berg2, Kaixin Zhou1, Colin N A Palmer1, Rajesh V Thakker3.
Abstract
CONTEXT: Genetic testing is increasingly used for clinical diagnosis, although variant interpretation presents a major challenge because of high background rates of rare coding-region variation, which may contribute to inaccurate estimates of variant pathogenicity and disease penetrance.Entities:
Keywords: ExAC; genetic testing; germline; mutation; penetrance; single nucleotide variant
Year: 2017 PMID: 29308445 PMCID: PMC5740525 DOI: 10.1210/js.2017-00330
Source DB: PubMed Journal: J Endocr Soc ISSN: 2472-1972
Details of the 38 Genes Studied and Their Associated Endocrine Disorders
| Familial isolated pituitary adenoma | Het (LOF, MS) | AD (RP) | ENST00000279146 | |
| Sporadic pituitary adenomas | Het (LOF, MS) | Sporadic | ||
| Familial hypocalciuric hypercalcemia | Het (MS) | AD | ENST00000263270 | |
| Familial macronodular adrenal hyperplasia | Het (MS) | AD (RP) | ENST00000268314 | |
| Sporadic macronodular adrenal hyperplasia | Het (MS) | Sporadic | ||
| Familial hypocalciuric hypercalcemia | Het (MS) | AD | ENST00000498619 | |
| Familial isolated hyperparathyroidism | Het (MS) | AD | ||
| Autosomal dominant hypocalcemia | Het (MS) | AD | ||
| Hyperparathyroidism-jaw tumor syndrome | Het (LOF, MS) | AD | ENST00000367435 | |
| Sporadic parathyroid carcinoma | Het (LOF, MS) | Sporadic | ||
| Sporadic parathyroid adenoma | Het (MS) | Sporadic | ENST00000405375 | |
| Multiple endocrine neoplasia type 4 | Het (LOF, MS) | AD | ENST00000228872 | |
| Sporadic parathyroid and pituitary adenoma | Het (MS) | Sporadic | ||
| Sporadic parathyroid adenoma | Het (MS) | Sporadic | ENST00000276925 | |
| Sporadic parathyroid adenoma | Het (MS) | Sporadic | ENST00000262662 | |
| Hypogonadotropic hypogonadism type 5/CHARGE | Het (LOF, MS) | AD (RP) | ENST00000423902 | |
| Sporadic pheochromocytoma/paraganglioma | Het (MS) | Sporadic | ENST00000366641 | |
| Sporadic pheochromocytoma/paraganglioma | Het (MS) | Sporadic | ENST00000263734 | |
| Autosomal dominant hypophosphatemic rickets | Het (MS) | AD | ENST00000237837 | |
| Hereditary leiomyomatosis/renal cell carcinoma | Het (LOF, MS) | AD | ENST00000366560 | |
| Sporadic pheochromocytoma/paraganglioma | Het (MS) | Sporadic | ||
| Hypoparathyroidism/deafness/renal dysplasia | Het (LOF, MS) | AD | ENST00000379328 | |
| Laron dwarfism | Homo (LOF, MS) | AR | ENST00000230882 | |
| Idiopathic short stature | Het | AD (?) | ||
| Familial hypocalciuric hypercalcemia | Het (MS) | AD | ENST00000078429 | |
| Autosomal dominant hypocalcemia | Het (MS) | AD | ||
| Pseudohypoparathyroidism type 1a | Het (LOF, MS) | AD | ENST00000371100 | |
| Sporadic acromegaly | Het/Hemi (MS) | Sporadic | ENST00000298110 | |
| Hypogonadotropic hypogonadism type 1 | Hemi (LOF, MS) | XLD | ENST00000262648 | |
| Familial hypertension | Het (MS) | AD | ENST00000529694 | |
| Familial pheochromocytoma/paraganglioma | Het (MS) | AD | ENST00000263934 | |
| Sporadic pheochromocytoma/paraganglioma | Het (MS) | Sporadic | ||
| Familial pheochromocytoma/paraganglioma | Het (LOF, MS) | AD (RP) | ENST00000358664 | |
| Sporadic pheochromocytoma/paraganglioma | Het (LOF, MS) | Sporadic | ||
| Multiple endocrine neoplasia type 1 | Het (LOF, MS) | AD | ENST00000337652 | |
| Neurofibromatosis | Het (LOF, MS) | AD | ENST00000358273 | |
| X-linked hypophosphatemic rickets | Het/Hemi (LOF, MS) | XLD | ENST00000379374 | |
| Carney complex | Het (LOF, MS) | AD | ENST00000589228 | |
| Hereditary hyperprolactinemia | Het (MS) | AD | ENST00000382002 | |
| Multiple endocrine neoplasia type 2 | Het (MS) | AD | ENST00000355710 | |
| Familial medullary thyroid cancer | Het (MS) | AD | ||
| Familial pheochromocytoma/paraganglioma | Het (LOF, MS) | AD (RP) | ENST00000264932 | |
| Sporadic pheochromocytoma/paraganglioma | Het (LOF, MS) | Sporadic | ||
| Familial pheochromocytoma/paraganglioma | Het (MS) | AD (RP) | ENST00000301761 | |
| Sporadic pheochromocytoma/paraganglioma | Het (LOF) | Sporadic | ||
| Familial pheochromocytoma/paraganglioma | Het (LOF, MS) | AD (RP) | ENST00000375499 | |
| Sporadic pheochromocytoma/paraganglioma | Het (LOF, MS) | Sporadic | ||
| Sporadic pheochromocytoma/paraganglioma | Het (LOF, MS) | Sporadic | ENST00000367975 | |
| Familial pheochromocytoma/paraganglioma | Het (LOF, MS) | AD (RP) | ENST00000375549 | |
| Sporadic pheochromocytoma/paraganglioma | Het (LOF, MS) | Sporadic | ||
| Thyroid hormone resistance | Het (LOF, | AD | ENST00000450525 | |
| Thyroid hormone resistance | Het (MS) | AD | ENST00000264637 | |
| Thyroid hormone resistance | Het (LOF, MS) | AD | ENST00000396671 | |
| Familial pheochromocytoma/paraganglioma | Het (LOF, MS) | AD (RP) | ENST00000258439 | |
| Sporadic pheochromocytoma/paraganglioma | Het (LOF, MS) | Sporadic | ||
| Von Hippel-Lindau (VHL) | Het (LOF, MS) | AD | ENST00000256474 | |
| Sporadic pheochromocytoma/paraganglioma | Het (LOF, MS) | Sporadic |
Abbreviations: AD, autosomal dominant; AR, autosomal recessive; Hemi, hemizygous; Het, heterozygous; MS, missense; RP, reduced penetrance; XLD, X-linked dominant.
Genes reported to be associated with endocrine disease, although evidence supporting pathogenicity may be limited.
Evidence supporting heterozygous GHR mutations in idiopathic short stature remains unclear.
Diseases associated with genomic imprinting.
THRA_1 refers to THRA isoform 1 encoded by the noncanonical transcript ENST00000450525. Disease-associated nonsense and missense mutations in final exon of isoform 1 are reported. THRA_2 refers to THRA isoform 2 encoded by the canonical transcript (ENST00000450525), in which a missense mutation, also present in isoform 1, has been reported.
Figure 1.Rare variant frequency, evolutionary conservation, and constraint metrics of genes associated with hereditary endocrine disease. (A) Proportion of all nonsynonymous SNVs occurring in the selected genes as singletons or with an AF <0.05%. Across the 38 genes, an average of 59.8% (range, 42.6% to 100%) of individual missense/LOF SNVs occurred as singletons, whereas 91.8% (range, 83% to 100%) had an AF <0.05%. (B) Rare SNV frequency was correlated with evolutionary conservation of the encoded protein. Individual genes were ranked according to both their size-corrected cumulative SNV frequency (for SNVs with AF <0.05%) and the degree of amino acid conservation between human and zebrafish (Danio rerio) orthologs. A significant correlation was observed (r = 0.69; P < 0.0001), such that genes with a high degree of conservation harbored the lowest rates of rare SNVs. Of note, a marked overlap was observed between genes with high conservation/low variation and those categorized as intolerant of both missense/LOF variation using constraint metrics (genes marked with open circles). All other genes are represented by closed circles. (C) Missense and LOF constraint metrics for the study genes. A z score >3.09 is reported to represent significant missense intolerance, whereas a pLI score >0.9 is indicative of extreme LOF intolerance and suggestive of a haploinsufficient function (i.e., gene intolerant to heterozygous LOF). Of note, 45% (17 of 38) of study genes could be classified as extreme LOF intolerant, whereas 32% (12 of 38) were missense intolerant. Several genes in which both missense and LOF mutations were responsible for penetrant monogenic disorders (e.g., MEN1, CDC73, and NF1) clustered in the combined LOF/missense intolerant group. In contrast, several genes in which the role of heterozygous germline variation in disease pathogenesis was less well defined were categorized as LOF (pLI score <0.1) and/or missense tolerant (e.g., CDKN1A, SDHA, and GPR101). However, the reliability of the pLI constraint metric was reduced for genes of small size where few LOF variants were predicted (e.g., <10), and these genes are identified by an open circle (e.g., CDKN1B, VHL). All other genes are represented by closed circles. pLI, probability of LOF intolerance.
Gene-Level Estimates of Cumulative Rare Nonsynonymous SNV Carrier Frequencies in the Control Cohort
| 2997 | 5.2 | 19 | 3.0 | 34 | 0.9 | 115 | |
| 2839 | 2.1 | 47 | 1.5 | 67 | 0.5 | 187 | |
| 1037 | 2.6 | 38 | 1.4 | 71 | 0.5 | 208 | |
| 1770 | 2.0 | 51 | 1.2 | 81 | 0.4 | 226 | |
| 1114 | 3.0 | 34 | 1.4 | 71 | 0.4 | 231 | |
| 935 | 2.8 | 35 | 1.2 | 82 | 0.3 | 289 | |
| 870 | 2.1 | 47 | 1.1 | 87 | 0.3 | 310 | |
| 1078 | 1.5 | 67 | 0.7 | 149 | 0.3 | 354 | |
| 510 | 0.9 | 117 | 0.5 | 191 | 0.3 | 392 | |
| 213 | 0.7 | 138 | 0.5 | 206 | 0.2 | 429 | |
| 638 | 1.7 | 58 | 0.7 | 141 | 0.2 | 474 | |
| 622 | 1.6 | 61 | 0.5 | 190 | 0.2 | 504 | |
| 749 | 0.8 | 132 | 0.5 | 199 | 0.2 | 514 | |
| 664 | 1.3 | 78 | 0.9 | 110 | 0.2 | 516 | |
| 680 | 2.4 | 42 | 0.8 | 131 | 0.2 | 536 | |
| 426 | 0.6 | 180 | 0.3 | 294 | 0.2 | 545 | |
| 444 | 0.5 | 181 | 0.3 | 286 | 0.2 | 644 | |
| 615 | 0.4 | 254 | 0.4 | 270 | 0.1 | 658 | |
| 419 | 0.5 | 195 | 0.4 | 228 | 0.1 | 734 | |
| 508 | 2.0 | 50 | 0.7 | 138 | 0.1 | 809 | |
| 251 | 0.5 | 213 | 0.3 | 288 | 0.1 | 891 | |
| 330 | 1.4 | 70 | 0.4 | 272 | 0.1 | 923 | |
| 461 | 0.6 | 156 | 0.3 | 379 | 0.1 | 963 | |
| 490 | 0.3 | 283 | 0.3 | 367 | 0.1 | 1034 | |
| 280 | 0.6 | 168 | 0.4 | 319 | 0.1 | 1048 | |
| 198 | 0.9 | 109 | 0.4 | 264 | 0.1 | 1124 | |
| 531 | 0.2 | 468 | 0.2 | 468 | 0.1 | 1208 | |
| 238 | 0.4 | 258 | 0.2 | 544 | 0.1 | 1217 | |
| 138 | 0.6 | 161 | 0.2 | 439 | 0.1 | 1336 | |
| 410 | 0.2 | 517 | 0.1 | 679 | 0.1 | 1509 | |
| 381 | 0.2 | 432 | 0.1 | 828 | 0.1 | 1514 | |
| 359 | 0.1 | 691 | 0.1 | 802 | 0.1 | 1594 | |
| 164 | 1.4 | 69 | 0.3 | 358 | 0.1 | 1598 | |
| 166 | 0.4 | 227 | 0.2 | 502 | 0.05 | 1890 | |
| 159 | 0.2 | 433 | 0.1 | 659 | 0.04 | 2245 | |
| 168 | 0.2 | 593 | 0.1 | 728 | 0.04 | 2302 | |
| 169 | 0.2 | 625 | 0.2 | 625 | 0.04 | 2757 | |
| 160 | 0.1 | 837 | 0.1 | 1380 | 0.03 | 3318 | |
| 142 | 0.01 | 12,066 | 0.01 | 12,066 | 0.01 | 12,066 | |
The number needed to sequence (NNS) equates to the mean number of individuals (reported to the nearest whole number) requiring sequencing to identify a rare variant of each type (i.e., AF <0.5%, AF <0.05%, or singleton). This was determined by taking the reciprocal of the estimated cumulative variant frequency for each category of rare variant per individual (i.e., taking into account the presence of two alleles per gene). Genes are arranged in decreasing frequency of singleton variants.
Abbreviation: AA, amino acid.
Missing data for part of gene and/or reduced reliability of estimates due to reduced exon coverage.
For X-linked disorders, the NNS is stated for females (i.e., accounting for each allele).
Two transcripts are reported for THRA as described in the footnote in Table 1.
Figure 2.Utility of computational tools to predict rare variant effects and frequency of disease-associated RET alleles in the ExAC cohort. (A) Graph shows the proportion of rare SNVs (AF <0.5%) predicted to result in deleterious effects using common computational prediction tools. The number of rare SNVs evaluated for each gene is shown above the respective gene column. In the subset of 12 genes, a mean of 20.5% (range, 9.5% to 37.9%) of gene-specific rare SNVs were categorized as deleterious (i.e., meeting the criteria: AF <0.5%, SIFT <0.05, Polyphen2 probably damaging, and scaled CADD >20), whereas 52.7% (range, 37.9% to 73.7%) were categorized as possibly deleterious (i.e., AF <0.5% and either SIFT ≤0.05 or Polyphen2 probably damaging or possibly damaging). (B) Frequency of RET alleles reported as pathogenic in the ExAC cohort. Twenty-two individuals harbored eight different MEN2/familial medullary thyroid cancer‒associated RET mutations corresponding to a prevalence of approximately one in 1750 (accounting for incomplete genotyping at the Val804Met locus). Only established MEN2/familial medullary thyroid cancer‒associated RET mutations were included in the analysis (Supplemental Table 7). Of these alleles, the majority are classified as moderate risk in the 2015 American Thyroid Association Medullary Thyroid Cancer Guidelines, whereas the two variants predicted to affect the cysteine residue at codon 634 (Cys634Arg, Cys634Phe) are categorized as high risk [27]. The Val804Met variant (observed 13 times) arose in four different ethnic populations. Excluding the Val804Met variant, approximately one in 6000 individuals in the ExAC cohort harbored a pathogenic RET mutation. *Two different SNVs predicted a Leu790Phe amino acid substitution.
Comparison of Frequencies of Missense Germline
| All (n = 1866) | |||||||
| Observed in sporadic pituitary tumor cohorts | 38 | 2 | 12 | 9 | 0 | 7 | 8 |
| Predicted from Global ExAC variant frequencies | 26.7 | 0.8 | 7.3 | 5.4 | 1.6 | 9.4 | 2.0 |
| Odds ratio (95% CI) observed vs predicted | 1.4 (1.0–2.0) | — | 1.6 (0.9–2.9) | 1.6 (0.8–3.2) | — | — | 4.1 (1.9–8.5) |
| Observed in sporadic pituitary tumor cohorts | 38 | 2 | 12 | 9 | 0 | 7 | 8 |
| Predicted from European ExAC variant frequencies | 30.2 | 1.1 | 11.2 | 9.1 | 2.5 | 4.7 | 1.6 |
| Odds ratio (95% CI) observed vs predicted | 1.2 (0.9–1.7) | — | 1.1 (0.6–1.9) | 1.0 (0.5–1.9) | — | — | 5.2 (2.3–11.4) |
| Acromegaly (n = 935) | |||||||
| Observed in acromegaly cohort | 18 | 0 | 6 | 5 | 0 | 1 | 6 |
| Predicted from global ExAC variant frequencies | 13.2 | 0.4 | 3.6 | 2.7 | 0.8 | 4.7 | 1 |
| Odds ratio (95% CI) observed vs predicted | 1.3 (0.8–2.1) | — | — | — | — | — | — |
| Observed in acromegaly cohort | 18 | 0 | 6 | 5 | 0 | 1 | 6 |
| Predicted from European ExAC variant frequencies | 15.1 | 0.5 | 5.6 | 4.6 | 1.2 | 2.4 | 0.8 |
| Odds ratio (95% CI) observed vs predicted | 1.2 (0.7–1.9) | — | — | — | — | — | — |
| Prolactinoma (n = 359) | |||||||
| Observed in prolactinoma cohort | 13 | 1 | 1 | 3 | 0 | 6 | 2 |
| Predicted from global ExAC variant frequencies | 5.1 | 0.2 | 1.4 | 1 | 0.3 | 1.8 | 0.4 |
| Odds ratio (95% CI) observed vs predicted | 2.6 (1.5–4.5) | — | — | — | — | — | — |
| Observed in prolactinoma cohort | 13 | 1 | 1 | 3 | 0 | 6 | 2 |
| Predicted from European ExAC variant frequencies | 5.8 | 0.2 | 2.1 | 1.8 | 0.5 | 0.9 | 0.3 |
| Odds ratio (95% CI) observed vs predicted | 2.2 (1.3–3.9) | — | — | — | — | — | — |
Germline missense variants reported in 1866 individuals (representing 3732 alleles) with apparently sporadic pituitary tumors in whom the AIP gene was sequenced. Patient groups represented in the respective studies include those with sporadic child gigantism, sporadic acromegaly presenting in young adulthood and sporadic acromegaly presenting at any age, and individuals with other forms of apparently sporadic pituitary adenomas [including prolactinomas (predominantly macroprolactinomas), nonfunctioning adenomas, and Cushing disease]. This analysis did not include AIP sequence analysis from those individuals with apparent familial isolated pituitary adenoma syndromes or those individuals with MEN1-like disorders. A separate subanalysis comparing allele frequencies between the pituitary tumor cohort and just the European subset of the ExAC cohort (n = 33,370 individuals) was performed, deemed to be the most suitable comparator group for the disease cohorts reported in the literature. Estimates of odds ratios and CIs were calculated at http://www.hutchon.net/confidor.htm. For individual variants, odds ratios are provided only where sufficient numbers of alleles were observed to enable meaningful comparison. Some of the studies included in the analysis, reported individuals with unclassified pituitary tumor subtypes. These individuals are not included in the subgroup analysis of prolactinoma and acromegalic cases.
Abbreviation: CI, confidence interval.
ExAC other: Observed at least once in the ExAC cohort but with an AF <0.5% and excluding R9Q, R16H, R304Q, and A229V variants. AIP variants occurring as singletons in ExAC are also reported in this group.
Novel Singleton: not observed in the ExAC cohort. Predicted number deduced from prevalence of singletons in the ExAC cohort.
Predicted Population-Level Prevalence of Rare Nonsynonymous SNVs Employing Disease-Relevant Gene Panels
| Pheochromocytoma/paraganglioma
| 13.9 | 7.2 | 8.7 | 11.6 | 3.0 | 32.8 |
| Pituitary tumor
| 4.9 | 20.5 | 1.9 | 51.3 | 0.54 | 185 |
| Hypercalcemia/hyperparathyroid
| 8.1 | 12.3 | 3.8 | 26.6 | 1.3 | 77.9 |
| Multiple endocrine neoplasia
| 6.3 | 15.8 | 3.8 | 26.1 | 1.4 | 70.8 |
Hypothetical gene panel to include sequencing of coding region of all genes listed under each respective heading. The number needed to sequence represents an estimation of the average number of individuals requiring sequencing to identify a variant in each relevant group.
Calculation based on occurrence of a rare variant (of the relevant frequency classification) in at least one of the genes in the respective panel.
Germline variants reported in a single or a very low number of individuals/kindreds.
Figure 3.Illustrative workflow outlining considerations for genetic testing and variant interpretation in clinical and research settings. Evaluating the spectrum and frequency of rare variations in large population cohorts may enhance the process of genetic testing at different stages of the clinical workflow. For example, as the content of genetic testing increases, the likelihood of identifying rare coding variants increases (e.g., VUSs and IFs). The current study illustrates how gene-specific cumulative rare variant frequencies may be used to establish pretest estimates for identifying such variants. This information could be incorporated into the informed consent process to alert patients to the likelihood of ambiguous test results. Although guidelines exist to standardize the process of ascribing pathogenicity to germline variants, these typically focus on variant-specific features. The current study highlights how gene-level factors, including estimates of rare missense/LOF variation together with metrics of constraint, may aid variant classification. For example, when the cumulative frequency of LOF variants in the control population exceeds the prevalence of the disease under investigation, the likelihood that such a variant is a high penetrance disease-allele is substantially reduced. Conversely, variants in genes with very low rare variation burden potentially have a higher likelihood of pathogenicity. In the future, it is possible that this information may contribute to Bayesian models for disease in which the likelihood of variant pathogenicity and/or disease expression is adjusted according to clinical factors as well as gene- and variant-level metrics. Furthermore, for potential disease alleles, refined estimates of disease penetrance may be established by evaluating the frequency of the variant in disease and control populations, and such accurate estimates are essential for appropriate genetic counseling (e.g., to determine the value of implementing treatment/surveillance programs and/or screening of first-degree relatives). In addition, when ambiguous test results have been obtained (e.g., VUSs), further refinement of risk may be established by relating the test result to both the pretest estimate of detecting such variation and constraint metrics, which together may aid the clinician and patient in making informed decisions regarding future care. Finally, these studies illustrate the need for transparent and accurate reporting of genetic data coupled to phenotypes (i.e., avoiding positive reporting bias) to improve the accuracy of existing disease/mutation databases. IF, incidental finding; NGS, next-generation sequencing; VUS, variant of uncertain significance.