Literature DB >> 34941562

Identification of NID1 as a novel candidate susceptibility gene for familial non-medullary thyroid carcinoma using whole-exome sequencing.

Luis Eduardo Barbalho de Mello1,2, Thaise Nayane Ribeiro Carneiro1, Aline Neves Araujo1, Camila Xavier Alves2, Pedro Alexandre Favoretto Galante3, Vanessa Candiotti Buzatto3, Maria das Graças de Almeida2,4, Karina Marques Vermeulen-Serpa2, Sancha Helena de Lima Vale4,5, Fernando José de Pinto Paiva2, José Brandão-Neto2, Janete Maria Cerutti1.   

Abstract

The genetics underlying non-syndromic familial non-medullary thyroid carcinoma (FNMTC) is still poorly understood. To identify susceptibility genes for FNMTC, we performed whole-exome sequencing (WES) in a Brazilian family affected by papillary thyroid carcinoma (PTC) in three consecutive generations. WES was performed in four affected and two unaffected family members. Manual inspection in over 100 previously reported susceptibility genes for FNMTC showed that no variants in known genes co-segregated with disease phenotype in this family. Novel candidate genes were investigated using PhenoDB and filtered using Genome Aggregation (gnomAD) and Online Archive of Brazilian Mutations (ABraOM) population databases. The missense variant p.Ile657Met in the NID1 gene was the only variant that co-segregated with the disease, while absent in unaffected family members and controls. The allele frequency for this variant was <0.0001 in the gnomAD and ABbraOM databases. In silico analysis predicted the variant to be deleterious or likely damaging to the protein function. Somatic mutations in NID1 gene were found in nearly 500 cases of different cancer subtypes in the intOGen platform. Immunohistochemistry analysis showed NID1 expression in PTC cells, while it was absent in normal thyroid tissue. Our findings were corroborated using data from the TCGA cohort. Moreover, higher expression of NID1 was associated with higher likelihood of relapse after treatment and N1b disease in PTCs from the TCGA cohort. Although replication studies are needed to better understand the role of this variant in the FNMTC susceptibility, the NID1 variant (c.1971T>G) identified in this study fulfills several criteria that suggest it as a new FNMTC predisposing gene.

Entities:  

Keywords:  FNMTC; NID1; papillary thyroid carcinomas; whole-exome sequencing

Year:  2022        PMID: 34941562      PMCID: PMC8859953          DOI: 10.1530/EC-21-0406

Source DB:  PubMed          Journal:  Endocr Connect        ISSN: 2049-3614            Impact factor:   3.335


Introduction

Thyroid cancers arising from follicular cells are referred to as non-medullary thyroid cancer (NMTC), while those arising from parafollicular cells are referred to as medullary thyroid cancer (MTC). Although the majority (~95%) of thyroid cancers are sporadic, a familial form has been described for both NMTC and MTC. The familial non-medullary thyroid carcinoma (FNMTC) was further divided into two groups: syndromic and non-syndromic (1). In the syndromic group, which comprises Carney complex, Werner’s syndrome, phosphatase and tensin homologue gene (PTEN) hamartoma tumour syndrome (PHTS) and familial adenomatous polyposis (FAP), NMTC occurs as a minor component of familial cancer syndromes (2, 3). As the predisposing genes for the syndromic cases have been characterized, at-risk patients can be readily identified based on genetic test. In the non-syndromic form, thyroid cancer is the primary feature of the disease and the predisposing genes are not well defined. Even if the genetic basis of non-syndromic FNMTC has not been well established, the thyroid gland shows the highest familial relative risk compared to many other well-characterized cancers with Mendelian pattern of inheritance such as breast, colon and prostate. In fact, several studies demonstrated an eight- to ten-fold increased risk of developing NMTC in first-degree relative of affected individuals than the risk observed in the general population (2, 4, 5, 6, 7, 8). Consequently, the diagnosis of non-syndromic FNMTC is primarily based on family history and it is currently defined as the occurrence of the thyroid cancer of follicular cell origin in two or more first-degree relatives in absence of other extra-thyroidal clinical manifestations (9, 10, 11, 12). Nevertheless, this classification is under debate as families with only two first-degree relatives affected by thyroid cancer may represent a casual association of the disease (13). In fact, it is estimated that nearly 62–69% of families with two first-degree relatives affected by thyroid cancer are sporadic. Though the presence of only two first-degree affected members could represent a sporadic disease, the diagnosis of FNMTC should be considered and the age of onset and evidence of genetic anticipation are important parameters to account (2, 9, 10, 11, 14). Nevertheless, when three or more first-degree relatives are affected by thyroid cancer, the risk of including sporadic cases into genetic evaluation of FNMTC is less than 6% (9, 13, 14). The analysis of different kindreds with FNMTC advocates for an autosomal dominant inheritance with incomplete penetrance and variable expressivity (1, 4, 5, 6, 7, 8). In the past years, several studies reported FNMTC-associated chromosomal loci (3, 15, 16, 17, 18) and predisposing risk variants in over 100 genes, including SRGAP1, CHEK2, SRRM2, TIFF-1/NKX2, FOXE1, NOP53, HABP2, ANO7, CAV2, KANK1, PIK3CB, PKD1L1, PTPRF, BROX, RHBDD2, ATM, MAP2K5, EWSR1, POT1, TIAM1and SPRY4 (1, 3, 7, 12, 15, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37). Recently, using whole-genomic sequencing, a group has identified variants in genes that were enriched in tumourigenic signalling pathways such as MAPK/ERK and PI3K/AKT in families with NMTC (19). As FNMTC has shown a high genetic heterogeneity, the identification of the susceptibility genes or chromosomal loci for most of FNMTC cases remains to be described. Additionally, as many of these loci or susceptibility genes were not further validated in subsequent studies, genetic testing is still not recommended for diagnosis, therapeutic and follow-up decisions. However, the detection of predisposing genes or susceptibility loci is highly important, since they will undoubtedly help to identify those at high risk for FNMTC, establish management strategies and assist genetic counselling. Here we report a family with four patients affected by PTC in three generations and evidence of genetic anticipation, suggesting a true non-syndromic FNMTC. Whole-exome sequencing (WES) was applied to identify a germline variant associated with an increased PTC predisposition in this family. We believe that our findings will add to FNMTC diagnosis, improving cancer prevention and genetic counselling protocols.

Materials and methods

Family history

We here report a five-generation Brazilian family with four family members affected by PTC in three consecutive generations in the absence of other extra-thyroidal clinical manifestations, suggesting a true non-syndromic FNMTC (Fig. 1). The family, from Natal, Rio Grande do Norte state, Brazil, was refereed to our institution for genetic investigation. Family history and blood samples were obtained from consenting affected and unaffected individuals. The pathology reports confirmed the diagnosis of PTC. The proband (III-2) was diagnosed with PTC at the age of 51. The proband’s mother (II-10) and proband’s uncle (II-7) were diagnosed with PTC at the age of 75 and 76, respectively. Proband’s son (IV-2) was diagnosed with PTC at the age of 29. Therefore, earlier age at onset of disease was observed in three successive generations, suggesting genetic anticipation. Controls, healthy individuals or individuals with PTC living in the same geographic region were included in the study. Healthy individuals were selected based on clinical and standard laboratory tests. The study was approved by the ethical and research committees of the Norte-Rio-Grandense League against Cancer (approval number 860.934) with written informed consent obtained from each participant and/or their legal representative.
Figure 1

Pedigree chart of the five-generation Brazilian family with a positive history of papillary thyroid carcinoma (PTC). Circle represents female family members, square male members, black-shaded symbols refer to individual affected by PTC and grey-shaded symbols individuals with a benign thyroid nodule. The black arrow indicates the proband. Slashes indicate deceased members. Numbers below symbols indicate age at disease onset.

Pedigree chart of the five-generation Brazilian family with a positive history of papillary thyroid carcinoma (PTC). Circle represents female family members, square male members, black-shaded symbols refer to individual affected by PTC and grey-shaded symbols individuals with a benign thyroid nodule. The black arrow indicates the proband. Slashes indicate deceased members. Numbers below symbols indicate age at disease onset.

DNA isolation

Blood samples were collected from affected and unaffected family members and controls (12 healthy individuals and 33 patients affected by PTC from the same geographic region). Genomic DNA was isolated from peripheral blood using a standard phenol/chloroform method, as described previously (38). The DNA concentration was measured using a Qubit 2.0 fluorometer (Life Technologies Corp.).

Whole-exome sequencing

In order to identify genes associated with non-syndromic FNMTC, DNA samples were isolated from four family members affected by PTC (II-7, II-10, III-2 and IV-2), one family member diagnosed with a thyroid benign disease (IV-3) and one unaffected member (III-1). DNA samples from two unaffected family members (III-3 and V-1) were not available for WES test. Whole exome libraries were prepared using SureSelectXT Human all exons V6 capture reagent, according manufactures’ recommendations (Agilent Technologies Inc.). Libraries were sequenced on an Illumina NextSeq550 system using NextSeq 500 High output Kit (300 cycles) (Illumina Inc., San Diego, CA, USA), targeting a mean coverage of 300×. Library preparation and sequencing were performed at the Centro de Oncologia Molecular, Hospital Sírio-libanês, São Paulo, Brazil (Fig. 2).
Figure 2

Flow chart showing the strategy and pipeline used in this study.

Flow chart showing the strategy and pipeline used in this study.

Next-generation sequencing analysis

Raw reads were initially processed for quality checking and trimming using FastQC (Babraham Institute, Cambridge) and Trimmomatic, respectively. The reads were aligned to the human reference genome (GRCh37/hg19) using Burrows–Wheeler Aligner (BWA-MEM; default parameters). Duplicates were marked by Picard (v.2.17.6; default parameters). Single nucleotides variants (SNVs) and insertions or deletions (InDels) were called using Genome Analyses Toolkit (GATK) HaplotyperCaller (version v.3.7.0; Broad Institute, Boston, MA, USA) (Fig. 2).

Visual inspection of previously described FNMTC-associated variants

Over the past decade many studies have identified candidate genes related to non-syndrome FNMTC. As a first step, PubMed database (‘familial non-medullary thyroid cancer’, ‘FNMTC and predisposing genes’, ‘hereditary and familial thyroid tumors’) was used to create a comprehensive list of FNMTC predisposing genes. Next, variant call format (VCF) files were manually inspected to identify variants in genes known to be associated with FNMTC (Fig. 2).

Variant annotation and filtering

Analysis of VCF files was performed using PhenoDB analysis tool (https://phenodb.org/) (39). PhenoDB was used to analyse frame shift, nonsense or missense variants that were shared between the four affected family members based on dominant or recessive modes of inheritance. Variants were filtered out according to read depth (<10 reads), genotype quality (<20) and frequency in the Genome Aggregation Database (gnomAD) and Online Archive of Brazilian Mutations (ABraOM). The gnomAD (https://gnomad.broadinstitute.org) spans data from 125,748 exomes and 15,708 genomes from healthy unrelated individuals from 60 populations. The ABraOM (http://abraom.ib.usp.br) database was also used to assess the frequency of the annotated variants. This database is composed of an elderly cohort of over 1500 Brazilian individuals aged 60 or older, including immigrant descendants from different continents and individuals from various Brazilian states (40). Only variants with MAF (minor allele frequency) <1% (gnomAD and ABraOM) were annotated (Fig. 2).

Sanger sequencing analysis

Sanger sequencing was used to validate the selected variant and to investigate its segregation in this family. Whole blood DNA from four family members affected by PTC (II-7, II-10, III-2 and IV-2) and three unaffected (III-1, IV-3 and V-1) were submitted to Sanger sequencing. Although the DNA of the 1-year-old child (V-1) was not available for WES test, her DNA was available for the validation analysis. The selected variant was also investigated in 12 healthy individuals and 33 patients affected by PTC from the same geographic region. Primers were designed to amplify the target region of NID1 gene using Primer3 v.0.4.0 (http://primer3.ut.ee) (F: CAATGTCAAAGAGTGGGGTTT; R: GCAGCTCTCGGTGGACAG). Amplified PCR product was purified and sequenced using the BigDye Terminator (Life Technologies) in the ABI 3100 Genetic Analyzer (Applied Biosystems).

In silicoanalysis

The effect of the amino acid substitution on NID1 protein structure and function was performed using various available plugins via the Ensembl Variant Effect Predictor interface (ensembl.org/info/docs/tools/vep) such as Sorting Intolerant From Tolerant (SIFT, http://sift.jcvi.org), Polymorphism Phenotyping v2 (Polyphen-2, http://genetics.bwh.harvard.edu/pph2), Loss-of-function (LoFtool), Combined Annotation-Dependent Depletion (CADD, https://cadd.gs.washington.edu), phred score (which is normalized to all potential SNVs providing an externally comparable unit for analysis) and Consensus Deleteriousness (Condel) scores. The online software HOPE (http://www.cmbi.ru.nl/hope/method) was used to analyse the structural effect of the amino acid change in the protein. Psipred (http://bioinf.cs.ucl.ac.uk/psipred) was used to predict amino acid changes in the protein secondary structure. Uniprot database (https://www.uniprot.org/) and Jalview software (version 2.11.1.3) were used to analyse the evolutionary conservation of amino acids by calculating the conservation score. This software uses a calculation based on AMAS (analysis of multiply aligned sequences) to determine the amino acid conservation score. To obtain the value, ten amino acid properties were evaluated to create an index that ranges from 1 to 11 (hydrophobic, polar, small, proline, tiny, aliphatic, aromatic, positive, negative and charged). A maximum score (score 11) is represented by ‘*’; if the entire properties match but the amino acid is different across species, the signal ‘+’ is used.

Immunohistochemistry analysis of available PTC samples

Formalin-fixed, paraffin embedded (FFPE) thyroid samples from two family members affected by PTC (II-7 and IV-2) were available for immunohistochemistry (IHC) analysis. Tissue slides (4 µm) were deparaffinized with xylene and hydrated through a graded alcohol series. Endogenous peroxidase activity was blocked by placing slides in 15% hydrogen peroxide solution for 10 min. Antigen retrieval was performed in Tris/EDTA buffer pH 9.0 (Envision Flex Target Retrieval Solution, Dako Agilent) in a steamer for 20 min. After cooling to room temperature for 1 h, nonspecific binding sites were blocked with 5% BSA in PBS. The slides were washed twice with 1× PBS and incubated with mouse MAB against NID1 (R&D Systems, Minneapolis, MN, USA; Cat no. MAB2570). The samples were incubated with NID1 antibody (1:50) overnight at 4°C, followed by incubation with secondary antibody-polymer horseradish peroxidase (HRP) for 60 min (VisUCyteTMMouse, Cat no. VCTS001). Slides were stained with DAB chromogen solution (VisUCyteTMMouse). The nuclei was counterstained with haematoxylin and analysed using a light microscope.

NID1 analysis in sporadic cancers

The prevalence of the selected variant was assessed in nearly 500 sporadic cases of PTC from thyroid cancer database of The Cancer Genome Atlas (TCGA; https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga). Additionally, the incidence of the selected variant was also assessed in 1,500,000 human cancers present in the Catalogue of Somatic Mutations in Cancer database (COSMIC v92; https://cancer.sanger.ac.uk/cosmic) and in the Integrative OncoGenomics analysis (intOGen; https://www.intogen.org/search). The intOGen evaluated the mutational status in 28,076 samples of 66 cancer types, identified cancer genes and pinpointed their putative mechanism of action across cancer types. The IntOGen comprised 684 thyroid cancers from four different cohorts, including 488 PTC cases from TCGA cohort. The expression of NID1 was also evaluated in PTCs from TCGA cohort, and its expression was correlated with available clinical and pathological data. DESeq2 was used to test for differential expression (41). For statistical analysis we used Kruskal–Wallis test followed by Dunn post hoc test.

Results

Screening of variants in candidate predisposing genes for non-syndromic FNMTC

To identify genetic variants associated with FNMTC, WES analysis was performed in six members (four affected and two unaffected) from a five-generation Brazilian family with PTC and a Mendelian-like inheritance (Fig. 1). We obtained on average 57 million reads per sample (49–59 million reads). After filtering, the reads were mapped to the human reference genome (hg19) and variant calling was performed using GATK programme (Fig. 2). As first step, we mined the literature for candidate non-syndromic FNMTC susceptibility genes. In total, we identified 114 candidate genes across 32 independent studies (15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 30, 31, 33, 34, 35, 36, 37, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60). A list of included genes and studies is provided in Supplementary data (Supplementary Table 1, see section on supplementary materials given at the end of this article). Next, WES data from affected and unaffected family members were manually inspected for variants in all candidate genes. No germline variant in the protein-coding regions of 114 pre-selected FNMTC susceptibility genes was found to co-segregate with the disease phenotype in this family, suggesting that a novel candidate variant could be implicated in the genetic predisposition to FNMTC.

The p.Ile657Met NID1 variant co-segregates with the phenotype in a five-generation family

Novel susceptibility variants for FNMTC were inspected using PhenoDB, a robust database that enables efficient and adjustable sequence analysis coupled to the phenotypic and other clinical information. We identified 15 missense variants located in 13 genes in all affected family members (Table 1). To identify if any biological pathway is enriched among the 13 candidate genes, the list of genes was used as input for enrichment (maayanlab.cloud/Enrichr). Enrichment analysis identified that three genes (ZNF717, ZNF578 and ZNF667) were mapped to Herpes simplex virus 1 infection KEGG pathway (P-adj < 0.05).
Table 1

Candidate susceptibility variants identified in this study.

Gene namePositionNucleotide changeAminoacid changedbSNPMutation classificationSIFTPolyPhen2LoFtoolCADD phredCondel
CAPN9chr1:230907829G>Ap.G224Rrs143186556Nonsynonymous SNVDeleterious (0)Probably damaging (1)Probably damaging (0.126)29.7Deleterious (0.945)
NID1chr1:236189209T>Gp.I657Mrs199655594Nonsynonymous SNVDeleterious (0.02)Probably damaging (0.98)Probably damaging (0.0768)12.35Deleterious (0.807)
TRANK1chr3:36887865T>Cp.T1645A-Nonsynonymous SNVDeleterious (0)Probably damaging (0.995)-25.0Deleterious (0.902)
POMGNT2chr3:43121810C>Tp.D372Nrs147129872Nonsynonymous SNVTolerated (0.39)Benign (0.159)-23.2-
ZNF717chr3:75786355T>Ap.T807Srs151138057Nonsynonymous SNVTolerated (0.09)Benign (0.214)-17.65-
ZNF717chr3:75790516T>Ap.H63Lrs201105907Nonsynonymous SNVTolerated (0.67)Benign (0.121)-5.565-
DGKQchr4:961091C>Tp.R349Qrs183334682Nonsynonymous SNVTolerated (0.58)Benign (0)Probably damaging (0.367)4.343-
MUC5Bchr11:1260217C>Ap.D1138Ers79638064Nonsynonymous SNVTolerated (0.13)Benign (0.229)Probably damaging (0.243)2.651-
MUC5Bchr11:1260219C>Tp.A1139Vrs75935363Nonsynonymous SNVDeleterious (0.05)Probably damaging (0.93)Probably damaging (0.243)17.78Deleterious (0.722)
CACNA1Hchr16:1265318G>Ap.A1700Trs761499667Nonsynonymous SNVDeleterious (0.03)Probably damaging (0.934)Probably damaging (0.0051)23.3Deleterious (0.744)
C16orf96chr16:4644482G>Ap.G1000Rrs142136567Nonsynonymous SNVDeleterious (0.03)Possibly damaging (0.851)-25.0Deleterious (0.692)
IL4I1chr19:50399161G>Ap.R55Wrs140225154Nonsynonymous SNVTolerated (0.05)Possibly damaging (0.727)Probably damaging (0.119)23.4Deleterious (0.613)
ZNF578chr19:53015322G>Ap.G563Ers566285018Nonsynonymous SNVDeleterious (0.03)Possibly damaging (0.627)Benign (0.881)21.2Deleterious (0.583)
ZNF667chr19:56953511T>Cp.K285Ers61740676Nonsynonymous SNVTolerated (0.79)Benign (0)Benign (0.804)0.009-
CTDSP2chr12:58217744>TCp.R211fs-Frameshift insertion-----

SIFT score ranges from 0.0 (deleterious) to 1.0 (tolerated). Variants with score from 0.0 to 0.05 are considered deleterious. PolyPhen-2 score ranges from 0.0 (tolerated) to 1.0 (deleterious), variants with score from 0.85 to 1 are more confidently predicted to be damaging. LoFtool provides a rank of genic intolerance and consequent susceptibility to disease based on the ratio of loss-of-function; the lower the LoFtool gene score percentile the most intolerant is the gene to functional variation. CADD phred score of 20 means that a variant is amongs the top 1% of deleterious variants in the human genome. Condel calculates consensus deleteriousness score of missense mutations; variants with score above 0.49 are considered deleterious.

Candidate susceptibility variants identified in this study. SIFT score ranges from 0.0 (deleterious) to 1.0 (tolerated). Variants with score from 0.0 to 0.05 are considered deleterious. PolyPhen-2 score ranges from 0.0 (tolerated) to 1.0 (deleterious), variants with score from 0.85 to 1 are more confidently predicted to be damaging. LoFtool provides a rank of genic intolerance and consequent susceptibility to disease based on the ratio of loss-of-function; the lower the LoFtool gene score percentile the most intolerant is the gene to functional variation. CADD phred score of 20 means that a variant is amongs the top 1% of deleterious variants in the human genome. Condel calculates consensus deleteriousness score of missense mutations; variants with score above 0.49 are considered deleterious. However, 14 out of 15 missense variants were also reported in healthy controls from the same geographic region. The c.1971T>G heterozygous missense variant (NM_002508) in the exon 8 of the Nidogen-1 (NID1) gene, which resulted in an isoleucine (ATT) to methionine (ATG) substitution at residue 657, is the unique variant that co-segregated with PTC phenotype in this family (Fig. 3).
Figure 3

(A) The graphic shows the p.Ile675Met variant in the NID1 gene and its localization relative to the respective protein domains. (B) Sanger sequencing confirmed the c.1971T>G substitution highlighted in yellow (antisense strand), located within exon 8 of the NID1 gene in all individuals affected by PTC (II-7, II-10, III-2 and IV-2) and its absence in an unaffected family member (III-1) and in the patient with benign thyroid lesion (IV-3). The 1-year-old child (V-1) is a carrier for the c.1971T>G substitution. (C) Sanger sequencing analysis confirming the absence of NID1 c.1971T>G variant in unaffected individuals from the same geographic region.

(A) The graphic shows the p.Ile675Met variant in the NID1 gene and its localization relative to the respective protein domains. (B) Sanger sequencing confirmed the c.1971T>G substitution highlighted in yellow (antisense strand), located within exon 8 of the NID1 gene in all individuals affected by PTC (II-7, II-10, III-2 and IV-2) and its absence in an unaffected family member (III-1) and in the patient with benign thyroid lesion (IV-3). The 1-year-old child (V-1) is a carrier for the c.1971T>G substitution. (C) Sanger sequencing analysis confirming the absence of NID1 c.1971T>G variant in unaffected individuals from the same geographic region.

Sanger analysis confirmed the presence of NID1 variant in affected family members

Sanger analysis confirmed the presence of the p.Ile657Met NID1 variant in all family members affected with PTC, and it was absent in unaffected family members (III-1 and IV-3) (Fig. 3B). Validation analysis revealed that the 1-year-old child (V-1) was a carrier of the p.Ile657Met variant, while it was absent in healthy controls and individuals affected by PTC from the same geographic region (Fig. 3C). Longer follow-up will help determine the pathogenicity of the p.Ile657Met NID1 genetic variant in this family, particularly in the 1-year-old unaffected child who is a carrier of the NID1 variant.

Filtering with gnomAD and ABraOM population databases

The gnomAD data set (v2.1.1) showed that the NID1 p.Ile657Met variant is extremely rare as it was found at a MAF of 0.00007798. The gnomAD v3.1 (non-cancer) spans 74,023 whole-genome sequences from unrelated individuals from over 60 distinct populations, who were not ascertained for having cancer in a cancer study (Table 2). The p.Ile657Met variant was reported in gnomAD v3.1 (non-cancer) also at exceptionally low allele frequency (0.00009476). Although the gnomAD v3.1 covers the data from 1000 Genomes (1KGP), we examined it independently as it lists human genetic variations in 3202 normal, healthy individuals from 26 different populations from many different locations around the globe (61). The NID1 p.Ile657Met variant was not reported in 1KGP.
Table 2

The p.Ile657Met allele frequency in different populations.

ABraOM SABE WES (n  = 609)AbraOM SABE-WGS (n  = 1171)gnomAD (non-cancer) v3.1* (n  = 74,023)gnomAD v2.1.1** (n  = 141,456)1KGP (n  = 3202)
European (non-Finnish)NDND0.00016990.0001478ND
LatinoNDND0.00013270.00005645ND
South AsianNDND0.0000000.000032660.000
AfricanNDND0.000024360.0000.000
Ashkenazi JewishNDND0.0000.000ND
East AsianNDND0.0000.0000.000
European (Finnish)NDND0.0000.0000.000
Middle Eastern0.0000.000ND
Brazilian0.0008210.000427NDNDND
Male0.000055760.000091520.000
Female0.00013160.000061940.000
Allele frequency0.0008210.0004270.000094760.000077980.000
The p.Ile657Met allele frequency in different populations. We also assessed the ABraOM repository of variant frequencies, a census-based cohort of over 1500 elderly individuals from São Paulo, Brazil (40). Notably, Brazilian population is constituted by nearly 500 years of admixture between Africans, Europeans and native Americans (Table 2). The p.Ile657Met NID variant was found at very low prevalence (<0.01%) and, therefore, considered as a extremely rare variant in the Brazilian population.

Pathogenicity prediction scores

To predict the likely functional impact of the missense p.Ile657Met variation on NID1 protein stability, activity or folding, we used computational prediction methods (Table 1). Interestingly, most of the prediction scores presumed that the amino acid change, which is located at a well-conserved position at the NID1 protein, is deleterious (SIFT, Condel) or probably damaging (PolyPhen-2, LoFtoll to protein function (Table 1). The HOPE analysis showed that the original WT residue and newly introduced mutant residue often differ in size, charge and hydrophobicity value properties. Not only the methionine residue is bigger than the isoleucine but also has different properties (Fig. 4A and B). The residue is buried in the core of a domain within the protein, and the differences between the WT and mutant residue are likely to disturb the core structure of this domain and alter its function (Fig. 4C and D).
Figure 4

Modelling of NID1 variant on 3D structure of the protein. (A) The 3D folding shows the isoleucine to methionine change (magenta) at position 657 in the NID1 protein (grey). The mutant amino acid differs in its hydrophobic and charge proprieties. The residue in the reference protein (WT) is smaller than the mutant residue (http://www.cmbi.ru.nl/hope). (B) The protein structure (grey) was zoomed in to show the WT residue (green) and the mutant residue (red). (C and D) Secondary structure map of the WT and mutant NID1 protein, respectively. Magenta represents α-helix structures and yellow represents β-sheets. (E) Amino acid sequence alignment of NID1 with ten orthologous proteins. The conserved isoleucine residue at position 657 is highlighted. Conservation scores is 9, which means that amino acids in this positions match in nine of ten properties across species. A maximum score is represented by ‘*’, the signal ‘+’ indicates that the entire properties match but the amino acid is different across species. (F) Phylogenetic tree of 11 sequences of NID1 is depicted. This tree was constructed based on amino acids alignments. NID1 human protein share stronger sequence similarity with Pan troglodytes (UniProt accession H2Q1E0) and Macaca mulatta (UniProt accession F7DF16).

Modelling of NID1 variant on 3D structure of the protein. (A) The 3D folding shows the isoleucine to methionine change (magenta) at position 657 in the NID1 protein (grey). The mutant amino acid differs in its hydrophobic and charge proprieties. The residue in the reference protein (WT) is smaller than the mutant residue (http://www.cmbi.ru.nl/hope). (B) The protein structure (grey) was zoomed in to show the WT residue (green) and the mutant residue (red). (C and D) Secondary structure map of the WT and mutant NID1 protein, respectively. Magenta represents α-helix structures and yellow represents β-sheets. (E) Amino acid sequence alignment of NID1 with ten orthologous proteins. The conserved isoleucine residue at position 657 is highlighted. Conservation scores is 9, which means that amino acids in this positions match in nine of ten properties across species. A maximum score is represented by ‘*’, the signal ‘+’ indicates that the entire properties match but the amino acid is different across species. (F) Phylogenetic tree of 11 sequences of NID1 is depicted. This tree was constructed based on amino acids alignments. NID1 human protein share stronger sequence similarity with Pan troglodytes (UniProt accession H2Q1E0) and Macaca mulatta (UniProt accession F7DF16).

Evolutionary conservation analysis

The Uniprot database and Jalview software (version 2.11.1.3) were used to compare the NID1 residues conservation across species. Amino acid sequences of NID1 or the orthologous protein were obtained for the following species: Homo sapiens (UniProt accession P14543), Mus musculus (UniProt accession P10493), Caenorhabditis elegans (UniProt accession Q93791), Pan troglodytes (UniProt accession H2Q1E0), Macaca mulatta (UniProt accession F7DF16), Canis lupus familiaris (UniProt accession A0A5F4CIR1), Bos taurus (UniProt accession A6QNS6), Rattus norvegicus (UniProt accession F1LM84), Felis catus (UniProt accession M3WED3), Sus scorfa (UniProt accession F1RGY5) and Equus caballus (UniProt accession F6XH19). The conservation, quality and consensus tracks from Jalview were calculated for each column showed in the histogram. As observed for selected regions of the alignments, the annotation scores (ranging from 1 to 11) reflect conservation of the physicochemical properties in the particular column of the alignment. The absolutely conserved residues are marked with a yellow asterisk. Other amino acids are highlighted with different colours, according to the conservation scores in the consensus sequence. Less conserved positions are shown in darker colours with decreasing score. The residue I (Ile) at position 657 displayed a score of 9 across the different species, which means a conservation of nine among ten amino acid properties is considered to create this index (Fig. 4E). It is well known that residues more relevant for functioning of the protein are more evolutionary conserved (i.e. preserved). As the evolutionary conservation of the corresponding site correlates with the effect of mutation, this result suggested that the mutation found in this residue of NID1 protein likely impacts the function of the protein. Considering conservation and orthologue structure of NID1 protein across species, a phylogenetic tree was constructed using neighbour joining. This methodology implemented in jalview applies a greedy algorithm to find the tree with the shortest branch lengths. As expected, NID1 human protein share higher similarity with Pan troglodytes (UniProt accession H2Q1E0) and Macaca mulatta (UniProt accession F7DF16) (Fig. 4F).

NID1 p.Ile657Met variant was absent in sporadic human cancers

The analysis of the NID1 p.Ile657Met variant in nearly 500 sporadic PTC cases from TCGA and in nearly 1,500,000 human cancers present in COSMIC showed an absence of this variant. Although the p.Ile657Met variant was absent in sporadic PTC cases, we looked for variants within NID1 gene in the IntOGen mutation platform, which collect and analyse somatic mutations in thousands of tumour genome from different cancers subtypes, including COSMIC, TCGA and International Cancer genomes Consortiums. Although no somatic mutation was in found at codon 657 of NID1 gene, the analysis disclosed 495 coding variants throughout the NID1 gene in different cancer types. Most of them (65%) are missense mutations, including the p.Thr863Ala variant in one PTC from the TCGA cohort.

NID1 expression in PTC samples

Representative haematoxylin and eosin (H&E) stained sections used to confirm the diagnosis of PTC are showed (Fig. 5A and B). IHC analysis performed in the available PTC thyroid tissue samples showed NID1 staining (brown) in the tumour cells (TC) (Fig. 5C and D), while it was negative in adjacent normal thyroid (NT) tissue (Fig. 5C).
Figure 5

Haemotoxylin and eosin staining of two affected family members II-7 (A) and IV-2 (B) confirmed the diagnosis of papillary thyroid cancer. NID1 expression analysis in thyroid sections of two family members affected with PTC. Staining of tumour cells (TC) was observed in thyroid sections of cases II-7 (C) IV-2 (D), while adjacent thyroid normal (NT) cells showed negative staining (C).

Haemotoxylin and eosin staining of two affected family members II-7 (A) and IV-2 (B) confirmed the diagnosis of papillary thyroid cancer. NID1 expression analysis in thyroid sections of two family members affected with PTC. Staining of tumour cells (TC) was observed in thyroid sections of cases II-7 (C) IV-2 (D), while adjacent thyroid normal (NT) cells showed negative staining (C).

NID1 expression in PTC from TCGA cohort was associated with more aggressive phenotype

As NID1 was expressed in PTC samples from two affected members and it was absent in normal thyroid by IHC, we next investigated the mRNA expression of NID1 in PTC from TCGA cohort and correlated with available clinical and pathological parameters. NID1 expression was higher in PTC and paired lymph node metastases samples compared to its expression in normal thyroid tissues (P-value and P-adj. < 0.05) (Fig. 6A). Moreover, NID1 expression was higher in PTC samples from patients who showed microscopic (R1) and R2 (macroscopic) residual tumours than the expression observed in PTC samples from patients who showed no residual disease present or complete remission (R0) and patients in whom the presence of residual cancer could not be assessed (RX) (P-value < 0.05) (Fig. 6B). Therefore, higher expression of NID1 was found in PTC with higher likelihood of relapse after treatment. We subsequently correlated its expression with lymph node (N) stage. No significant difference was observed in the expression of NID1 in patients with no evidence of regional lymph node metastasis (N0) and those with lymph node disease (N1). When N1 group was further divided into N1a (central node metastasis only) and lateral neck lymph nodes (N1b), higher expression of NID1 was detected in patients in the N1b group than its expression in patients in the group N0 (P-value and P-adj. < 0.05) (Fig. 6C). No difference was observed in other available clinical pathological features.
Figure 6

Analysis of NID1 expression in PTC from TCGA cohort. (A) Analysis showed higher expression of NID1 in primary tumour and metastatic thyroid cancer samples when compared to normal thyroid tissues. (B) NID1 expression was higher in PTC from patients with microscopic residual tumour (R1) and macroscopic residual tumour (R2) than in PTC from patients with no residual disease (R0). (C) Increased expression of NID1 was observed in PTC from patients with N1b disease than in PCT from patients with no evidence of regional lymph node metastasis (N0). P-value < 0.05, **P-adj<0.005. **P-adj. < 0.005, ***P-adj.< 0.0005.

Analysis of NID1 expression in PTC from TCGA cohort. (A) Analysis showed higher expression of NID1 in primary tumour and metastatic thyroid cancer samples when compared to normal thyroid tissues. (B) NID1 expression was higher in PTC from patients with microscopic residual tumour (R1) and macroscopic residual tumour (R2) than in PTC from patients with no residual disease (R0). (C) Increased expression of NID1 was observed in PTC from patients with N1b disease than in PCT from patients with no evidence of regional lymph node metastasis (N0). P-value < 0.05, **P-adj<0.005. **P-adj. < 0.005, ***P-adj.< 0.0005. Remarkably, NID1 amplification was found in 4/496 (0,81%) PTC cases from the TCGA cohort. All together, these results not only supported our findings but also suggested that NID1 expression was associated with a more aggressive and diffuse PTC.

Discussion

Despite the advances in understanding the genetic basis of non-syndromic FNMTC, no susceptibility genes have been identified. Therefore, no genetic test is currently available to identify at-risk patients and the diagnosis is based on the family history. We herein report a five-generation Brazilian family with positive family history of PTC generations in the absence of other extra-thyroidal clinical manifestations, suggesting a true non-syndromic FNMTC. Regarding disease presentation, we observed the occurrence of the disease in three consecutive generations in both sexes, which is consistent with an autosomal dominant inheritance pattern. Although no significant difference was observed in the age of onset of the affected family member in the second generation (IV-2 and IV-3), individuals in third and fourth generations manifested the disease at an earlier age. The observed decreased age at onset in successive generations suggested genetic anticipation and reinforced the premise that is a true familial disease rather than fortuitous association. To identify the underlying genetic basis of the disease in this family, WES was performed. As part of the strategy, affected and unaffected family members were initially screened for variants in 114 genes previously described as associated with non-syndromic FNMTC. As variants in the candidate genes were excluded, we next explored the WES data for novel candidate genes for non-syndromic FNMTC. We identified a novel missense variant within NID1gene co-segregating with the disease in this family. Sanger sequencing confirmed that the NID1 p.Ile675Met variant co-segregated with the disease phenotype. Remarkably, the only unaffected carrier was the 1-year-old child. As penetrance is age dependent, longer follow-up is needed to better define the penetrance of PTC in this carrier. NID1 gene is located at 1q42.3 and encodes a member of the nidogen family of basement membrane glycoproteins. The protein interacts with several other components of basement membranes and may play a role in cell interactions with the extracellular matrix. Gene Ontology (GO) annotations related to this gene include calcium ion binding and laminin binding. Although several lines of evidence support that NID1 variant is a new candidate variant associated with FNMTC, this study has three main limitations. First, NID1 variant was not identified in other families with non-syndromic FNMTC. However, it is important to mention that using WES and comprehensive pathway enrichment analysis, NID1 was previously reported as one of the mutated genes associated with extracellular matrix organization pathway in a family with PTC, breast cancer and benign lesions in Brazil. The authors suggested that this gene might have pathogenic effects in the risk of familial PTC development (53). Secondly, although all affected member of this family underwent WES, we did not test some unaffected individuals, as DNA was not available. Thirdly, no functional analysis was performed for this specific variant found in exon 8 of NID1 gene. Although functional analysis would help to define the role of NID1 as predisposing gene for FNMTC, it is important to notice that though functional studies can help supporting pathogenicity, not all analysis are effective in predicting the impact of a variant in a protein function. In an attempt to overcome these limitations, several parameters commonly used to classify variants were considered such as MAF, in silico tools analysis for interpretation of pathogenicity of the variant (functional prediction, structural effect and evolutionary conservation analysis) and NID1 expression in thyroid tumour samples from affected members and in PTC samples from TCGA cohort. According to the parameters used to categorize variants by frequency, the p.Ile675Met NID1 variant is extremely rare in control populations from gnomAD data set (MAF < 0.0001) and from elderly individuals from São Paulo, Brazil (MAF < 0.001) and it is absent in the 1000 Genomes project. In silico analysis is a rapid and a reliable way to improve our understanding of how an amino acid substitution could affect the protein structure and function. Using different platforms, we attempted to cover the major aspects influencing protein function. The variant was classified by in silico tools most commonly used in clinical laboratories as probably pathogenic (PolyPhen-2 and LoFtool) or deleterious (SIFT and Condel) to the NID1 protein function or structure. Although the impact of an amino acid substitution on protein function differs across predictor methods, as they are based on different criteria such as the evolutionary conservation of the amino acid, the impact of the context and the biochemical consequence of the amino acid substitution were rather similar. Additionally, amino acid alignment confirmed that the residue Ile at position 675 of the NID1 protein is conserved across multiple species (score 9). Amino acid substitution at conserved positions more likely leads to deleterious effects on gene functions. In summary, these data combined suggest that the missense variant likely has an effect on protein function and it is potentially associated with phenotype in this family. Expression analysis performed in thyroid sections obtained from affected family members suggests that NID1 might play a role in the pathogenesis of PTC, as it is expressed in thyroid cancer cells from PTC samples but is not expressed in normal thyroid follicular cells. Moreover, the results of PTC from TCGA cohort showed that compared with normal thyroid tissues, the expression of NID1 was higher in cancer cells. Higher expression of NID1 was associated with higher likelihood of relapse after treatment and N1b disease, suggesting that NID1 expression is associated with a more aggressive and diffuse PTC. Data from literature also recognized the role of NID1 in the pathogenesis and progression of different cancer subtypes (62, 63, 64). As example, it was showed that high expression of NID1 significantly correlated with shorter overall survival of ovarian cancer patients. The authors demonstrated that the ectopic expression of NID1 in an adenocarcinoma cell line promoted cell migration, invasion and chemoresistance and that the underlying mechanism involves, at least in part, the activation of ERK/MAPK signalling pathway (62). In vitro analysis further revealed multiple prometastatic functions of NID1, including enhanced cell migration and invasion, and improved vascular tube formation capacity. Importantly, higher expression of NID1 was associated with an increased risk of developing distant metastasis and poor clinical outcomes. The authors suggested that NID1 is a prometastatic-secreted protein that may develop as a new biomarker for disease progression and therapeutic target in breast cancer and melanoma (65). Higher levels of NID1 expression was also correlated with the advanced stages of oral cavity squamous cell carcinoma (OSCC) and facilitated gastric cancer cell migration and invasion in vitro and poor survival outcome in gastric cancer patients, highlighting the role of NID1 in gastric cancer progression (66). Interestingly, NID1 signalling pathway was also associated with the pathogenesis of Hirschsprung disease (HSCR) (67). Recently, it was demonstrated that the loss of NDRG4 (N-MYC downstream-regulated gene 4) promotes colorectal cancer via increased release of NID1 and FBLN2 (68). Recently, the thyroglobulin interactome map showed that NID1 is one of the proteins that comprise the secretory proteostasis network associated with thyroglobulin mutation-specific changes that impair proper production, folding or processing leading to dyshormonogenesis (69). However, further analysis is needed to better understand the role of NID1 in thyroid cancer process. In conclusion, we report individuals with a family history of PTC and evidence of clinical anticipation in successive generations, suggesting a true non-syndromic FNMTC. Although the affected members were negative for variants in genes previously reported as associated with FNMTC, WES analysis identified a novel p.I657M variant within NID1 gene that segregates with the phenotype and is absent in controls from same geographic region. The p.Ile657Met allele frequency is extremely low in control populations, and multiple lines of computational evidence support deleterious/damaging effect of the amino acid substitution on NID1 protein. Although further analysis will help to better understand the role of this variant in the pathogenesis of FNMTC, NID1 variant fulfills several criteria that suggest it as a new candidate FNMTC predisposing gene.

Declaration of interest

The authors declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported.

Funding

This work was supported by grants from the São Paulo Research Foundation (FAPESP 2014/06570-6 and 2018/23497-1). P A F G is funded by FAPESP (grant 2018/15579-8). J M C and P A F G are investigators of the Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) (grant 304534/2018-8 for J M C).
  68 in total

1.  Germline PARP4 mutations in patients with primary thyroid and breast cancers.

Authors:  Yuji Ikeda; Kazuma Kiyotani; Poh Yin Yew; Taigo Kato; Kenji Tamura; Kai Lee Yap; Sarah M Nielsen; Jessica L Mester; Charis Eng; Yusuke Nakamura; Raymon H Grogan
Journal:  Endocr Relat Cancer       Date:  2015-12-23       Impact factor: 5.678

2.  A genome-wide association study yields five novel thyroid cancer risk loci.

Authors:  Julius Gudmundsson; Gudmar Thorleifsson; Jon K Sigurdsson; Lilja Stefansdottir; Jon G Jonasson; Sigurjon A Gudjonsson; Daniel F Gudbjartsson; Gisli Masson; Hrefna Johannsdottir; Gisli H Halldorsson; Simon N Stacey; Hannes Helgason; Patrick Sulem; Leigha Senter; Huiling He; Sandya Liyanarachchi; Matthew D Ringel; Esperanza Aguillo; Angeles Panadero; Enrique Prats; Almudena Garcia-Castaño; Ana De Juan; Fernando Rivera; Li Xu; Lambertus A Kiemeney; Gudmundur I Eyjolfsson; Olof Sigurdardottir; Isleifur Olafsson; Hoskuldur Kristvinsson; Romana T Netea-Maier; Thorvaldur Jonsson; Jose I Mayordomo; Theo S Plantinga; Hannes Hjartarson; Jon Hrafnkelsson; Erich M Sturgis; Unnur Thorsteinsdottir; Thorunn Rafnar; Albert de la Chapelle; Kari Stefansson
Journal:  Nat Commun       Date:  2017-02-14       Impact factor: 14.919

3.  Absence of the MAP2K5 germline variants c.G961A and c.T1100C in a wide series of familial nonmedullary thyroid carcinoma Italian families.

Authors:  Valentina Cirello; Carla Colombo; Luca Persani; Laura Fugazzola
Journal:  Int J Cancer       Date:  2019-03-19       Impact factor: 7.396

4.  C14orf93 (RTFC) is identified as a novel susceptibility gene for familial nonmedullary thyroid cancer.

Authors:  Chang Liu; Yang Yu; Guangliang Yin; Junxia Zhang; Wei Wen; Xianhui Ruan; Dapeng Li; Shuang Zhang; Wenjun Cai; Ming Gao; Lingyi Chen
Journal:  Biochem Biophys Res Commun       Date:  2016-11-15       Impact factor: 3.575

5.  Results of Screening in Familial Non-Medullary Thyroid Cancer.

Authors:  Joanna Klubo-Gwiezdzinska; Lily Yang; Roxanne Merkel; Dhaval Patel; Naris Nilubol; Maria J Merino; Monica Skarulis; Samira M Sadowski; Electron Kebebew
Journal:  Thyroid       Date:  2017-07-19       Impact factor: 6.568

6.  Short telomeres, telomerase reverse transcriptase gene amplification, and increased telomerase activity in the blood of familial papillary thyroid cancer patients.

Authors:  Marco Capezzone; Silvia Cantara; Stefania Marchisotta; Sebastiano Filetti; Maria Margherita De Santi; Benedetta Rossi; Giuseppe Ronga; Cosimo Durante; Furio Pacini
Journal:  J Clin Endocrinol Metab       Date:  2008-07-29       Impact factor: 5.958

7.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.

Authors:  Michael I Love; Wolfgang Huber; Simon Anders
Journal:  Genome Biol       Date:  2014       Impact factor: 13.583

8.  Identification of susceptibility gene mutations associated with the pathogenesis of familial nonmedullary thyroid cancer.

Authors:  Junwei Zhu; Kaile Wu; Zhangying Lin; Suwen Bai; Jing Wu; Peikun Li; Haowei Xue; Juan Du; Bing Shen; Huiyin Wang; Yehai Liu
Journal:  Mol Genet Genomic Med       Date:  2019-10-22       Impact factor: 2.183

9.  The variant rs1867277 in FOXE1 gene confers thyroid cancer susceptibility through the recruitment of USF1/USF2 transcription factors.

Authors:  Iñigo Landa; Sergio Ruiz-Llorente; Cristina Montero-Conde; Lucía Inglada-Pérez; Francesca Schiavi; Susanna Leskelä; Guillermo Pita; Roger Milne; Javier Maravall; Ignacio Ramos; Víctor Andía; Paloma Rodríguez-Poyo; Antonino Jara-Albarrán; Amparo Meoro; Cristina del Peso; Luis Arribas; Pedro Iglesias; Javier Caballero; Joaquín Serrano; Antonio Picó; Francisco Pomares; Gabriel Giménez; Pedro López-Mondéjar; Roberto Castello; Isabella Merante-Boschin; Maria-Rosa Pelizzo; Didac Mauricio; Giuseppe Opocher; Cristina Rodríguez-Antona; Anna González-Neira; Xavier Matías-Guiu; Pilar Santisteban; Mercedes Robledo
Journal:  PLoS Genet       Date:  2009-09-04       Impact factor: 5.917

10.  Whole Genome Sequencing Prioritizes CHEK2, EWSR1, and TIAM1 as Possible Predisposition Genes for Familial Non-Medullary Thyroid Cancer.

Authors:  Aayushi Srivastava; Sara Giangiobbe; Diamanto Skopelitou; Beiping Miao; Nagarajan Paramasivam; Chiara Diquigiovanni; Elena Bonora; Kari Hemminki; Asta Försti; Obul Reddy Bandapalli
Journal:  Front Endocrinol (Lausanne)       Date:  2021-02-22       Impact factor: 5.555

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.