Literature DB >> 30809968

Exome sequencing in 51 early onset non-familial CRC cases.

Jessada Thutkawkorapin1, Annika Lindblom2, Emma Tham2.   

Abstract

BACKGROUND: Colorectal cancer (CRC) cases with an age of onset <40 years suggests a germline genetic cause. In total, 51 simplex cases were included to test the hypothesis of CRC as a mendelian trait caused by either heterozygous autosomal dominant or bi-allelic autosomal recessive pathogenic variants.
METHODS: The cohort was whole exome sequenced (WES) at 100× coverage. Both a dominant- and recessive model were used for searching predisposing genetic factors. In addition, we assayed recessive variants of potential moderate risk that were enriched in our young-onset CRC cohort. Variants were filtered using a candidate cancer gene list or by selecting variants more likely to be pathogenic based on variant type (e.g., loss-of-function) or allele frequency.
RESULTS: We identified one pathogenic variant in PTEN in a patient subsequently confirmed to have a hereditary hamartoma tumor syndrome (Cowden syndrome) and one patient with a pathogenic heterozygous variant in PMS2 that was originally not identified by WES due to low quality reads resulting from pseudogenes. In addition, we identified three heterozygous candidate missense variants in known cancer susceptibility genes (BMPR1A, BRIP1, and SRC), three truncating variants in possibly novel cancer genes (CLSPN, SEC24B, SSH2) and four candidate missense variants in ACACA, NR2C2, INPP4A, and DIDO1. We also identify five possible autosomal recessive candidate genes: ATP10B, PKHD1, UGGT2, MYH13, TFF3.
CONCLUSION: Two clear pathogenic variants were identified in patients that had not been identified clinically. Thus, the chance of detecting a hereditary cancer syndrome in patients with CRC at young age but without family history is 2/51 (4%) and therefore the clinical benefit of genetic testing in this patient group is low. Of note, using stringent filtering, we have identified a total of ten candidate heterozygous variants and five possibly biallelic autosomal recessive candidate genes that warrant further study.
© 2019 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.

Entities:  

Keywords:  colorectal cancer; early onset; exome sequencing; non-familial

Mesh:

Year:  2019        PMID: 30809968      PMCID: PMC6503031          DOI: 10.1002/mgg3.605

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


INTRODUCTION

Colorectal cancer (CRC) is the third most common cancer diagnosed in Sweden (“Cancer fact sheet. World Health Organization”, 2010). In Sweden, familial CRC represents approximately 13% of all CRC cases (Frank, Sundquist, Yu, Hemminki, & Hemminki, 2017), while the cases with age of onset <40 years account for 1.4% (“Statistics on Cancer Incidence in Sweden 2016,” 2017). Known CRC susceptibility genes are responsible for 3.5% of all cases (Aaltonen, Johns, Jarvinen, Mecklin, & Houlston, 2007). In a study with over 10,000 CRC cases, Lynch syndrome was responsible for 3.1% overall (Moreira et al., 2012) and in 11/140 (8%) of CRC before 50 years (Giraldez et al., 2010). The same study detected 4/140 (3%) with bi‐allelic MUTYH mutations. Factors other than monogenic variants that contribute to cancer development are somatic genetic and epigenetic aberrations, combinations of low‐ and moderate‐risk genetic variation, individual life style (Lichtenstein et al., 2000), chronic bowel inflammation (Beaugerie & Itzkowitz, 2015), and environmental factors (Bodmer & Bonilla, 2008). However, CRC development before the age of 40 years and without a family history of CRC is unusual, suggesting either a de novo monogenic factor in a high‐risk autosomal dominant cancer gene, an autosomal dominant syndrome with decreased penetrance or a high‐risk autosomal recessive gene, or complex disease. Here, we used exome sequencing of 51 simplex CRC cases with age of onset <40 years in order to search for novel monogenic cancer genes that cause a rare autosomal dominant or autosomal recessive colorectal cancer syndrome. In addition, we looked for moderate recessive risk variants that were enriched in our young‐onset CRC cohort.

MATERIALS AND METHODS

Ethical compliance

All patients gave written informed consent in accordance with Swedish legislation (2003:460) and the study was approved by the Regional Research Ethics Committee, Dnr: 2002‐20489, 2008/125‐2031/2, and 2014/1326‐32.

Early‐onset CRC cohort

CRC patients with age of onset before 40 years and without a family history of CRC were recruited through the Department of Clinical Genetics, Karolinska University Hospital Solna (Sweden) (n = 17) or recruited in a nationwide study, the Swedish Low‐risk Colorectal Cancer Study (n = 34). The latter consisted of more than 3,300 consecutive patients operated on for CRC in 14 hospitals in and around Stockholm and Uppsala between 2004 and 2009. All patients gave written informed consent to participate in the study and DNA was isolated from peripheral blood. The cancer diagnosis and histopathology were verified in all cases through medical records (Ghazi, 2010). For the present study, all simplex CRC cases diagnosed before 40 years were selected.

Familial Breast cancer (BRC) cohort used as a comparison group in autosomal recessive gene analysis

The familial BRC cohort was defined as individuals from families where at least two first or second‐degree relatives were affected with BRC. The families were recruited through the Department of Clinical Genetics, Karolinska University Hospital Solna (Sweden).

Whole exome sequencing of early onset CRC and familial BRC samples

DNA was quantified using a Qubit Fluorometer (Life Technologies, USA). Sequencing libraries were prepared according to the TruSeq DNA Sample Preparation Kit EUC 15005180 or EUC 15026489 (Illumina, USA). Briefly, 1–1.5 μg of genomic DNA was fragmented using the Covaris 400 bp protocol (Covaris, Inc., USA). After fragmentation, all samples were subjected to end‐repair, A‐tailing, and adaptor ligation of Illumina Multiplexing PE adaptors. An additional gel‐based size selection step was performed and the adapter‐ligated fragments were subsequently enriched by PCR followed by purification using Agencourt AMPure Beads (Beckman Coulter, Sweden). Exome capture was performed by pre‐pooling equimolar amounts and performing enrichment in 5‐ or 6‐plex reactions according to the TruSeq Exome Enrichment Kit Protocol (EUC 15013230). Library size was checked on a Bioanalyzer High Sensitivity DNA chip (Agilent Technologies, Sweden) while concentration was calculated by quantitative PCR. The pooled DNA libraries were clustered on a cBot instrument (Illumina) using the TruSeq PE Cluster Kit v3. Paired‐end sequencing was performed for 100 cycles using a HiSeq 2000 instrument (Illumina) with TruSeq SBS Chemistry v3, according to the manufacturer's protocol. Base calling was performed with RTA (1.12.4.2 or 1.13.48) and the resulting BCL files were filtered, de‐multiplexed, and converted to FASTQ format using CASAVA 1.7 or 1.8 (Illumina). The sequencing was performed at an average coverage of 100x.

Bioinformatics workflow

Sequencing reads were aligned to the reference genome GRCh37 using BWA (H. Li & Durbin, 2010). Aligned reads were sorted and PCR‐duplicated reads were removed using Picard (http://broadinstitute.github.io/picard/). The calculation of mapping and enrichment statistics were done with Picard and GATK (McKenna et al., 2010). Variants were called using GATK by following the best practice procedure implemented at the Broad Institute (DePristo et al., 2011). Variant Quality Score Recalibration from GATK was used for quality control of the variants. Variant annotation was done by ANNOVAR (Wang, Li, & Hakonarson, 2010). The annotated information includes RefSeq gene annotation (O'Leary et al., 2016), dbSNP rs number (Sherry et al., 2001), COSMIC (Forbes et al., 2017), ClinVar (Landrum et al., 2018), SPIDEX (Xiong et al., 2015), ExAC conservative constraint (Lek et al., 2016), UniProt (Chen, Huang, & Wu, 2017). Background allele frequencies are from SweGen (Ameur et al., 2017), ExAC (Lek et al., 2016), gnomAD (Lek et al., 2016), and 1000 Genomes Project allele frequencies (1000 Genomes Project Consortium et al., 2012). In silico predictors used for predicting pathogenic effects include SIFT (Kumar, Henikoff, & Ng, 2009), PolyPhen2 (Adzhubei et al., 2010), Phylop (Cooper et al., 2005), LRT (Chun & Fay, 2009), Mutation Taster (Schwarz, Rodelsperger, Schuelke, & Seelow, 2010), Mutation Assessor (Reva, Antipin, & Sander, 2011), FATHMM (Shihab et al., 2015), GERP++ (Davydov et al., 2010), and CADD (Kircher et al., 2014).

Allele frequency database

In this study, we have used maximum minor allele frequency (MMAF) from 21 populations (SweGen (Ameur et al., 2017), 1000G (The 1000 Genomes Project Consortium, 2012), ExAC (Lek et al., 2016), gnomAD (Lek et al., 2016), 200Danes (Y. Li et al., 2010), and 249Swedes (http://neotek.scilifelab.se/hbvdb/)) to identify common variants.

Analysis workflows

Rare autosomal dominant and autosomal recessive analysis in cancer susceptibility gene list

To search for causative mutations in autosomal dominant‐ or autosomal recessive cancer susceptibility genes, we first used an in silico cancer gene list modified from (Vogelstein et al., 2013) in the analysis of 51 early‐onset CRC samples. The gene list contains 244 known cancer‐related genes (known somatic cancer driver genes as well as hereditary cancer genes). The candidate variants were primarily selected by filtering splicing and non‐silent variants with MMAF <1% in autosomal‐recessive cancer genes and <0.1% in autosomal‐dominant cancer genes. The variants with MMAF higher than the prevalence of cancer syndrome suggested by the gene were manually excluded. The resulting candidate variants were classified as of uncertain significance, likely pathogenic, or pathogenic using American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG‐AMP) guidelines criteria (Richards et al., 2015) (Figure 1). The candidate variants were then confirmed by Sanger sequencing. Manual analysis of poor quality PMS2 variants using the sequencing file (BAM) was performed.
Figure 1

Autosomal dominant and autosomal recessive analysis in cancer susceptibility gene list

Autosomal dominant and autosomal recessive analysis in cancer susceptibility gene list

Frameshift‐, nonsense‐, and splice variants analysis

We selected all frameshift‐, nonsense‐, and splice variants in the exome which have MMAF < 0.1% (for heterozygous variants) and <1% (for recessive variants) (Figure 2). The candidate variants were then confirmed by Sanger sequencing.
Figure 2

Frameshift‐, nonsense‐, and splice variants analysis

Frameshift‐, nonsense‐, and splice variants analysis

Analysis of missense variants

In this analysis, we selected all missense variants in the exomes of the 51 CRC patients with MMAF < 0.1% in all public databases. Then, we grouped the variants using CADD score (Kircher et al., 2014) (“more than 20”, “more than 25”, and “more than 30”) and ExAC missense Z‐score (Lek et al., 2016) (“less than or equal to 3” and “more than 3”) (Figure 3).
Figure 3

Analysis of missense variants

Analysis of missense variants

Autosomal recessive genes analysis (rare monogenic and less common risk genes)

To pinpoint autosomal recessive genes predisposing to early‐onset colorectal cancer, we hypothesized that the patients inherited biallelic risk variants, one from each parent in a homozygous or compound heterozygous state. A gene was assumed to be associated with an increased risk of CRC if the number of cases with the possible biallelic variants was more prevalent than in a control population. Due to an absence of individual genotyping data of a normal population, we used a cohort of familial breast cancer processed in the same way as the CRC samples as a comparison group, assuming that the possible biallelic variants predisposing to early‐onset colorectal cancer are not associated with familial breast cancer. With such a hypothesis, we searched for possible biallelic variants in the exome in our 51 early‐onset CRC cases with 56 familial BRC as a comparison group. During the first step of filtering, genes with splicing and non‐silent variants, with a MMAF < 20% and predicted to be pathogenic by more than 4 out of 9 in silico effect predictors (SIFT (Kumar et al., 2009), PolyPhen2 HDIV (Adzhubei et al., 2010), PolyPhen2 HVAR (Adzhubei et al., 2010), Phylop (Cooper et al., 2005), LRT (Chun & Fay, 2009), Mutation Taster (Schwarz et al., 2010), Mutation Assessor (Reva et al., 2011), FATHMM (Shihab et al., 2015), GERP++ (Davydov et al., 2010)), were selected. The next filtering step was to select genes with possible biallelic variants (possible compound heterozygous or homozygous) in at least two CRC cases and where no possible compound or homozygous case was found among the familial breast cancer cohort. If any two variants always had a similar MAF among population allele frequency databases or the variants showed up together in multiple samples, the variants were considered likely to be on the same allele and were excluded (Figure 4). The variants were prioritized based on MMAF and the observed frequency in CRC compared to the statistical likelihood of occurring together (computed by multiplying the MMAF of the two alleles).
Figure 4

Autosomal recessive genes analysis

Autosomal recessive genes analysis

Sanger sequencing

Primer pairs were designed using Primer3Plus (http://www.bioinformatics.nl/primer3plus) to amplify the candidate pathogenic variants suggested by exome sequencing data. PCR was performed in 25 μl reaction volumes containing AmpliTaq Gold (Life Technologies, Carlsbad, CA), 1.25 μg DNA and 10 μmol l−1 of primers. Excess single‐stranded DNA, primers, and dNTPs were removed using ExoSAP. The sequences were visualized and analyzed using CodonCode Aligner (http://www.codoncode.com).

Database submission of novel variants

Relevant variant information has been uploaded to the Leiden Open Variation Database (LOVD), http://www.lovd.nl/3.0

RESULTS

We first searched for variants in known cancer susceptibility genes (Figure 1), resulting in eight heterozygous variants in eight genes (Table 1, Figure 1). Among them, the variant in PTEN, p.Ser59Ter, has been reported to be pathogenic in ClinVar (Landrum et al., 2018) and had been reported as a somatic mutation 15 times in the TCGA dataset in various tumor types in cBioportal (Cerami et al., 2012; Gao et al., 2013). The patient was subsequently examined at the Department of Clinical Genetics at an age of 50 years and found to have macrocephaly (OFC: 65 cm), papillomatous lesions on hands and oral mucosa, intellectual disability (IQ = 84), and hamartomatous intestinal polyps. He was sent to surveillance of the thyroid according to clinical guidelines and multinodular goiter was detected. In all, these findings confirmed the clinical diagnosis of Cowden syndrome/Hamartoma tumor syndrome (MIM #601728). His parents were both 23 years of age when he was born and neither had had any cancer. Genetic counseling was offered to his family members. Unfortunately, no family members have contacted us and we have therefore not been able to perform genetic testing of the parents/siblings in order to confirm a de novo or familial variant. There were six candidate missense variants in the following cancer genes: MSH2, APC, PTPN12, BMPR1A, POLE, and SRC, and one inframe duplication in BRIP1. All were classified as variants of unknown significance according to ACMG criteria (Table 1). All of these genes are expressed in colon tissue (www.genecards.org), but none of these specific variants have been reported as somatic variants in 3,473 colorectal cancer samples or in any other tumor types in cBioPortal. After our study, immunohistochemistry of mismatch repair genes was performed as part of another research project on one patient with CRC at 37 years. His tumor showed loss of PMS2 protein. Targeted sequencing of PMS2 (based on a clinical nested PCR approach) detected a NM_000535.5:c.2113G>A, p.Glu705Lys variant that is known to be pathogenic and has been reported as causative in many families, even though the INSIGHT expert panel have interpreted it as a variant of unknown significance (ClinVar). Thereafter, we manually checked the sequencing files of the other patients for poor quality PMS2 variants and no other possible pathogenic variant in PMS2 was found. No homozygous/compound heterozygous variants were found among the 244 cancer genes.
Table 1

Candidate variants from cancer‐gene‐list study

Gene nameProtein changecDNA changeProtein domaina MMAFb ClinVarCADDc Samples
PTEN NP_000305.3:p.Ser59Ter NM_000314.3:c.176C>A Phosphatase tensin‐type NA Pathogenic50 48‐07LF
APCNP_000029.4:p.Asn944AspNM_000038.4:c.2830A>GNo special domain0.05%VUS10.93329‐09F
MSH2NP_000242.2:p.Val722PheNM_000251.2:c.2164G>TNo special domain0.02%VUS341068‐05o
PTPN12NP_002826.6:p.Asp26GluNM_002835.6:c.78C>GNo special domain0.01%NA11.9323‐09F
POLENP_006222.5:p.Pro13SerNM_006231.5:c.37C>TNo special domain0.01%NA10.64Co‐121
BMPR1ANP_004320.1:p.Ala293ValNM_004329.3:c.878C>TCytoplasmic and protein kinase domainNANA27.2Co‐286
BRIP1NP_114432.2:p.Asn1161_Asp1163dupNM_032043.2:c.3481_3489dupNo special domainNANANA4283‐13D
SRCNP_938033.4:p.Gln112HisNM_198291.4:c.336G>CSH3 domainNANA12.2259‐04o

VUS, variant of unknown significance.

Bold font indicates a known pathogenic variant based on it being a nonsense mutation, not previously reported in healthy individuals, but reported in patients with Cowden syndrome.

UniProt protein domain.

Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SweGen 249DANES, 249SWEDES.

CADD score.

Candidate variants from cancer‐gene‐list study VUS, variant of unknown significance. Bold font indicates a known pathogenic variant based on it being a nonsense mutation, not previously reported in healthy individuals, but reported in patients with Cowden syndrome. UniProt protein domain. Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SweGen 249DANES, 249SWEDES. CADD score. In addition to the cancer gene list, we also performed an analysis on frameshift‐, nonsense‐, and splice variants in the exome (Figure 2). After Sanger verification, apart from the variant in PTEN, there were 10 possible candidate variants from 10 genes (CLSPN, CELSR2, ADAM17, BIRC6, SEC24B, RBM27, PPARGC1B, NCOA7, SSH2, and MYO9B) (1 frameshift deletion, 7 nonsense‐, and 2 splice variants) (Table 2). All these genes are expressed in the normal colon tissue (genecards). None of the variants were found in cBioPortal.
Table 2

Candidate truncating variants

Gene nameProtein changecDNA changeProtein domaina MMAFb Samples
PTEN NP_000305.3:p.Ser59Ter NM_000314.3:c.176C>A Phosphatase tensin‐typeNA48‐07LF
CLSPNNP_071394.5:p.Arg1040TerNM_022111.5:c.3118C>TNo special domainNA2750‐13D
CELSR2NP_001399.3:p.Asp843IlefsTer65NM_001408.3:c.2527delCadherin 9NA4270‐11D
ADAM17NP_003174.3:p.Gln471TerNM_003183.3:c.1411C>TPeptidase M12BNA760‐04o
BIRC6NANM_016252.2:c.12811‐1G>ANo special domain0.05%1307‐09D
SEC24BNP_006314.1:p.Arg307TerNM_006323.2:c.919C>TNo special domainNACo‐286
RBM27NANM_018989.1:c.590‐2A>CArg‐rich domain0.05%4735‐11D
PPARGC1BNP_573570.5:p.Trp328TerNM_133263.5:c.983G>ANo special domainNA294‐07LF
NCOA7NP_001186548.3:p.Arg913TerNM_001199619.3:c.2737C>TTLD domain0.02%664‐08F
SSH2NP_203747.1:p.Tyr1292TerNM_033389.1:c.3876C>GNo special domainNA49‐06o
MYO9BNP_004136.1:p.Cys1646TerNM_004145.1:c.4938C>APhorbol‐ester/DAG‐typeNACo‐489

All the variants in this table are heterozygous and were not reported in ClinVar. FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SweGen 249DANES, 249SWEDES.

Bold font indicates a known pathogenic variant based on it being a nonsense mutation, not previously reported in healthy individuals, but reported in patients with Cowden syndrome.

UniProt protein domain.

Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC.

Candidate truncating variants All the variants in this table are heterozygous and were not reported in ClinVar. FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SweGen 249DANES, 249SWEDES. Bold font indicates a known pathogenic variant based on it being a nonsense mutation, not previously reported in healthy individuals, but reported in patients with Cowden syndrome. UniProt protein domain. Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC. In missense variants analysis (Figure 3), there were in total 3,800 missense variants with MMAF < 0.1% in all public databases. In order to select candidate variants, we filtered the variants with CADD score (Kircher et al., 2014) more than 20 (n = 2301) (Supporting Information, Table S1). Among them, there were 248 variants in genes in which the normal population has fewer missense variants than expected, ExAC missense Z‐score > 3 (Lek et al., 2016), (Supporting Information, Table S2), suggesting that missense variants might be detrimental to the normal gene function. If we instead categorize the 2,301 variants based on higher CADD score, 1,140 variants had a CADD score >25 and 367 variants had a CADD score of more than 30 (Supporting Information, Table S2). 669/2301 variants had never been reported in a normal population and, of those with CADD >30, 82 had never been reported and eight of these were in a gene where missense variants were expected to be deleterious based on the conservative constraint, ExAC missense Z‐score (Lek et al., 2016) resulting in the following candidate gene list: ACACA, KIAA1109, GPHN, NR2C2, GPR25, ZNF462, INPP4A, and DIDO1 (Table 3).
Table 3

Candidate missense variants from whole exome sequencing

Gene nameProtein changeProtein domaina MMAFb Mis Zc CADDd Samples
KIAA1109NM_015312:p.S3578FNo special domain0.00011.21017728.1711‐14D
GPR25NM_005298:p.Arg228CysCytoplasmic domain03.4183516354270‐11D
INPP4ANM_001134224:p.Arg901HisNo special domain03.190765734295‐08F
NR2C2NM_003298:p.Arg411HisNuclear receptor (NR) ligand‐binding (LBD) domain03.48155435329‐09F
ZNF462NM_021224:p.Arg2068GlnIn zinc finger: C2H2‐type 2003.224178335679‐05o
GPHNNM_020806:p.Gly472AspNo special domain03.57193432376‐05o
ACACANM_198834:p.Arg2208GlnNo special domain0.00008997.5240764351307‐09D
DIDO1NM_033081:p.Arg1145HisNo special domain03.0966303351068‐05o

UniProt protein domain.

Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SWEGEN 249DANES, 249SWEDES.

ExAC missense Z Score.

Combined Annotation Dependent Depletion (CADD) score.

Candidate missense variants from whole exome sequencing UniProt protein domain. Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SWEGEN 249DANES, 249SWEDES. ExAC missense Z Score. Combined Annotation Dependent Depletion (CADD) score. In the autosomal recessive gene analysis (Figure 4), there were 16 genes where at least two CRC cases had homozygous or possible compound heterozygous variants and none in the breast cancer cohort that we used as a comparison group (Table 4). Six genes (ATP10B, PKHD1, PTPRQ, UGGT2, MYH13, TFF3) had variants with MMAF <5%, which were observed together in CRC cases twenty times more than expected by the computed likelihood. Therefore, these six genes are more likely to be candidate autosomal recessive risk factors.
Table 4

Sixteen candidate autosomal recessive genes where at least two CRC cases had homozygous or possible compound heterozygous variants and none in the breast cancer cohort that we used as a comparison group

Gene nameProtein changecDNA changeCoordinatea Likelihoodb obs gnomADc obs CRCd MMAFe CRCf BRCg Patient1h Patient2i
ATP10B NP_079429.2:p.Asn865LysNM_025153.2:c.2595C>A5:g.160042903G>T0.04300.0294NA48‐07LF (het)569‐04o (het)
NP_079429.2:p.Gly393TrpNM_025153.2:c.1177G>T5:g.160061565C>A0.0008NA0.04000.01990.01960.017948‐07LF (het)569‐04o (het)
PKHD1 NP_619639.3:p.Thr2869LysNM_138694.3:c.8606C>A6:g.51637536G>T0.02370.04900.0179566‐04o (het)
NP_619639.3:p.Ser1833LeuNM_138694.3:c.5498C>T6:g.51882310G>A0.0000NA0.02000.00050.0098NA566‐04o (het)
NP_619639.3:p.Leu1709PheNM_138694.3:c.5125C>T6:g.51889483G>A0.00530.0098NA1199‐05o (het)
NP_619639.3:p.Cys843SerNM_138694.3:c.2527T>A6:g.51910867A>T0.0000NA0.02000.00000.0098NA1199‐05o (het)
PTPRQ NP_001138498.1:p.Pro98LeuNM_001145026.1:c.293C>T12:g.80839400C>T0.00150.0098NA4773‐11D (het)
NP_001138498.1:p.Gln425GluNM_001145026.1:c.831A>G12:g.80878310A>G0.06250.0294NA386‐04o (het)
NP_001138498.1:p.Thr1967MetNM_001145026.1:c.5458C>T12:g.81028849C>T0.0001NA0.02000.00120.0098NA386‐04o (het)
NP_001138498.1:p.Ile2207ValNM_001145026.1:c.6177A>G12:g.81066976A>G0.0001NA0.02000.04850.0098NA4773‐11D (het)
UGGT2 NP_064506.3:p.Asn1268TyrNM_020121.3:c.3802A>T13:g.96511868T>A0.0002NA0.02000.03100.03920.0089Co‐1524 (het)
NP_064506.3:p.Arg480HisNM_020121.3:c.1439G>A13:g.96601605C>T0.00060.00020.02000.00780.0294NACo‐1524 (het)61‐09F (hom)
MYH13 NP_003793.2:p.Arg1438CysNM_003802.2:c.4312C>T17:g.10215944G>A0.01690.01960.0179784‐05o (het)
NP_003793.2:p.Ala1128ValNM_003802.2:c.3383C>T17:g.10222462G>A0.01490.00980.0089Co‐286 (het)
NP_003793.2:p.Lys807delNM_003802.2:c.2420_2422del17:g.10233717_10233719del0.0001NA0.02000.00650.00980.0268Co‐286 (het)
NP_003793.2:p.Gly701ArgNM_003802.2:c.2101G>A17:g.10236464C>T0.0006NA0.02000.03310.0392NA784‐05o (het)
TFF3 NP_003217.2:p.Val116MetNM_003226.2:c.346G>A21:g.43733628C>T0.00010.00010.04000.00890.0568NA1205‐05o (hom)760‐04o (hom)
PCDHGB4NP_115269.1:p.Pro15_Val16delNM_032098.1:c.44_49del5:g.140767492_140767497del0.03600.01800.04000.19050.07840.0909294‐07LF (hom)Co‐286 (hom)
BCLAF1NP_001070908.1:p.Glu401TerNM_001077440.1:c.1201G>T6:g.136597456C>A0.03930.0196NA2014‐04018 (het)91‐04o (het)
NP_001070908.1:p.Arg45MetNM_001077441.1:c.134G>T6:g.136599885C>A0.0030NA0.04000.10490.05880.08932014‐04018 (het)91‐04o (het)
ZNRF2NP_667339.3:p.Asp123dupNM_147128.3:c.368_370dup7:g.30325341_30325343dup0.00270.00060.04000.05260.05880.0577813‐06o (hom)91‐04o (hom)
AGPAT2NP_006403.3:p.Val18dupNM_006412.3:c.50_52dup9:g.139581758_139581760dup0.01100.00860.04000.10540.17650.0545749‐08F (hom)760‐04o (hom)
MRGPRX3NP_473372.3:p.Trp307TerNM_054031.3:c.920G>A11:g.18159669G>A0.00700.00600.04000.08550.12750.08931307‐09D (hom)784‐05o (hom)
TPSG1NP_036599.3:p.Arg194TrpNM_012467.3:c.580C>T16:g.1272273G>A0.0016NA0.02000.01320.0098NA4270‐11D (het)
NP_036599.3:p.Ser138PheNM_012467.3:c.413C>T16:g.1272750G>A0.01500.00900.02000.12250.07840.04461094‐08F (hom)4270‐11D (het)
NP_036599.3:p.Thr75LysNM_012467.3:c.224C>A16:g.1273444G>T0.02400.00700.02000.15460.01960.0364711‐14D (hom)
FBXW10NP_001254514.1:p.Arg275LeuNM_001267585.1:c.824G>T17:g.18653188G>T0.01200.00700.02000.11340.08820.0804Co‐1190 (hom)1199‐05o (het)
NP_001254514.1:p.Arg607CysNM_001267585.1:c.1819C>T17:g.18671961C>T0.0150NA0.02000.13200.06860.02681199‐05o (het)
ZNF730NP_001264332.1:p.Thr196AlaNM_001277403.1:c.586A>G19:g.23328432A>G0.09210.01960.0446771‐08F (het)
NP_001264332.1:p.Ser296PheNM_001277403.1:c.887C>T19:g.23328733C>T0.0060NA0.02000.06490.03920.0446711‐14D (het)771‐08F (het)
NP_001264332.1:p.Ile479AsnfsTerNM_001277403.1:c.1429_1430insA19:g.23329275_23329276insA0.0004NA0.02000.00650.0098NA711‐14D (het)
TARM1NP_001129158.1:p.Arg111HisNM_001135686.1:c.332G>A19:g.54578105C>T0.01000.02000.04000.09790.11760.06385053‐10D (hom)Co‐489 (hom)
RFPL1NP_066306.2:p.Gln243TerNM_021026.2:c.727C>T22:g.29837884C>T0.03700.03000.04000.19310.18630.125258‐06o (hom)4283‐13D (hom)

Six genes (ATP10B, PKHD1, PTPRQ, UGGT2, MYH13 and TFF3) had variants, which were observed together in CRC cases twenty times more than expected by the computed likelihood.

Bold font indicates variants that are rare in the normal population and have a low liekliheood of occuring together and are therefore possible candidate variants.

Genomics coordinate based on GRCh37.

Likelyhood to observe homozygous‐ or possible compound heterozygous variants based on MMAF of the two alleles.

Frequency of observed homozygous‐ or possible compound heterozygous variants in 51 early onset colorectal cancer samples.

Maximum obsevered frequency of homozygous samples in gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH.

Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SweGen 249DANES, 249SWEDES.

Allele frequency in 51 early onset colorectal cancer samples.

Allele frequency in 56 breast cancer samples.

Patients with homozygous‐ and possible compound heterozygous variants.

Sixteen candidate autosomal recessive genes where at least two CRC cases had homozygous or possible compound heterozygous variants and none in the breast cancer cohort that we used as a comparison group Six genes (ATP10B, PKHD1, PTPRQ, UGGT2, MYH13 and TFF3) had variants, which were observed together in CRC cases twenty times more than expected by the computed likelihood. Bold font indicates variants that are rare in the normal population and have a low liekliheood of occuring together and are therefore possible candidate variants. Genomics coordinate based on GRCh37. Likelyhood to observe homozygous‐ or possible compound heterozygous variants based on MMAF of the two alleles. Frequency of observed homozygous‐ or possible compound heterozygous variants in 51 early onset colorectal cancer samples. Maximum obsevered frequency of homozygous samples in gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH. Maximum minor allele frequency from ExAC ALL, ExAC NFE, ExAC AFR, ExAC AMR, ExAC EAS, ExAC FIN, ExAC SAS, ExAC OTH, gnomAD ALL, gnomAD AFR, gnomAD AMR, gnomAD ASJ, gnomAD EAS, gnomAD FIN, gnomAD NFE, gnomAD OTH, 1000G ALL, 1000G EUR, SweGen 249DANES, 249SWEDES. Allele frequency in 51 early onset colorectal cancer samples. Allele frequency in 56 breast cancer samples. Patients with homozygous‐ and possible compound heterozygous variants.

DISCUSSION

Since onset of colorectal cancer before the age of 40 years is infrequent, genetic predisposition may be suspected. Our cohort of 51 simplex cases with early‐onset CRC tested two hypotheses: first a pathogenic variant in an autosomal dominant cancer gene; and second an autosomal recessive cancer syndrome. In order to address the first hypothesis, we searched for heterozygous variants in genes known to be important in hereditary cancer or known somatic driver genes according to the classification by the Vogelstein group (Vogelstein et al., 2013) (Table 1). We found one pathogenic variant in PTEN, p.Ser59Ter, known to cause Cowden/hamartoma tumor syndrome and this syndrome could later be confirmed clinically in the patient and genetic counseling could be offered to his relatives. While the other variants (Table 1) suggested by the cancer gene list approach have an unknown clinical significance (Landrum et al., 2018), the genes are known to be cancer‐related. As we did not have access to tumor material or DNA from relatives, unfortunately further analyses to test the gene expression in tumors and/or segregation in the family could not be performed. One patient demonstrated loss of PMS2 protein in his tumor and harbored a pathogenic variant in exon 12 in PMS2 that was detected by a clinical lab. This variant was initially missed by our WES due to poor quality in the mapping step as a result of the pseudogene PMS2CL that has a high homology to exon 9 and 11–15 (Takeda et al., 2014). Loss of function in APC gene leads to the development of adenomas, a precursor lesion to CRC, in familial adenomatous polyposis (MIM #175100). This condition occurs in <1/10,000, thus the p.Asn944Asp variant with a MMAF of 0.05% is also probably too common to be disease causing. MSH2 is known to cause Lynch syndrome (MIM #120435). The incidence of Lynch syndrome has been estimated to be 1/660–1/2,000 (de la Chapelle, 2005) and approximately 40% of cases are caused by MSH2 mutation (i.e., incidence of MSH2Lynch syndrome: 1/1,650–1/5,000) (Moller et al., 2017). In the ClinVar database, there are 850 pathogenic variants in MSH2 and thus the p.Val722Phe MSH2 variant with an incidence of 1/5,000 (0.02%) is probably too common to be disease‐causing. PTPN12 is critical for the regulation of cell proliferation, differentiation, and neoplastic transformation (Takekawa et al., 1992) but is not associated with a known clinical cancer syndrome. POLE encodes the catalytic subunit of DNA polymerase epsilon and mutations cause susceptibility to CRC and other tumor types (MIM # 615083). Both of these genes are somatically mutated in CRC, but these variants were also present in 0.01% of a normal population and are likely too common to be cancer predisposition variants. The BMPR1A, BRIP1 and SRC variants have never been reported in healthy individuals which makes them highly interesting. BMPR1A variants can cause juvenile intestinal polyposis (MIM #174900). However, the only clinical information we had available from the patient was microsatellite‐stable CRC at 35 years with no information regarding possible polyps. We cannot exclude or verify this potential diagnosis clinically. BRIP1 has been shown to be a moderate‐risk ovarian cancer gene (Ramus et al., 2015; Weber‐Lassalle et al., 2018), most often caused by truncating variants (ClinVar). Two CRC patients with truncating BRIP1 variants and either familial CRC or polyposis have recently been reported (Martin‐Morales et al., 2018; Rosenthal et al., 2018). However, the pathogenicity of our in‐frame BRIP1 duplication is not known. SRC, encoding a tyrosine kinase receptor, is frequently implicated in cancer (Turro et al., 2016). It was suggested as a putative CRC gene based on proteomic analysis (Zhang et al., 2014) and may have a role in colon cancer progression (Irby et al., 1999). In an attempt to search for novel cancer genes, we identified 10 truncating variants (in addition to the pathogenic PTEN variant) (Table 2) in genes which are not known to be susceptibility genes for cancer: CLSPN, CELSR2, ADAM17, BIRC6, SEC24B, RBM27, PPARGC1B, NCOA7, SSH2, and MYO9B. All these genes are expressed in colon tissue (genecards.org). Seven of these have never been reported in normal databases, supporting their role as rare risk variants. Three have a function highly related to cancer. CLSPN is involved in the DNA damage checkpoint (Chini & Chen, 2003) and is required for the activation of CHK1 during a DNA replication checkpoint response (Kumagai & Dunphy, 2000). It is also involved in DNA replication and repair of DNA damage repair and may function as a tumor suppressor gene (Azenha, Lopes, & Martins, 2017). Nine somatic truncating mutations in CLSPN have been reported in CRC in cBioPortal including one at position 1139, that is, more distal to our variant. Therefore, this gene is the most likely candidate for loss‐of‐function mutations. The CELSR2 protein was identified to contain epidermal growth factor‐like domains (Nakayama et al., 1998). It is important for the development of normal cilia and neuronal migration (Goffinet & Tissir, 2017). The expression of CELSR2 protein was increased in the cytoplasm of breast cancer cells (Jiang et al., 2018), but its possible function in cancer is unclear. The ADAM17 protein is a metalloprotease first identified as an enzyme that cleaves tumor necrosis factor‐alpha (Adrain, Zettl, Christova, Taylor, & Freeman, 2012; Black et al., 1997). Later it was found to have more than 80 substrates and is important for releasing ligands of EGFR from the membrane and thus activating EGFR (Zunke & Rose‐John, 2017). ADAM17 is upregulated in colorectal cancer cells (Blanchot‐Jossic et al., 2005) and blocking ADAM17 in mouse colorectal cancer xenografts inhibited tumor growth (Rios‐Doria et al., 2015). In humans, loss of both alleles of ADAM17 can cause a rare neonatal autosomal recessive inflammatory bowel and skin disease (MIM #603639). As both CELSR2 and ADAM17 seem to have growth‐stimulating functions, the loss‐of‐function mutations are less likely candidates for cancer predisposition. The variants in BIRC6, RBM27, and NCOA7 were found in 0.02%–0.05% of the reference population which reduces the likelihood of them being pathogenic. Of these, BIRC6 exhibited a role in resistance to apoptosis (Chen et al., 1999; Hao et al., 2004). The gene has been suggested to be a prognosis predictor in colorectal cancer (Hu et al., 2015) and BIRC6 mutations have been found in colorectal adenomas (Zhou et al., 2013) and are common in carcinomas (Wolff et al., 2018). NCOA7/ERAP140 is a nuclear receptor coactivator that binds to the estrogen receptor among others. It has been implicated as a risk factor for breast cancer (Higginbotham et al., 2011) but has as yet no known association to CRC. It has been shown to be over‐expressed in oral squamous cell carcinoma where it promotes cell proliferation (Xie et al., 2016). Two genes have some association to cancer but not to CRC: PPARGC1B is a co‐activator for ESR1 and variants in the gene have been associated with increased risk for estrogen‐positive breast cancer (Li et al., 2011). One study demonstrated that MYO9B was involved in the migration of prostate cancer cell lines and might be important for metastasis (Makowska, Hughes, White, Wells, & Peckham, 2015). To date, no studies have implicated RBM27, SEC24B, or SSH2 in CRC or cancer development. As the variants in SEC24B and SSH2 have never been reported in the normal population databases, they are also possible candidate genes. In total, 3,800 rare missense variants were found in the exomes of the 51 young CRC cases. In order to try to reduce the number of candidates to consider, we filtered using CADD score > 20 which left 2301 missense variants (Supporting Information, Table S2). In order to pinpoint the candidate variants, we used even more stringent criteria (higher CADD, variants that were never previously reported and in genes that normally do not have many missense variants) giving a short‐list of eight variants. CADD represents multiple lines of in silico evidence of pathogenicity and can predict pathogenicity reasonably well, however, in silico tools (such as CADD and the ExAC missense Z‐score) will also miss clinically relevant pathogenic variants (van der Velde et al., 2015) and therefore this way of filtering is not optimal. In this way, we have probably excluded several variants that may have a pathogenic effect and in the future when many hundred thousand individuals have been sequenced in population databases, we may be able to filter solely on MMAF. Among the eight top candidate variants, one has been reported to be a possible tumor suppressor gene: INPP4A. INPP4A has been shown to be downregulated in pancreatic cancer and to inhibit cell proliferation and promote apoptosis in bladder and pancreatic cancer cells (Wang, Feng, Jiang, & Zuo, 2017; Wang, Wu, Huang, & Chen, 2018). Three have been described in cancer, but their function remains unclear. ACACA (acetyl‐CoA carboxylase or ACC1) catalyses the rate‐limiting reaction in the biogenesis of long‐chain fatty acids and is thus vital for cancer cell survival during hypoxia (Gao et al., 2016). Inhibition of ACACA can lead to either decreased cell proliferation (Jones et al., 2017; Singh, Yadav, Kumar, & Saini, 2015) or decreased apoptosis (Keenan et al., 2015) and increased risk of metastasis/tumor recurrence (in mice) (Rios Garcia et al., 2017). NR2C2 can inhibit cancer initiation, but promote cancer progression (Lin et al., 2017). In colon cancer cell lines, NR2C2 is required for cell survival and its inhibition induces cell death (McNew et al., 2016; Singh et al., 2012). DIDO1 regulates embryonic stem cell renewal (Liu et al., 2014) and is upregulated in melanoma tissues and cell lines as well as colorectal tumors (Braig & Bosserhoff, 2013; Sillars‐Hardebol et al., 2012), although in the latter, there was no correlation to gene copy number and DIDO1 was suggested to be a passenger on the common 20q duplication seen in CRC. The remaining four have no known cancer‐related function (KIAA1109, GPHN, GPR25, ZNF462). Thus, whole exome sequencing has demonstrated its capability in identifying a large amount of possible candidate variants in this study. By using very stringent criteria, we could identify four possible candidate genes with a putative role in cancer, however we have probably excluded several candidate variants in the filtering. Thus, the challenge is how to rank the variants in the candidate list to obtain an optimal balance between sensitivity and specificity. In the search for a recessive syndrome, we found no cases of rare biallelic variants in any known cancer genes in our patients. We also searched the entire exome for more common autosomal recessive alleles and listed 16 possible candidate genes (Table 4) with six of them (ATP10B, PKHD1, PTPRQ, UGGT2, MYH13, TFF3) more likely than the others based on their MMAF <5% and observed frequency 20 times higher than the expected likelihood of occurring together. ATP10B is the catalytic part of a complex which catalyzes the hydrolysis of ATP coupled to the transport of aminophospholipids from the outer to the inner leaflet of the late endosomes (UniProt). It has been reported to be mutated somatically in 1.2% of all CRC in cBioportal (accessed 20181218), but its possible function in cancer is not yet known. PKHD1 is known to cause autosomal recessive Polycystic Kidney Disease (MIM #606702). It encodes a protein which is important for correct mitotic spindle formation and function and inhibition leads to mitotic defects (Zhang et al., 2010). PKHD1 is mutated in 5% of all CRC in cBioportal and in an analysis of 13,023 genes in 11 colorectal cases, PKHD1 was ranked as the seventh most common somatically mutated gene (Sjoblom et al., 2006). However, there was a debate whether the finding reached statistical significance (Forrest & Cavet, 2007; Ward et al., 2011). PTPRQ, Protein tyrosine phosphatase receptor‐like type Q, was reported to be involved in phosphorylation/dephosphorylation signaling pathways and metastasis (Laczmanska et al., 2014, 2016; Sato et al., 2017). 1.2% of all CRC have somatic mutations in this gene (cBioportal, accessed 181218), however, as a suspected oncogene, this gene is a less likely candidate for an autosomal recessive predisposition gene. UGGT2 is a uridine diphosphate‐glucose:glycoprotein glucosyltransferase (Takeda et al., 2014). It has no known function in cancer although 2.4% of all CRC carry somatic mutations in UGGT2 (cBioportal). MYH13 also has no known role in cancer, but somatic mutations in CRC are found in 2.3% (cBioportal). TFF3, Trefoil factor 3, is a secreted protein which stimulates cell migration and prevention of apoptosis, enabling repair of the intestinal mucosa. It was suggested to be a risk factor for early recurrence of CRC and to be involved in promoting lymph node metastasis (Huang, Li, Wang, & Zhang, 2013; Morito et al., 2015). although TFF3 is very rarely mutated in CRC tissue (0.1%, cBioportal). In all, the relevance of our detected possibly bi‐allelic variants is unknown. In summary, we have detected six putative candidate genes, of which five (ATP10B, PKHD1, UGGT2, MYH13, TFF3) may be candidate risk factor recessive genes for CRC. Using the breast cancer cohort as a comparison to search for autosomal recessive genes may not be a good approach since there are genes that are predispose to both breast‐ and colorectal cancer. However, to look for possible compound heterozygous variants, we need a cohort where we have all the genotyping information. We have considered using genotyping data from public databases but there are a few issues. First, the number of samples with available genotyping information is far fewer than those that provide only allele frequency, due to data privacy. The second issue is about platform errors. Not all the variants we see in the analysis are from the samples. The errors can happen in any steps after DNA isolation: errors caused by contamination, by library preparation, by sequencing machines, or errors caused by computational tools. An advantage of our inhouse BRC cohort is that they are Swedish, the same population as the study group. Also, the DNA from the BRC cohort has been collected and processed in the same way as the study group. These advantages have eliminated the artifacts caused by the differences in populations and the difference in platforms. The number of our proposed candidate truncating variants (10 variants in 51 cases) are slightly fewer than expected compared to those with similar study (26 of the 102 cases) (Adam et al., 2016), considering that our frequency cut‐off is less stringent (0.1% vs. 0.01%). This discrepancy can likely be explained by number of databases and number of sub‐populations used during filtering of rare variants. Adam et al. filtered their variants using 12 population databases including the seven ExAC cohorts. Our proposed candidate variants were required to have a MMAF <0.1% in 21 population databases. Among them, SweGen (Ameur et al., 2017) and our in‐house database of 249 Swedish samples played a major role in removing normal variants belonging to the Swedish population. The ALL population in ExAC and gnomAD (Lek et al., 2016), together with their other seven sub populations, even though they overlapped, contributed significantly with variants belonging to other populations. Databases of 200 Danish samples (Li et al., 2010) and 1000Genomes (ALL and NFE populations) (1000 Genomes Project Consortium et al., 2012) were also included to increase the sensitivity. This shows the importance of using population‐specific databases of normal variants in the filtering stage. Our results (2/51 = 4% with a hereditary cancer syndrome: Cowden and PMS2Lynch syndrome) are comparable to others. For example Pearlman et al. showed that 2% of all CRC cases with onset before 50 years had a hereditary cancer syndrome other than Lynch syndrome (APC, MUTYH, SMAD4). They detected Lynch syndrome in 8%, especially in those with a family history (Pearlman et al., 2017). In our cohort, we selected patients with no family history, which largely excludes Lynch syndrome, although CRC caused by PMS2‐mutation with its low penetrance can occur in the absence of family history. As PMS2 has multiple pseudogenes, pathogenic variants are often missed by WES and need to be manually analysed. Simplex CRC cases are often caused by other mechanisms such as somatic mutations (Haraldsdottir et al., 2014; Mensenkamp et al., 2014) or epigenetic aberrations (Moreira et al., 2015). In the near future, patients with cancer will likely undergo paired tumor and germline testing in order to detect somatic aberrations that can guide therapy as part of genomic medicine initiatives across the Western world. As a bonus, both hereditary variants and somatic driver mutations can be identified in the same analysis. Until such pipelines are established, our results and those of others suggest that, in the absence of family history, or other suggestive features suggestive of hereditary syndromes such as polyposis or multiple tumors, genetic testing for gemline mutations in young CRC patients is of limited clinical benefit.

CONCLUSION

Whole exome sequencing in early onset non‐familial CRC patients only identified a causative germline mutation in two of 51 (4%). These patients had Cowden syndrome, that had not been diagnosed clinically or PMS2Lynch syndrome that was difficult to detect using WES. Our results suggest germline analysis using WES or broad gene panels in simplex cases with CRC at young age is of limited value in the clinic. In addition, we propose three candidate variants in known cancer susceptibility genes (BMPR1A, BRIP1, and SRC), up to three truncating variants in possibly novel cancer genes (CLSPN possibly SEC24B, SSH2), four missense variants in genes involved in cancer initiation or progression (ACACA, NR2C2, INPP4A, and DIDO1), and five candidate risk factor recessive genes (ATP10B, PKHD1, UGGT2, MYH13, TFF3). Further studies are needed to find support for the pathogenicity of these variants in novel Mendelian or complex disease in early onset non‐familial CRC.

CONFLICT OF INTEREST

The authors declare no conflict of interest. Click here for additional data file. Click here for additional data file.
  95 in total

Review 1.  Seven pass Cadherins CELSR1-3.

Authors:  Andre M Goffinet; Fadel Tissir
Journal:  Semin Cell Dev Biol       Date:  2017-07-15       Impact factor: 7.727

Review 2.  Protein Bioinformatics Databases and Resources.

Authors:  Chuming Chen; Hongzhan Huang; Cathy H Wu
Journal:  Methods Mol Biol       Date:  2017

3.  A metalloproteinase disintegrin that releases tumour-necrosis factor-alpha from cells.

Authors:  R A Black; C T Rauch; C J Kozlosky; J J Peschon; J L Slack; M F Wolfson; B J Castner; K L Stocking; P Reddy; S Srinivasan; N Nelson; N Boiani; K A Schooley; M Gerhart; R Davis; J N Fitzner; R S Johnson; R J Paxton; C J March; D P Cerretti
Journal:  Nature       Date:  1997-02-20       Impact factor: 49.962

Review 4.  Cancer genome landscapes.

Authors:  Bert Vogelstein; Nickolas Papadopoulos; Victor E Velculescu; Shibin Zhou; Luis A Diaz; Kenneth W Kinzler
Journal:  Science       Date:  2013-03-29       Impact factor: 47.728

5.  The cBio cancer genomics portal: an open platform for exploring multidimensional cancer genomics data.

Authors:  Ethan Cerami; Jianjiong Gao; Ugur Dogrusoz; Benjamin E Gross; Selcuk Onur Sumer; Bülent Arman Aksoy; Anders Jacobsen; Caitlin J Byrne; Michael L Heuer; Erik Larsson; Yevgeniy Antipin; Boris Reva; Arthur P Goldberg; Chris Sander; Nikolaus Schultz
Journal:  Cancer Discov       Date:  2012-05       Impact factor: 39.397

6.  Evaluation of CADD Scores in Curated Mismatch Repair Gene Variants Yields a Model for Clinical Validation and Prioritization.

Authors:  K Joeri van der Velde; Joël Kuiper; Bryony A Thompson; John-Paul Plazzer; Gert van Valkenhoef; Mark de Haan; Jan D H Jongbloed; Cisca Wijmenga; Tom J de Koning; Kristin M Abbott; Richard Sinke; Amanda B Spurdle; Finlay Macrae; Maurizio Genuardi; Rolf H Sijmons; Morris A Swertz
Journal:  Hum Mutat       Date:  2015-05-20       Impact factor: 4.878

7.  Acetate functions as an epigenetic metabolite to promote lipid synthesis under hypoxia.

Authors:  Xue Gao; Shu-Hai Lin; Feng Ren; Jin-Tao Li; Jia-Jia Chen; Chuan-Bo Yao; Hong-Bin Yang; Shu-Xia Jiang; Guo-Quan Yan; Di Wang; Yi Wang; Ying Liu; Zongwei Cai; Ying-Ying Xu; Jing Chen; Wenqiang Yu; Peng-Yuan Yang; Qun-Ying Lei
Journal:  Nat Commun       Date:  2016-06-30       Impact factor: 14.919

8.  BRIP1 loss-of-function mutations confer high risk for familial ovarian cancer, but not familial breast cancer.

Authors:  Nana Weber-Lassalle; Jan Hauke; Juliane Ramser; Lisa Richters; Eva Groß; Britta Blümcke; Andrea Gehrig; Anne-Karin Kahlert; Clemens R Müller; Karl Hackmann; Ellen Honisch; Konstantin Weber-Lassalle; Dieter Niederacher; Julika Borde; Holger Thiele; Corinna Ernst; Janine Altmüller; Guido Neidhardt; Peter Nürnberg; Kristina Klaschik; Christopher Schroeder; Konrad Platzer; Alexander E Volk; Shan Wang-Gohrke; Walter Just; Bernd Auber; Christian Kubisch; Gunnar Schmidt; Judit Horvath; Barbara Wappenschmidt; Christoph Engel; Norbert Arnold; Bernd Dworniczak; Kerstin Rhiem; Alfons Meindl; Rita K Schmutzler; Eric Hahnen
Journal:  Breast Cancer Res       Date:  2018-01-24       Impact factor: 6.466

9.  ClinVar: improving access to variant interpretations and supporting evidence.

Authors:  Melissa J Landrum; Jennifer M Lee; Mark Benson; Garth R Brown; Chen Chao; Shanmuga Chitipiralla; Baoshan Gu; Jennifer Hart; Douglas Hoffman; Wonhee Jang; Karen Karapetyan; Kenneth Katz; Chunlei Liu; Zenith Maddipatla; Adriana Malheiro; Kurt McDaniel; Michael Ovetsky; George Riley; George Zhou; J Bradley Holmes; Brandi L Kattman; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2018-01-04       Impact factor: 16.971

10.  COSMIC: somatic cancer genetics at high-resolution.

Authors:  Simon A Forbes; David Beare; Harry Boutselakis; Sally Bamford; Nidhi Bindal; John Tate; Charlotte G Cole; Sari Ward; Elisabeth Dawson; Laura Ponting; Raymund Stefancsik; Bhavana Harsha; Chai Yin Kok; Mingming Jia; Harry Jubb; Zbyslaw Sondka; Sam Thompson; Tisham De; Peter J Campbell
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

View more
  6 in total

1.  High-throughput gene screen reveals modulators of nuclear shape.

Authors:  Andrew C Tamashunas; Vincent J Tocco; James Matthews; Qiao Zhang; Kalina R Atanasova; Lauren Paschall; Shreya Pathak; Ranjala Ratnayake; Andrew D Stephens; Hendrik Luesch; Jonathan D Licht; Tanmay P Lele
Journal:  Mol Biol Cell       Date:  2020-04-22       Impact factor: 4.138

2.  Exome sequencing in 51 early onset non-familial CRC cases.

Authors:  Jessada Thutkawkorapin; Annika Lindblom; Emma Tham
Journal:  Mol Genet Genomic Med       Date:  2019-02-27       Impact factor: 2.183

Review 3.  Candidate Gene Discovery in Hereditary Colorectal Cancer and Polyposis Syndromes-Considerations for Future Studies.

Authors:  Iris B A W Te Paske; Marjolijn J L Ligtenberg; Nicoline Hoogerbrugge; Richarda M de Voer
Journal:  Int J Mol Sci       Date:  2020-11-19       Impact factor: 5.923

4.  Potential Involvement of NSD1, KRT24 and ACACA in the Genetic Predisposition to Colorectal Cancer.

Authors:  Isabel Quintana; Pilar Mur; Mariona Terradas; Sandra García-Mulero; Gemma Aiza; Matilde Navarro; Virginia Piñol; Joan Brunet; Victor Moreno; Rebeca Sanz-Pamplona; Gabriel Capellá; Laura Valle
Journal:  Cancers (Basel)       Date:  2022-01-29       Impact factor: 6.639

5.  miR-31-5p-DMD axis as a novel biomarker for predicting the development and prognosis of sporadic early-onset colorectal cancer.

Authors:  Changqin Liu; Wei Wu; Wenju Chang; Ruijin Wu; Xiaomin Sun; Huili Wu; Zhanju Liu
Journal:  Oncol Lett       Date:  2022-03-17       Impact factor: 2.967

6.  Hormone receptor expression of colorectal cancer diagnosed during the peri-partum period.

Authors:  Jordyn Silverstein; Wesley Kidder; Susan Fisher; Thomas A Hope; Samantha Maisel; Dianna Ng; Jessica Van Ziffle; Chloe E Atreya; Katherine Van Loon
Journal:  Endocr Connect       Date:  2019-08       Impact factor: 3.335

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.