Literature DB >> 30952955

Linkage analysis revealed risk loci on 6p21 and 18p11.2-q11.2 in familial colon and rectal cancer, respectively.

Susanna von Holst1, Xiang Jiao1, Wen Liu1, Vinaykumar Kontham1, Jessada Thutkawkorapin1, Jenny Ringdahl1, Patrick Bryant1, Annika Lindblom2.   

Abstract

Colorectal cancer (CRC) is one of the major cancer types in the western world including Sweden. However, known genetic risk factors could only explain a limited part of heritability of the disease. Moreover, colon and rectal cancers are habitually discussed as one entity, colorectal cancer, although different carcinogenesis has been recognized. A genome-wide linkage scan in 32 colon- and 56 rectal cancer families from Sweden was performed based on 475 non-FAP/HNPCC patients genotyped using SNP arrays. A maximum HLOD of 2.50 at locus 6p21.1-p12.1 and a HLOD of 2.56 at 18p11.2 was obtained for colon and rectal cancer families, respectively. Exome sequencing over the regions of interest in 12 patients from six families identified 22 and 25 candidate risk variants for colon and rectal cancer, respectively. Haplotype association analysis in the two regions was carried out between additional 477 familial CRC cases and 4780 controls and suggested candidate haplotypes possibly associated with CRC risk. This study suggested two new linkage regions for colon cancer and rectal cancer with candidate predisposing variants. Further studies are required to elucidate the pathogenic mechanism of these regions and to pinpoint the causative genes.

Entities:  

Mesh:

Year:  2019        PMID: 30952955      PMCID: PMC6777498          DOI: 10.1038/s41431-019-0388-3

Source DB:  PubMed          Journal:  Eur J Hum Genet        ISSN: 1018-4813            Impact factor:   4.246


Introduction

More than 1.2 million new cases are diagnosed with colorectal cancer (CRC) in the world yearly, primarily in the western world [1]. In Sweden, CRC is the third most common cancer type among women and men, and affects 4000–6000 individuals each year [2]. Less than 5% of CRC cases are caused by known genes, such as those causing familial adenomatous polyposis (FAP) and hereditary non-polyposis colorectal cancer (HNPCC) [3]. Previous CRC genes mapped using linkage analysis include APC [4], MSH2 [5], and MLH1 [6]. Other linkage studies have suggested potential CRC loci at 2q, 3q, 4q, 8q, 9q, 10p, 12q, 14q, and 15q [7-14]. Moreover, hundreds of common variants have been reported by genome-wide association studies (GWAS) to be associated to CRC, but they only describe a limited portion of disease risk [15, 16]. Altogether, germline variants in known genes and moderate- and low risk variants were suggested to explain 10–15% of the genetic CRC contribution [17]. Although parts of the causative CRC genetic factors are known, further investigation to learn about the missing genetics is important, since up to 35% of all CRC cases could be explained by hereditable factors [18]. Colon cancer and rectal cancer are habitually discussed as colorectal cancer. The question whether it is one single or two different entities has been under debate. Some studies have presented possible differences and recognized that colon and rectal cancer have different carcinogenesis. Bufill et al. reported that the location of the tumor might be a marker for the clinical feature [19]. Tumors arise predominantly distal to the splenic flexure in adenomatous polyposis, while in HNPCC, most tumors arise proximal to the splenic flexure [19]. One study disclosed a higher frequency in genetic alteration and allelic losses on chromosome 5q, 17p and 18 among distal compared to proximal colorectal tumors [20]. Kapiteijn et al. reported that rectal cancers had a significantly higher immunohistochemical expression of TP53 and nuclear β-catenin compared to colon cancers and that TP53 mutation rate was higher in rectal cancer cases [21]. However, no significant differences were seen for clinical and histopathological data [21]. Another study showed that KRAS variants in stool DNA were associated with tumors in the sigmoid colon and rectum but not with tumors from other parts of the colon [22]. Elevated expression of the oncogene MYC was seen more often in the left-side (rectum, sigmoid and descending colon) compared to the right-side (caecum and ascending colon) of colorectum [23]. Therefore, novel loci harboring predisposing genes could possibly be found by analyzing colon and rectal cancer families separately. Thus, we performed a new linkage scan on 32 colon cancer and 56 rectal cancer families corresponding to 169 and 306 individuals, respectively. These families were included in a genome-wide linkage analysis of 121 families conducted previously [9]. In order to find evidence supporting the candidate regions revealed by linkage analysis and to further pinpoint genes and variants that potentially affect functions, we performed targeted exome sequencing over the two regions in the families mostly contributing to the positive LOD scores and haplotype association analysis in additional CRC cases and controls.

Materials and methods

Study subjects

Cancer families were recruited through the Department of Clinical Genetics at the Karolinska University Hospital in Stockholm, Sweden between 1990 and 2005. FAP was excluded using medical records from affected individuals and HNPCC was excluded using our current clinical protocols [24]. Families were included in the study if there were at least two affected relatives informative for linkage analysis (i.e., at least a sib-pair). A family was included in the linkage analysis if the family could be classified to have a risk for colon or rectal cancer. Eighty-eight of the previously analyzed 121 families fulfilled the criteria above and were included in the linkage analysis (Table 1).
Table 1

Cancer families included in the linkage analysis

No. of familiesNo. of individuals genotypedNo. of affected genotypedMean ageYoungest age at diagnosis
Colon3230610865.833
Rectum56169676431
Cancer families included in the linkage analysis A case–control study used 477 familial CRC cases from the Swedish Colorectal Cancer Low-risk Study and 4780 control individuals from the Swedish Twin Registry [25, 26]. The 477 CRC cases were from a cohort of more than 3300 consecutive patients operated on for CRC in 14 hospitals in and around Stockholm and Uppsala between 2004 and 2009. For the twin controls, phenotypic data on cancer had previously been obtained through linking the twins to the Swedish Cancer Registry using the unique person identification number available for all Swedish citizens. Only one twin from each twin pair where none was affected was considered eligible for serving as control in the association analysis. The study was undertaken in agreement with the Swedish legislation of ethical permission (2003:460) and according to the decision in the Stockholm regional ethical committee (2008/125-31.2 and 02-489). All participants had given informed consent to participate in the study.

Genotyping and quality control (QC)

Genomic DNA was extracted from peripheral blood using standard procedures. Genotyping was performed as previously described [9]. In order to generate haplotypes for CRC families, a total of 60 parent–child pairs from 10 colon cancer families, 17 rectal cancer families as well as 33 CRC families without a clear tumor location predominance from this study was genotyped with the Illumina Infinium HumanOmniExpress-12v1 BeadChip (730,525 markers) at the SNP&SEQ Technology Platform in Uppsala, Sweden. The overall reproducibility of the genotype data was 99.996% based on 1.53% of duplicated genotyping, with an average call rate per SNP of 99.43%. The 477 additional familial CRC cases were genotyped using the Illumina Infinium OncoArray-500K BeadChip at the Center for Inherited Disease Research at Johns Hopkins University, MD, US [16]. The 4780 controls from the Swedish TwinGene registry were genotyped using the Illumina OmniExpress BeadChip in Uppsala, Sweden. All samples went through initial QC at their corresponding centers before being merged on the 235,573 SNPs that were shared between the two platforms. QC of the merged dataset excluded variants from analysis if call rate was ≤97%, minor allele frequency was <1% or if the variant deviated significantly from Hardy–Weinberg equilibrium (p ≤ 1E−7). Samples were removed in case of genotyping success rate was <97%, gender discrepancy between reported and X-chromosome heterozygosity-predicted, abnormal heterozygosity (>3 standard deviations from mean) or detection of cryptic relatedness.

Linkage analysis

Pedcheck [27] was used to check for the initial Mendelian inheritance analysis among the families. The family-based genetic model was used for parametric linkage analysis for all chromosomes including chromosome X. Non-parametric analysis was performed as a supplement. LOD scores as well as heterogeneity LOD scores were computed using MERLIN (version 1.1.2) [28] and was given for all genotyped positions. Analyses were done assuming both dominant and recessive traits and the parameters were set as described by our previously published paper [9]. Individuals with CRC or a polyp with high degree dysplasia were coded as affected. All subjects in the 88 families from the two genotyping sessions were included in the analysis. Due to two genotyping sets, two map files were merged and 7256 markers were used in the analysis. As a consequence of limitations in MERLIN, four large families had to be split when running the analysis.

Exome sequencing and variant calling

Twelve patients from six families, three colon (110, 301, 350) and three rectal (8, 918, 1213) cancer families, respectively, were selected for exome sequencing based on their major contribution to the LOD scores in the linkage regions. In four families two affected sibs were sequenced. In one family a single patient was sequenced and in the last family three sibs were subject to sequencing. Sequencing libraries were prepared from genomic DNA using TruSeq DNA Sample Preparation Kit (Illumina, San Diego, CA, USA) or SureSelectXT Reagent HSQ 96 Auto kit (Agilent, Santa Clara, CA, USA) according to manufacturers’ instructions. Exome enrichment was performed using TruSeq Exome Enrichment Kit (Illumina) or SureSelect XT Human All Exon V5 library (Agilent). Multiplexed paired-end libraries were pooled in equal molar and sequenced on an Illumina HiSeq 2000 or HiSeq 2500 system (Illumina) according to manufacturer’s instructions. Base calling was performed on the instrument with RTA (1.12.4.2 or 1.13.48) and the resulting BCL files were filtered, de-multiplexed, and converted to FASTQ format using CASAVA 1.7 or 1.8 (Illumina). Raw reads were mapped to the hg19 GRCh37 reference genome sequence using bwa (0.5.9), and variants were called using GATK (1.0.5974) following realignment and recalibration. Variant annotation was performed using ANNOVAR (released 2013-08-23).

Haplotype association analysis

Association analysis were carried out between 477 familial CRC cases and 4780 controls over the two regions of interest revealed by linkage analysis in sliding windows containing 1–25 consecutive markers. In short, haplotype frequency was estimated for each window and p-values were calculated using Plink v.1.07 [29].

Data deposition

Non-synonymous coding sequence variants with a MAF < 0.20 that segregated in at least one of the six selected families with corresponding disease information were deposited to the gene variant database of Leiden Open Variation Database (https://databases.lovd.nl/shared/genes). Individual IDs were #00208599, #00208600, #00208601, #00208603, #00208611, and #00208612 for one representative from each of the families 310, 110, 350, 8, 918, and 1213, respectively.

Results

Linkage analysis suggested candidate regions for colon and rectal cancer separately

A total of 88 families were genotyped and analyzed in two groups, comprising of 32 colon and 56 rectal cancer families with 306 and 169 individuals, respectively (Table 1). No LOD or HLOD score above three was observed. However, suggestive linkage could be demonstrated for colon as well as rectal cancer families (Fig. 1). Regions with HLODs above 1.0 are summarized in Table 2. A maximum HLOD of 2.5 was observed for a 6 Mb region at locus 6p21.1-p12.1 in the colon cancer families. The highest HLOD was 2.6 for the rectal cancer families at locus 18p11.2 with about 10 Mb in length.
Fig. 1

LOD/HLOD score plots for colon and rectal cancer families. a LOD/HLOD plot for 32 colon cancer families. b LOD/HLOD plot for 56 rectal cancer families. LODs are represented in red and HLODs are represented in cyan

Table 2

Linked regions with maximum observed HLODs above 1.0

Study groupLinked regioncM. SNPModelHLOD (α)
Colon5p15.31rs875347Dominant1.2 (0.8)
Colon6p21.1rs1537638Dominant1.4 (0.9)
Colon19q13.13rs241942Dominant1.0 (0.8)
Colon4p16.3rs935971Recessive1.5 (0.5)
Colon5p15.3rs413666Recessive1.8 (0.6)
Colon6p21.1rs722269Recessive2.5 (0.5)
Colon9q22.2rs7037744Recessive1.6 (0.5)
Colon10q22.1rs736594Recessive1.1 (0.4)
Colon12q23.1rs17290272Recessive1.4 (0.4)
Colon14q31.3rs981270Recessive1.6 (0.5)
Colon16q12.2rs1990637Recessive1.3 (0.4)
ColonXp22.31rs1159561Recessive1.7 (0.8)
Rectal2q23.3rs1441973Dominant1.3 (0.7)
Rectal6q24.3rs6570867Dominant1.3 (0.7)
Rectal9q22.33rs1167768Dominant1.7 (0.9)
Rectal10q26.2rs1926143Dominant1.2 (0.7)
Rectal18p11.2rs872906Recessive2.6 (0.6)
Rectal1q31.1rs1160832Recessive1.4 (0.5)
Rectal7q33rs2059367Recessive1.1 (0.3)
RectalXq26.3rs2254857Recessive1.0 (0.4)
LOD/HLOD score plots for colon and rectal cancer families. a LOD/HLOD plot for 32 colon cancer families. b LOD/HLOD plot for 56 rectal cancer families. LODs are represented in red and HLODs are represented in cyan Linked regions with maximum observed HLODs above 1.0

Exome sequencing revealed genetic variants segregating in cancer families

In order to identify variants that possibly affect gene function in the linked regions, we did exome sequencing on twelve individuals representing the families contributing to the LOD scores. Three colon cancer families were included to investigate the candidate region on chromosome 6p21.1, whereas three rectal cancer families were included for the region on chromosome 18p11.2. Non-synonymous coding variants with a MAF < 0.20 in the regions of interest were assessed in relevant family members. We report variants segregating in all individuals for at least one of the three families (Tables 3 and 4). Twenty-two variants in 18 genes and 25 variants in 10 genes were identified segregating in colon cancer and rectal cancer families, respectively. Among the 22 variants observed in the colon cancer patients, 20 were missense variants, one was a frameshift insertion and one was an in-frame deletion. The 25 variants in the rectal cancer families were all missense variants.
Table 3

Sequence variants segregated in the colon cancer families

Genomic DNA changecDNA changersIDVariant typeGeneAmino acid changeMAFSegregating families
NC_000006.11: g.38704943A>GNM_001206927.1: c.863A>Grs6935293Missense DNAH8 NP_001193856.1: p.(N288S)0.06110
NC_000006.11: g.38905957C>TNM_001206927.1: c.11771C>Trs61757218Missense DNAH8 NP_001193856.1: p.(T3924M)<0.01110
NC_000006.11: g.38980081A>GNM_001206927.1: c.13462A>Grs10484847Missense DNAH8 NP_001193856.1: p.(I4488V)0.11301
NC_000006.11: g.39034072G>ANM_002062.4: c.502G>Ars6923761Missense GLP1R NP_002053.3: p.(G168S)0.12110,301
NC_000006.11: g.39507889C>TNM_001289020.1: c.1535G>Trs2273063Missense KIF6 NP_659464.3: p.(R512L)0.08301
NC_000006.11: g.40400288C>TNM_020737.2: c.565G>Ars61731040Missense LRFN2 NP_065788.1: p.(A189T)0.07110
NC_000006.11: g.41029342T>CNM_006789.3: c.407T>Crs2076472Missense APOBEC2 NP_006780.1: p.(I136T)0.18301, 350
NC_000006.11: g.41166017C>TNM_024807.3: c.206G>Ars77093113Missense TREML2 NP_079083.2: p.(R69Q)<0.01110, 301
NC_000006.11: g.41766439_41766443dupNM_018561.4: c.1901_1905duprs201338884frameshift ins USP49 NP_061031.2: p.(D636fs)0.01301
NC_000006.11: g.41897919G>ANM_004053.4: c.481G>AMissense BYSL NP_004044.3: p.(E161K)<0.01301
NC_000006.11: g.41903798C>ANM_001760.4: c.759G>Trs33966734Missense CCND3 NP_001751.1: p.(E253D)0.01110
NC_000006.11: g.42932835C>TENST00000304611.8: c.2644G>Ars141238034Missense PEX6 ENSP00000303511.8: p.(V882I)0.02301
NC_000006.11: g.42981051C>GNM_014623.3: c.105G>Crs35628750Missense MEA1 NP_055438.1: p.(E35D)0.02301
NC_000006.11: g.43013368C>TNM_001168370.1: c.3071G>Ars201130952Missense CUL7 NP_001161842.1: p.(R1024H)<0.01110
NC_000006.11: g.43014022G>ANM_001168370.1: c.2864C>Trs61732148Missense CUL7 NP_001161842.1: p.(A955V)0.02301
NC_000006.11: g.43034855C>TNM_138343.3: c.913C>Trs146188256Missense KLC4 NP_612352.1: p.(R305C)0.01301
NC_000006.11: g.43112267C>TNM_001270398.1: c.2354C>Trs34764696Missense PTK7 NP_001257327.1: p.(A785V)0.03110, 301
NC_000006.11: g.43153787G>ANM_015089.3: c.845G>Ars61743561Missense CUL9 NP_055904.1: p.(G282E)0.01110
NC_000006.11: g.43166414_43166416delNM_015089.3: c.2871_2873delGGGrs141674093Inframe del CUL9 NP_055904.1: p.(G958del)0.04110
NC_000006.11: g.43535018C>TNM_020750.2: c.722G>Ars34324334Missense XPO5 NP_065801.1: p.(S241N)0.04301
NC_000006.11: g.43596814C>GNM_019096.4: c.86G>Crs112851070Missense GTPBP2 NP_061969.3: p.(G29A)0.02110
NC_000006.11: g.44147821A>GNM_007058.3: c.1561A>Grs34710081Missense CAPN11 NP_008989.2: p.(I521V)0.19110, 301

Genomic coordinates were based on GRCh37 (hg19). MAF, minor allele frequency (based on the 1000Genomes project and the ExAC database when 1000Genomes data were not available)

Table 4

Sequence variants segregated in the rectal cancer families

Genomic DNA changecDNA changersIDVariant typeGeneAmino acid changeMAFSegregating families
NC_000018.9:g.9944960A>GNM_003574.5:c.457A>Grs29132Missense VAPA NP_003565.4:p.(M153V)0.088
NC_000018.9:g.10471732G>ANM_153000.4:c.448G>Ars3748415Missense APCDD1 NP_694545.1:p.(V150I)0.18, 918
NC_000018.9:g.10681711C>TNM_022068.3:c.7387G>Ars3748428Missense PIEZO2 NP_071351.2:p.(V2463I)0.128
NC_000018.9:g.10699129G>TNM_022068.3:c.6149C>Ars113682091Missense PIEZO2 NP_071351.2:p.(A2050D)0.18
NC_000018.9:g.10731428C>TNM_022068.3:c.4832G>Ars2865121Missense PIEZO2 NP_071351.2:p.(R1611Q)0.16918, 1213
NC_000018.9:g.10759840C>ANM_022068.3:c.3443G>Trs35033671Missense PIEZO2 NP_071351.2:p.(C1148F)0.168
NC_000018.9:g.13008497G>CNM_032142.3:c.333G>Crs149216711Missense CEP192 NP_115518.3:p.(L111F)0.021213
NC_000018.9:g.13055915A>CNM_032142.3:c.3326A>Crs11080623Missense CEP192 NP_115518.3:p.(Q1109P)0.018
NC_000018.9:g.13056682G>ANM_032142.3:c.4093G>Ars2282542Missense CEP192 NP_115518.3:p.(V1365M)0.19918
NC_000018.9:g.13068109G>ANM_032142.3:c.4631G>Ars7228940Missense CEP192 NP_115518.3:p.(R1544H)0.19918
NC_000018.9:g.13095591G>ANM_032142.3:c.6344G>Ars56913743Missense CEP192 NP_115518.3:p.(R2115Q)0.19918
NC_000018.9:g.13100326G>ANM_032142.3:c.6686G>Ars74340616Missense CEP192 NP_115518.3:p.(R2229Q)0.021213
NC_000018.9:g.13826678C>TNM_005913.2:c.914C>Trs143262370Missense MC5R NP_005904.1:p.(T305I)0.00241213
NC_000018.9:g.14537970C>TNM_001137671.1:c.640G>Ars148283099Missense POTEC NP_001131143.1:p.(V214I)0.058, 918
NC_000018.9:g.14542648C>ANM_001137671.1:c.498G>Trs12454500Missense POTEC NP_001131143.1:p.(M166I)0.16918
NC_000018.9:g.14542931C>TNM_001137671.1:c.215G>Ars45554841Missense POTEC NP_001131143.1:p.(C72Y)0.171213
NC_000018.9:g.14542949T>CNM_001137671.1:c.197A>Grs9807555Missense POTEC NP_001131143.1:p.(H66R)0.181213
NC_000018.9:g.14543039T>CNM_001137671.1:c.107A>Grs45570841Missense POTEC NP_001131143.1:p.(K36R)0.0068
NC_000018.9:g.14543063A>CNM_001137671.1:c.83T>Grs45626231Missense POTEC NP_001131143.1:p.(F28C)0.01918
NC_000018.9:g.18534948G>CNM_005406.2:c.3649C>Grs201390233Missense ROCK1 NP_005397.1:p.(Q1217E)0.078
NC_000018.9:g.19995731T>CNM_172241.2:c.2044A>Grs9946136Missense CTAGE1 NP_758441.2:p.(I682V)0.191213
NC_000018.9:g.19996805T>CNM_172241.2:c.970A>Grs12961009Missense CTAGE1 NP_758441.2:p.(I324V)0.021213
NC_000018.9:g.21124945C>GNM_000271.4:c.1926G>Crs1788799Missense NPC1 NP_000262.2:p.(M642I)0.83a918, 1213
NC_000018.9:g.21424991C>ANM_198129.2:c.3622C>Ars17202961Missense LAMA3 NP_937762.2:p.(P1208T)0.051213
NC_000018.9:g.21511034C>ANM_198129.2:c.8445C>Ars1154232Missense LAMA3 NP_937762.2: p.(N2815K)0.168

Genomic coordinates were based on GRCh37 (hg19). MAF, minor allele frequency (based on the 1000Genomes project and the ExAC database when 1000Genomes data were not available)

aIt is the reference minor allele (NC_000018.9:g.21124945C) that segragates in the rectal cancer families

Sequence variants segregated in the colon cancer families Genomic coordinates were based on GRCh37 (hg19). MAF, minor allele frequency (based on the 1000Genomes project and the ExAC database when 1000Genomes data were not available) Sequence variants segregated in the rectal cancer families Genomic coordinates were based on GRCh37 (hg19). MAF, minor allele frequency (based on the 1000Genomes project and the ExAC database when 1000Genomes data were not available) aIt is the reference minor allele (NC_000018.9:g.21124945C) that segragates in the rectal cancer families

Haplotype association analysis identified candidate targets

To further pinpoint the genetic risk factors for colon and rectal cancers, we performed haplotype association analysis on the two regions of suggestive linkage (HLOD > 2). A total of 593 and 554 SNPs was successfully genotyped in the two regions on chromosomes 6 and 18, respectively. Association analysis between 477 familial CRC cases and 4780 controls using these genetic markers identified two candidate risk loci on chromosome 6 and two on chromosome 18. At least one candidate risk haplotype of each loci was associated with an elevated CRC risk (odds ratio 1.68–2.45) with a p-value lower than 1E−4. One of these four candidate risk haplotypes was relatively common (haplotype frequency of 15% in the control group), whereas the other three were infrequent (haplotype frequency 2–5% in the control group) (Fig. 2).
Fig. 2

Candidate risk haplotypes revealed by sliding-window association analysis within the linked regions on chromosome 6 (a) and chromosome 18 (b). Association was evaluated for haplotypes of sizes ranging from 1 to 25 markers between 477 familial CRC cases and 4 780 controls. All haplotypes with OR > 1 and p-value < 1E−4 were listed with p-value, odds ratio (OR), estimated frequency in controls (F_U) and cases (F_A). One haplotype of highest interest (lowest p-value and highest OR) for each of the four loci was indicated in orange and searched among 60 CRC families. Familial haplotypes of the most informative families potentially carrying these haplotypes were listed (question marks indicate undetermined markers of the haplotypes). Genomic regions covered by these risk haplotypes were illustrated showing co-localized genes, where exons and introns were indicated with dark and light gray, respectively

Candidate risk haplotypes revealed by sliding-window association analysis within the linked regions on chromosome 6 (a) and chromosome 18 (b). Association was evaluated for haplotypes of sizes ranging from 1 to 25 markers between 477 familial CRC cases and 4 780 controls. All haplotypes with OR > 1 and p-value < 1E−4 were listed with p-value, odds ratio (OR), estimated frequency in controls (F_U) and cases (F_A). One haplotype of highest interest (lowest p-value and highest OR) for each of the four loci was indicated in orange and searched among 60 CRC families. Familial haplotypes of the most informative families potentially carrying these haplotypes were listed (question marks indicate undetermined markers of the haplotypes). Genomic regions covered by these risk haplotypes were illustrated showing co-localized genes, where exons and introns were indicated with dark and light gray, respectively One risk haplotype identified on chromosome 6 stretched 14 kb in size, contained 4 markers and overlapped with gene KCNK5 (Fig. 2a). Haplotyping of 60 CRC families revealed at least one colon cancer family (family 110), one rectal cancer family (family 242) and one CRC family without tumor site predominance (family 869) that potentially have this haplotype (Fig. 2a). It is notable that two of the three colon cancer families that contributed to the LOD score, families 110 and 301 (data not shown), were identified as potential carriers of this haplotype. The other risk haplotype on chromosome 6 was 176 kb in size and overlapped with genes CDC5L, SPATS1 and part of TMEM151B. At least two colon cancer families (families 46 and 237) likely harbor this haplotype (Fig. 2a). One risk haplotype on chromosome 18 was 61 kb in size but did not overlap with any known gene. Two rectal cancer families (families 425 and 1252) and one colon cancer family (family 68) in our study clearly harbored this haplotype (Fig. 2b). One of the most linked rectal cancer families, family 918, is also a potential carrier of this haplotype (data not shown). The other risk haplotype on chromosome 18 overlapped with part of the gene PIEZO2, and at least one rectal cancer family (family 1425) is likely to have this 220-kb haplotype based on genotyping of the parent–child pair (Fig. 2b).

Discussion

CRC is a multifactorial disease. Previous studies have shown that tumor location differs among FAP compared to HNPCC patients and that different tumor sites would display diverse genetic alterations and allelic loss at 5q, 17p and 18 [19, 20]. Also, gene expressions and mutation rates vary among right and left colon and rectal tumors [21-23]. We hypothesized that, by subdividing the CRC families into colon and rectal cancer families, it would hopefully result in novel loci and predisposing genes for the two different cancer entities. Our linkage analysis provided us with some interesting regions with suggestive linkage HLOD = 2.5 for the colon cancer families on locus 6p21.1-p12.1 and HLOD = 2.6 for the rectal cancer families on locus 18p11.2 (Table 2). These regions have not yet, to our knowledge, been reported by other linkage studies, possibly because no previous study subdivided the CRC families into colon cancer and rectal cancer families. Exome sequencing was carried out on twelve individuals representing the families contributing to the LOD scores and identified within the linked regions 22 colon and 25 rectal cancer variants segregating in the cancer families, respectively (Tables 3, 4). Genes harboring these variants are involved in signal transduction (GLP1R, CCND3, CUL7, PTK7, VAPA, APCDD1, MC5R, ROCK1, NPC1), microtubule-based process (DNAH8, KIF6, CUL9, CEP192), RNA metabolism (APOBEC2, USP49, BYSL, XPO5), establishment of localization (PEX6, GTPBP2 and PIEZO2) and cell differentiation (LRFN2, MEA1, LAMA3) among others. Some of these genes have been implicated in colorectal tumorigenesis, for instance, CCND3 is a known oncogene in multiple cancer types including CRC [30]. PTK7, whose variants presented in two families, is reported to be expressed and actively involved in various malignancies including CRC, and its function in the Wnt signaling pathway has been demonstrated (reviewed in the ref. [31]). Moreover, overexpression of PTK7 has been implicated as a biomarker for adenoma and CRC, and is correlated with several clinicopathological features such as TNM stage, tumor differentiation, lymph node and distant metastasis [32, 33]. Similarly, GTPBP2 is also known as a positive regulator of the Wnt signaling pathway [34], which is involved in tumorigenesis of a wide variety of cancers including CRC. The variant in the gene APCDD1 is also shared among two families. APCDD1 is suggested to be regulated by the β-catenin/Tcf complex involved in colorectal tumorigenesis [35]. A previous methylation microarray-based scanning has revealed that hypermethylation of GLP1R is a biomarker for CRC and adenoma [36]. CUL7 has been identified as an oncogene, since it could directly bind to p53 and prevent cells from Myc-induced apoptosis [37]. Overexpression of CUL7 could distinguish metastatic CRC samples from the non-metastatic ones [38]. XPO5 is a key protein responsible for miRNA transportation and is upregulated both at mRNA and protein levels in CRC. Its overexpression is associated with worse clinicopathologic features and poor survival in CRC [39]. The POTEC gene had one variant shared in two families and other variants in single families. POTEC is a member of the highly homologous POTE family which are expressed in multiple cancer types including colon cancer [40, 41]. Gene ROCK1 is part of the Rho-kinase family and is overexpressed in CRC cell lines [42] and tissues [43]. Overexpression of ROCK1 has been shown to lead to increased CRC cell proliferation, transformation and invasion [42]. The gene CTAGE1 is described as a cancer antigen for T-cell lymphoma and other malignancies [44], and is expressed in 12–19% CRCs [45]. Previous studies have reported somatic frameshift variants of LAMA3 in CRC with high microsatellite instability [46] and deletions of the LAMA3 gene in CRC with high chromosomal instability [47]. Haplotype analysis has been proven valuable in identifying susceptibility genes in familial breast cancer [48] and cancer syndromes [49], especially in populations with a relatively homogenous genetic background. In particular, a candidate CRC locus on chromosome 9q [8, 9, 13, 14] was recently suggested to be explained by two different risk haplotypes in familial and sporadic bowel cancer [50]. In order to search for additional support of the two loci in the current study and to further pinpoint candidate risk variants, we performed haplotype association studies between familial CRC cases and controls for the two regions. The four candidate haplotypes harbor coding regions of several genes including CDC5L (cell division cycle 5 like), a positive regulator of cell cycle G2/M progression and key promoter of colorectal cancer cells [51]. The relationship between colorectal cancer and other genes located within these candidate haplotypes haven’t been well studied. But the fact that some of the families in the linkage analysis were demonstrated to be potential carriers of these risk haplotypes supports that these haplotypes may by associated with an increased risk. In conclusion, we propose two new linkage regions for colon cancer and rectal cancer. Haplotype analysis provides additional support and information regarding candidate variants that might affect function. We also report candidate variants within the linked regions that possibly predispose to CRC risk. Further studies on these genes of interest are needed to support or exclude them to be harboring disease causing variants.
  50 in total

1.  Tumor location and detection of k-ras mutations in stool from colorectal cancer patients.

Authors:  Milo Frattini; Debora Balestra; Silvana Pilotti; Lucio Bertario; Marco A Pierotti
Journal:  J Natl Cancer Inst       Date:  2003-01-01       Impact factor: 13.506

2.  PedCheck: a program for identification of genotype incompatibilities in linkage analysis.

Authors:  J R O'Connell; D E Weeks
Journal:  Am J Hum Genet       Date:  1998-07       Impact factor: 11.025

3.  Role of rho-kinase gene polymorphisms and protein expressions in colorectal cancer development.

Authors:  Ibrahim Sari; Betul Berberoglu; Esma Ozkara; Serdar Oztuzcu; Celaletdin Camci; Abdullah T Demiryurek
Journal:  Pathobiology       Date:  2013-01-17       Impact factor: 4.342

Review 4.  Colorectal cancer: evidence for distinct genetic categories based on proximal or distal tumor location.

Authors:  J A Bufill
Journal:  Ann Intern Med       Date:  1990-11-15       Impact factor: 25.391

5.  Exportin-5 Functions as an Oncogene and a Potential Therapeutic Target in Colorectal Cancer.

Authors:  Kunitoshi Shigeyasu; Yoshinaga Okugawa; Shusuke Toden; C Richard Boland; Ajay Goel
Journal:  Clin Cancer Res       Date:  2016-08-23       Impact factor: 12.531

6.  cTAGE: a cutaneous T cell lymphoma associated antigen family with tumor-specific splicing.

Authors:  Dirk Usener; Dirk Schadendorf; Joachim Koch; Stefan Dübel; Stefan Eichmüller
Journal:  J Invest Dermatol       Date:  2003-07       Impact factor: 8.551

7.  Evidence that c-myc expression defines two genetically distinct forms of colorectal adenocarcinoma.

Authors:  P G Rothberg; J M Spandorfer; M D Erisman; R N Staroscik; H F Sears; R O Petersen; S M Astrin
Journal:  Br J Cancer       Date:  1985-10       Impact factor: 7.640

8.  Gtpbp2 is a positive regulator of Wnt signaling and maintains low levels of the Wnt negative regulator Axin.

Authors:  William Q Gillis; Arif Kirmizitas; Yasuno Iwasaki; Dong-Hyuk Ki; Jonathan M Wyrick; Gerald H Thomsen
Journal:  Cell Commun Signal       Date:  2016-08-02       Impact factor: 5.712

Review 9.  Ptk7 and Mcc, Unfancied Components in Non-Canonical Wnt Signaling and Cancer.

Authors:  Norris Ray Dunn; Nicholas S Tolwinski
Journal:  Cancers (Basel)       Date:  2016-07-16       Impact factor: 6.639

10.  Cancer risk susceptibility loci in a Swedish population.

Authors:  Wen Liu; Xiang Jiao; Jessada Thutkawkorapin; Hovsep Mahdessian; Annika Lindblom
Journal:  Oncotarget       Date:  2017-11-25
View more
  1 in total

1.  Whole-Exome Sequencing Identifies a Novel Germline Variant in PTK7 Gene in Familial Colorectal Cancer.

Authors:  Beiping Miao; Diamanto Skopelitou; Aayushi Srivastava; Sara Giangiobbe; Dagmara Dymerska; Nagarajan Paramasivam; Abhishek Kumar; Magdalena Kuświk; Wojciech Kluźniak; Katarzyna Paszkowska-Szczur; Matthias Schlesner; Jan Lubinski; Kari Hemminki; Asta Försti; Obul Reddy Bandapalli
Journal:  Int J Mol Sci       Date:  2022-01-24       Impact factor: 5.923

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.