| Literature DB >> 34447411 |
Anna L W Huskey1,2, Isaac McNeely1, Nancy D Merner1.
Abstract
INTRODUCTION: Recent studies comparing canine mammary tumors (CMTs) and human breast cancers have revealed remarkable tumor similarities, identifying shared expression profiles and acquired mutations. CMTs can also provide a model of inherited breast cancer susceptibility in humans; thus, we investigated breed-specific whole genome sequencing (WGS) data in search for novel CMT risk factors that could subsequently explain inherited breast cancer risk in humans.Entities:
Keywords: CEACAM; breast cancer; canine mammary tumor; comparative oncology; inherited risk; rare protein truncating variants; splice mutations; whole genome sequencing
Year: 2021 PMID: 34447411 PMCID: PMC8383343 DOI: 10.3389/fgene.2021.702889
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1CEACAM24 (c.247dupG; p.(Val83Glyfs*48)) mutation summary. (A) Samtools tview image capture of the mutation in a WGS CMT-affected Golden Retriever. (B) Sanger sequencing results of validation in CMT-affected Golden Retriever cohort depicting wildtype (WT), heterozygous, and homozygous sequences at the mutation location. (C) Mutalyzer prediction of the change in protein sequence with frame-shifting mutation. (D) Depiction of the WT and mutated protein and lost regions and domains of the dog CEACAM24 protein with the frame-shift mutation.
CEACAM24 c.247dupG; p.(Val83Glyfs*48) genotypes and allele frequencies.
| Data set/Cohort | Dog breed | # of dogs | # of HOM | # of HET | Minor allele frequency | |
| CMT Affected | Golden Retriever | 18 | 6 | 9 | 66.7 | - |
| CHIC United States Breed Specific Controls | Golden Retriever | 87 | 42 | 34 | 67.8 | 0.3334 |
| CHIC United States Non-Golden Retriever Controls | Petit Basset Griffon Vendeen | 10 | 7 | 2 | 80.0 |
|
| Gordon Setter | 8 | 5 | 2 | 75.0 | ||
| Australian Cattle Dog | 10 | 4 | 2 | 50.0 | ||
| Siberian Husky | 10 | 4 | 1 | 45.0 | ||
| Dalmatian | 10 | 3 | 2 | 40.0 | ||
| Irish Setter | 9 | 0 | 1 | 5.6 | ||
| Welsh Pembroke Corgi | 10 | 0 | 0 | 0.0 | ||
| Standard Schnauzer | 10 | 0 | 0 | 0.0 | ||
| Newfoundland | 10 | 0 | 0 | 0.0 | ||
| Keeshond | 10 | 0 | 0 | 0.0 | ||
| Great Dane | 8 | 0 | 0 | 0.0 | ||
| Doberman Pinscher | 10 | 0 | 0 | 0.0 | ||
| Boxer | 10 | 0 | 0 | 0.0 | ||
| Totals and Avg MAF of CHIC Non-Golden Retriever Controls | 125 | 23 | 10 | 22.4 | ||
| European Variation Archive Controls | European General Dog Population | 196 | 12 | 44 | 17.3 |
|
Homology of dog and human ceacam proteins to dog CEACAM24 protein.
| Species | Gene name | Protein accession | % Identity | % Similarity |
| Dog | CEACAM1 |
| 52.2 | 58.4 |
| CEACAM16 |
| 22.5 | 37.7 | |
| CEACAM18 |
| 19.3 | 32.5 | |
| CEACAM20 |
| 21.2 | 31.9 | |
| CEACAM23 |
| 38.4 | 40.8 | |
| CEACAM24 |
| 100 | 100 | |
| CEACAM28 |
| 42.2 | 46.3 | |
| CEACAM30 |
| 53.6 | 58.3 | |
| Average of all Dog CEACAM proteins compared to Dog CEACAM24 (excluding CEACAM24 from analysis) | 35.6 | 43.7 | ||
| Human | CEACAM1 |
| 53.1 | 60.8 |
| CEACAM3 |
| 47 | 58.2 | |
| CEACAM4 |
| 50.4 | 63.4 | |
| CEACAM5 |
| 53.2 | 61 | |
| CEACAM6 |
| 37.8 | 48 | |
| CEACAM7 |
| 45.1 | 58.3 | |
| CEACAM8 |
| 53.8 | 63.6 | |
| CEACAM16 |
| 28 | 43.5 | |
| CEACAM18 |
| 26.9 | 46.2 | |
| CEACAM19 |
| 23.7 | 38.1 | |
| CEACAM20 |
| 25.7 | 39.9 | |
| CEACAM21 |
| 34.1 | 42.3 | |
| Average of all Human CEACAM proteins compared to Dog CEACAM24 | 39.9 | 51.9 | ||
FIGURE 2Dog and human CEACAM gene family protein domain analysis. (A) Dog CEACAM protein domain and binding site depictions with membrane regions. (B) Human CEACAM protein domain and binding site depictions with membrane regions.
Significant mutations in CEACAM gene family. Individual mutation p-values were calculated using Fisher’s Exact test.
| Gene name | Variant type | Genomic position on Chr 19 | mRNA variant name | Protein variant name | rs ID | EA | AA | ||||
|
|
| ||||||||||
| MAF (%) | Mutation specific | MAF (%) | Mutation specific | ||||||||
|
|
|
|
| ||||||||
| EVS EA | TCGA EA | TCGA EA | EVS AA | TCGA AA | TCGA AA | ||||||
| CEACAM4: NM_001817 | stopgain | 41625658 | c.367C > T | p.R123X | rs147663846 | − | − | − | 0.20 | 0.89 |
|
| stopgain | 41625601 | c.424C > T | p.Q142X | rs199937487 | − | − | − | 0.02 | 0.60 |
| |
| CEACAM6: NM_002483 | splicing | 41766301 | c.*40 + 2T > G | − | rs782698255 | 0.00 | 0.46 |
| 0.00 | 0.30 | 0.07636 |
| CEACAM7: NM_006890 | stopgain | 41687091 | c.195C > A | p.Y65X | rs782316651 | 0.00 | 10.79 |
| 0.00 | 4.46 |
|
| CEACAM8: NM_001816 | splicing | 42583204 | c.*40 + 2T > G | − | rs748512513 | 0.00 | 1.62 |
| − | − | − |
FIGURE 3Individual significant stop gain mutations. (A) CEACAM4 c.367C > T;p.(Arg123∗). (B) CEACAM4 c.424C > T;p.(Gln142∗). (C) CEACAM7 c.195C > A;p.(Tyr65∗).
FIGURE 4CEACAM6 and CEACAM8 significant splicing mutations. (A) Depiction of the change in genomic sequence with splice site mutation. (B) Depiction of the top five miRNA binding sites for CEACAM6 and CEACAM8 within the mature mRNA. Blue is coding and red is non-coding.
Top five miRNA binding sites for both CEACAM6 and CEACAM8 and previous cancer associations.
| Gene target name | miRNA name | Previous cancer association | Previous HBOC association |
| CEACAM6 | miR-3119 |
| No |
| miR-766-3p | |||
| miR-6512-3p |
|
| |
| miR-6720-5p |
| ||
| miR-5702 |
| ||
| CEACAM8 | miR-661 | ||
| miR-9903 |
|
| |
| miR-616-5p | |||
| miR-371b-5p | No | ||
| miR-4635 |
|
Aggregation analysis for rare (<1% MAF) PTVs in the CEACAM gene family.
| Gene name | Gene specific | |
|
| ||
| AA | EA | |
| CEACAM1: NM_001184815 | 1 | 0.8784262 |
| CEACAM3: NM_001815 | 1 | 0.3978745 |
| CEACAM4: NM_001817 | 0.148726 | 0.7479721 |
| CEACAM5: NM_004363 | 1 | 0.8516203 |
| CEACAM6: NM_002483 | 0.07636 |
|
| CEACAM7: NM_006890 |
|
|
| CEACAM8: NM_001816 | 0.2727805 |
|
| CEACAM16: NM_001039213 | 0.923479 | 0.9930833 |
| CEACAM18: NM_001278392 | 1 | 1 |
| CEACAM19: NM_020219 | 1 | 1 |
| CEACAM20: NM_001102597 | 1 | 0.9190567 |
| CEACAM21: NM_001098506 | 0.9604724 | 0.7104384 |
|
|
|
|