| Literature DB >> 21108794 |
Qi Zhao1, Ewen F Kirkness, Otavia L Caballero, Pedro A Galante, Raphael B Parmigiani, Lee Edsall, Samantha Kuan, Zhen Ye, Samuel Levy, Ana Tereza R Vasconcelos, Bing Ren, Sandro J de Souza, Anamaria A Camargo, Andrew J G Simpson, Robert L Strausberg.
Abstract
BACKGROUND: To identify potential tumor suppressor genes, genome-wide data from exome and transcriptome sequencing were combined to search for genes with loss of heterozygosity and allele-specific expression. The analysis was conducted on the breast cancer cell line HCC1954, and a lymphoblast cell line from the same individual, HCC1954BL.Entities:
Mesh:
Year: 2010 PMID: 21108794 PMCID: PMC3156953 DOI: 10.1186/gb-2010-11-11-r114
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Statistics of exome sequencing and reads mapping
| HCC1954 | HCC1954BL | |
|---|---|---|
| Number of 454 reads | 6,878,120 | 6,658,357 |
| Total bases pairs | 2,588,213,873 | 2,325,966,906 |
| Uniquely mapped reads | 6,645,304 (97%) | 6,385,651 (96%) |
| Reads uniquely mapped to primary targets | 4,806,828 (70%) | 4,310,274 (65%) |
| Target coverage | 94.8% | 96.0% |
| Mean target coverage | 19.4× | 18.1× |
| Median target coverage | 16× | 16× |
| Coverage enrichment by exome sequencing | 23× | 24× |
| Total high-confidence (HC) SNVs (known SNVs) | 13,102 (12,145) | 14,219 (13,309) |
| HC heterozygous SNVs (known SNVs) | 5,602 (4,954) | 8,203 (7,408) |
| HC heterozygous SNVs in CDS (known SNVs) | 5,329 (4,709) | 7,848 (7,082) |
CDS, coding sequence; SNV, single nucleotide variant.
Figure 1Exome-based loss of heterozygosity events detected in HCC1954. (a-c) Comparison between variant allele frequencies of the same locus between HCC1954 and HCC1954BL for chromosome 17 (a), chromosome 8 (b) and chromosome 9 (c). Blue shaded areas represent large LOH regions. (d) Distribution of LOH genes across the HCC1954 chromosomes.
Top categories of general molecular types
| General molecular function | Number of genes | Examples |
|---|---|---|
| LOH genes | ||
| Enzyme | 80 | |
| Kinase | 25 | |
| Transporter | 21 | |
| Transcription regulator | 20 | |
| G-protein coupled receptor | 19 | |
| Transmembrane receptor | 9 | |
| Cytokine | 7 | |
| Ion channel | 6 | |
| ASE genes | ||
| Enzyme | 18 | |
| Transcription regulator | 7 | |
| Kinase | 6 | |
| Transporter | 5 | |
| Transmembrane receptor | 3 | |
| G-protein coupled receptor | 2 |
Genes are binned exclusively into each category based on their primary molecular function terms. LOH, loss of heterozygosity; ASE, allele-specific expression.
Figure 2Schematic diagram of allele-specific expression events detected by combination of exome sequencing and transcriptome sequencing.
Selected list of allele-specific expression genes detected in the HCC1954 cell line
| Gene | Number of reads ratio major/minor | Gene product | Non-ASE | Known SNV ID | |
|---|---|---|---|---|---|
| 404/0 | 10 | C-terminal binding protein 2 | 0.000 | 3 novel SNVs | |
| 415/1 | |||||
| 395/0 | |||||
| 267/4 | 6 | Major histocompatibility complex, class I, A | 0.000 | rs2231114 | |
| 546/0 | 0.000 | rs1059517 | |||
| 223/16 | 17 | TAO kinase 1 | 0.000 | rs508706 | |
| 140/0 | 10 | Plasminogen activator, urokinase | 0.000 | rs2227568 | |
| 124/1 | 3 | Podocalyxin-like 2 | 0.000 | rs920232 | |
| 53/2 | 21 | Integrin, beta 2 | 0.000 | rs11088969 | |
| 45/1 | 3 | Latexin | 0.000 | rs8455 | |
| 48/2 | 15 | Snurportin 1 | 0.000 | rs11547316 | |
| 50/5 | 19 | RAB3A, member RAS oncogene family | 0.000 | rs1046565 | |
| 31/0 | 10 | KIN, antigenic determinant of recA protein homolog (mouse) | 0.000 | rs61752337 | |
| 26/0 | 11 | Galactosidase, beta 1-like 2 | 0.000 | rs3741097 | |
| 21/0 | 10 | O-6-methylguanine-DNA methyltransferase | 0.001 | rs2308327 | |
| 340/1 | 1 | Pleckstrin homology domain containing, family A member 6 | 0.000 | rs33911350 | |
| 36/0 | 2 | Threonine synthase-like 2 ( | 0.000 | rs35051888 | |
| 220/10 | 20 | Lipopolysaccharide binding protein | 0.000 | rs5744204 | |
| 123/6 | rs2232582 | ||||
| 238/10 | rs2232596 | ||||
| 267/27 | 5 | Fibroblast growth factor receptor 4 | 0.000 | rs1966265 | |
| 57/0 | 1 | Solute carrier family 44, member 5 | 0.000 | rs17096508 | |
| 40/0 | rs10493565 | ||||
| 31/0 | rs588098 | ||||
| 41/0 | Novel SNV | ||||
| 44/0 | 10 | Fibroblast growth factor receptor 2 | 0.000 | rs1047100 | |
| 48/2 | X | Synaptotagmin-like 5 | 0.000 | rs5918476 | |
| 30/3 | rs4827330 |
Genes under ASE regulation in HCC1954 but expressed from both alleles in HCC1954BL are shown in bold; genes under ASE regulation in HCC1954 but barely expressed in HCC1954BL are shown in regular font. ASE, allele-specific expression; SNV, single nucleotide variant.
Figure 3Distribution of genes carrying high-confidence heterozygous alleles and genes under allele-specific expression. (a) In HCC1954; (b) in HCC1954BL.
Figure 4Validation of allele-specific expression events in HCC1954. The top trace is from cDNA, and the bottom trace is from genomic DNA (gDNA). (a) FGFR2; (b) MAP9; (c) FANCB.
Selected list of LOH or ASE genes: known or putative tumor suppressor genes
| Gene product and functional properties | Reported functional studies in cancer | |
|---|---|---|
| LOH genes | ||
| | Breast cancer 1, a nuclear phosphoprotein involved in maintaining DNA stability | Tumor suppressor function [ |
| | MutS homolog 3, a subunit of MutS beta involved in DNA mismatch repair | Genetic instability caused by loss of |
| | Polycomb group ring finger 2, involved in protein-protein interaction and transcription repression | Tumor suppressor function [ |
| | Platelet-derived growth factor receptor-like, a cell surface tyrosine kinase receptor | Mutation and gene loss correlated with breast cancer progression [ |
| | Breakpoint cluster region | Putative tumor suppressor in meningiomas [ |
| ASE genes | ||
| | Desmocollin 3, a cell adhesion molecule in cadherin family | Epigenetic silencing of |
| | Fibroblast growth factor receptor 2, a transmembrane tyrosine kinase | Hypermethylation of |
| | Myeloma overexpressed, a putative transforming gene | Epigenetically inactivated in esophageal squamous cell carcinomas [ |
| | Tumor necrosis factor receptor superfamily, member 10 d, a member of TNF-receptor superfamily | Aberrant methylation in multiple tumor type and mapped to tumor suppressor region in prostate cancer [ |
| | O-6-methylguanine-DNA methyltransferase, a DNA repair gene | Methylation of MGMT in many types of cancers [ |
LOH, loss of heterozygosity; ASE, allele-specific expression.
Figure 5DNA recombination, replication and repair network. In HCC1954, genes that have somatic point mutations (blue), ASE (yellow), LOH (green), or translocations (purple) form a DNA repair network. Small circles represent protein complexes or protein families with components encoded by either ASE or LOH genes.