| Literature DB >> 18364049 |
Benjamin J Raphael1, Stanislav Volik, Peng Yu, Chunxiao Wu, Guiqing Huang, Elena V Linardopoulou, Barbara J Trask, Frederic Waldman, Joseph Costello, Kenneth J Pienta, Gordon B Mills, Krystyna Bajsarowicz, Yasuko Kobayashi, Shivaranjani Sridharan, Pamela L Paris, Quanzhou Tao, Sarah J Aerni, Raymond P Brown, Ali Bashir, Joe W Gray, Jan-Fang Cheng, Pieter de Jong, Mikhail Nefedov, Thomas Ried, Hesed M Padilla-Nash, Colin C Collins.
Abstract
BACKGROUND: The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using end sequencing profiling, which relies on paired-end sequencing of cloned tumor genomes.Entities:
Mesh:
Year: 2008 PMID: 18364049 PMCID: PMC2397511 DOI: 10.1186/gb-2008-9-3-r59
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Schematic of ESP. End sequencing and mapping of tumor genome fragments to the human genome provides information about structural rearrangements in tumors. A bacterial artificial chromosome (BAC) end sequence (BES) pair is a valid pair if distance between ends mapped on the normal human genome sequence and the orientation of these ends and are consistent with those for a BAC clone insert; otherwise, the BES pair is invalid. bp, base pairs; ESP, end sequencing profiling.
Clinical characteristics of the brain, breast, ovary and prostate tumor samples, and three breast cancer cell lines used for BAC library construction
| Library name | ||||||||
| AA9 | B421 | CHORI-514 | MCF7 | PM-1 | CHORI-510 | CHORI-518 | CHORI-520 | |
| Clinical sample designation | AA9 | B421 | S104 | MCF-7 | 25-48 | 860-7 | BT-474 | SK-BR-3 |
| Organ site | Brain | Breast | Breast | Breast cancer adenocarcinoma (metastasis - pleural effusion) | Prostate metastasis | Ovarian carcinoma | Ductal carcinoma | Breast cancer adenocarcinoma (metastasis - pleural effusion) |
| Therapies applied | Radiotherapy | Chemotherapy 4 months before surgery (CMF) | No radiation therapy or chemotherapy before surgery | N/A | Hormone ablation, palliative radiotherapy | No therapy before surgery | N/A | N/A |
| Patient status | Deceased | Deceased, no recurrence | No recurrence for 10 years | N/A | Deceased | Tumor recurred within 13 months | N/A | N/A |
| Total amount of tumor material used for library construction (mg) | 100 | 150 (20 mg effective) | 100 | N/A | 50 | 200 | N/A | N/A |
| Average clone size (± standard deviation; kb) | 129.1 ± 38.3 | 136.4 ± 29.2 | 166.1 ± 53.2 | 148.0 ± 30 | N/D | 149.3 ± 28.8 | 179 ± 23 | 154 ± 25 |
Shown are the clinical characteristics of the recurrent glioblastoma AA9, primary breast tumors B421 and S104, ovarian tumor 860, prostate metastasis 25-48, and the breast cancer cell lines MCF7, BT474, and SKBR3 used for bacterial artificial chromosome (BAC) library construction. Average clone size was determined by pulsed field-gel electrophoresis of Not1-digested DNA from 30 to 100 clones. The presence of a large blood clot in the B421 sample reduced the effective amount of tumor tissue to an estimated 20 mg (out of about 150 mg received from the tumor bank). CMF, cyclophosphamide, methotrexate and fluorouracil; kb, kilobases; N/A, number is not applicable for cell lines that can be grown in any amount and whose clinical history is not available; N/D, number not determined.
Results of end sequencing and mapping of each library
| MCF7 | BT474 | SKBR3 | Breast | Breast.2 | Ovary | Prostate | Brain | Normal | |
| Library name | MCF7_1 | CHORI-518 | CHORI-520 | B421 | CHORI514 | CHORI510 | PM1 | IGBR | K0241 |
| Mapped clones ( | 12,143 | 8,044 | 7,363 | 6,972 | 5,678 | 3,946 | 3,499 | 3,238 | 609 |
| Unique mapped clones ( | 11,492 | 7,547 | 6,950 | 6,540 | 5,381 | 3,714 | 3,296 | 3,051 | 568 |
| Valid pairs ( | 11,001 | 7,361 | 6,763 | 6,376 | 5,268 | 3,627 | 3,200 | 2,984 | 560 |
| Contigs ( | 6,323 | 4,135 | 4,171 | 4,365 | 3,450 | 2,877 | 2,747 | 2,573 | 548 |
| Contig coverage | 0.324 | 0.327 | 0.274 | 0.233 | 0.243 | 0.155 | 0.104 | 0.103 | 0.019 |
| Invalid pairs ( | 491 | 186 | 187 | 164 | 113 | 87 | 96 | 67 | 8 |
| Fraction invalid | 0.043 | 0.025 | 0.027 | 0.025 | 0.021 | 0.023 | 0.029 | 0.022 | 0.014 |
| 4.10 × e-04 | 0.056 | 0.032 | 0.051 | 0.133 | 0.080 | 0.020 | 0.113 | NA | |
| Number clusters ( | 36 | 26 | 24 | 2 | 7 | 2 | 2 | 0 | 0 |
| Invalid pairs in clusters ( | 164 | 61 | 64 | 4 | 24 | 4 | 4 | 0 | 0 |
The fraction of invalid pairs is calculated relative to the number of uniquely mapped pairs. The P value is the probability that the fraction of invalid pairs is the same as observed in the normal library, using a sample proportion test with pooled variance.
Summary of BAC sequencing
| Sample | Clones with identified or sequenced breakpoints | Total number of identified/sequenced breakpoints | Intragenic rearrangements | Gene:intergenic fusions | Gene:gene fusions | Intergenic: intergenic fusions |
| MCF7 | 12 | 36/35 | 3 | 10 | 19 | 4 |
| BT474 | 6 | 15/6 | 3 | 2 | 10 | 0 |
| SKBR3 | 8 | 24/8 | 7 | 4 | 12 | 1 |
| Breast (2B421) | 3 | 3/3 | 0 | 0 | 3 | 0 |
| Breast (CH514) | 0 | - | - | - | - | - |
| Ovary | 4 | 4/4 | 0 | 0 | 4 | 0 |
| Prostate | 5 | 5/5 | 0 | 4 | 1 | 0 |
| Brain | 3 | 3/3 | 0 | 3 | 0 | 0 |
Breakpoints are indicated as sequenced if the nucleotide sequence was obtained, or identified if the breakpoint was localized to 3-kilobase subclones. BAC, bacterial artifical chromosome.
Figure 2PCR validation of breakpoints in MCF7. (a) MCF7 clone 69F1 was sequenced and contained a small piece of chromosome 1 (purple rectangle) to chromosome 17 (yellow rectangle). Arrows on each rectangle indicate whether the fragment is oriented as in the reference genome (pointing to right) or inverted (pointing to left). PCR primers were designed to amplify the breakpoint and these primers were used to assay the other clones in the BES cluster with 69F1. Each of the other clones in the cluster are indicated as lines below 69F1, with the end-points of the lines indicating the locations of the mapped ends relative to the ends of 69F1. The heterogeneous PCR results might result from heterogeneity of the MCF7 cells, or the existence of multiple versions of this breakpoint in MCF7 genome. (b) PCR results for the clones presented in panel a. The expected size of the PCR fragment is 600 base pairs. (c) PCR validation of breakpoints in sequenced clone 37E22 from MCF7 and three additional clones in bacterial artificial chromosome end sequence (BES) cluster all fusing nearby locations from chromosomes 1, 3, and 20. Two other clones have the same complex internal organization as 37E22 with four rearrangement breakpoints. However, clone 34J23 contains only one of these breakpoints, suggesting that the rearrangement history of this clone is different from that of the others in the cluster.
Figure 3Use of dual-color FISH to validate a BT474 genomic breakpoint. End sequences from clone CHORI518_014-E04 were mapped to chromosomes 1 and 4. Clones RP11-692N22 and RP11-1095F2 were selected from the human RPCI11 library because their sequences map to just outside of tumor bacterial artificial chromosome (BAC) end sequence (BES) locations. These BACs were labeled with fluorescein and Texas red, respectively. Top: two chromosomes containing a merged yellow signal indicating juxtaposition of both probes are indicated with white arrows (and labeled A and B). Bottom: each labeled chromosome is shown with corresponding inverted-DAPI banded chromosome, and red and green image layers. Black arrows identify the region where the red and green probes are juxtaposed to one another. FISH, fluorescence in situ hybridization.
Figure 4Recurrent rearrangement loci in the three breast cancer cell lines. (a,b) Four loci on 20q13.2-13.3 shared by MCF7 and BT474 and (c) a locus near to the ERBB2 amplicon shared by BT474 and SKBR3. Colored boxes indicate the breakpoint regions for different bacterial artificial chromosome (BAC) clones from MCF7 (blue), BT474 (red), and SKBR3 (green) as a custom track on the University of California, San Francisco (UCSC) genome browser. A breakpoint region is defined as the possible locations of a breakpoint that are consistent with all the BAC end sequence (BES) in the cluster; thus, shorter boxes indicate more precise breakpoint localization. Arrows give the strand of the mapped BES and thus point away from the fused region.
Figure 5RT-PCR assays of fusion transcripts on a panel of breast cancer cell lines and normal tissues. HMEC-P1 stands for normal human mammary epithelial cells (passage 1), and HMEC-P4 stands for HMEC passage 4 (higher passage). (a) RT-PCR reveals expression of DR00074 (HYDIN gene fusion) in 16 out of 21 tested breast cancer cell lines, normal cultured human breast epithelial cells, and a wide range of normal human tissues. (b) RT-PCR validation of CN272097 a cDNA produced by a complex rearrangement on chromosome 5 fusing the SLC12A2 gene and expressed sequence tag (EST) AK090949. The results provide evidence for expression of the fused transcript in 5 out of 21 breast cancer cell lines and in higher passage but not lower passage human mammary epithelial cells (HMECs). Note that MDAMB435 was recently demonstrated to be derivative of the M14 melanoma cell line and not from breast [62], and the absence of the SLC12A2 fusion is this cell line is consistent with its absence in other nonbreast tissues.
Figure 6Results of SNP identification in BAC end sequences. (a) The number of high quality isolated single nucleotide polymorphisms (SNPs) in uniquely mapped bacterial artificial chromosome (BAC) end sequences expressed per kilobase (blue). Each tumor sample has a significantly higher rate of SNPs compared with the normal library, whereas the ovarian library exhibits a rate significantly higher than the other tumor samples. Also shown is the fraction of SNPs not found in dbSNP124 (red). The ovarian library shows a significantly higher rate of these novel SNPs. (b) Mutational spectrum of SNPs for each of the samples. For C:G → T:A transitions and C:G → G:C transversions, the fraction at CpG dinucleotides is indicated in red and yellow, respectively.