| Literature DB >> 25288879 |
Rendong Yang1, Li Chen2, Scott Newman3, Khanjan Gandhi3, Gregory Doho4, Carlos S Moreno5, Paula M Vertino6, Leon Bernal-Mizarchi7, Sagar Lonial7, Lawrence H Boise7, Michael Rossi8, Jeanne Kowalski9, Zhaohui S Qin1.
Abstract
We present a pipeline to perform integrative analysis of mate-pair (MP) and paired-end (PE) genomic DNA sequencing data. Our pipeline detects structural variations (SVs) by taking aligned sequencing read pairs as input and classifying these reads into properly paired and discordantly paired categories based on their orientation and inferred insert sizes. Recurrent SV was identified from the discordant read pairs. Our pipeline takes into account genomic annotation and genome repetitive element information to increase detection specificity. Application of our pipeline to whole-genome MP and PE sequencing data from three multiple myeloma cell lines (KMS11, MM.1S, and RPMI8226) recovered known SVs, such as heterozygous TRAF3 deletion, as well as a novel experimentally validated SPI1 - ZNF287 inter-chromosomal rearrangement in the RPMI8226 cell line.Entities:
Keywords: multiple myeloma; structural variations; variant detection; whole-genome sequencing
Year: 2014 PMID: 25288879 PMCID: PMC4179644 DOI: 10.4137/CIN.S13783
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Summary of sequencing data.
| TYPE | CELL LINE | READ LENGTH (BP), DEPTH, QUALITY SCORE | NO. OF TOTAL READS | NO. OF MAPPED READS | FRAGMENT LENGTH MEAN ± SD (BP) |
|---|---|---|---|---|---|
| Pair-end | KMS11 | 2 × 100, 12X, 35 | 374, 274, 810 | 347, 448, 960 | 301 ± 75 |
| MM.1S | 2 × 100, 14X, 34 | 429, 622, 446 | 394, 631, 072 | 296 ± 78 | |
| RPMI8226 | 2 × 100, 13X, 34 | 394, 471, 850 | 366, 210, 450 | 291 ± 81 | |
| Mate-pair | KMS11 | 2 × 100, 7X, 31 | 221, 885, 782 | 48, 782, 044 | 3,268 ± 496 |
| MM.1S | 2 × 100, 3X, 35 | 86, 871, 758 | 43, 464, 282 | 3,646 ± 244 | |
| RPMI8226 | 2 × 100, 4X, 36 | 114, 486, 206 | 59, 310, 148 | 3,501 ± 340 |
Figure 1An overview of the SVfinder pipeline. SVfinder takes a SAM file as input and divides the mapped reads into concordant and discordant group. The normal insert size range is estimated from reads in concordant group. The discordant reads are clustered and classified into different types of variation subgroup according to their insert size and read pairs orientation. Identified SVs are annotated and output in BED format, allowing for easy downstream analysis or viewing in a genome browser.
Known and novel SVs associated with genes identified by SVfinder.
| GENE A | GENE B | POSITION A | POSITION B | CHR A | CHR B | KMS11 | MM.1S | RPMI8226 | TYPE | REFERENCE |
|---|---|---|---|---|---|---|---|---|---|---|
| WHSC1 | IGH | intron | upstream | chr4 | chr14 | PE | Translocation | |||
| MAF | IGH | intron | downstream | chr16 | chr14 | PE | Translocation | |||
| CDKN2A | CDKN2B | coding | intron | chr9 | chr9 | PE | Deletion | |||
| TRAF3 | CDC42BPB | coding | intron | chr14 | chr14 | MP/PE | Deletion | |||
| SPI1 | ZNF287 | coding | coding | chr11 | chr17 | MP/PE | Translocation | Novel | ||
| KDM6A | KDM6A | intron | intron | chrX | chrX | MP/PE | Deletion | Novel | ||
| KCTD8 | KCTD8 | intron | intron | chr4 | chr4 | MP/PE | Deletion | Novel | ||
| ABHD17B | C9orf85 | coding | intron | chr9 | chr9 | MP/PE | Deletion | Novel | ||
Figure 2Discovery of novel SPI1 and ZNF287 t(11;17) inter-chromosomal rearrangement in RPMI8226 multiple myeloma cell line. (A) Mapped discordant read pairs in mate pair (upper panel) and paired end (lower panel) sequencing shown in integrative genome viewer (IGV). (B) Identification of t(11;17) translocation and breakpoint using Sanger sequencing.