| Literature DB >> 25887218 |
Min Wang1, Christine R Beck2, Adam C English3, Qingchang Meng4, Christian Buhay5, Yi Han6, Harsha V Doddapaneni7, Fuli Yu8,9, Eric Boerwinkle10,11, James R Lupski12,13, Donna M Muzny14, Richard A Gibbs15,16.
Abstract
BACKGROUND: Generation of long (>5 Kb) DNA sequencing reads provides an approach for interrogation of complex regions in the human genome. Currently, large-insert whole genome sequencing (WGS) technologies from Pacific Biosciences (PacBio) enable analysis of chromosomal structural variations (SVs), but the cost to achieve the required sequence coverage across the entire human genome is high.Entities:
Mesh:
Year: 2015 PMID: 25887218 PMCID: PMC4376517 DOI: 10.1186/s12864-015-1370-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Workflow of the PacBio-LITS method. A. sample intake QC; B. DNA fragmentation; C. target size selection using Pippin/BluePippin; D. construction of pre-capture library using size-selected DNA fragments; E. Roche/NimbleGen SeqCap EZ solution-based target enrichment using custom probes; F. construction of PacBio library using post-capture product and G. PacBio long read-length sequencing.
Optimized conditions for PacBio-LITS library preparation
|
|
| |||
|---|---|---|---|---|
|
|
|
|
| |
|
| 500 ng | 1 μg | 1.5 μg | 2 μg |
|
| Covaris | Covaris or g-TUBE | g-TUBE | g-TUBE |
|
| 1 Kb shearing with microTube | Covaris: 5 Kb shearing with miniTube o | 7000 rpm, 3 min. | 4800 rpm, 8 min. |
|
| Pippin | Blue Pippin | Blue Pippin | Blue Pippin |
|
| Range mode: 900 bp-1.3 Kb | Range mode: 4 Kb-8 Kb | Range mode: 5 Kb-9 Kb | Range mode: 8 Kb-12 Kb |
|
| Phusion HF Polymerase | TaKaRa LA | TaKaRa LA | TaKaRa LA |
|
| 6 | 8 | 8 | 10 |
|
| 1 μg | 1 μg | 2 μg | 2 μg |
|
| 14 | 16 | 16 | 18 |
|
| ~700 bp | ~1.8-2 Kb | ~2-2.5 Kb | ~2.5-3 Kb |
Pilot studies of PacBio-LITS libraries
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|
| BAB1123 | 1 Kb | SMS/PTLS (7 Mb) | Regular/ C2-P2/1 × 90 | 360,145 | 277 | 770 | 65 | 20 | 89 | 41 |
| BAB1123 | 4 Kb | SMS/PTLS (7 Mb) | MagBead/ C2-P2/1 × 120 | 162,600 | 353 | 2,169 | 69 | 29 | 90 | 54 |
| NA12878 | 4 Kb | MHC (5 Mb) | MagBead/ C2-XL/1 × 120 | 214,130 | 409 | 1,885 | 50 | 44 | 96 | 70 |
| HS1011 | 4 Kb | MHC (5 Mb) | MagBead/ C2-XL/1 × 120 | 329,251 | 698 | 2,101 | 49 | 73 | 97 | 87 |
| NA12878 | 6 Kb | SMS/PTLS (7 Mb) | MagBead/ C3-P5/1 × 180 | 395,342 | 933 | 2,358 | 73 | 91 | 95 | 89 |
*Detailed sequencing metrics showed in Additional file 3: Table S1.; SRB: subread base; SRL: subread length.
Figure 2PacBio sequencing of a 6 Kb insert capture library. A) Post-capture library QC. A 6Kb insert capture library was constructed using the custom SMS/PTLS probe set and quality of the post-capture LM-PCR product was analyzed by Agilent 2100 Bioanalyzer. B) Target capture performance. Critical capture sequencing metrics (alignment rate, on-target/buffer rate and base coverage) were calculated after data analysis. C) Read length performance. Sequencing the 6 Kb insert library by PacBio C3/P5 chemistry with a 180-min movie generated a set of read length metrics for post-filter reads and subreads.
Figure 3Delineation of CGRs in PTLS cases. A) CGRs revealed by aCGH. Human chromosome 17p11.2 is illustrated as a horizontal line on the top of the figure with coordinates (Mb) indicated below. Red blocks represent duplicated regions and blue segments indicate triplications. RAI1 is indicated by the vertical gray shadow. Yellow and blue shaded areas represent LCRs; purple arrows indicating the orientation [19]. Vertical black lines define the 7 Mb (14.9-21.9 Mb) targeted by the SMS/PTLS probe set. Individual array results are below the schematics, focused on of copy number alterations. Coordinates coordinates (in Mb) are indicated below arrays. Previously determined junctions are labeled with a “1” so that the rearrangement joins together the two number “1”s, and junctions identified by PacBio-LITS are labeled with a “2”. Data for 2714 [19] and 2695 [23] were published previously. B) Novel breakpoint junction sequences detected with PacBio-LITS. Breakpoint sequences for the three new junctions identified by PacBio-LITS are aligned to the reference sequence. Transitions between the sequences are indicated with different colors, with gray denoting regions of disagreement with the junction sequence. Chromosome 17 coordinates (hg19/GRCh37) are indicated. Red lettering denotes microhomology. The Alu-Alu mediated alignment in BAB2695 has asterisks (*) denoting regions where the two Alu elements do not align. C) Formation of CGRs. Case BAB2714 i) A map of the reference genome. Colored boxes represent sequence blocks. ii) Black arrows indicate the two template switches resulting in the rearrangement. The template switches could also have occurred in the opposite order. iii) The rearranged region, which has an inversion-duplication for the blue sequence block followed by a direct duplication of the red sequence block. Case BAB2695 i) A map of the reference genome. ii) The resultant rearrangement. Both junctions are mediated by Alu elements, and are in a head-to-tail tandem orientation (no inversion).