| Literature DB >> 28570594 |
Simon Haile1, Pawan Pandoh1, Helen McDonald1, Richard D Corbett1, Philip Tsao1, Heather Kirk1, Tina MacLeod1, Martin Jones1, Steve Bilobram1, Denise Brooks1, Duane Smailus1, Christian Steidl2, David W Scott2, Miruna Bala1, Martin Hirst1, Diane Miller1, Richard A Moore1, Andrew J Mungall1, Robin J Coope1, Yussanne Ma1, Yongjun Zhao1, Rob A Holt1, Steven J Jones1, Marco A Marra1,3.
Abstract
Curation and storage of formalin-fixed, paraffin-embedded (FFPE) samples are standard procedures in hospital pathology laboratories around the world. Many thousands of such samples exist and could be used for next generation sequencing analysis. Retrospective analyses of such samples are important for identifying molecular correlates of carcinogenesis, treatment history and disease outcomes. Two major hurdles in using FFPE material for sequencing are the damaged nature of the nucleic acids and the labor-intensive nature of nucleic acid purification. These limitations and a number of other issues that span multiple steps from nucleic acid purification to library construction are addressed here. We optimized and automated a 96-well magnetic bead-based extraction protocol that can be scaled to large cohorts and is compatible with automation. Using sets of 32 and 91 individual FFPE samples respectively, we generated libraries from 100 ng of total RNA and DNA starting amounts with 95-100% success rate. The use of the resulting RNA in micro-RNA sequencing was also demonstrated. In addition to offering the potential of scalability and rapid throughput, the yield obtained with lower input requirements makes these methods applicable to clinical samples where tissue abundance is limiting.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28570594 PMCID: PMC5453589 DOI: 10.1371/journal.pone.0178706
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Automated high throughput FormaPure-based extraction protocol.
(A) Work flow illustration of sample acquisition, upstream sample processing and extraction. Note that a separate high temperature incubation step is added to facilitate the reversal of remaining crosslinks. The upstream processes are manual in the original protocol whereas those steps are modified to be suitable for automation in the modified protocol. The in-house on-deck heating blocks were instrumental in rendering the lysis/deparaffinization steps automatable. Acquisition of samples in SBS format matrix tubes with their automated capping and decapping were also further measures that allowed the entire process to be amenable for automated liquid handling. (B) gDNA yield. Historical gDNA yield data from the Qiagen/High Pure protocol (Q; n = 142) using equivalent sizes of numerous FFPE samples of lymphoma origin was compared with that of the FormaPure protocol (F; n-91). (C) RNA yield. Comparison of the Qiagen-High Pure (Q-H), and FormaPure (F) protocols are shown. N = 142 for Q-H and N = 44 for F.
Fig 2Suitability of the FormaPure extracted DNA for next generation sequencing.
(A) Final library yield. Library yield (nM) for 100 ng of DNA extracted from each source using the FormaPure protocol (n = 91). (B) Alignment rate. 32 libraries that were generated using the FormaPure extraction were sequenced at PE125bp on the Illumina platform. The proportions of untrimmed reads that mapped to the human reference genome as well as alignment rates after trimming down to PE100 and PE75 are shown graphically. Data on other metrics is shown in S1 Table. (C) Pair-wise correlation matrix of various laboratory and bioinformatic metrics. Pearson’s correlation coefficients were calculated for the indicated pairs with the heat map representing the extent of positive (red) and negative (green) correlations.
Fig 3Suitability of the FormaPure extracted RNA for FFPE strand-specific RNA-seq.
(A) Strand-specific libraries were generated from four different FormaPure extracted human FFPE samples (FFPE A-D) and UHR fresh RNA. Two different total (DNase-treated) RNA input amounts were used (100 and 200 ng, respectively). Final library yield (nM) (left panel) and % duplicates (middle panel) as well as the distribution of aligned reads to various regions of the transcriptome (right panel) are shown graphically. These libraries were sequenced as a pool at PE75 bp. (B) Comparison of Qiagen and FormaPure extraction protocols using mouse FFPE scrolls. Final library yield (nM) (Left panel) and % duplicates, % aligned, and the distribution of aligned reads to various regions of the transcriptome (middle panel) as well as number of genes with 1x coverage (right panel) are shown graphically.
Fig 4Suitability of the FormaPure extracted RNA for FFPE miRNA-seq.
(A) miRNA library quality. Comparisons of adapter dimer content, alignment rate, and miRNA content between the FormaPure (F) protocol and the Qiagen-High Pure (Q-H) protocol for 250 ng and 1000 ng input amounts are shown. n = 6 for the 1000 ng F and Q-H conditions; n = 4 for the 250 ng F condition and n = 3 for the 250 ng Q-H condition. *P<0.05. Error bars = standard deviations. (B) miRNA library diversity. Comparison of the number of miRNA species with ≥10 reads between the two protocols. (C) Inter-protocol expression correlation. The strength of the expression correlation of miRNAs from samples that were extracted using the FormaPure protocol with that of matching samples from the Qiagen-High Pure protocol is shown in the heat map. In bold are correlation values between the two protocols from matched samples.