| Literature DB >> 26428292 |
Konstantin Okonechnikov1, Ana Conesa2, Fernando García-Alcalde1.
Abstract
MOTIVATION: Detection of random errors and systematic biases is a crucial step of a robust pipeline for processing high-throughput sequencing (HTS) data. Bioinformatics software tools capable of performing this task are available, either for general analysis of HTS data or targeted to a specific sequencing technology. However, most of the existing QC instruments only allow processing of one sample at a time.Entities:
Mesh:
Year: 2015 PMID: 26428292 PMCID: PMC4708105 DOI: 10.1093/bioinformatics/btv566
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Multi-sample BAM QC analysis of a γH2AX ChiP-seq experiment in human cells comparing four different conditions (Koeppel ). The sequencing was performed in three batches. A single batch included samples in all conditions. (A) The GC-content distribution indicates a problem with the samples from the second batch. (B) The PCA biplot also demonstrates the second batch grouped together, despite different biological treatments
Qualimap2—overview of novel features
| Mode | Novel features and improvements |
|---|---|
| BAM QC | Advanced statistics of coverage, insert size, mismatch rate, etc.; duplicates extraction; homopolymer size control; performance and output data adaption |
| Multi-sample BAM QC | Comparison of coverage, GC-content, insert size etc. from multiple samples along with PCA-based summary |
| RNA-seq QC | Transcript coverage, 5′–3′ bias, alignment distribution, junction, strand-specificity analysis; counts computation |
| Counts QC | Multi-sample analysis (expression level, biotype, etc.) and condition comparison (expression level, GC bias, etc.) |