| Literature DB >> 23887607 |
D Ulahannan1, M B Kovac, P J Mulholland, J-B Cazier, I Tomlinson.
Abstract
Next-generation sequencing (NGS) of cancer genomes promises to revolutionise oncology, with the ability to design and use targeted drugs, to predict outcome and response, and to classify tumours. It is continually becoming cheaper, faster and more reliable, with the capability to identify rare yet clinically important somatic mutations. Technical challenges include sequencing samples of low quality and/or quantity, reliable identification of structural and copy number variation, and assessment of intratumour heterogeneity. Once these problems are overcome, the use of the data to guide clinical decision making is not straightforward, and there is a risk of premature use of molecular changes to guide patient management in the absence of supporting evidence. Paradoxically, NGS may simply move the bottleneck of personalised medicine from data acquisition to the identification of reliable biomarkers. Standardised cancer NGS data collection on an international scale would be a significant step towards optimising patient care.Entities:
Mesh:
Year: 2013 PMID: 23887607 PMCID: PMC3749581 DOI: 10.1038/bjc.2013.416
Source DB: PubMed Journal: Br J Cancer ISSN: 0007-0920 Impact factor: 7.640
Figure 1Framework for cancer genome analysis using NGS.
Demonstrating the key software programs that could be used on NGS data on cancer genomes
| BWA | Burrows–Wheeler alignment tool. Consists of three algorithms. BWA-backtrack, BWA-SW and BWA-MEM. BWA-backtrack is designed for Illumina reads of up to 100 bp. The other two algorithms are designed for longer sequences ranging from 70 bp to 1 Mbp. | |
| Eland | Eland is a commercially based software program designed by Illumina to map Illumina reads to a reference genome as part of their genome analysis pipeline. | |
| MAQ | Maps short reads to a reference genome. Historically useful in work in cancer genomes designed originally for Illumina and SOLiD platforms but is becoming outdated because of speed and accuracy by newer software programs. Produces ungapped alignment of reads. | |
| Stampy | Maps short reads to a reference genome using Illumina reads. Particularly useful for sequences, which are divergent to the reference genome, containing insertions and deletions. Can be used in combination with BWA. | |
| GATK | Structured software library that has programs to analyse NGS data. Can be used for variant calling and identification of indels. | |
| JointSNVMix | Analyses tumour and normal genome pairs simultaneously so that germline and somatic mutations can be distinguished. | |
| MuTect | A variant caller to identify somatic point mutations from tumour normal paired sequencing data. Reportedly low false-positive rate. The program can determine from the depth of coverage in tumour and normal whether there is sufficient sensitivity to call a somatic mutation. | |
| Samtools | This is a software program that can align and manipulate NGS data, which is stored in the SAM format, a generic format for storing large nucleotide sequence data. It is not specific to cancer genomes but can be used to identify variant calls in the tumour distinct from the reference and can also be used to identify short range indels. | |
| Somatic Sniper | The program compares tumour and normal data to produce a Phred-based probability score to determine the likelihood of the tumour and normal genotypes being different. | |
| Varscan2 | Can be used to identify somatic and germline variants and LOH events in tumour normal pairs. Has been used to identify CNVs in tumour normal exome data. It is a platform independent tool working on data with most NGS platforms including Ion Torrent. | |
| BreakDancer | BreakDancer Max – can identify structural variants using paired-end sequencing reads by noting paired-end reads, which are mapped at unexpected distances or are incorrectly orientated. Detects large insertions, deletions, inversions, inter/intra chromosomal translocations. BreakDancer Mini – used to detect small indels 10–100 bp, which are not routinely identified by BreakDancer Max. | |
| Dindel | This can identify small indels in NGS data. However, it can only be used with Illumina sequence data. With deeper coverage the number of false positives can be reduced by filtering the data to ensure that each indel is present more than twice. | |
| Genome STRiP | Designed to detect structural variations shared by multiple individuals. Needs 20–30 genomes to achieve satisfactory results. Its current use is limited to uncovering and genotyping deletions relative to a reference sequence. | |
| Pindel | Can be used to identify simple deletions and insertions. Uses paired-end reads to identify large breakpoints and medium size insertions. Can detect inversions and tandem duplications. It uses BAM files generated from Illumina read data. | |
| CNAseg | Uses NGS data to estimate copy number states using the depth of coverage and variability in coverage in the cancer and normal to try and control false-positive rate. | |
| SegSeq | Uses NGS data to detect CNVs of a given size using tumour- normal pairs and can be used to map breakpoints. | |
Abbreviations: BWA=Burrows Wheeler Aligner; BWA-SW=Burrows Wheeler Aligner's Smith-Waterman Alignment; CNV=copy number variant; LOH=loss of heterozygosity; MAQ=mapping and assembly with quality; NGS=next-generation sequencing; SAM=sequence alignment/map.
Clinical and research applications of NGS
| Disease classification | NGS will increase accessibility for genetic testing. A larger number of patients can undergo genetic testing for familial cancer syndromes.
In future, NGS could be used for ‘molecular Staging of tumours' to improve classification by relating this to the behaviour of the tumour with regards to aggressiveness and propensity to metastasise. This may have therapeutic implications. | Current cost and labour constraints limit the number of patients who are eligible for genetic testing as they selected on stringent criteria based on personal and family history. However, this approach may miss a sizeable number of patients who are carriers of the mutation ( |
| Therapeutic options | NGS will facilitate the development of targeted therapy and personalised medicine.
It could be used potentially to try and detect relapse and monitoring residual disease burden by undertaking deep sequencing of blood to try and detect circulating tumour cells.
| |
| Potential research interests | Uncovering of driver mutations. Profiling genomic instability. Characterising tumour evolution. Epigenetics analysis of cancer genomes. Discovery of targets for therapy. | |
Abbreviation: NGS=next-generation sequencing.