| Literature DB >> 21155765 |
Aaron G Day-Williams1, Eleftheria Zeggini.
Abstract
BACKGROUND: Advances in the understanding of complex trait genetics have always been enabled by advances in genomic technology. Next-generation sequencing (NGS) is set to revolutionize the way complex trait genetics research is carried out.Entities:
Mesh:
Year: 2010 PMID: 21155765 PMCID: PMC3107422 DOI: 10.1111/j.1365-2362.2010.02437.x
Source DB: PubMed Journal: Eur J Clin Invest ISSN: 0014-2972 Impact factor: 4.686
Figure 1Consequences of Short Indel Misalignment on False Positive single nucleotide polymorphisms (SNP) Calling. Turquoise bars represent false positive SNPs caused by the misalignment of reads containing a two base-pair insertion relative to the reference. For the misaligned reads, since the inserted sequence occurs at the beginning/end of short reads, it is difficult for the alignment algorithms to recognize the insertion. Whereas for the correctly mapped insertion read, the inserted sequence occurs in the middle of a read with perfect matches on either side of the indel allowing the alignment algorithm correctly open a gap in the reference. Several SNPs in such a short window are hallmarks of indel misalignment.
Figure 2Graphical representation of allelic heterogeneity at a locus with common and low frequency/rare variant associations with a complex trait. Turquoise bars represent trait-related variants carried by different individuals at a locus of interest. Bar width represents variant frequency (wider bars denote variants with higher frequency). Single-point association analysis of the low frequency/rare variants has extremely low power, but composite analysis by considering all variants of interest locus-wide is a more powerful alternative.
Sensitivity and specificity of next-generation sequencing designs in complex disease studies
| Project type | Example setting | Sequencing design | Required sensitivity | Required specificity |
|---|---|---|---|---|
| Sequence variant discovery | Variant identification in candidate region before genotyping follow-up | Targeted (nonindexed pools can increase cost efficiency) | High | Low |
| Novel association discovery | Genome-wide sequence-based association study; promising variants followed-up with genotyping | Whole-genome/Whole-exome | High | Medium |
| Fine-mapping established association signal | Variant identification and association testing to narrow established association signal | Targeted (indexed pools can increase cost efficiency) | Medium | High |
| Cancer somatic mutation | Identifying interchromosomal rearrangements, duplications, amplifications, point mutations | Whole-genome (both normal and cancer samples sequenced) | High | High |
Sensitivity is defined as the ability to detect all variants present (e.g. high sensitivity ≥ high true positive rate). Specificity is defined as the correctness of called variants (e.g. high specificity ≥ high true negative rate).