| Literature DB >> 32623783 |
Matt A Field1,2.
Abstract
Sequencing the first human genome in 2003 took 15 years and cost $2.7 billion. Advances in sequencing technologies have since decreased costs to the point where it is now feasible to resequence a whole human genome for $1000 in a single day. These advances have allowed the generation of huge volumes of high-quality human sequence data used to construct increasingly large catalogs of both population-level and disease-causing variation. The existence of such databases, coupled with a high-quality human reference genome, means we are able to interrogate and annotate all types of genetic variation and identify pathogenic variants for many diseases. Increasingly, sequencing-based approaches are being used to elucidate the underlying genetic cause of autoimmune diseases, a group of roughly 80 polygenic diseases characterized by abnormal immune responses where healthy tissue is attacked. Although sequence data generation has become routine and affordable, significant challenges remain with no gold-standard methodology to identify pathogenic variants currently available. This review examines the latest methodologies used to identify pathogenic variants in autoimmune diseases and considers available sequencing options and subsequent bioinformatic methodologies and strategies. The development of reliable and robust sequencing and analytic workflows to detect pathogenic variants is critical to realize the potential of precision medicine programs where patient variant information is used to inform clinical practice.Entities:
Keywords: Autoimmune diseases; SNV; high-throughput sequencing; pathogenic variant; variant annotation; variant detection
Year: 2020 PMID: 32623783 PMCID: PMC7891608 DOI: 10.1111/imcb.12372
Source DB: PubMed Journal: Immunol Cell Biol ISSN: 0818-9641 Impact factor: 5.126
Sample of sequencing options available for variant detection in autoimmune diseases
| Sequencing type | Detectable variation | Advantages | Limitations |
|---|---|---|---|
| GWAS | Loci on GWAS chip | Cheap/large studies possible | Only common SNVs |
| Whole genome | All variant types: coding and noncoding | All variant types detectable | Expensive relative to targeted approaches |
| Exome | SNV and small indel in coding regions | Capture most coding regions | No noncoding or large variation |
| Gene panel | SNV and small indel in panel genes | High‐depth coverage for panel genes | Nothing novel is detectable |
| Molecular tagging | Somatic and cell subset–specific variants | Analyze individual input molecules | Additional library preparation / custom software |
| Single cell | Somatic and cell subset–specific variants | Analyze individual cells | Additional library preparation / custom software |
| HLA typing | HLA genotypes | High‐resolution phased HLA genotypes | Immune system specific |
| BCR | B‐cell clonotypes | Construct and observe changes in BCR | Immune system specific |
| TCR | T‐cell clonotypes | Construct and observe changes in TCR | Immune system specific |
| Transcriptome | Aberrant splicing/gene fusions/coding SNV | Observe effect of variants on genes | Miss rare transcripts/added expense |
| Long reads | All variant types: coding and noncoding | Resolve large variants/full gene transcripts | Higher error rate and higher per‐base cost |
BCR, B‐cell receptor; GWAS, genome‐wide association study; HLA, human leukocyte antigen; SNV, single‐nucleotide variant; TCR, T‐cell receptor.
Figure 1Variant detection workflow. SNV, single‐nucleotide variant.
Common software options for analysis steps in variant detection workflow
| Analysis step | Example software | URL |
|---|---|---|
| Data quality control/trimming | Trimmomatic |
|
| Read alignment | BWA |
|
| BAM preprocessing | Picard |
|
| Variant calling (SNV/indel) | GATK |
|
| Variant calling (UMI tags) | DeepSNVMiner |
|
| Variant calling (structural variation/copy‐number variation) | Manta |
|
| Variant calling (Pedigree) | VASP |
|
| Variant annotation | Variant Effect Predictor |
|
| Variant prioritization | PolyPhen‐2 |
|
SNV, single‐nucleotide variant; UMI, unique molecular identifier.
Figure 2Strategies to reduce the variant search space for pathogenic variants. HLA, human leukocyte antigen; TCR, T‐cell receptor.
Figure 3Personalized medicine workflow. SNV, single‐nucleotide variant.
Resources to reduce variant search space in autoimmune diseases
| Analysis type | Software/database | URL |
|---|---|---|
| HLA sequencing | HLAminer |
|
| HLA sequencing | seq2HLA |
|
| HLA sequencing | OptiType |
|
| HLA sequencing | PHLAT |
|
| TCR/BCR sequencing | MiXCR |
|
| TCR/BCR sequencing | VDJPuzzle |
|
| TCR/BCR sequencing | IMSEQ |
|
| BCR sequencing | IgDiscover |
|
| Annotations | ImmGen |
|
| Annotations | InnateDB |
|
| Annotations | Immuno Polymorphism Database |
|
| Annotations | Centre for Personalised Immunology |
|
| Annotations | Infevers |
|
| Annotations | LOVD 2.0 |
|
BCR, B‐cell receptor; HLA, human leukocyte antigen; TCR, T‐cell receptor.