| Literature DB >> 22206293 |
Karen K Mestan1, Leonard Ilkhanoff, Samdeep Mouli, Simon Lin.
Abstract
Human genome sequencing is the process by which the exact order of nucleic acid base pairs in the 24 human chromosomes is determined. Since the completion of the Human Genome Project in 2003, genomic sequencing is rapidly becoming a major part of our translational research efforts to understand and improve human health and disease. This article reviews the current and future directions of clinical research with respect to genomic sequencing, a technology that is just beginning to find its way into clinical trials both nationally and worldwide. We highlight the currently available types of genomic sequencing platforms, outline the advantages and disadvantages of each, and compare first- and next-generation techniques with respect to capabilities, quality, and cost. We describe the current geographical distributions and types of disease conditions in which these technologies are used, and how next-generation sequencing is strategically being incorporated into new and existing studies. Lastly, recent major breakthroughs and the ongoing challenges of using genomic sequencing in clinical research are discussed.Entities:
Mesh:
Year: 2011 PMID: 22206293 PMCID: PMC3269395 DOI: 10.1186/1479-5876-9-222
Source DB: PubMed Journal: J Transl Med ISSN: 1479-5876 Impact factor: 5.531
Summary of throughput, length, quality, and cost of current versions of genomic sequencing.
| Platform | Throughput | Length | Quality | Cost | Applications | Sources of error | Advantages | Disadvantages |
|---|---|---|---|---|---|---|---|---|
| 6 Mb/day | 1,000 nt | 10-4-10-5 | ~$500/Mb | Small sample sizes, genomes, SNPs, long haplotypes, low complexity regions, etc. | Polymerase/amplification, low intensities/missing termination variants, contaminant sequences | Longest reads, gold standard for validations | High cost, low throughput | |
| 750 Mb/day | 400 nt | 10-3-10-4 | ~$20/Mb | Complex genomes, SNPs, structural variation, indexed samples, small RNAs, mRNAs, etc. | Amplification, mixed beads, intensity thresholding, homopolymers, phasing, neighbor interference | Longer reads, easier to assemble | Medium throughput, expensive, indel errors more likely | |
| 5,000 Mb/day | 100 nt | 10-2-10-3 | ~$0.50/Mb | Complex genomes, counting (SAGE, CNV Chip, small RNA), mRNAs, structural variation, bisulfite data, indexing SNPs, etc. | Amplification, mixed clusters/neighbor interference, phasing, base labeling | Lower cost, widely adopted platform, most well-developed bioinformatics efforts | Higher base substitution error rate, shorter reads | |
| SOLiD | 5,000 Mb/day | 75 nt | 10-2-10-3 | ~$0.50/Mb | Complex small genomes, counting (SAGE, ChiP, small RNA, CNV), SNPs, mRNAs, structural variation, indexing, etc. | Amplification, mixed beads, phasing, signal decline, neighbor interference | Lower cost, 2-base encoding chemistry, higher per-base accuracy | Shortest read lengths, |
Adapted from Kircher, et al. Bioessays 201010
Comparison of first-, second-, and third-generation genomic sequencing.
| First generation | Second generation | Third generation | |
|---|---|---|---|
| Size-separation of specifically end-labeled DNA fragments | Wash-and-scan SBS | Single molecule real time sequencing | |
| Averaged across many copies of the DNA molecule | Averaged across many copies of the DNA molecule | Single DNA molecule | |
| High | High | Lower | |
| Moderate | Short (generally much shorter than Sanger sequencing) | > 1000 bp | |
| Low | High | High | |
| High cost per base, | Low cost per base, | Low cost per base, | |
| cDNA sequencing | cDNA sequencing | Direct RNA sequencing | |
| Hours | Days | < 1 day | |
| Moderately complex, PCR amplification is not required | Complex, PCR amplification is required | Various | |
| Routine | Complex | Complex | |
| Base calls with quality values | Base calls with quality values | Base calls with quality values | |
Adapted from Schadt, et al. Hum Mol Genet 201013
Figure 1Worldwide map of clinical trials registered in ClinicalTrials.gov. Thirty-five studies were found by query of: genomic sequencing, based upon a recent search in http://www.clinicaltrials.gov[37] (July, 2011).
Figure 2U.S. distribution of registered clinical trials that disclose the use of genomic sequencing. In July, 2011, twenty studies reported incorporation of NGS technology. The majority of these studies were being conducted in California (4) and Maryland (7).
Condition categories and diseases studied utilizing genomic sequencing technology.
| Condition | Diseases |
|---|---|
| Mycoses, Osteitis, Pelvic Infection, Pelvic Inflammatory Disease, Proteus Infections | |
| Anemia, Blood Coagulation, Burkitt Lymphoma, Hodgkin Disease, Hemorrhagic, Hemoglobinopathies, Leukemias, Lymphomas, Lymphoproliferative Disorders, Multiple Myeloma | |
| Adenocarcinoma, Neuroblastoma, Nevus, Osteosarcoma, Retinoblastoma, multiple neoplasms, carcinomas, etc. | |
| Digestive System Neoplasms, Duodenal Diseases, Gastroenteritis, Ileal, Jejunal, Stomach Diseases and Neoplasms | |
| Multiple congenital anomalies, Cardiovascular Anomalies/Congenital Heart Disease, Inborn Diseases, Hemoglobinopathies, Neurocutaneous Syndromes, Neurofibromatoses | |
| Deafness, Hearing Disorders, Hearing Loss | |
| Retinoblastoma | |
| Acromegaly, Endocrine Disorders, Dwarfism, Neoplasms, Hyperparathyroidism, Parathyroid and Pituitary Diseases | |
| Aortic Valve Stenosis, Arterial Occlusive Diseases, Cardiomyopathies, Coronary Artery Disease, MI | |
| AIDS, Lymphomas, Hodgkin Disease, Immunoproliferative Disorders, Leukemias, Myelomas, Mycosis, Macroglobulinemia | |
| Acromegaly, Bone Diseases, Dwarfism, Congenital Limb Anomalies, Musculoskeletal Abnormalities, Osteitis | |
| ALS, Aphasias, Brain Neoplasms, CNS Diseases, Coma, Communication Disorders, Deafness, Dementia/Delirium, Motor Neuron Diseases, Neurocutaneous Syndromes, Neurodegenerative Diseases, Neurofibromas/NF, Neuromuscular Diseases, Pain, Speech Disorders, Spinal Cord Diseases | |
| Breast Diseases/Neoplasms, Neurocutaneous Syndromes | |
| Coma, Communication, Deafness/Delirium, Hearing Disorders, Hemolysis, Inflammation, Ischemia, Neurobehavioral, Pain, Sclerosis, Sepsis/Shock | |
| Adnexal Diseases, Renal Cell Carcinoma, Endometritis, Kidney Diseases, Pelvic Inflammatory Disease, Prostatic Neoplasms, Urogenital Neoplasms, Uterine and Urologic Diseases, Wilm's Tumor | |
| AIDS, Burkitt Lymphoma, HIV Infections | |
Source: http://www.clinicaltrials.gov (July, 2011)
Active and completed studies using genomic sequencing results as the primary outcome measure (source: http://www.clinicaltrials.gov, July 2011)
| Study Title/ | NCT #/ | Condition | Description |
|---|---|---|---|
| Next Generation to Identify Genetic Causes of Disease in Patients Participating in NICHD Clinical Protocols | NCT01375543 | Genetic diseases (pediatric) | Use of DNA samples to conduct exome and genome sequencing |
| Feasibility Clinical Study of Targeted and Genome-Wide Sequencing | NCT01345513 | Solid Tumors | Targeted and genome-wide sequencing of DNA to enable molecular characterization of tumors. |
| Biomarkers in Tissue Samples from Patients with High-Risk Wilms Tumor | NCT01118078 | Kidney Cancer | Application of array-based methods and NGS to identify candidate molecular targets |
| Whole Genome Medical Sequencing for Genome Discovery | NCT01087320 | Congenital Syndromes/ | Using genomic sequencing to identify genetic causes of disorders that are difficult to identify with existing techniques |
| Studying DNA in Tumor Tissue Samples from Patients with Localized or Metastatic Osteosarcoma | NCT01062438 | Sarcoma | Genomic expression profile in osteosarcoma tumor samples using transcriptome sequencing |
| Genetics of Congenital Heart Disease | NCT01192048 | Congenital Heart Disease | Direct sequencing and/or microarray, whole-genome array comparative genomic hybridization (CGH) |
| Integrated Whole-Genome Analysis of Hematologic Disorders | NCT01108159 | Hematologic Diseases | Whole-genome analysis/high-throughput sequencing using blood, bone marrow and skin biopsy samples |
| Study of Tissue Samples from Patients with Lymphoma | NCT00952809 | Lymphoma; Small Intestinal Cancer | Generation of genome-wide maps of the distribution of nucleosomes and histone modifications as assessed by high throughput sequencing (ChIP-Seq) |
| Genetics of Endocrine Tumours | NCT00461188 | Acromegaly | Tumor samples studied using candidate gene sequencing |
| DNA Analysis of Tumor Tissue Samples from Patients with Diffuse Brain Stem Glioma | NCT00899834 | Brain & CNS Tumors | Genome-wide expression of RNA in tumor samples using gene expression profiling. Direct sequencing analysis of tumor DNA |
| ClinSeq: A Large-Scale Medical Sequencing Clinical Research Pilot Study | NCT00410241 | Cardiovascular Disease | Sequencing ~ 400 genes related to heart disease |
| Laboratory Study of Lymphoblasts in Young Patients with High-Risk ALL | NCT00896766 | Leukemia | Pilot application of array-based methods and gene re-sequencing to identify candidate molecular targets for ALL |
| Genome Expression in Lymphoma, Leukemia and Multiple Myeloma | NCT00339963 | Lymphoma, Leukemia | Participating centers send samples to the NCI for gene expression profiling, array-based comparative genomic hybridization and cancer gene re-sequencing. |