| Literature DB >> 30086710 |
K Blighe1,2,3, L DeDionisio4, K A Christie5, B Chawes6, S Shareef7, T Kakouli-Duarte8, C Chao-Shern5,4, V Harding9, R S Kelly10, L Castellano9,11, J Stebbing9, J A Lasky-Su10, M A Nesbit5, C B T Moore12,13.
Abstract
The reporting of the first draft of the human genome in 2000 brought with it much hope for the future in what was felt as a paradigm shift toward improved health outcomes. Indeed, we have now mapped the majority of variation across human populations with landmark projects such as 1000 Genomes; in cancer, we have catalogued mutations across the primary carcinomas; whilst, for other diseases, we have identified the genetic variants with strongest association. Despite this, we are still awaiting the genetic revolution in healthcare to materialise and translate itself into the health benefits for which we had hoped. A major problem we face relates to our underestimation of the complexity of the genome, and that of biological mechanisms, generally. Fixation on DNA sequence alone and a 'rigid' mode of thinking about the genome has meant that the folding and structure of the DNA molecule -and how these relate to regulation- have been underappreciated. Projects like ENCODE have additionally taught us that regulation at the level of RNA is just as important as that at the spatiotemporal level of chromatin.In this review, we chart the course of the major advances in the biomedical sciences in the era pre- and post the release of the first draft sequence of the human genome, taking a focus on technology and how its development has influenced these. We additionally focus on gene editing via CRISPR/Cas9 as a key technique, in particular its use in the context of complex biological mechanisms. Our aim is to shift the mode of thinking about the genome to that which encompasses a greater appreciation of the folding of the DNA molecule, DNA- RNA/protein interactions, and how these regulate expression and elaborate disease mechanisms.Through the composition of our work, we recognise that technological improvement is conducive to a greater understanding of biological processes and life within the cell. We believe we now have the technology at our disposal that permits a better understanding of disease mechanisms, achievable through integrative data analyses. Finally, only with greater understanding of disease mechanisms can techniques such as gene editing be faithfully conducted.Entities:
Keywords: CRISPR; Complex genetics; Epigenome; Gene editing; Genome; Genomic complexity; Integrated omics; Sequencing technology development; Transcriptome
Mesh:
Substances:
Year: 2018 PMID: 30086710 PMCID: PMC6081867 DOI: 10.1186/s12864-018-4963-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
breast cancer CCND1 locus. Status: unsolved
| In breast cancer, germline SNPs at 11q13 in the vicinity of |
A gambit of technological methods to interrogate the genome’s complexity in every possible way
| Broad area | Technique | Investigates | Description | Citation |
|---|---|---|---|---|
| RNA transcription, translation, and binding | ChIRP-seq | RNA-DNA binding | Chromatin Isolation by RNA purification sequencing (ChIRP-seq) is used to determine regions of the genome that are bound by a specific RNA species. | [ |
| CLASH | RNA-RNA binding | Crosslinking, Ligation, And Sequencing of Hybrids (CLASH) is capable of determining RNA-RNA binding interactions. | [ | |
| GRO-seq | Active RNA transcription | Global Run-On sequencing (GRO-seq) determines the sites in the genome at which active transcription is occurring by targeting transcriptionally-engaged RNA polymerases. | [ | |
| NET-seq | Active RNA transcription | Native elongating transcript sequencing (NET-seq) determines, at nucleotide resolution, the sites in the genome at which active transcription is occurring by targeting the 3’ends of nascent transcripts associated with RNA polymerases. | [ | |
| Ribo-seq | Active RNA translation | Ribosome sequencing (Ribo-seq) is capable of identifying ribosome-bound messenger RNAs (mRNAs), i.e., mRNAs that are under active translation. | [ | |
| TRAP-seq | Active RNA translation | Translating Ribosome Affinity Purification sequencing (TRAP-seq) quantifies all mRNAs that are associated with 80s ribosome. | [ | |
| RIP-seq | RNA–protein binding | RNA Immunoprecipitation sequencing (RIP-seq) is used to determine RNA species that are bound to a RNA binding protein (RBP) of interest. | [ | |
| HITS-CLIP | RNA-protein binding | High Throughput Sequencing Crosslinking and Immunoprecipitation (HITS-CLIP) is used to determine RNA species that are bound to a RBP of interest. | [ | |
| PAR-CLIP | RNA-protein binding | Photoactivatable Ribonucleoside-Enhanced Crosslinking and Immunoprecipitation (PAR-CLIP) determines RNA species that are bound to a RBP of interest. PAR-CLIP improves on HITS-CLIP and RIP-seq through the inclusion photoreactive ribonucleoside analogs, which further improves specificity at the RNA-protein boundary during crosslinking. | [ | |
| iCLIP | RNA-protein binding | Individual-nucleotide resolution UV cross-linking and immunoprecipitation (iCLIP) determines RNA species that are bound to a RBP of interest, and provides base-level specificity at the RNA-protein boundary. | [ | |
| PARE-seq | miRNA target RNA | Parallel Analysis of RNA Ends sequencing (PARE-seq) looks at the 5′ ends of polyadenylated products of miRNA-mediated mRNA decay to identify miRNA-target RNA pairs. | [ | |
| TIF-seq PEAT | RNA transcript isoforms | Transcript Isoform Sequencing (TIF-seq) allows for the identification of transcript isoforms by mapping their exact 5’ start and 3’end boundaries. | [ | |
| RNA form and structure | SHAPE-seq | RNA secondary and tertiary conformation | Selective 2’-Hydroxyl Acylation analyzed by Primer Extension sequencing (SHAPE-seq) utilizes SHAPE chemistry followed by multiplexed paired-end deep sequencing of primer extension products and bioinformatic analysis using a maximum likelihood model to infer secondary and tertiary RNA structure. | [ |
| PARS | RNA secondary structure | Parallel analysis of RNA structure (PARS) determines RNA secondary structure simultaneously for thousands of RNA molecules via enzymatic footprinting with different RNAses. | [ | |
| Frag-seq | RNA secondary structure | Fragmentation sequencing (Frag-seq) determines RNA secondary structure transcriptome-wide via P1 endonuclease, which cleaves single-stranded nucleic acids. | [ | |
| ICE | RNA inosines | Inosine Chemical Erasing (ICE) identifies inosines on RNA species in the context of adenosine-to-inosine (A-to-I) conversion, a post-transcriptional modification that diversifies the transcriptome in various pathways. | [ | |
| MeRIP-seq | RNA methylation of the N6 position of adenosine (m6A) | Methylated RNA Immunoprecipitation sequencing (MeRIP-Seq) identifies RNA species with methylation of the N6 position of adenosine (m6A), a post-transcriptional RNA modification. | [ | |
| Cap-seq / CIP-TAP | RNA 5′ capping | Cap sequencing (Cap-seq) and Calf Intestinal alkaline Phosphatase Tobacco Acid Pyrophosphatase (CIP-TAP) both enrich for the 5′ ends of Pol II RNA species and differ based on the following: Cap-seq is selective for long-capped RNAs; CIP-TAP is selective for capped small RNAs (csRNAs). Both therefore define Pol II transcription start sites (TSSs). | [ | |
| DNA-protein interactions | DNase-seq | Global mapping of active regulatory chromatin, i.e., nucleosome-depleted | DNase-seq identifies regulatory regions by targeting DNase I hypersensitive (HS) sites. | [ |
| FAIRE-seq | Global mapping of active regulatory chromatin, i.e., nucleosome-depleted | Formaldehyde-Assisted Isolation of Regulatory Elements sequencing (FAIRE-seq) identifies regions of active chromatin that coincide with DNase I HS sites and others. | [ | |
| MNase-seq (MAINE-seq) | Global mapping of histone-bound DNA, i.e., nucleosome positioning | MNase-Assisted Isolation of Nucleosomes Sequencing (MAINE-seq) identifies histone-bound DNA via digestion by micrococcal nuclease (MN). | [ | |
| ATAC-seq | Global mapping of both active regulatory chromatin and histone-bound DNA | Assay for Transposase Accessible Chromatin sequencing (ATAC-seq) identifies regions of DNA via hyperactive Tn5 transposase, which inserts adapters into accessible regions of chromatin. | [ | |
| ChIA-PET | Detects global chromatin interactions and infers 3-D structure | Chromatin Interaction Analysis by Paired-End Tag sequencing (ChIA-PET) isolates chromatin interactions by formaldehyde cross-linking, sonication, and then chromatin immunoprecipitation (ChIP). Paired chromatin DNA fragments are then connected with linkers. | [ | |
| 3-C, 4-C, 5-C, Hi-C | Captures interactions within and between chromosomes and infers 3-D structure | Chromosome conformation capture (3C), chromosome conformation capture on chip (4C), 3C-carbon copy (5C), and high-throughput chromosome conformation capture are methods used to identify chromatin interactions at short ranges between 2 loci (3C) or long ranges via multiple loci (Hi-C). | [ | |
| Sequence rearrangements | RC-seq | Retrotransposon insertions | Retrotransposon Capture sequencing (RC-seq) enriches for mobile the 5′ and 3′ termini of mobile genetic elements. | [ |
| TN-seq / INseq | Mariner transposon insertions | Transposon sequencing (TN-seq) and Insertion sequencing (INseq) study the Himar I Mariner transposon. | [ | |
| TC-seq | DNA double strand break-mediated rearrangements | Translocation Capture sequencing (TC-seq) identifies AID-dependent chromosomal rearrangements. | [ |
Fig. 1‘Surgery’ by CRISPR
Crisis ‘bee’. Status: imminent problem
| In recent years, domesticated honeybees ( |
Childhood asthma and the 17q21 locus. Status: partially solved
| Childhood asthma is the most common chronic childhood disorder with up to 50% of all children experiencing asthma-like symptoms before the age of 6 years, and 15% being diagnosed with persistent asthma during school-age [ |
Fig. 2Is there utility for CRISPR via circulating tumour DNA detection?
Cardiovascular disease and gene editing. Status: gene editing’s clinical utility in the cardiovascular realm
| Cardiovascular disease (CVD) consists of acute coronary syndrome (ACS), acute myocardial infarction (AMI), angina, arrhythmia, atherosclerosis, congestive heart failure (CHF), coronary artery disease (CAD), myocardial ischemia, etc. In the USA, per year, approximately 700,000 people suffer their first AMI and 500,000 experience a second or recurrent AMI, with 1.7 million being hospitalised annually due to ACS [ | |
| However, not all biochemical tests are accurate. For example, it is known that half of AMIs occur in individuals with normal lipid panels [ | |
| Advances in the management of patients with cardiovascular disease through improved pharmacologic therapy have lessened impact; however, various limitations including patient compliance, side effects, and the need for repeat procedures keep patients in symptomatic status [ | |
| Current gene therapy clinical trials have proven short-term safety; however, long term surveillance over a period of decades is still under investigation. Also, the cost-effectiveness of gene therapy has to be considered due to the laborious nature of the procedures. Current pharmacological approaches may still be more favourable in terms of cost benefit ratio [ |
T-cell acute lymphoblastic leukaemia. Status: solved
| In T-cell acute lymphoblastic leukaemia (T-ALL), 25% of cases exhibit high expression of the | |
| Conclusion: The Mansour study shows how data from DNA, RNA, and DNA-binding interactions can be used in combination to clearly define a disease mechanism. In this example, observing the intergenic upstream insertion variants (DNA), the heightened expression of |