| Literature DB >> 31890139 |
Alice Maria Giani1, Guido Roberto Gallo2, Luca Gianfranceschi2, Giulio Formenti3.
Abstract
Genomes represent the starting point of genetic studies. Since the discovery of DNA structure, scientists have devoted great efforts to determine their sequence in an exact way. In this review we provide a comprehensive historical background of the improvements in DNA sequencing technologies that have accompanied the major milestones in genome sequencing and assembly, ranging from early sequencing methods to Next-Generation Sequencing platforms. We then focus on the advantages and challenges of the current technologies and approaches, collectively known as Third Generation Sequencing. As these technical advancements have been accompanied by progress in analytical methods, we also review the bioinformatic tools currently employed in de novo genome assembly, as well as some applications of Third Generation Sequencing technologies and high-quality reference genomes.Entities:
Keywords: BAC, Bacterial Artificial Chromosome; Bioinformatics; Genome assembly; HGP, Human Genome Project; HMW, high molecular weight; HapMap, haplotype map; NGS, Next Generation Sequencing; Next-generation; OLC, Overlap-Layout-Consensus; QV, Quality Value (QV); Reference; SBS, Sequencing by Synthesis; SMRT, Single Molecule Real-Time; SNPs, Single Nucleotide Polymorphisms; SRA, Short Read Archive; SV, Structural Variant; Sequencing; TGS, Third Generation Sequencing; Third-generation; WGS, Whole Genome Sequencing; ZMW, Zero-Mode Waveguide; bp, base pair; dNTPs, deoxynucleoside triphosphates; ddNTP, 2,3-dideoxynucleoside triphosphate
Year: 2019 PMID: 31890139 PMCID: PMC6926122 DOI: 10.1016/j.csbj.2019.11.002
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Milestones in genome assembly. Timeline illustrating many of the major genome assembly achievements ranging from the beginning of the sequencing era to the large-scale genome projects currently ongoing. Each genome or genome project (GP) is placed under a color-coded background according to the sequencing approach adopted. Light red: early sequencing methods, Yellow: Sanger-based shotgun sequencing, Green: NGS, Light blue: TGS. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2Structural variations. Schematic representation of the five major types of SVs.
Fig. 3Chromosome-level scaffolding of de novo genome assemblies. Schematic illustration of an hybrid de novo genome assembly approach where linkage information obtained from optical and Hi-C maps is used to properly position contigs along the chromosomes. First, sequencing reads are assembled together to form contigs. Then, contigs are aligned to consensus BioNano optical maps, a process that enables to accurately order and orient the contigs with respect to each others and assign each contig to a specific chromosome. Here, BioNano optical maps are represented by black lines and the green dots indicate the labeled sequence motifs. Further ordering of the resulting scaffolds is made possible by Hi-C maps, that take into account the interactions between contiguous genomic loci. In the resulting chromosome-level scaffold colored segments represent genomic regions of known sequence while the remaining gaps between the scaffolds of unknown sequence are depicted as gray lines. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)