| Literature DB >> 33319909 |
Dandan Lang1, Shilai Zhang2, Pingping Ren1, Fan Liang1, Zongyi Sun1, Guanliang Meng1, Yuntao Tan1, Xiaokang Li1, Qihua Lai1, Lingling Han1, Depeng Wang1, Fengyi Hu2, Wen Wang3,4, Shanlin Liu1,5.
Abstract
BACKGROUND: The availability of reference genomes has revolutionized the study of biology. Multiple competing technologies have been developed to improve the quality and robustness of genome assemblies during the past decade. The 2 widely used long-read sequencing providers-Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT)-have recently updated their platforms: PacBio enables high-throughput HiFi reads with base-level resolution of >99%, and ONT generated reads as long as 2 Mb. We applied the 2 up-to-date platforms to a single rice individual and then compared the 2 assemblies to investigate the advantages and limitations of each.Entities:
Keywords: CCS; ONT ultralong; PacBio HiFi; assembly comparison; contiguity; single-molecular sequencer
Mesh:
Year: 2020 PMID: 33319909 PMCID: PMC7736813 DOI: 10.1093/gigascience/giaa123
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Contiguity of the ONT and PacBio assemblies. (a) Treemaps for contig length difference between the ONT (left) and PacBio (right) assembly; (b) the 6 PacBio contigs mapped to 1 ONT contig corresponding to Chr. 6; (c) details of the 3 PacBio gaps. Red rectangles indicate repeat elements.
Figure 2:Assembly errors in which genes can be annotated. (a) An example shows gene gains caused by assembly redundancies, of which PB-R1 and PB-R2 had a similarity level of 99.67% and 99.51%, respectively, compared with the corresponding region on PB-L2. D: depth. (b) The gene redundancies caused by gaps that failed to be correctly connected by the PacBio assembly. (c) An example shows how a 1-base deletion led to a frameshift mistake for protein translation. (d) An example shows how a single-base error led to stop codon gain and truncated protein translation.
Figure 3:Assembly comparisons using the same methods. Left: number of contigs that were mapped onto Chr. 6; right: number of mismatches (including SNVs and InDels) per 100 kb.