| Literature DB >> 29688527 |
José Horacio Grau1, Thomas Hackl2, Klaus-Peter Koepfli3,4, Michael Hofreiter5.
Abstract
Background: Contiguous genome assemblies are a highly valued biological resource because of the higher number of completely annotated genes and genomic elements that are usable compared to fragmented draft genomes. Nonetheless, contiguity is difficult to obtain if only low coverage data and/or only distantly related reference genome assemblies are available. Findings: In order to improve genome contiguity, we have developed Cross-Species Scaffolding-a new pipeline that imports long-range distance information directly into the de novo assembly process by constructing mate-pair libraries in silico. Conclusions: We show how genome assembly metrics and gene prediction dramatically improve with our pipeline by assembling two primate genomes solely based on ∼30x coverage of shotgun sequencing data.Entities:
Mesh:
Year: 2018 PMID: 29688527 PMCID: PMC5967465 DOI: 10.1093/gigascience/giy029
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Chart demonstrating the workflow implemented in Cross-Species Scaffolding for generating mate-pair libraries in silico. The approach is composed of three steps. In the first step, reads from shotgun libraries are mapped onto a set of repeat-masked reference chromosomes or genome assembly. In the second step, a large consensus fastq file is obtained from every chromosome or contig, generated only from the mapped reads. Finally, Cross-mates is used to simulate the sequencing of mate-pair or paired-end scaffolding libraries from the consensus fastq chromosomes.
Figure 2:A) Plot of final contig size for the chimpanzee and aye-aye genome assemblies. Chimpanzee genome assembled with shotgun-only data (32x coverage) and with in silico mate-pairs generated from the human chromosomes using Cross-mates (see Methods section). Aye-aye genome assembled with shotgun only data (22x coverage) and with in silico mate-pairs generated from the human chromosomes and the gray mouse lemur. B) Summary table of the assembly statistics showing chimpanzee and aye-aye results.