| Literature DB >> 27129708 |
Diego César Batista Mariano1, Thiago de Jesus Sousa1, Felipe Luiz Pereira2, Flávia Aburjaile1, Debmalya Barh3, Flávia Rocha1, Anne Cybelle Pinto1, Syed Shah Hassan1, Tessália Diniz Luerce Saraiva1, Fernanda Alves Dorella2, Alex Fiorini de Carvalho2, Carlos Augusto Gomes Leal2, Henrique César Pereira Figueiredo2, Artur Silva4, Rommel Thiago Jucá Ramos4, Vasco Ariston Carvalho Azevedo5.
Abstract
BACKGROUND: Studies have detected mis-assemblies in genomes of the species Corynebacterium pseudotuberculosis. These new discover have been possible due to the evolution of the Next-Generation Sequencing platforms, which have provided sequencing with accuracy and reduced costs. In addition, the improving of techniques for construction of high accuracy genomic maps, for example, Whole-genome mapping (WGM) (OpGen Inc), have allow high-resolution assembly that can detect large rearrangements.Entities:
Keywords: Genomics; Mis-assembly; Optical mapping; Sequencing
Mesh:
Substances:
Year: 2016 PMID: 27129708 PMCID: PMC4851793 DOI: 10.1186/s12864-016-2673-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics of the C. pseudotuberculosis 1002 new assembly
| Assembler | Mira 3.9.18 |
|---|---|
| Reads assembled | 731,481 |
| Contigs | 9 |
| Shortest contig | 4,133 |
| Largest contig | 542,891 |
| N50 | 402,955 |
| N90 | 218,254 |
| N95 | 147,989 |
| Total coverage | 58.63 |
Fig. 1Alignment between the restriction map of C. pseudotuberculosis 1002 (above) and the in silico map of the new assembly of C. pseudotuberculosis 1002 (below). Both restriction maps were generated using the restriction enzyme KpnI. The alignment shows a high similarity between the two restriction maps, indicating a high probability of a correct assembly
Fig. 2Alignment between the restriction map of C. pseudotuberculosis 1002 (above) and the in silico map of the complete genome of C. pseudotuberculosis 1002 (NC_017300) obtained from NCBI database (below). Both the restriction maps were generated using the restriction enzyme KpnI. The alignment shows a large inversion between the two restriction maps. A detailed analysis using CLC Genomics Workbench 7, BLAST and NR database shows that the inversion occurs between two rRNA regions
Fig. 3Syntenic comparison between the first assembly (Cp1002A) and the new assembly (Cp1002B). a The genome of Cp1002A is showed above, while the genome of Cp1002B in shown below. Red lines linking the line above and the line below indicate syntenic regions. The annotation of Cp1002A was used to insert color targets in the graph that detect repetitive regions: blue for rRNA operons, light blue for transposons, yellow for plasmids and green for phages. b The genomes are highly similar, except by a genetic inversion larger than 1 Mbp between two rRNA operons. c rRNA operon in the left side of the genetic inversion. It is possible to detect a change in the sense strand after the rRNA operon that indicates an inversion. d rRNA operon in the right side of the inversion sequence
Comparison between the assemblies of C. pseudotuberculosis 1002: Cp1002A (first assembly) and Cp1002B (new assembly)
| Cp1002A | Cp1002B | |
|---|---|---|
| Genome length | 2,335,113 bp | 2,335,107 bp |
| CDS | 2,095 | 2,071 |
| Hypothetical proteins | 592 | 551 |
| Pseudogenes | 47 | 43 |
| Depth coverage | 31x | 58x |
| GC % | 52.2 % | 52.2 % |
| rRNAs | 12 | 12 |
| tRNAs | 48 | 48 |