| Literature DB >> 15774024 |
Steven L Salzberg1, Julie C Dunning Hotopp, Arthur L Delcher, Mihai Pop, Douglas R Smith, Michael B Eisen, William C Nelson.
Abstract
BACKGROUND: The Trace Archive is a repository for the raw, unanalyzed data generated by large-scale genome sequencing projects. The existence of this data offers scientists the possibility of discovering additional genomic sequences beyond those originally sequenced. In particular, if the source DNA for a sequencing project came from a species that was colonized by another organism, then the project may yield substantial amounts of genomic DNA, including near-complete genomes, from the symbiotic or parasitic organism.Entities:
Mesh:
Year: 2005 PMID: 15774024 PMCID: PMC1088942 DOI: 10.1186/gb-2005-6-3-r23
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Alignment of complete wMel genome (horizontal axis) to longest scaffold from the wAna genome assembly. Red points indicate sequences aligned in the forward orientation, green points indicate reverse orientation. The diagonals represent colinear regions, and breaks in the diagonals correspond to inversions and translocations between the two genomes.
Summary statistics for assemblies of the three new Wolbachia genomes
| wAna | wSim | wMoj | wMel | |
| Molecule length (bp) | 1,440,650 | 896,761 | 86,870 | 1,267,782 |
| Scaffolds | 329 | 84 | 114 | 1 |
| Genes | 1837 | 790 | 63 | 1271 |
| Contigs | 464 | 388 | 114 | 1 |
| GC content (%) | 35.4 | 35.0 | 34.5 | 35.2 |
| Average gene length (bp) | 608 | 916 | 633 | 855 |
The wSim genome was assembled using the comparative assembler, AMOS-Cmp, and scaffolded using Bambus. The wAna genome was assembled using the Celera Assembler, as described in Materials and methods. Note that the high gene count for wAna is likely due to fragmentation of individual genes across separate contigs.
Figure 2Circular map comparing the wMel genome with the wAna, wSim and wMoj assemblies. Ring 1 (outermost ring): forward strand genes; ring 2: reverse strand genes; ring 3: GC-skew plot; ring 4: X2 analysis of trinucleotide composition, with peaks indicating atypical regions; ring 5: wMel genes present in wAna assembly; ring 6: wMel genes present in the wSim assembly; ring 7: wMel genes present in wMoj assembly. Large regions on the wMel genome that were not recovered in the wAna or wSim assemblies are marked on the outside (regions A, B).
Percent identity between nucleotide sequences of the four sequenced strains of Wolbachia
| 97.2 | 97.1 | ||
| 97.2 | 99.8 | ||
| 97.1 | 99.8 | ||
| 94.9 | 97.5 | 97.3 |
Figure 3Relationship of ankyrin repeat domain proteins between wMel, wAna, wSim and wMoj. All the predicted ankyrin repeat proteins with greater than 120 amino acids were aligned and clustered using ClustalW. Nine predicted ankyrin repeat domain proteins (A-I) were found to be conserved among at least wMel and one other of these Wolbachia species with nucleotide sequence identity > 95% across the entire length of the gene.