| Literature DB >> 19495914 |
Jennifer Commins1, Christina Toft1, Mario A Fares1.
Abstract
Comparative genomics has become a real tantalizing challenge in the postgenomic era. This fact has been mostly magnified by the plethora of new genomes becoming available in a daily bases. The overwhelming list of new genomes to compare has pushed the field of bioinformatics and computational biology forward toward the design and development of methods capable of identifying patterns in a sea of swamping data noise. Despite many advances made in such endeavor, the ever-lasting annoying exceptions to the general patterns remain to pose difficulties in generalizing methods for comparative genomics. In this review, we discuss the different tools devised to undertake the challenge of comparative genomics and some of the exceptions that compromise the generality of such methods. We focus on endosymbiotic bacteria of insects because of their genomic dynamics peculiarities when compared to free-living organisms.Entities:
Year: 2009 PMID: 19495914 PMCID: PMC3055744 DOI: 10.1007/s12575-009-9004-1
Source DB: PubMed Journal: Biol Proced Online ISSN: 1480-9222 Impact factor: 3.244
Figure 1a Whole genome shotgun sequencing: Genome is sheared into small approximately equal sized fragments which are subsequently small enough to be sequenced in both directions followed by cloning. The cloned sequences (reads) are then fed to an assembler (illustrated in Figure 2). b To overcome some of the complexity of normal shotgun sequencing of large sequences such as genomes a hierarchical approach can be taken. The genome is broken into a series of large equal segments of known order which are then subject to shotgun sequencing. The assembly process here is simpler and less computationally expensive.
Figure 2Overlap–layout–consensus genome assembly algorithm: Reads are provided to the algorithm. Overlapping regions are identified. Each read is graphed as a node and the overlaps are represented as edges joining the two nodes involved. The algorithm determines the best path through the graph (Hamiltonian path). Redundant information (i.e., unused nodes and edges) is discarded. This process is carried out multiple times and resulting sequences are combined to give the final consensus sequence that represents the genome.
Figure 3Genome rearrangements plots comparing two genomes. Genome plots can provide information on the kind of rearrangements undergone. These plots represent the location of each gene in one axis for one of the genomes against the location of the found ortholog in the other axis for the second genome. a Comparative genomic plot when comparing two genomes showing no lineage-specific genome rearrangements. In this case, the plot was produced for the comparison of two primary symbiotic bacteria of insects (B. aphidicola strain A. pisum versus B. aphidicola strain Schizaphis graminum). Since no rearrangements have occurred in any of the two genomes, the comparison yields a straight diagonal line. b Comparative genomic plot for two genomes showing lineage-specific genome rearrangements. In this case, the plot was comparing the genome of other patterns that can be observed and are x-like patterns b (in this case, B. aphidicola, A. pisum, and E. coli k12) where the rearrangements have occurred over the replication axis E. coli K12 to the genome of B. aphidicola strain A. pisum. c This is the comparison between Chlamydophila pneumoniae CWL029 and Chlamydia trachomatis 434/Bu that show an even better example of rearrangements that have occurred over the replication axis (this example have also been shown in [102]). As shown, many rearrangements including inversions and translocations have occurred, and consequently, the orthologs are not located in the major diagonal of the plot but rather show an X-shape distribution. This is expected if an inversion has taken place near the centromer of the chromosome.