| Literature DB >> 25937951 |
Eran Elhaik1, Dan Graur2.
Abstract
Eukaryotic genomes, particularly animal genomes, have a complex, nonuniform, and nonrandom internal compositional organization. The compositional organization of animal genomes can be described as a mosaic of discrete genomic regions, called "compositional domains," each with a distinct GC content that significantly differs from those of its upstream and downstream neighboring domains. A typical animal genome consists of a mixture of compositionally homogeneous and nonhomogeneous domains of varying lengths and nucleotide compositions that are interspersed with one another. We have devised IsoPlotter, an unbiased segmentation algorithm for inferring the compositional organization of genomes. IsoPlotter has become an indispensable tool for describing genomic composition and has been used in the analysis of more than a dozen genomes. Applications include describing new genomes, correlating domain composition with gene composition and their density, studying the evolution of genomes, testing phylogenomic hypotheses, and detect regions of potential interbreeding between human and extinct hominines. To extend the use of IsoPlotter, we designed a completely automated pipeline, called IsoPlotter(+) to carry out all segmentation analyses, including graphical display, and built a repository for compositional domain maps of all fully sequenced vertebrate and invertebrate genomes. The IsoPlotter(+) pipeline and repository offer a comprehensive solution to the study of genome compositional architecture. Here, we demonstrate IsoPlotter(+) by applying it to human and insect genomes. The computational tools and data repository are available online.Entities:
Year: 2013 PMID: 25937951 PMCID: PMC4393066 DOI: 10.1155/2013/725434
Source DB: PubMed Journal: ISRN Bioinform ISSN: 2090-7338
Figure 1The frequency of compositional domains for five taxa.
Figure 2Compositional organization of eleven insect genomes including six newly sequenced ants (red). Compositional domains were classified as GC poor or GC rich based on the mean GC content of all insect genomes (37.62%) and further divided by size. The similarity in genome compositional organization can be compared to the estimated phylogenetic tree constructed based on recent publications [11–13].
Figure 3Ideogram of compositional domains in humans as inferred by IsoPlotter and mapped to chromosomes created using the PlotGenome.m script. The ideogram uncovers the compositional patterns of long compositionally homogeneous domains (“isochoric”), short compositionally homogeneous domains (<300 kb), and compositionally nonhomogeneous domains.