| Literature DB >> 24822029 |
Ren Zhang1, Hong-Yu Ou2, Feng Gao3, Hao Luo3.
Abstract
The nucleotide composition of genomes undergoes dramatic variations among all three kingdoms of life. GC content, an important characteristic for a genome, is related to many important functions, and therefore GC content and its distribution are routinely reported for sequenced genomes. Traditionally, GC content distribution is assessed by computing GC contents in windows that slide along the genome. Disadvantages of this routinely used window-based method include low resolution and low sensitivity. Additionally, different window sizes result in different GC content distribution patterns within the same genome. We proposed a windowless method, the GC profile, for displaying GC content variations across the genome. Compared to the window-based method, the GC profile has the following advantages: 1) higher sensitivity, because of variation-amplifying procedures; 2) higher resolution, because boundaries between domains can be determined at one single base pair; 3) uniqueness, because the GC profile is unique for a given genome and 4) the capacity to show both global and regional GC content distributions. These characteristics are useful in identifying horizontally-transferred genomic islands and homogenous GC-content domains. Here, we review the applications of the GC profile in identifying genomic islands and genome segmentation points, and in serving as a platform to integrate with other algorithms for genome analysis. A web server generating GC profiles and implementing relevant genome segmentation algorithms is available at: www.zcurve.net.Entities:
Keywords: GC profile; Genome segmentation.; Genomic island
Year: 2014 PMID: 24822029 PMCID: PMC4009839 DOI: 10.2174/1389202915999140328163125
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Comparison between the window-based method and the windowless method (GC profile) for displaying GC content dis-tributions.
| Window-based Method | Window-less Method (GC Profile) | Note | |
|---|---|---|---|
| Method | Overlapping or non-overlapping sliding windows | Cumulative number of G and C along the genome, the | The 2 methods are complementary |
| Sensitivity | Low, because GC content variation is averaged within a window | High, because of the special subtraction procedure that amplifies GC content variation. | The high sensitivity is useful especially for detecting horizontally-transferred genomic islands. |
| Resolution | Low | High, because no window is used, the boundary between domains can be determined at one single nucleotide. | The high resolution proves valuable in identifying genome segmentation points. |
| Uniqueness | Different window size leads to different results | Unique for a given genome | The uniqueness is a useful feature for comparative genomics. |
| Regional or global | Regional, GC content within a window | Global as well as regional, because cumulative GC content is calculated. | The global GC content distribution is a useful characteristic in identifying isochores. |
| Examining homogeneity | Not intuitive | Intuitive, because a straight line denotes a homogenous region, as opposed to GC content variations of the entire genome. Homogeneity can be examined statistically as well. | Examining homogeneity based on the GC profile is helpful in identifying isochores. |