| Literature DB >> 19087263 |
Carlos Quijano1, Pavel Tomancak, Jesus Lopez-Marti, Mikita Suyama, Peer Bork, Marco Milan, David Torrents, Miguel Manzanares.
Abstract
BACKGROUND: The physical organization and chromosomal localization of genes within genomes is known to play an important role in their function. Most genes arise by duplication and move along the genome by random shuffling of DNA segments. Higher order structuring of the genome occurs in eukaryotes, where groups of physically linked genes are co-expressed. However, the contribution of gene duplication to gene order has not been analyzed in detail, as it is believed that co-expression due to recent duplicates would obscure other domains of co-expression.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19087263 PMCID: PMC2646280 DOI: 10.1186/gb-2008-9-12-r176
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Genomic and temporal distribution of duplicated genes in . (a) The distance between duplicates does not increase sequentially with time, as estimated by dS values. The majority of gene pairs are either very near or far apart. The most frequent profiles for duplicated genes are (but not restricted to) consecutive (i-genes approximately 0) or recently (dS approximately 0) duplicated genes or both. Only pairs separated by up to 100 intervening genes and with dS < 5 are shown. (b) The proportion of pairs of duplicated genes that have separated increases over time, reaching a point where more than 90% of all duplicated genes are not physically linked. For example, there are 240 linked pairs in the dS 0-0.5 range, while there are only 19 for dS 4.5-5.0. The best fit exponential distribution that reaches a plateau at 92.51% is shown as a solid line.
Figure 2Evolutionarily conserved TDGs are enriched in developmental and transcription factor genes. The graph shows the ratio of the abundance of the listed GO categories in the different subsets of duplicated genes to the abundance among all duplicated genes. A value of 1 indicates that the abundance in a subset is comparable to that in the whole set. The conserved TDG subset is enriched in genes under the categories 'multicellular organismal developmental' and 'transcription factor activity'. p-values for individual children GO terms of these categories found to be overrepresented among conserved TDGs are all < 0.05 (Additional data file 5). Abbreviations: non TDGs, duplicated genes that are not arranged in tandem; TDGs, duplicated genes that are arranged in tandem; non cons TDGs, tandem duplicates that are not conserved in A. gambiae; cons TDGs, tandem duplicates that are conserved between D. melanogaster and A. gambiae.
Number of groups and genes that show co-expression in the D. melanogaster embryo
| Number of groups* (genes†) | Number of co-expressing groups‡ (genes†) | Percentage of co-expressing groups (genes†) | |
| Conserved TDGs | 52 (118) | 19 (43) | 36.5 (36.4) |
| Conserved non-TDGs§ | 179 (578) | 38 (89) | 21.3 (15.4) |
| Conserved neighbors¶ | 198 (716) | 37 (107) | 18.7 (14.9) |
*Total number of groups where two or more genes have a reported in situ analysis. †Number of genes that have been analyzed by in situ. ‡Number of groups where two or more genes show co-expression in at least one domain of the embryo. §Groups of duplicated genes that are not arranged in tandem and that have one-to-one orthologues in A. gambiae. ¶Groups of genes that are located in syntenic regions between D. melanogaster and A. gambiae, that have one-to-one orthologues in A. gambiae, and that are not tandem duplicates.
Figure 3Conserved TDGs show co-expression in the . The figure shows in situ hybridization data for TDGs whose expression has not been previously described. Four genes from the Osiris cluster are expressed in the esophagus and in the ventral ectoderm, while three genes encoding Elongation-of-very-long-fatty-acids synthases (ELOVL) are expressed in the large intestine. We also found two undescribed genes that encode proteins with collagen-like repeats that are both expressed in a discrete domain at the anterior end of the syncitial blastoderm stage embryo, and two Ras-family members that show expression in the procephalon and ventral ectoderm. We have also found that the scrt Snail-type zinc finger gene has a conserved linked duplicate and both are expressed in overlapping domains in the central nervous system.