| Literature DB >> 17653590 |
Hasan Khatib1, Ismail Zaitoun, Eui-Soo Kim.
Abstract
Genomic imprinting is an epigenetic mechanism that results in monoallelic expression of genes depending on parent-of-origin of the allele. Although the conservation of genomic imprinting among mammalian species has been widely reported for many genes, there is accumulating evidence that some genes escape this conservation. Most known imprinted genes have been identified in the mouse and human, with few imprinted genes reported in cattle. Comparative analysis of genomic imprinting across mammalian species would provide a powerful tool for elucidating the mechanisms regulating the unique expression of imprinted genes. In this study we analyzed the imprinting of 22 genes in human, mouse, and cattle and found that in only 11 was imprinting conserved across the three species. In addition, we analyzed the occurrence of the sequence elements CpG islands, C + G content, tandem repeats, and retrotransposable elements in imprinted and in nonimprinted (control) cattle genes. We found that imprinted genes have a higher G + C content and more CpG islands and tandem repeats. Short interspersed nuclear elements (SINEs) were notably fewer in number in imprinted cattle genes compared to control genes, which is in agreement with previous reports for human and mouse imprinted regions. Long interspersed nuclear elements (LINEs) and long terminal repeats (LTRs) were found to be significantly underrepresented in imprinted genes compared to control genes, contrary to reports on human and mouse. Of considerable significance was the finding of highly conserved tandem repeats in nine of the genes imprinted in all three species.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17653590 PMCID: PMC2000230 DOI: 10.1007/s00335-007-9039-z
Source DB: PubMed Journal: Mamm Genome ISSN: 0938-8990 Impact factor: 2.957
Comparative imprinting analysis of human, mouse, and cattle genes
| Gene | Humana | Mousea | Cattle | Expression | Reference for cattle |
|---|---|---|---|---|---|
|
| NIb | I | NI | M | Khatib |
|
| I | I | I | M | Khatib |
|
| I | I | I | M | Zaitoun and Khatib |
|
| I | I | I | P | Dindot et al. |
|
| PI | I | I | M | Killian et al. |
|
| I | I | I | P | Unpublished data |
|
| I | I | I | M | Dindot et al. |
|
| I | I | I | P | Ruddock et al. |
|
| Ic | I | I | P | Zaitoun and Khatib |
|
| I | I | I | P | Zaitoun and Khatib |
|
| I | I | I | P | Kim et al. |
|
| I | I | I | P | Unpublished data |
|
| CD | NR | NI | P | Khatib |
|
| NR | I | NI | P | Zaitoun and Khatib |
|
| NI | I | I | P | Dindot et al. |
|
| I | I | NI | M, P | Kim et al. |
|
| NR | I | NI | M | Unpublished data |
|
| NI | I | NI | M | Unpublished data |
|
| CD | I | NI | M | Unpublished data |
|
| I | I | NI | M | Unpublished data |
|
| I | I | I | P | Unpublished data |
|
| NI | I | I | M | Unpublished data |
NR = not reported; I = imprinted; PI = polymorphic imprinting; NI = not imprinted; M = maternal; P = paternal; CD = conflicting data
aData from http://www.otago.ac.nz/IGC
bMonk et al. 2006
cWood et al. 2007
The number of tandem repeats (TR) and CpG islands in coding sequences and in total sequences (coding and 50-kb up- and downstream flanking sequences) of imprinted and control genes
| Gene | Coding sequence (bp) | TR in coding sequences | TR in total sequences | CpG islands in coding sequences | CpG islands in total sequences |
|---|---|---|---|---|---|
|
| 13523 | 1 | 4 | 0 | 2 |
|
| 6070 | 0 | 22 | 2 | 19 |
|
| 102613 | 11 | 23 | 31 | 36 |
|
| 4000 | 4 | 5 | 1 | 4 |
|
| 13919 | 1 | 5 | 1 | 3 |
|
| 2354 | 0 | 1 | 1 | 1 |
|
| 1609 | 0 | 1 | 2 | 4 |
|
| 23509 | 11 | 18 | 19 | 30 |
|
| 3996 | 0 | 29 | 4 | 10 |
|
| 36421 | 6 | 19 | 0 | 0 |
|
| 6978 | 1 | 5 | 1 | 5 |
|
| 3661 | 0 | NA | 1 | NA |
|
| 1305 | 0 | 19 | 3 | 22 |
|
| 25865 | 2 | 22 | 4 | 15 |
| Total imprinted | 218653 | 37 | 173 | 70 | 151 |
|
| 18062 | 0 | 10 | 3 | 15 |
|
| 173709 | 5 | 12 | 0 | 0 |
|
| 6161 | 0 | 3 | 1 | 8 |
|
| 11306 | 0 | 4 | 0 | 0 |
|
| 15185 | 0 | 8 | 0 | 2 |
|
| 13498 | 0 | 1 | 0 | 5 |
|
| 22458 | 1 | 22 | 2 | 22 |
|
| 36670 | 1 | 3 | 0 | 1 |
|
| 22203 | 0 | 1 | 1 | 9 |
|
| 34676 | 1 | 7 | 0 | 2 |
|
| 26804 | 1 | 2 | 1 | 2 |
|
| 236368 | 1 | 12 | 5 | 5 |
|
| 8639 | 1 | 9 | 0 | 0 |
|
| 39226 | 1 | 9 | 0 | 0 |
|
| 49311 | 1 | 4 | 0 | 3 |
|
| 16039 | 0 | 3 | 1 | 7 |
|
| 78511 | 2 | 6 | 1 | 3 |
|
| 7071 | 0 | 3 | 0 | 10 |
|
| 61202 | 0 | 6 | 0 | 2 |
|
| 32344 | 3 | 22 | 6 | 28 |
| Total control | 909443 | 19 | 150 | 21 | 124 |
Fig. 1Boxplot graphs (median and interquartile) representing (a) SINE and(b) LINE-1 frequencies identified by the RepeatMasker program in 14 imprinted cattle genes and in the coding sequences of 20 control (nonimprinted) cattle genes. Wilcoxon-Mann-Whitney test revealed a statistically significant difference for the occurrence of SINEs and LINE-1s between imprinted and control genes
Sequence alignment of tandem repeats in imprinted genes across human, mouse, and cattle
| Gene | Number of tandem repeats | Aligned repeatsa | Score | Repeats in CpG islands | ||
|---|---|---|---|---|---|---|
| Human | Mouse | Cattle | ||||
|
| 7 | 13 | 1 | H3/M1 | 75 | H1, H2, H3, M1, M2 |
| H5/M7 | 75 | |||||
| H3/M2 | 72 | |||||
| H2/M9 | 64 | |||||
|
| 20 | 3 | 11 | H5/C1 | 76 | H5, C1, C3, C10, C11 |
| H5/M2 | 64 | |||||
| H5/C3 | 64 | |||||
| H5/C11 | 64 | |||||
| H20/C1 | 59 | |||||
| H20/C3 | 56 | |||||
|
| 2 | 3 | 4 | H2/C4 | 91 | C2 |
| H1/C4 | 88 | |||||
| M2/C1 | 76 | |||||
| H1/C3 | 61 | |||||
| H2/M1 | 51 | |||||
|
| 2 | 0 | 1 | H2/C1 | 89 | |
|
| 7 | 7 | 11 | H7/M6 | 90 | |
| H5/C9 | 80 | |||||
| H6/M5 | 80 | |||||
| M5/C9 | 76 | |||||
| H2/C3 | 72 | |||||
| H7/M2 | 66 | |||||
| H7/C1 | 66 | |||||
| M7/C10 | 66 | |||||
| H6/M3 | 60 | |||||
| H7/C9 | 57 | |||||
| M7/C9 | 57 | |||||
| H3/C2 | 53 | |||||
|
| 2 | 2 | 0 | H1/M1 | 80 | |
|
| 4 | 3 | 6 | H2/C2 | 71 | |
| H4/M1 | 47 | |||||
|
| 6 | 2 | 3 | M2/H1 | 83 | |
| M2/H4 | 83 | |||||
| M2/H5 | 83 | |||||
| M2/C3 | 83 | |||||
|
| 2 | 9 | 1 | H2/M3 | 89 | |
| H2/M4 | 81 | |||||
| M1/C1 | 55 | |||||
aH = human, M = mouse, C = cattle
Number of tandem repeats (TR) and CpG islands per kb genomic sequences in coding and flanking sequences of imprinted and biallelically expressed genes
| Element/gene type | 50-kb upstream | 50-kb downstream | Coding sequence | Total sequence |
|---|---|---|---|---|
| TR/kb imprinted | 0.120 | 0.100 | 0.151 | 0.113 |
| TR/kb biallelic | 0.071 | 0.060 | 0.021 | 0.052 |
| CpG/kb imprinted | 0.073 | 0.055 | 0.285 | 0.100 |
| CpG/kb biallelic | 0.054 | 0.048 | 0.023 | 0.043 |
Fig. 2Distribution of CpG islands in a subset of (a) imprinted and(b) biallelically expressed genes with 50-kb upstream and downstream flanking sequences. Vertical bars indicate position of each CpG island and the thick line on the x axis corresponds to the coding region of the gene. The position of each genomic region counts from the upstream flanking region of each gene (×105 bp). The distribution of CpG islands in other genes examined is presented in Supplementary Fig. 1
Fig. 3Boxplots of retrotransposable element densities in imprinted genes, in biallelically expressed control genes, and in genes from bovine chromosome 14 (n = 671), chromosome 5 (n = 1335), and chromosome 6 (n = 565). SINE and LINE-1 frequencies were calculated for the coding sequence and 50-kb upstream and downstream of the coding region. a SINE frequency in imprinted genes was 10.89% compared with 16.57%, 15.42%, 15.65%, and 16.46% in biallelically expressed control genes and genes from BTA14, BTA5, and BTA6, respectively. b LINE-1 frequencies were 10.71%, 9.30%, 9.80%, 9.34%, and 9.93% in imprinted, biallelically expressed control genes, and genes from BTA14, BTA5, and BTA6, respectively