| Literature DB >> 16857058 |
Leonardo Mariño-Ramírez1, I King Jordan.
Abstract
BACKGROUND: Transposable elements (TEs) are abundant genomic sequences that have been found to contribute to genome evolution in unexpected ways. Here, we characterize the evolutionary and functional characteristics of TE-derived human genome regulatory sequences uncovered by the high throughput mapping of DNaseI-hypersensitive (HS) sites.Entities:
Year: 2006 PMID: 16857058 PMCID: PMC1538576 DOI: 10.1186/1745-6150-1-20
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Figure 1DNaseI-hypersensitive site conservation. a) The percent of phastCons (PC) denoted conserved positions for DNaseI-hypersensitive (HS) versus non DNaseI-hypersensitive (nonHS) sites in the human genome. b) 2 × 2 contingency table for evaluating the difference in evolutionary conservation between HS and nonHS sites.
Figure 2Relative percentages of four transposable element (TE) classes. Percentages of the four classes are shown for the human genome as a whole (dark grey) and for HS sites only (light grey).
Figure 3Evolutionary conservation of TE-derived versus non TE-derived HS sites. a) The percent of phastCons (PC) denoted conserved positions for TE-derived (HSTE) and non TE-derived (HSnotTE) sites in the human genome. b) 2 × 2 contingency table for evaluating the difference in evolutionary conservation between HSTE and HSnotTE sites.
Figure 4Comparison of CD4+ T cell expression levels for different classes of human genes. a) Cumulative frequency distributions of CD4+ T cell expression for genes do not have any HS sites (notHS) and genes with HS sites (allHS). b) Comparison of average CD4+ T cell expression levels for without (notHS) and with (allHS) HS sites. Genes with HS sites were further broken down into groups of genes with TE-derived (HSandTE) and not TE-derived (HSnotTE) HS sites.
Figure 5Gene expression patterns for a cluster of coexpressed genes with TE-derived HS sites. a) View of relative expression levels across conditions for a subset of cluster 2 genes, color coded according to over (red) and under (green) expression. b) Centroid view with cluster 2 expression averages.
Overrepresented GO terms from genes with TE-derived HS sites
| Immune response group | |||||
| 2 | GO:0009607 | 4 | response to biotic stimulus | 21 | 4.9 × 10-7 |
| 25 | 8 | 1.7 × 10-2 | |||
| 2 | GO:0006952 | 5 | defense response | 20 | 1.0 × 10-6 |
| 25 | 8 | 1.6 × 10-2 | |||
| 2 | GO:0006955 | 6 | immune response | 20 | 3.6 × 10-7 |
| 25 | 8 | 1.3 × 10-2 | |||
| 2 | GO:0051707 | 5 | response to other organism | 11 | 1.8 × 10-4 |
| 25 | 5 | 3.3 × 10-2 | |||
| 2 | GO:0009613 | 6 | response to pest, pathogen or parasite | 11 | 1.0 × 10-4 |
| 25 | 5 | 2.7 × 10-2 | |||
| 2 | GO:0030098 | 9 | lymphocyte differentiation | 3 | 6.2 × 10-3 |
| 19 | 3 | 1.6 × 10-2 | |||
| Regulation group | |||||
| 15 | GO:0050789 | 2 | regulation of biological process | 40 | 4.4 × 10-8 |
| 25 | 17 | 7.5 × 10-3 | |||
| 15 | GO:0050794 | 3 | regulation of cellular process | 39 | 2.2 × 10-8 |
| 25 | 17 | 4.1 × 10-3 | |||
| 15 | GO:0050791 | 3 | regulation of physiological process | 38 | 2.2 × 10-8 |
| 25 | 15 | 1.3 × 10-6 | |||
| 15 | GO:0051244 | 4 | regulation of cellular physiological process | 38 | 1.3 × 10-8 |
| 25 | 15 | 1.3 × 10-2 | |||
| Metabolism group | |||||
| 15 | GO:0050875 | 3 | cellular physiological process | 73 | 6.5 × 10-4 |
| 19 | 88 | 3.8 × 10-3 | |||
| 15 | GO:0044238 | 4 | primary metabolism | 63 | 1.7 × 10-6 |
| 19 | 67 | 4.8 × 10-3 | |||
| 15 | GO:0044237 | 4 | cellular metabolism | 63 | 9.0 × 10-6 |
| 19 | 71 | 1.3 × 10-3 | |||
| 9 | GO:0043170 | 4 | macromolecule metabolism | 26 | 2.8 × 10-2 |
| 15 | 37 | 2.0 × 10-2 | |||
| 19 | 50 | 9.1 × 10-4 | |||
| 9 | GO:0043283 | 5 | biopolymer metabolism | 18 | 2.8 × 10-2 |
| 15 | 30 | 8.0 × 10-5 | |||
| 19 | 35 | 5.1 × 10-4 | |||
| Cell death group | |||||
| 2 | GO:0043067 | 6 | regulation of programmed cell death | 7 | 7.8 × 10-3 |
| 19 | 7 | 3.7 × 10-2 | |||
| 2 | GO:0042981 | 7 | regulation of apoptosis | 7 | 7.8 × 10-3 |
| 19 | 7 | 3.7 × 10-2 | |||
1Gene coexpression, based on k-means analysis, cluster identifier.
2Biological process Gene Ontology (GO) annotation term.
3Hierarchical level (descending from 1) in the biological process GO directed acyclic graph.
4Name assigned to the GO term
5Number of genes in each cluster associated with the GO term.
6P-value associated with the over-representation of the GO term in each cluster
Figure 6Path on the GO biological process graph leading to overrepresented functions related to immune response. Hierarchical levels of the directed acyclic GO graph are shown on the left. Nodes in the graph represent individual GO annotation terms. Nodes are color coded according to the statistical significance level for their overrepresentation. Edges in the graph represent parent-child relationships between GO terms.
Immune response (GO:0006955) genes with TE-derived HS sites
| 2 | CD97 | leukocyte antigen; receptor involved in both cell adhesion and signalling processes early after leukocyte activation | 12460_5 | SINE/MIR/MIR | |
| 2 | IGLC2 | Ig light-chain, partial Ke-Oz- polypeptide, C-term; immunoglobulin lambda constant region 2 | 13851_3 | LTR/ERVL/LTR16A | |
| 2 | IGLC1 | constant region of lambda light chains | 13851_3 | LTR/ERVL/LTR16A | |
| 2 | MR1 | major histocompatibility complex, class I-related | 1311_2 | LINE/L2/L2 | |
| 2 | ADORA2A | adenosine receptor subtype A2a; G-protein coupled receptor; reduces the activation status of inflammatory cells | 13875_2 | SINE/MIR/MIR3 | |
| 2 | CKLF | chemokine-like factor (cytokine); essential role in the immune and inflammatory responses; potent chemoattractant for neutrophils, monocytes and lymphocytes | 10591_2 | SINE/Alu/AluSg | |
| 2 | KLF6 | Kruppel-like factor 6; core promoter guanine-rich element binding protein; transcriptional activator | 6794_2 | SINE/MIR/MIRb | |
| 2 | IL16 | interleukin 16; lymphocyte chemoattractant factor; cytokine; modulator of T cell activation; mediated by CD4 | 10023_2 | DNA/MER1/MER117 | |
| 2 | ST6GAL1 | sialyltransferase 1 (beta-galactoside alpha-2,6-sialytransferase); role in T-cell death; generation of cell-surface carbohydrate determinants and differentiation antigens | 3308_3 | SINE/MIR/MIR3 | |
| 2 | HLA-E | HLA class I histocompatibility lymphocyte antigen, E alpha chain; immunoregulatory role for cytotoxic T-lymphocytes | 4584_2 | DNA/MER1/Charlie1 | |
| 2 | IL21R | interleukin 21 receptor; type I cytokine receptor; transduces the growth promoting signal of IL21, and is important for the proliferation and differentiation of T cells, B cells, and natural killer (NK) cells | 10408_3 | SINE/MIR/MIRb | |
| 2 | ITGB2 | leukocyte cell surface adhesion glycoprotein; complement receptor C3 beta-subunit; integrin beta 2; macrophage antigen 1; facilitates inflammatory cell recruitment | 13756_2 | LINE/L1/LIME4a | |
| 2 | FYB | FYN-binding protein; adhesion and degranulation-promoting adaptor protein; adaptor helps form immunological synapse between T cell and antigen-presenting cell; mediates signaling from the T cell antigen receptor to integrins | 3896_2 | SINE/MIR/MIR3 | |
| 2 | IGHG1 | immunoglobulin heavy constant region gamma 1; involved in antigen binding and immune response | 9667_2 | LINE/L1/L1 | |
| 2 | PTPRC | protein tyrosine phosphatase receptor; leukocyte-common CD45 antigen; essential regulator of T- and B-cell antigen receptor signaling; regulator of cytokine receptor signaling; involved in hematopoiesis | 1347_2 | LINE/L1/L1ME | |
| 2 | LCP2 | lymphocyte cytosolic protein 2; adaptor or scaffold protein that promotes T cell development and activation as well as mast cell and platelet function. | 4315_2 | SINE/Alu/AluSx | |
| 2 | CD53 | glycoprotein leukocyte cell surface antigen; contributes to the transduction of CD2-generated signals in T cells and natural killer cells; role in T cell growth regulation | 964_2 | LINE/L2/L2 | |
| 2 | DOCK2 | dedicator of cytokinesis 2; hematopoietic cell-specific CDM family protein essential for lymphocyte chemotaxis; mediates T cell receptor-induced activation of Rac2 and IL-2 | 4313_2 | SINE/Alu/AluSx SINE/MIR/MIR | |
| 2 | CD74 | cell surface antigen; major histocompatibility complex, class II invariant chain; involved in NF-kappaB activation and interleukin-8 production | 4254_2 | DNA/MER1/Charlie8 | |
| 2 | MX1 | myxovirus resistance protein 1; interferon inducible; role in host defense against viruses | 13686_3 | DNA/MER1/MER5A | |
| 25 | CD6 | glycoprotein cell surface antigen; involved in lymphocyte activation and thymocyte development; role in maturation of the immunological synapse | DNA/MER1/MER113 | ||
| 25 | PLA2G4B | phospholipase enzyme secreted by neutrophils; produces arachidonic acid used for the biosynthesis of leukotriene in inflammatory response | 9758_2 | SINE/MIR/MIRb SINE/Alu/AluY | |
| 25 | BCL2 | B-cell lymphoma protein 2; blocks the apoptotic death of some cells such as lymphocytes | 12021_2 | LINE/L1/HAL1 | |
| 25 | TRA@ | T cell antigen receptor alpha locus; involved in thymocyte developement | 9169_2 9171_2 9174_4 | LINE/L1/L1ME2 SINE/MIR/MIR LINE/L2/L2 | |
| 25 | CCR9 | chemokine (C-C motif) G protein coupled receptor; regulator of thymocytes migration and maturation in normal and inflammation conditions; functional specialization of immune responses in different segments of the gastrointestinal tract | 2827_2 | LINE/L1/L1ME4a | |
| 25 | TCF7 | T cell specific transcription factor; regulates T cell development and peripheral T cell differentiation | 4139_4 | LINE/CR1/L3 | |
| 25 | CCL5 | T cell specific chemokine (C-C motif) ligand; chemoattractant for blood monocytes, memory T helper cells and eosinophils; causes release of histamine from basophils and activates eosinophils; | 11206_2 | LINE/L1/L1MB3 | |
| 25 | LAT | linker for activation of T cells; phosphorylated following activation of the T-cell antigen receptor signal transduction pathway; acts as a docking site and recruits multiple adaptor proteins and downstream signaling molecules into multimolecular signaling complexes | 10425_2 | SINE/Alu/AluJb SINE/MIR/MIR |
1Gene coexpression cluster from which the gene is taken.
2Official gene name symbol from the HUGO Gene nomenclature committee
3Representative mRNA Genbank accession taken from USC genome browser knownGene table.
4Brief description of the gene function
5NHGRI DNase-I hypersensitive site identifier taken from the UCSC genome browser nhgriDnaseHs table.
6Class/family/name of the transposable element (TE) sequence colocalized with the HS site