| Literature DB >> 25642351 |
James Lindesay1, Tshela E Mason2, William Hercules1, Georgia M Dunston3.
Abstract
Single nucleotide polymorphisms (SNPs) represent an important type of dynamic sites within the human genome. These common variants often locally correlate within more complex multi-SNP haploblocks that are maintained throughout generations in a stable population. Information encoded in the structure of SNPs and SNP haploblock variation can be characterized through a normalized information content metric. Genodynamics is being developed as the analogous "thermodynamics" characterizing the state variables for genomic populations that are stable under stochastic environmental stresses. Since living systems have not been found to develop in the absence of environmental influences, this paper describes the analogous genomic free energy metrics in a given environment. SNP haploblocks were constructed by Haploview v4.2 for five chromosomes from phase III HapMap data, and the genomic state variables for each chromosome were calculated. An in silico analysis was performed on SNP haploblocks with the lowest genomic energy measures. Highly favorable genomic energy measures were found to correlate with highly conserved SNP haploblocks. Moreover, the most conserved haploblocks were associated with an evolutionarily conserved regulatory element and domain.Entities:
Keywords: Information theory; biological information; entropy; genodynamics; genomic variation
Year: 2014 PMID: 25642351 PMCID: PMC4310014
Source DB: PubMed Journal: J Comput Biol Bioinform Res ISSN: 2141-2227
Figure 1Analysis of NIC values for chromosomes 1, 6, 11, 19 and 22 in the YRI and CEU populations.
Figure 2Analysis of the genomic energy measurements for chromosome 6 in the YRI and CEU populations.
Figure 3Comparison of the genomic energy spectra for chromosome 6 in the YRI and CEU populations.
Figure 4Comparison of the informatics of chromosome 6 in the YRI and CEU populations. The NIC value for the YRI population is 0.74±0.17 and 0.81±16 for the CEU population.
Figure 6Figures 6a and 6b. The ideogram on the left is of chromosome 6 illustrating the location of Block 3013 in the YRI population, while the ideogram on the right illustrates the location of Block 7016 in the CEU population on chromosome 6.
Protein domains associated with functional genes located in block 3013 (YRI) and block 7016 (CEU). The protein sequences were scanned using PROSITE, a database of protein domains, families and functional sites. CDART, a domain architecture retrieval tool, was used to identify evolutionarily conserved domains which are in lowercase; while those in boldface are common in both blocks.
| Protein domains | ZNRD1 | PPP1R11 | RNF39 | EPM2A | FBXO30 | SHPRH |
|---|---|---|---|---|---|---|
| carbohydrate binding module family 20 (CBM) | x | |||||
| Carbohydrate-binding-like fold | X | |||||
| Dual Specificity Phosphatase, Catalytic Domain | X | |||||
| protein-tyrosine/dual specificity phosphatase | x | |||||
| c-terminal helicase | x | |||||
| Helicase, superfamily ½, ATP-binding domain | X | |||||
| linker histone H1/H5, domain H15 | x | |||||
| P-loop containing nucleoside triphosphate hydrolase | X | |||||
| SNF2-related Domain | X | |||||
| WW Domain | X | |||||
| x | x | x | x | |||
| f-box domain, cyclin-like | x | |||||
| TRAF-like domain | X | |||||
| b30.2/spry | x | |||||
| Butyrophilin-like | X | |||||
| Concanavalin A-like | X | |||||
| SPRY-associated | X | |||||
| SPla/RY anodine receptor SPRY | X | |||||
| protein phosphatase inhibitor | x |
Molecular functions associated with the genes in block 3013 (YRI) and block 7016 (CEU). The molecular functions were determined by searching BioGPS, a gene annotation portal. Functions associated with one or more of the evolutionarily conserved protein domains are in lowercase, while those in boldface are common in both blocks.
| Molecular functions | ZNRD1 | PPP1R11 | RNF39 | EPM2A | FBXO30 | SHPRH |
|---|---|---|---|---|---|---|
| Protein Binding | X | |||||
| protein ser/thr phosphatase activity | x | |||||
| protein try phosphatase activity | x | |||||
| protein ser/thr/tyr phosphatase activity | x | |||||
| starch binding | x | |||||
| ATP Binding | X | |||||
| dna binding | x | |||||
| helicase activity | x | |||||
| ligase activity | x | |||||
| x | x | x | x | |||
| ubiquitin-protein ligase activity | x | |||||
| DNA-directed RNA Polymerase Activity | X | |||||
| Nucleic Acid Binding | X | |||||
| protein phosphatase inhibitor activity | x |
Biological processes associated with the genes in blocks 3013 (YRI) and 7016(CEU). The biological processes were determined using the web-based gene annotation tool, BioGPS. Those processes that are associated with one or more of the evolutionarily conserved protein domain are in lowercase.
| Biological processes | ZNRD1 | PPP1R11 | RNF39 | EPM2A | FBXO30 | SHPRH |
|---|---|---|---|---|---|---|
| Behavior | X | |||||
| glycogen metabolic process | x | |||||
| Nervous System Development | X | |||||
| peptidyl-tyrosine dephosphorylation | x | |||||
| protein dephosphorylation | x | |||||
| DNA Repair | X | |||||
| nucleosome assembly | x | |||||
| protein ubiquitination | x | |||||
| Nucleobase-Containing Compound Metabolic Process | X | |||||
| DNA-Dependent Transcription | X |
Figure 5Comparison of the major histocompatibility complex (MHC) region on chromosome 6 for the YRI and CEU populations.