| Literature DB >> 29358630 |
Ariel A Aptekmann1,2, Alejandro D Nadra3,4,5.
Abstract
The subtle mechanisms by which protein-DNA interactions remain functional across a wide range of temperatures are largely unknown. In this work, we manually curated available information relating fully sequenced archaeal genomes with organism growth temperatures. We built a motif that represents the core promoter of each species and calculated its information content. We then studied the relation between optimal growth temperature (OGT) and information content (IC) in the promoter region.We found a positive correlation between G + C content and OGT in tRNA regions and not in overall genome. Furthermore, we found that there is a positive correlation between information content and optimal growth temperatures in Archaea. This can't be explained by an increased C+G composition nor by other obvious mechanisms. These findings suggest that increased information content could produce a positive fitness in organisms living at high temperatures. We suggest that molecular information theory may need to be adapted for hyperthermophiles.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29358630 PMCID: PMC5777992 DOI: 10.1038/s41598-018-19495-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of acheal species evaluated informing its optimal growth temperature (OGT), genomic G + C content (G + C), motif’s information content (IC) and motif’s relative entropy (RE).
| Index | Name | OGT (°C) | IC (bits) | RE (bits) | G + C (%) | |
|---|---|---|---|---|---|---|
| 0 |
| 35.5 | 11.3 | 13.2 | 0.65 | |
| 1 |
| 37 | 11 | 13.6 | 0.66 | |
| 2 |
| 37 | 10.8 | 13 | 0.64 | |
| 3 |
| 38.5 | 12.6 | 14 | 0.65 | |
| 4 |
| 38.5 | 11.4 | 11.7 | 0.36 | |
| 5 |
| 39 | 13.9 | 15.9 | 0.61 | |
| 6 |
| 41 | 12.9 | 14.6 | 0.61 | |
| 7 |
| 42.5 | 13.3 | 15.2 | 0.66 | |
| 8 |
| 45 | 13.3 | 16.8 | 0.67 | |
| 9 |
| 45 | 12.9 | 15.2 | 0.65 | |
| 10 |
| 45 | 10.7 | 11.9 | 0.63 | |
| 11 |
| 47 | 11.7 | 13.9 | 0.62 | |
| 12 |
| 49 | 13.8 | 15.6 | 0.61 | |
| 13 |
| 49 | 13.6 | 16.5 | 0.68 | |
| 14 |
| 49 | 11.7 | 13.4 | 0.62 | |
| 15 |
| 51 | 12.4 | 15.2 | 0.66 | |
| 16 |
| 51 | 13.4 | 16 | 0.65 | |
| 17 |
| 67.5 | 16.9 | 13.9 | 0.37 | |
| 18 |
| 75 | 12.6 | 12.1 | 0.43 | |
| 19 |
| 76 | 12.9 | 12.6 | 0.49 | |
| 20 |
| 77.5 | 14 | 13.4 | 0.47 | |
| 21 |
| 78 | 19.5 | 17.5 | 0.4 | |
| 22 |
| 79 | 18.8 | 17.1 | 0.42 | |
| 23 |
| 80 | 13.7 | 13.4 | 0.44 | |
| 24 |
| 81 | 15 | 14.3 | 0.45 | |
| 25 |
| 81.2 | 14.8 | 16.7 | 0.58 | |
| 26 |
| 82 | 13.8 | 12.4 | 0.42 | |
| 27 |
| 82.5 | 11.9 | 13.2 | 0.57 | |
| 28 |
| 85 | 18.9 | 17.3 | 0.42 | |
| 29 |
| 85 | 14.1 | 14.8 | 0.53 | |
| 30 |
| 85 | 13.1 | 13.6 | 0.57 | |
| 31 |
| 85 | 12.8 | 12.5 | 0.47 | |
| 32 |
| 87.5 | 18.8 | 17.7 | 0.45 | |
| 33 |
| 90 | 16.5 | 19.1 | 0.6 | |
| 34 |
| 92 | 17.5 | 19 | 0.55 | |
| 35 | NC_014471 |
| 93.5 | 13.1 | 12.3 | 0.36 |
| 36 |
| 98 | 17.9 | 18.3 | 0.52 | |
| 37 |
| 100 | 17.8 | 16.7 | 0.45 | |
| 38 |
| 100 | 17.5 | 15.7 | 0.41 |
Figure 1G + C content for tRNA coding and promoter regions. G + C content was analyzed either in the region coding for the tRNA itself (empty circles) or in a promoter region of the same length for each of the genomes in our dataset (filled circles). Coding regions show a clear increase with temperature (R = 0.91 p = 4.9 × 10−30), while promoters region do not (R = −0.3 p = 7 × 10−3).
Figure 2Best and worst TATA’s logos. Lowest (top) and highest (bottom) e-score motif of the TBP binding site (TBS) for 39 archaeal genomes. TBS are represented as sequence logos. 100 bp upstream regions of tRNAs were collected for each genome and were analyzed by MEME to identify TBS. For a full list of obtained logos refer to Figure S12.
Figure 3Information content correlates with optimal growth temperature. Information content of the predicted TBS motif on 100bp window, for 39 archaeal genomes is plotted against optimal growth temperature. Each dot represents one species average. Numbering for each dot refers to the species/motifs in Table 1