| Literature DB >> 22152105 |
Chun-Hsi Chen1, Hsuan-Yu Lin, Chia-Lin Pan, Feng-Chi Chen.
Abstract
BACKGROUND: The lengths of 5'UTRs of multicellular eukaryotes have been suggested to be subject to stochastic changes, with upstream start codons (uAUGs) as the major constraint to suppress 5'UTR elongation. However, this stochastic model cannot fully explain the variations in 5'UTR length. We hypothesize that the selection pressure on a combination of genomic features is also important for 5'UTR evolution. The ignorance of these features may have limited the explanatory power of the stochastic model. Furthermore, different selective constraints between vertebrates and invertebrates may lead to differences in the determinants of 5'UTR length, which have not been systematically analyzed.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22152105 PMCID: PMC3283318 DOI: 10.1186/1471-2105-12-S9-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Coefficients of linear regression models for 5’UTR length prediction
| Predictors | human | mouse | rat | chicken | frog | zebrafish | fruit fly | mosquito | sea squirt | nematode |
|---|---|---|---|---|---|---|---|---|---|---|
| G+C content | 0.53 | 0.56 | 0.58 | 0.65 | 0.58 | 0.55 | 1.65 | 1.23 | 1.62 | 0.89 |
| AUG OE | -0.01 | -0.03 | -0.01 | -0.04 | -0.04 | -0.03 | -0.02 | -0.08 | -0.04 | -0.07 |
| UGA OE | -0.06 | -0.07 | -0.08 | -0.05 | -0.09 | -0.08 | -0.15 | -0.10 | -0.12 | -0.10 |
| UAA OE | -0.05 | -0.06 | -0.08 | -0.05 | -0.10 | -0.11 | -0.12 | -0.12 | -0.17 | -0.22 |
| UAG OE | -0.13 | -0.13 | -0.15 | -0.14 | -0.13 | -0.12 | -0.20 | -0.15 | -0.12 | -0.17 |
| CG OE | -0.03 | 0.04 | nsa | -0.08 | -0.10 | 0.02 | ns | -0.05 | ns | -0.02 |
| UG OE | -0.05 | -0.03 | -0.04 | -0.10 | ns | -0.04 | 0.12 | 0.07 | 0.10 | -0.04 |
| UU OE | 0.04 | 0.02 | ns | ns | 0.04 | ns | 0.09 | ns | 0.17 | ns |
| UA OE | -0.03 | -0.04 | -0.04 | ns | ns | ns | 0.18 | 0.11 | ns | ns |
| Adjusted R2 | 0.14 | 0.15 | 0.16 | 0.21 | 0.19 | 0.14 | 0.29 | 0.23 | 0.33 | 0.30 |
ans, not significant
Figure 1The relative contributions to variability explained (RCVE) of different genomic features in the analyzed species. The RCVE was calculated according to the difference of R2 between the full model (with all predictors) and the reduced model (remove one predictor of interest). A large RCVE indicates a large contribution of a specific predictor.
The coefficients in the partial correlations between 5’UTR length and each predictor while other genomic features are controlled.
| Predictors | human | mouse | rat | chicken | frog | zebrafish | fruit fly | mosquito | sea squirt | nematode |
|---|---|---|---|---|---|---|---|---|---|---|
| G+C content | 0.160 | 0.148 | 0.143 | 0.211 | 0.170 | 0.146 | 0.226 | |||
| AUG OE | -0.023 | -0.059 | ns | -0.116 | -0.090 | -0.074 | -0.026 | -0.145 | -0.075 | -0.145 |
| UGA OE | -0.135 | -0.152 | -0.163 | -0.125 | -0.209 | -0.176 | -0.252 | -0.188 | -0.262 | -0.218 |
| UAA OE | -0.144 | -0.173 | -0.187 | -0.201 | -0.213 | -0.186 | -0.141 | -0.187 | -0.262 | -0.244 |
| UAG OE | -0.272 | -0.239 | -0.267 | |||||||
| CG OE | nsa | 0.035 | ns | -0.069 | -0.122 | 0.028 | ns | -0.059 | ns | -0.043 |
| UG OE | -0.039 | -0.024 | ns | -0.105 | ns | -0.034 | 0.092 | 0.062 | ns | ns |
| UU OE | 0.036 | ns | ns | ns | 0.044 | ns | 0.061 | ns | 0.125 | ns |
| UA OE | ns | -0.023 | ns | ns | ns | ns | 0.093 | 0.069 | ns | ns |
ans, not significant.
bThe bold-faced values represent the highest (absolute value) correlation in each specie