| Literature DB >> 23193361 |
Qiao Zhong1, Weidong Xu, Yuanjian Wu, Hongxing Xu.
Abstract
Human metapneumovirus (HMPV) is an important agent of acute respiratory tract infection in children, while its pathogenicity and molecular evolution are lacking. Herein, we firstly report the synonymous codon usage patterns of HMPV genome. The relative synonymous codon usage (RSCU) values, effective number of codon (ENC) values, nucleotide contents, and correlation analysis were performed among 17 available whole genome of HMPV, including different genotypes. All preferred codons in HMPV are ended with A/U nucleotide and exhibited a great association with its high proportion of these two nucleotides in their genomes. Mutation pressure rather than natural selection is the main influence factor that determines the bias of synonymous codon usage in HMPV. The complementary pattern of codon usage bias between HMPV and human cell was observed, and this phenomenon suggests that host cells might be also act as an important factor to affect the codon usage bias. Moreover, the codon usage biases in each HMPV genotypes are separated into different clades, which suggest that phylogenetic distance might involve in codon usage bias formation as well. These analyses of synonymous codon usage bias in HMPV provide more information for better understanding its evolution and pathogenicity.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23193361 PMCID: PMC3502075 DOI: 10.1155/2012/460837
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
17 HMPV genomes sequences used in this study.
| SN | Genbank accession | Genotype | Source |
|---|---|---|---|
| 1 | AB503857.1 | A2 | Shiga, Japan |
| 2 | AF371337.2 | A1 | Amsterdam, The Netherlands |
| 3 | AY297748.1 | B2 | Quebec, Canada |
| 4 | AY297749.1 | A2 | Quebec, Canada |
| 5 | AY525843.1 | B1 | Amsterdam, The Netherlands |
| 6 | DQ843658.1 | B1 | Beijing, China |
| 7 | DQ843659.1 | A2 | Beijing, China |
| 8 | EF535506.1 | B2 | Taiwan, China |
| 9 | FJ168778.1 | B2 | Amsterdam, The Netherlands |
| 10 | FJ168779.1 | A2 | Amsterdam, The Netherlands |
| 11 | GQ153651.1 | A2 | Guangzhou, China |
| 12 | HM197719.1 | B2 | Rwanda |
| 13 | JN184399.1 | A1 | Nashville, TN, USA |
| 14 | JN184400.1 | A2 | Nashville, TN, USA |
| 15 | JN184401.1 | B2 | Nashville, TN, USA |
| 16 | JN184402.1 | B2 | Nashville, TN, USA |
| 17 | NC_004148.2 | A2 | Quebec, Canada |
Abbreviation: SN is serial number.
Synonymous codon usages of complete coding region of HMPV, human cell, AMPV, and bird cell.
| AA | Codon | aRSCU | bRSCU | cRSCU | dRSCU |
|---|---|---|---|---|---|
| Ala | GCU | 1.36 | 1.08 | 1.02 |
|
| GCC | 0.55 |
| 0.77 | 1.33 | |
| GCA |
| 0.92 |
| 0.98 | |
| GCG | 0.2 | 0.44 | 0.22 | 0.27 | |
|
| |||||
| Arg | CGU | 0.12 | 0.48 | 0.15 | 0.00 |
| CGC | 0.13 | 1.08 | 0.14 | 0.55 | |
| CGA | 0.31 | 0.66 | 0.34 | 0.27 | |
| CGG | 0.23 | 1.2 | 0.41 | 1.09 | |
| AGA |
|
|
|
| |
| AGG | 1.63 |
| 2.10 | 1.91 | |
|
| |||||
| Asn | AAU |
| 0.94 |
|
|
| AAC | 0.87 |
| 0.92 | 0.59 | |
|
| |||||
| Asp | GAU |
| 0.92 |
| 0.88 |
| GAC | 0.74 |
| 0.89 |
| |
|
| |||||
| Cys | UGU |
| 0.92 |
| 0.67 |
| UGC | 0.96 |
| 0.89 |
| |
|
| |||||
| Gln | CAA |
| 0.54 |
| 0.48 |
| CAG | 0.73 |
| 0.81 |
| |
|
| |||||
| Glu | GAA |
| 0.84 |
| 0.75 |
| GAG | 0.74 |
| 0.89 |
| |
|
| |||||
| Gly | GGU | 1.08 | 0.64 | 0.89 | 0.43 |
| GGC | 0.73 |
| 0.68 | 1.19 | |
| GGA |
| 1 |
| 1.08 | |
| GGG | 0.71 | 1 | 1.01 |
| |
|
| |||||
| His | CAU |
| 0.84 | 1.00 | 0.35 |
| CAC | 0.87 |
| 1.00 |
| |
|
| |||||
| Ile | AUU |
| 1.08 | 1.02 |
|
| AUC | 1 |
| 0.90 | 1.03 | |
| AUA | 0.8 | 0.51 |
| 0.19 | |
|
| |||||
| Leu | UUA |
| 0.48 | 1.19 | 0.42 |
| UUG | 1.08 | 0.78 |
| 0.85 | |
| CUU | 1.14 | 0.78 | 0.83 | 0.56 | |
| CUC | 0.64 | 1.2 | 0.57 | 0.99 | |
| CUA | 0.85 | 0.42 | 0.92 | 0.28 | |
| CUG | 0.84 |
| 1.13 |
| |
|
| |||||
| Lys | AAA |
| 0.86 |
| 0.93 |
| AAG | 0.79 |
| 0.88 |
| |
|
| |||||
| Phe | UUU |
| 0.92 |
|
|
| UUC | 0.68 |
| 0.73 | 0.67 | |
|
| |||||
| Pro | CCU | 1.15 | 1.16 | 1.12 | 1.22 |
| CCC | 0.82 |
| 0.74 | 1.22 | |
| CCA |
| 1.12 |
|
| |
| CCG | 0.23 | 0.44 | 0.36 | 0.08 | |
|
| |||||
| Ser | UCU | 0.97 | 1.14 | 0.93 | 0.71 |
| UCC | 0.63 | 1.32 | 0.72 | 1.15 | |
| UCA |
| 0.9 | 1.37 | 1.06 | |
| UCG | 0.13 | 0.3 | 0.18 | 0.26 | |
| AGU |
| 0.9 |
| 0.44 | |
| AGC | 1.36 |
| 1.30 |
| |
|
| |||||
| Thr | ACU | 1.17 | 1 | 0.98 | 0.16 |
| ACC | 0.92 |
| 0.83 | 1.28 | |
| ACA |
| 1.12 |
|
| |
| ACG | 0.16 | 0.44 | 0.27 | 0.80 | |
|
| |||||
| Tyr | UAU |
| 0.88 |
| 0.95 |
| UAC | 0.66 |
| 0.86 |
| |
|
| |||||
| Val | GUU |
| 0.72 | 1.17 | 0.57 |
| GUC | 0.73 | 0.96 | 0.72 | 0.91 | |
| GUA | 0.93 | 0.48 | 0.87 | 0.80 | |
| GUG | 0.88 |
|
|
| |
The number in this table is shown as relative synonymous codon usage values (RSCU) and aRSCU values represent values in HMPV, bRSCU values represent values in human host cell, cRSCU values represent values in AMPV, and dRSCU values represent values in bird cell. All preferred codons are shown in bold. AA is abbreviation of amino acid.
Nucleotide contents and effective number of codons in complete coding region in HMPV genomes.
| SN | A% | U% | G% | C% | GC% | A3% | U3% | C3% | G3% | GC3% | *ENC |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 37.51 | 25.18 | 19.52 | 17.8 | 37.3 | 41.76 | 27.7 | 17.68 | 11.6 | 29.28 | 45.741 |
| 2 | 37.49 | 25.58 | 19.55 | 17.4 | 36.93 | 41.67 | 28.6 | 16.83 | 11.52 | 28.35 | 45.148 |
| 3 | 37.78 | 25.27 | 19.63 | 17.3 | 36.95 | 36.09 | 21.5 | 15.18 | 17.51 | 32.69 | 47.608 |
| 4 | 37.4 | 25.5 | 19.61 | 17.5 | 37.09 | 41.55 | 27.9 | 17.44 | 11.76 | 29.2 | 45.667 |
| 5 | 37.58 | 25.41 | 19.76 | 17.3 | 37.02 | 41.32 | 28.1 | 16.71 | 12.47 | 29.18 | 45.529 |
| 6 | 37.65 | 25.64 | 19.75 | 17 | 36.71 | 41.54 | 28.5 | 16.25 | 12.35 | 28.6 | 45.662 |
| 7 | 37.33 | 25.66 | 19.59 | 17.4 | 37.01 | 36.36 | 23.1 | 16.45 | 15.7 | 32.16 | 48.28 |
| 8 | 37.53 | 26.27 | 19.88 | 16.3 | 36.2 | 41.52 | 29.3 | 15.5 | 12.34 | 27.84 | 45.127 |
| 9 | 37.61 | 25.69 | 19.72 | 17 | 36.7 | 41.48 | 28.8 | 16.05 | 12.36 | 28.41 | 45.468 |
| 10 | 37.46 | 25.58 | 19.51 | 17.5 | 36.96 | 41.62 | 28.2 | 17.26 | 11.58 | 28.85 | 45.707 |
| 11 | 37.22 | 25.83 | 19.76 | 17.2 | 36.95 | 41.16 | 28.9 | 16.56 | 12.07 | 28.63 | 45.379 |
| 12 | 37.73 | 25.68 | 19.7 | 16.9 | 36.59 | 41.55 | 28.9 | 15.85 | 12.42 | 28.27 | 45.28 |
| 13 | 37.42 | 25.72 | 19.59 | 17.3 | 36.86 | 41.47 | 28.9 | 16.61 | 11.67 | 28.28 | 45.274 |
| 14 | 37.4 | 25.47 | 19.61 | 17.5 | 37.13 | 41.38 | 28.1 | 17.46 | 11.77 | 29.24 | 45.58 |
| 15 | 37.47 | 25.51 | 19.87 | 17.2 | 37.02 | 41.16 | 28.4 | 16.52 | 12.64 | 29.16 | 45.442 |
| 16 | 37.61 | 25.43 | 19.78 | 17.2 | 36.96 | 41.56 | 28.4 | 16.32 | 12.45 | 28.77 | 45.778 |
| 17 | 37.4 | 25.5 | 19.61 | 17.5 | 37.09 | 41.55 | 27.9 | 17.44 | 11.76 | 29.2 | 45.667 |
| Mean | 37.51 | 25.58 | 19.67 | 17.2 | 36.91 | 40.87 | 27.7 | 16.6 | 12.59 | 29.18 | 45.785 |
| SD | 0.144 | 0.241 | 0.114 | 0.33 | 0.25 | 1.755 | 2.1 | 0.713 | 1.589 | 1.296 | 0.8458 |
Abbreviation: SN is serial number; *ENC is effective number of codons.
Summary of correlation analysis between A%, U%, C%, G%, and GC% and A3, U3, C3, G3, and GC3 in all HMPV samples.
| A3% | U3% | C3% | G3% | GC3% | |
|---|---|---|---|---|---|
| A% | 0.0959NS | 0.1144NS | −0.6199** | 0.3788NS | −0.2595NS |
| U% | −0.1596NS | 0.7821** | −0.4150NS | −0.0933NS | −0.7342** |
| C% | 0.2417NS | −0.7153** | 0.8331** | −0.4699NS | 0.7178** |
| G% | −0.4012NS | 0.4234NS | −0.5846* | 0.6129** | −0.3138NS |
| GC% | 0.0774NS | −0.7474** | 0.7880** | −0.1328NS | 0.8052** |
The number in the table represents as correlation coefficient r value, which is calculated from each correlation analysis. Abbreviation: NS represents as nonsignificant (P > 0.05), *represents P < 0.05, **represents P < 0.01.
Summary of correlation between the first two principle axes and nucleotide contents in samples.
| Base compositions |
|
|
|---|---|---|
| A3% |
|
|
| U3% |
|
|
| C3% |
|
|
| G3% |
|
|
| GC3% |
|
|
The number in the table represents as correlation coefficient r value, which is calculated from each correlation analysis. Abbreviation: NS represents as nonsignificant (P > 0.05), *represents P < 0.05, **represents P < 0.01.
Figure 1The plot of value of the first and second axis of each complete coding region in COA. The first axis (f1′) accounts for 43.27% of the total variation, and the second axis (f2′) accounts for 33.38% of the total variation. Each HMPV complete coding region was divided by genotype.
Figure 2Effective number of codons used in each complete coding region plotted against the GC3. The continuous curve plots the relationship between GC3 and ENC in the absence of selection. All of plots lie below the expected curve.