| Literature DB >> 24303050 |
Zhicheng Zhang1, Wei Dai, Dingzhen Dai.
Abstract
Two species of the DNA virus Torque teno sus virus (TTSuV), TTSuV1 and TTSuV2, have become widely distributed in pig-farming countries in recent years. In this study, we performed a comprehensive analysis of synonymous codon usage bias in 41 available TTSuV2 coding sequences (CDS), and compared the codon usage patterns of TTSuV2 and TTSuV1. TTSuV codon usage patterns were found to be phylogenetically conserved. Values for the effective number of codons (ENC) indicated that the overall extent of codon usage bias in both TTSuV2 and TTSuV1 was not significant, the most frequently occurring codons had an A or C at the third codon position. Correspondence analysis (COA) was performed and TTSuV2 and TTSuV1 sequences were located in different quadrants of the first two major axes. A plot of the ENC revealed that compositional constraint was the major factor determining the codon usage bias for TTSuV2. In addition, hierarchical cluster analysis of 41 TTSuV2 isolates based on relative synonymous codon usage (RSCU) values suggested that there was no association between geographic distribution and codon bias of TTSuV2 sequences. Finally, the comparison of RSCU for TTSuV2, TTSuV1 and the corresponding host sequence indicated that the codon usage pattern of TTSuV2 was similar to that of TTSuV1. However the similarity was low for each virus and its host. These conclusions provide important insight into the synonymous codon usage pattern of TTSuV2, as well as better understangding of the molecular evolution of TTSuV2 genomes.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24303050 PMCID: PMC3841265 DOI: 10.1371/journal.pone.0081469
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
41 complete TTSuV2 genes used in this study.
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | AY823991 | 2p | Brazil | 2005 | 1875 |
| 2 | GU188046 | 472142 | Germany | 2008 | 1878 |
| 3 | GU456385 | PTTV2b-VA | USA | 2008 | 1878 |
| 4 | GU456386 | PTTV2c-VA | USA | 2008 | 1878 |
| 5 | GU570197 | TTV2_GE9 | Spain | 2011 | 1884 |
| 6 | GU570203 | TTV2_1907 | Spain | 2011 | 1884 |
| 7 | GU570204 | TTV2_G31 | Spain | 2011 | 1884 |
| 8 | GU570205 | TTV2_G33 | Spain | 2011 | 1884 |
| 9 | GU570206 | TTV2_G43 | Spain | 2011 | 1875 |
| 10 | GU570207 | TTV2_G61 | Spain | 2011 | 1875 |
| 11 | GU570208 | TTV2_G64 | Spain | 2011 | 1884 |
| 12 | GU570209 | TTV2_GE1 | Spain | 2010 | 1884 |
| 13 | HM633214 | TTV2Bj7-2 | China | 2009 | 1863 |
| 14 | HM633215 | TTV2Bj2-3 | China | 2009 | 1884 |
| 15 | HM633216 | TTV2Bj4-3 | China | 2009 | 1884 |
| 16 | HM633217 | TTV2Bj6-2 | China | 2009 | 1875 |
| 17 | HM633218 | TTV2Bj6-3 | China | 2009 | 1875 |
| 18 | HM633219 | TTV2Bj7-3 | China | 2009 | 1884 |
| 19 | HM633220 | TTV2Fj2 | China | 2009 | 1884 |
| 20 | HM633221 | TTV2Jl1 | China | 2009 | 1884 |
| 21 | HM633222 | TTV2Jl2 | China | 2009 | 1884 |
| 22 | HM633223 | TTV2Jl27 | China | 2009 | 1863 |
| 23 | HM633224 | TTV2Bj1-2 | China | 2009 | 1878 |
| 24 | HM633225 | TTV2Hb1 | China | 2009 | 1884 |
| 25 | HM633226 | TTV2Bj8 | China | 2009 | 1863 |
| 26 | HM633227 | TTV2Bj11 | China | 2009 | 1878 |
| 27 | HM633228 | TTV2Bj12 | China | 2009 | 1875 |
| 28 | HM633229 | TTV2Gx1 | China | 2010 | 1872 |
| 29 | HM633230 | TTV2Gx2 | China | 2009 | 1878 |
| 30 | HM633231 | TTV2Gx3-2 | China | 2009 | 1884 |
| 31 | HM633232 | TTV2Gx4 | China | 2009 | 1872 |
| 32 | HM633233 | TTV2Jx1 | China | 2009 | 1884 |
| 33 | HM633234 | TTV2Jx2 | China | 2009 | 1875 |
| 34 | HM633235 | TTV2Ln13 | China | 2009 | 1863 |
| 35 | HM633236 | TTV2Ln14 | China | 2009 | 1872 |
| 36 | HM633237 | TTV2Ln21 | China | 2009 | 1863 |
| 37 | HM633238 | TTV2Ln22 | China | 2009 | 1863 |
| 38 | HM633239 | TTV2Ln23-2 | China | 2009 | 1875 |
| 39 | HM633240 | lung1 | China | 2009 | 1884 |
| 40 | HM633241 | lung3 | China | 2009 | 1878 |
| 41 | HQ204188 | SC | China | 2010 | 1878 |
Figure 1Recombination analysis of TTSuV2 and TTSuV1 sequences using the RAT.
The colour of the line on the graph is the same as the colour of its sequence name on the left.
Nucleotide content of 41 TTSuV2 genomes (%).
|
|
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 20.54 | 27.81 | 21.41 | 33.11 | 36.33 | 41.96 | 21.72 | 28.10 | 45.0 | 46.0 | 54.55 |
| 2 | 20.22 | 26.48 | 22.54 | 32.61 | 35.60 | 43.24 | 21.64 | 26.19 | 45.9 | 45.3 | 55.20 |
| 3 | 20.40 | 27.06 | 22.41 | 33.55 | 34.88 | 39.46 | 22.31 | 29.01 | 46.7 | 48.0 | 57.31 |
| 4 | 20.40 | 27.06 | 22.41 | 33.55 | 34.88 | 39.46 | 22.31 | 29.01 | 46.7 | 48.0 | 57.31 |
| 5 | 20.75 | 27.89 | 21.67 | 33.33 | 35.44 | 40.17 | 22.14 | 27.76 | 45.6 | 46.5 | 57.86 |
| 6 | 20.44 | 27.91 | 21.83 | 33.41 | 35.90 | 40.40 | 21.83 | 28.33 | 45.5 | 46.6 | 58.18 |
| 7 | 20.50 | 28.07 | 21.57 | 32.89 | 35.74 | 40.52 | 22.19 | 28.17 | 45.6 | 46.2 | 58.16 |
| 8 | 20.39 | 27.91 | 21.88 | 33.41 | 35.90 | 40.40 | 21.83 | 28.33 | 45.6 | 46.6 | 58.18 |
| 9 | 20.34 | 27.31 | 21.37 | 33.26 | 36.47 | 41.72 | 21.83 | 28.47 | 45.1 | 46.5 | 53.76 |
| 10 | 20.03 | 27.63 | 21.67 | 32.89 | 36.47 | 41.69 | 21.83 | 27.25 | 45.6 | 45.8 | 56.04 |
| 11 | 20.03 | 27.03 | 21.52 | 33.19 | 36.11 | 41.33 | 22.34 | 27.88 | 45.8 | 46.4 | 55.66 |
| 12 | 20.96 | 28.54 | 21.47 | 32.68 | 35.44 | 39.91 | 22.14 | 28.13 | 45.4 | 46.2 | 58.18 |
| 13 | 21.50 | 27.29 | 21.34 | 31.99 | 37.02 | 44.04 | 20.15 | 28.54 | 43.4 | 45.1 | 54.69 |
| 14 | 19.93 | 28.05 | 21.52 | 33.71 | 36.67 | 41.83 | 21.88 | 27.99 | 45.4 | 46.1 | 55.37 |
| 15 | 20.44 | 28.28 | 21.06 | 34.39 | 37.03 | 41.31 | 21.47 | 27.62 | 44.6 | 46.5 | 55.04 |
| 16 | 19.81 | 28.28 | 21.83 | 33.71 | 36.58 | 40.40 | 21.78 | 27.92 | 45.8 | 46.5 | 57.14 |
| 17 | 20.09 | 26.23 | 21.79 | 34.75 | 36.17 | 41.24 | 21.95 | 29.17 | 46.1 | 48.1 | 57.56 |
| 18 | 19.88 | 28.41 | 21.73 | 33.33 | 36.72 | 41.74 | 21.67 | 26.97 | 45.5 | 45.5 | 56.51 |
| 19 | 20.24 | 28.07 | 21.62 | 32.68 | 35.95 | 40.35 | 22.19 | 29.02 | 45.8 | 46.6 | 57.03 |
| 20 | 19.93 | 28.05 | 21.62 | 33.71 | 36.52 | 41.65 | 21.93 | 27.86 | 45.5 | 46.1 | 55.55 |
| 21 | 20.29 | 27.95 | 21.57 | 32.53 | 35.90 | 40.35 | 22.24 | 28.78 | 45.9 | 46.5 | 56.86 |
| 22 | 20.40 | 27.33 | 20.87 | 31.89 | 37.90 | 43.16 | 20.82 | 30.00 | 44.2 | 45.9 | 57.78 |
| 23 | 20.66 | 26.29 | 21.23 | 33.71 | 36.12 | 40.80 | 22.00 | 30.26 | 45.2 | 48.0 | 56.84 |
| 24 | 20.34 | 28.48 | 21.06 | 33.86 | 36.88 | 41.31 | 21.73 | 28.06 | 44.8 | 46.4 | 54.86 |
| 25 | 21.81 | 28.51 | 20.92 | 31.63 | 36.86 | 45.07 | 20.40 | 26.82 | 42.9 | 43.8 | 53.98 |
| 26 | 19.99 | 26.29 | 22.00 | 35.06 | 36.01 | 40.92 | 22.00 | 29.10 | 46.3 | 48.2 | 57.24 |
| 27 | 20.59 | 28.16 | 21.57 | 33.92 | 36.07 | 40.77 | 21.78 | 28.06 | 45.3 | 46.7 | 54.41 |
| 28 | 21.09 | 27.25 | 21.34 | 31.21 | 37.42 | 44.59 | 20.16 | 28.12 | 43.4 | 44.4 | 55.97 |
| 29 | 20.56 | 26.15 | 21.84 | 33.63 | 35.55 | 41.16 | 22.05 | 28.95 | 46.1 | 47.5 | 56.81 |
| 30 | 20.34 | 28.28 | 21.11 | 34.39 | 36.93 | 40.99 | 21.62 | 27.79 | 44.9 | 46.7 | 54.93 |
| 31 | 21.45 | 26.81 | 21.14 | 31.21 | 36.90 | 43.72 | 20.52 | 28.50 | 43.7 | 44.9 | 56.01 |
| 32 | 20.08 | 26.67 | 21.57 | 33.11 | 36.31 | 42.54 | 22.03 | 28.07 | 45.3 | 46.0 | 56.14 |
| 33 | 20.38 | 29.12 | 21.57 | 32.28 | 37.31 | 43.51 | 20.74 | 26.85 | 44.7 | 44.1 | 56.39 |
| 34 | 20.77 | 27.27 | 21.13 | 31.82 | 37.64 | 42.89 | 20.46 | 30.56 | 44.1 | 46.2 | 56.61 |
| 35 | 21.19 | 29.36 | 21.19 | 30.91 | 37.16 | 43.94 | 20.47 | 26.42 | 43.7 | 43.2 | 56.11 |
| 36 | 21.18 | 28.47 | 20.40 | 31.48 | 37.85 | 42.66 | 20.56 | 31.39 | 43.5 | 45.9 | 57.41 |
| 37 | 21.08 | 28.37 | 20.46 | 31.63 | 38.21 | 43.22 | 20.25 | 31.22 | 43.2 | 45.8 | 56.27 |
| 38 | 20.43 | 27.33 | 21.47 | 33.78 | 36.38 | 40.67 | 21.72 | 29.15 | 45.2 | 47.3 | 54.02 |
| 39 | 19.88 | 26.93 | 21.93 | 33.55 | 36.67 | 42.83 | 21.52 | 27.23 | 45.4 | 45.8 | 54.49 |
| 40 | 20.40 | 28.92 | 21.64 | 32.96 | 37.35 | 43.12 | 20.61 | 26.59 | 44.4 | 44.4 | 56.01 |
| 41 | 20.14 | 27.11 | 21.79 | 33.78 | 36.27 | 41.63 | 21.79 | 28.85 | 45.7 | 47.1 | 56.25 |
| mean | 20.48 | 27.67 | 21.52 | 33.04 | 36.46 | 41.77 | 21.53 | 28.35 | 45.09 | 46.18 | 56.21 |
RSCU values of codons in TTSuV2, TTSuV1 and swine.a
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| Phe | UUU | 1.00 | 0.76 |
| Tyr |
|
|
|
|
| UUC | 1.00 |
| 0.89 | UAC | 0.90 | 0.90 | 0.82 | ||
| Leu |
|
|
| 0.65 | Ala | GCU | 0.81 |
|
|
| UUG | 0.85 | 1.21 | 0.85 | GCC | 0.94 | 1.15 | 1.22 | ||
| CUU | 0.90 | 0.49 | 1.20 |
|
| 1.15 | 1.05 | ||
| CUC | 1.03 | 0.71 | 1.12 | GCG | 0.74 | 0.50 | 0.37 | ||
| CUA | 1.23 | 0.99 | 0.56 | His | CAU | 0.91 | 0.60 | 0.97 | |
| CUG | 0.44 | 1.22 |
|
|
|
|
| ||
| Ile | AUU | 0.53 | 0.57 | 1.06 | Gln |
|
| 0.90 | 0.85 |
| AUC | 0.95 | 1.02 |
| CAG | 0.99 |
|
| ||
|
|
|
| 0.83 | Asn | AAU | 0.97 |
|
| |
| Val | GUU | 0.71 | 0.62 | 1.11 |
|
| 0.99 | 0.98 | |
| GUC | 0.56 | 0.40 | 0.96 | Lys |
|
|
|
| |
|
|
| 1.28 | 0.64 | AAG | 0.74 | 0.78 | 0.79 | ||
| GUG | 0.99 |
|
| Asp | GAU | 0.67 | 0.74 | 0.95 | |
| Ser | UCU | 0.82 | 0.75 |
|
|
|
|
| |
| UCC | 1.35 | 0.72 | 1.20 | Glu |
|
|
|
| |
|
|
|
| 1.00 | GAG | 0.74 | 0.92 | 0.91 | ||
| UCG | 0.53 | 0.56 | 0.31 | Cys | UGU | 0.67 | 0.55 |
| |
| AGU | 0.55 | 1.39 | 0.93 |
|
|
| 0.94 | ||
| AGC | 1.10 | 1.10 | 1.22 | Arg | CGU | 0.58 | 0.51 | 0.55 | |
| Pro |
|
|
|
| CGC | 0.39 | 0.93 | 0.65 | |
| CCC | 0.25 | 0.72 | 1.08 | CGA | 0.50 | 0.63 | 0.54 | ||
| CCA | 1.48 | 1.11 | 1.23 | CGG | 0.64 | 0.55 | 0.74 | ||
| CCG | 0.69 | 0.96 | 0.43 |
|
| 1.34 |
| ||
| Thr |
|
| 1.01 | 1.19 | AGG | 1.73 |
| 1.67 | |
| ACC | 1.01 |
| 1.23 | Gly | GGU | 0.49 | 0.74 | 0.81 | |
| ACA | 1.06 | 1.21 |
| GGC | 0.66 | 0.99 | 1.08 | ||
| ACG | 0.63 | 0.55 | 0.34 |
|
|
|
| ||
| GGG | 0.78 | 0.95 | 0.94 |
The preferred codons for each amino acid is displayed in bold.
AA is the abbreviation of Amino Acid.
SUS is swine.
Figure 2Distribution of the ENC values and GC content at synonymous codon third position (GC3S).
The curve indicates the expected codon usage if compositional constraint alone account for codon usage bias.
Figure 3Correspondence analysis of codon usage patterns of TTSuV2 and TTSuV1.
The correlation analysis between A, T, G, C, GC contents and A3S, T3S, G3S, C3S, GC3S contents in TTSuV2 CDS.a
| A3S% | T(U)3S % | G3S % | C3S % | GC3S % | |
|---|---|---|---|---|---|
| A% | 0.761 | 0.378 | -0.086NS | -0.364 | -0.616 |
| U% | 0.238NS | 0.175NS | 0.234NS | -0.52 | -0.237NS |
| G% | -0.805 | -0.391 | 0.171NS | 0.434 | 0.664 |
| C% | -0.458 | -0.393 | -0.139NS | 0.392 | 0.378 |
| GC% | -0.710 | 0.434 | 0.078NS | 0.505 | 0.645 |
Value in this table is the P-value of correlation analysis.
NS, non-significant (p>0.05).
0.01
p<0.01.
The correlation analysis between the first two axes and nucleotide contents in TTSuV2 CDS.a
| A% | T(U)% | G% | C% | GC% | A3S% | T(U)3S% | G3S% | C3S% | GC3S% | |
|---|---|---|---|---|---|---|---|---|---|---|
|
| -0.631 | -0.367 | 0.614 | 0.493 | 0.552 | -0.619 | -0.260NS | 0.054NS | 0.664 | 0.608 |
|
| 0.071NS | -0.014NS | -0.023NS | -0.233NS | -0.236NS | 0.017NS | 0.270NS | -0.357 | -0.093NS | -0.286NS |
Value in this table is the P-value of correlation analysis.
NS, non-significant (p>0.05).
0.01
p<0.01
Figure 4Cluster tree result of 41 TTSuV2 genes based on hierarchical cluster method.
Figure 5Phylogenetic tree of 41 TTSuV2 sequences and 29 TTSuV1 sequences.
● represents TTSuV2 and ○ represents TTSuV1.