| Literature DB >> 21629696 |
Abstract
Sites in a gene showing the nonsynonymous/synonymous rate ratio (ω) >1 have been frequently identified to be under positive selection. To examine the performance of such analysis, sites of the ω ratio >1 in the HA1 gene of H3N2 subtype human influenza viruses were identified from seven overlapping sequence data sets in this study. Our results showed that the sites of the ω ratio >1 were of significant variation among the data sets even though they targeted similar clusters, indicating that the analysis is likely to be either of low sensitivity or of low specificity in identifying sites under positive selection. Most (43/45) of the sites showing ω >1 calculated from at least one data set are involved in B-cell epitopes which cover less than a half sites in the protein, suggesting that the analysis is likely to be of low sensitivity rather than of low specificity. It was further found that the analysis sensitivity could not be enhanced by including more sequences or covering longer time intervals. Previously some reports also likely identified only a portion of the sites under positive selection in the viral gene using the ω ratio. Low sensitivity of the analysis may result from that some sites under positive selection in the gene are also under negative (purifying) selection simultaneously for functional constrains, and so their ω ratios could be <1. Theoretically, the sites under the two opposite selection forces at the same time favor only certain nonsynonymous changes, e.g. those changing the antigenicity of the gene and maintaining the gene function. This study also suggested that sometimes we can identify more sites under positive selection using the ω ratio by integrating the positively selected sites estimated from multiple data sets.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21629696 PMCID: PMC3101217 DOI: 10.1371/journal.pone.0019996
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Phylogenetic tree of 86 H3N2 human influenza viruses based on the whole-length sequences of the viral HA1 gene.
The tree was calculated using the maximum likelihood method, and bootstrap percentage values out of 1000 replicates are shown at the relevant nodes.
Sites under positive selection calculated using models M2, M3 and M8 from seven data sets.
| Data sets | Sequence background | Sites under positive selection | ||
| Calculated by model M2 | Calculated by model M3 | Calculated by model M8 | ||
| 1 | One sequence for each of the 43 years from 1968–2010 | 124, 137, 144, 145, 155, 159, 189, 193, 226 | 124, 137, 144, 145, 155, 159, 189, 193, 226 | 121, 124, 135, 137, 144, 145, 155, 159, 172, 189, 193, 226 |
| 2 | Another sequence for each of the 43 years from 1968–2010 | 2, 3, 50, 124, 133, 137, 144, 145, 155–159, 186, 189, 193, 196, 226 |
| 2, 3, 50, 82, 83, 121, 124, 133, 135, 137, 138, |
| 3 | Data set 1+data set 2 | 137, 138, 144, | 50, 121, 124, | 50, 121, 124, 137, 138, 144, |
| 4 | The part of data set 1 covering the years 1983–1997 | 121, 124, 133, 135, 138, 145, 159, 193, 226 |
| 121, |
| 5 | The part of data set 2 covering the years 1983–1997 | 94, 121, 124, 133, 135, 156, 159, 186, 196, 219, 226, 262 | 3, 67, 82, 88, 91, 94, | 3, 67, 82, 88, 91, 94, 121, 124, 133, 135, 145, 156, 159, 172, 186, 189, 193, 196, 219, 226, 246, 262, 276, 278, 299 |
| 6 | The part of data set 3 covering the years 1996–2010 | 50, 144, 157, 189, 226 | 50, 144, 157, 189, 226 | 45, 50, 82, 137, 144, 155, 157, 173, 189, 220, 226 |
| 7 | The part of data set 3 covering the years 1983–1997 | 121, 124, 133, 135, 145, 156, 159, 186, 193, 219, 226 | 94, | 121, 124, 133, 135, 138, |
Sites under positive selection with P>95% are indicated in bold type.
Distribution of the sites under positive selection in the HA1 gene of H3N2 influenza virus calculated by different analyses.
| Analyses | Sites under positive selection | |
| At BCEs | Elsewhere | |
| By combining the results of seven data sets using the ω ratio (this study) | 45, 50, 67, 82, 83, 88, 91, 94, 121, 124, 133, 135, 137, 138, 144, 145, 155–159, 163, 172, 173, 186, 188, 189, 193, 194, 196, 197, 213, 219, 220, 226, 246, 261, 262, 273, 276–279, 299 | 2, 3 |
| By Shih et al. not using the ω ratio | 50, 53, 54, 57, 62, 63, 67, 75, 78, 82, 83, 94, 121, 122, 124, 126, 131, 133, 135, 137, 142, 143, 144, 145, 146, 155, 156, 157, 158, 159, 160, 163, 172, 173, 186, 188, 189, 190, 192, 193, 196, 197, 207, 213, 217, 226, 227, 242, 244, 248, 260, 262, 275, 276, 278, 299, 307 | 2, 3, 25, 202, 222, 225 |
| By Suzuki et al. using the ω ratio | 138, 196, 226 | |
| By Suzuki using the ω ratio | 220, 229 | |
| By Bush et al. using the ω ratio | 121, 124, 128, 133, 135, 137, 138, 142, 145, 156, 158, 159, 182, 186, 190, 193, 194, 196, 197, 201, 219, 220, 226, 246, 262, 275, 276, 310, 312, | |
| By Fitch et al. using the ω ratio | 121, 133, 135, 137, 138, 145, 156, 186, 190, 193, 194, 196, 226, 276 | 1, 2, 3, |
| By Yang et al. using the ω ratio | 133, 135, 137, 138, 145, 156, 157, 159, 186, 193, 219, 226 | |
| By Plotkin et al. not using the ω ratio | 62, 83, 121, 124, 131, 133, 135, 142, 144, 145, 156, 157, 158, 172, 189, 190, 193, 196, 197, 220, 226, 262, 275, 276, 278, 299 | 2, 3, 5, 31, 112, 271 |
Sites 44–48, 50, 51, 53, 54, 57, 59, 62, 63, 67, 75, 78, 80–83, 86–88, 91, 92, 94, 96, 102, 103, 109, 117, 121, 122, 124, 126, 128–135, 137, 138, 140, 142–146, 150, 152, 155–160, 163–165, 167, 168, 170–177, 179, 182, 186–190, 192–194, 196–198, 201, 203, 207–209, 212–219, 220, 226–230, 238, 240, 242, 244, 246–248, 260, 261, 262, 265, 273, 275, 276, 278–280, 294, 297, 299, 300, 304, 305, 307–312 are considered to be at BCEs, and the other 197 sites in the gene are considered to be out of BCEs from the structure of the gene of a virus isolated in 1968 [16], [25].