| Literature DB >> 17020602 |
Chi-Yu Zhang1, Ji-Fu Wei, Shao-Heng He.
Abstract
BACKGROUND: It is believed that animal-to-human transmission of severe acute respiratory syndrome (SARS) coronavirus (CoV) is the cause of the SARS outbreak worldwide. The spike (S) protein is one of the best characterized proteins of SARS-CoV, which plays a key role in SARS-CoV overcoming species barrier and accomplishing interspecies transmission from animals to humans, suggesting that it may be the major target of selective pressure. However, the process of adaptive evolution of S protein and the exact positively selected sites associated with this process remain unknown.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17020602 PMCID: PMC1609170 DOI: 10.1186/1471-2180-6-88
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Likelihood values and parameter estimates for 45 S gene sequences from the whole epidemic of SARS from 2002 to 2004.
| M0 (one-ratio) | -5818.14 | 0.64 | ω = 0.64 | 51.91 | None |
| M3 (discrete) | -5792.19 | 0.69 | (13.28) | ||
| M1a (NearlyNeutral) | -5807.78 | 0.36 | 31.25 | Not allowed | |
| M2a (PositiveSelection) | -5792.16 | 0.69 | (9.21) | ||
| M7 (beta) | -5807.92 | 0.40 | 31.51 | Not allowed | |
| M8 (beta&v) | -5792.16 | 0.69 | (9.21) |
The values in parentheses represent the significant level of 0.01 with a χ2distribution at d.f. = 4 (M0 vs. M3) or 2 (M1a vs. M2a and M7 vs. M8). The amino acid sequence of strain PC4-13 was used as the sequence reference, and positive selected sites were identified with posterior probability p ≥ 0.9. In boldface, p ≥ 0.95.
Figure 1Phylogenetic tree of 47 S gene sequences from human patients and animals. The evolutionary process of S proteins during whole epidemic was simplified into three epidemic groups: 02–04 interspecies, 03-early-mid, and 03-late epidemic groups. Each group includes 15 unique S gene sequences after deleting all duplicate sequences (Table S1, see additional file 2). Two sequences isolated from bats were used as the outgroup in phylogenetic tree construction. The tree was constructed by the maximum likelihood method with 1000 bootstrap replicates using PHYML v2.4.4. Only the branch bootstrap values ≥ 50% are shown. PC, palm civet; HP, human patient.
Phylogenetic analysis by ML estimation for SARS S gene sequences from different epidemic groups.
| M0 (one-ratio) | -5339.90 | 0.64 | ω = 0.64 | 40.23 | Yes | |
| M3 (discrete) | -5319.78 | 0.77 | (13.28) | |||
| M1a (NearlyNeutral) | -5334.74 | 0.35 | 29.92 | Yes | ||
| M2a (PositiveSelection) | -5319.78 | 0.77 | (9.21) | |||
| M7 (beta) | -5336.08 | 0.30 | 32.23 | Yes | ||
| M8 (beta&v) | -5319.97 | 0.80 | (9.21) | |||
| M0 (one-ratio) | -5194.47 | 0.94 | ω = 0.94 | 24.25 | Yes | |
| M3 (discrete) | -5182.34 | 1.10 | (13.28) | |||
| M1a (NearlyNeutral) | -5192.6 9 | 0.45 | 20.70 | Yes | ||
| M2a (PositiveSelection) | -5182.34 | 1.10 | (9.21) | |||
| M7 (beta) | -5192.73 | 0.40 | 20.78 | Yes | ||
| M8 (beta&v) | -5182.34 | 1.10 | (9.21) | |||
| M0 (one-ratio) | -5121.66 | 0.26 | ω = 0.26 | NA | No | |
| M3 (discrete) | -5121.66 | 0.26 | (13.28) | |||
| M1a (NearlyNeutral) | -5121.66 | 0.26 | NA | No | ||
| M2a (PositiveSelection) | -5121.66 | 0.26 | (9.21) | |||
| M7 (beta) | -5121.66 | 0.25 | NA | No | ||
| M8 (beta&v) | -5121.68 | 0.26 | (9.21) |
The values in parentheses represent the significant level of 0.01 with a χ2distribution at d.f. = 4 (M0 vs. M3) or 2 (M1a vs. M2a and M7 vs. M8). NA, not applicable.
Positively selected sites identified by Codeml program and REL in DataMonkey package.
| Domains | |||
| 78 113 139 147 227 239 261 | 49 | None | |
| 701 714 | None | ||
Positively selected sites identified by program Codeml at the level of posterior probability ≥ 0.95 are shown in boldface. The underlines represent the sites locating in receptor-binding domain (RBD) (residues: 318–510) of S1 domain or in heptad repeat (HR)1 (residues: 889–972)-HR2 (residues: 1142–1185) region of S2 domain.
The number of positively selected sites in different functional domains of S protein.
| 6 | 1 | ||
| 1 | 4 | 0.045455 | |
| 0 | 3 | 0.033333 |
The Fisher's exact test was performed for HR1–HR2 vs. RBD, and HR2 vs. RBD, respectively.
Figure 2Amino acid variations of 45 S proteins. The amino acid sequence of CUHK-SU10 strain isolated from 03-late epidemic group was used as the reference sequence. The shadows represent the sites or residues under positive selection with posterior probability ≥ 0.9. The red sashes highlight three special sites where amino acids were replaced and fixed in all 03 human strains, suggesting selective sweep acting on the S gene.