| Literature DB >> 11114436 |
.
Abstract
The past few years have seen the development of powerful statistical methods for detecting adaptive molecular evolution. These methods compare synonymous and nonsynonymous substitution rates in protein-coding genes, and regard a nonsynonymous rate elevated above the synonymous rate as evidence for darwinian selection. Numerous cases of molecular adaptation are being identified in various systems from viruses to humans. Although previous analyses averaging rates over sites and time have little power, recent methods designed to detect positive selection at individual sites and lineages have been successful. Here, we summarize recent statistical methods for detecting molecular adaptation, and discuss their limitations and possible improvements.Entities:
Year: 2000 PMID: 11114436 PMCID: PMC7134603 DOI: 10.1016/s0169-5347(00)01994-7
Source DB: PubMed Journal: Trends Ecol Evol ISSN: 0169-5347 Impact factor: 17.712
Selected examples of protein-coding genes in which positive selection was detected by using the dN/dS ratio
| Gene | Organism | Refs |
|---|---|---|
| Class I chitinase gene | ||
| Colicin genes | ||
| Defensin genes | Rodents | |
| Fv1 | ||
| Immunoglobulin VH genes | Mammals | |
| MHC genes | Mammals | |
| Polygalacturonase inhibitor genes | Legume and dicots | |
| RH blood group and RH50 genes | Primates and rodents | |
| Ribonuclease genes | Primates | |
| Transferrin gene | Salmonid fishes | |
| Type I interferon-ω gene | Mammals | |
| α1-Proteinase inhibitor genes | Rodents | |
| Capsid gene | FMD virus | |
| Delta-antigen coding region | Hepatitis D virus | |
| Phages G4, | ||
| Envelope gene | HIV | |
| Pseudorabies virus | ||
| Hemagglutinin gene | Human influenza A virus | |
| Invasion plasmid antigen genes | ||
| Merozoite surface antigen-1 gene | ||
| Msp 1α | ||
| HIV | ||
| Outer membrane protein gene | ||
| Polygalacturonase genes | Fungal pathogens | |
| Porin protein 1 gene | Neisseria | |
| Murine coronavirus | ||
| Reovirus | ||
| Virulence determinant gene | ||
| 18-kDa fertilization protein gene | Abalone ( | |
| Acp26Aa | ||
| Androgen-binding protein gene | Rodents | |
| Bindin gene | Echinometra | |
| Egg-laying hormone genes | ||
| Ods homeobox gene | ||
| Pem homeobox gene | Rodents | |
| Protamine P1 gene | Primates | |
| Sperm lysin gene | Abalone ( | |
| S-Rnase gene | Rosaceae | |
| Sry gene | Primates | |
| k-casein gene | Bovids | |
| Lysozyme gene | Primates | |
| Conotoxin genes | ||
| Phospholipase A2 gene | ||
| ATP synthase F0 subunit gene | ||
| COX7A isoform genes | Primates | |
| COX4 gene | Primates | |
| Granulocyte-macrophage SF gene | Rodents | |
| Interleukin-3 gene | Primates | |
| Interleukin-4 gene | Rodents | |
| CDC6 | ||
| Growth hormone gene | Vertebrates | |
| Hemoglobin β-chain gene | Antarctic fishes | |
| Prostatein peptide C3 gene | Rat | |
Estimation of dN and dS between the human and orangutan α2-globin genes (142 codons)a
| Method and/or model | l | Refs | ||||||
|---|---|---|---|---|---|---|---|---|
| Nei and Gojobori | 1.0 | 109.4 | 316.6 | 0.0095 | 0.0569 | 0.168 | – | |
| Li | – | NA | NA | 0.0104 | 0.0517 | 0.201 | – | |
| Ina | 2.1 | 119.3 | 299.9 | 0.0101 | 0.0523 | 0.193 | – | |
| Yang and Nielsen | 6.1 | 61.7 | 367.3 | 0.0083 | 0.1065 | 0.078 | – | |
| (1) Fequal, k=1 | 1.0 | 108.5 | 317.5 | 0.0093 | 0.0557 | 0.167 | −633.67 | |
| (2) Fequal, k estimated | 3.0 | 124.6 | 301.4 | 0.0099 | 0.0480 | 0.206 | −632.47 | |
| (3) F1×4, | 1.0 | 129.1 | 296.9 | 0.0092 | 0.0671 | 0.137 | −612.40 | |
| (4) F1×4, | 3.9 | 137.1 | 288.9 | 0.0093 | 0.0624 | 0.149 | −610.48 | |
| (5) F3×4, | 1.0 | 63.2 | 362.8 | 0.0084 | 0.0973 | 0.087 | −560.76 | |
| (6) F3×4, | 5.4 | 60.6 | 365.4 | 0.0084 | 0.1061 | 0.079 | −557.85 | |
| (7) F61, | 1.0 | 58.3 | 367.7 | 0.0082 | 0.1145 | 0.072 | −501.39 | |
| (8) F61, | 5.3 | 55.3 | 370.7 | 0.0082 | 0.1237 | 0.066 | −498.61 | |
GenBank accession numbers are V00516 (human) and M12158 (orangutan).
Fequal, equal codon frequencies (=1/61) are assumed; F1×4, four nucleotide frequencies are used to calculate codon frequencies (3 free parameters); F3×4, nucleotide frequencies at three codon positions are used to calculate codon frequencies (9 free parameters); F61, all codon frequencies are used as free parameters (60 free parameters).
l is the log-likelihood value.
Fig. 1The identification of sites under positive selection from the sperm lysin genes of 25 abalone species. (a) Posterior probabilities for site classes with different ω ratios along the sequence. The lysin sequence of the red abalone (Haliotis rufescens) is shown below the x-axis. ML estimates under Model M3 (discrete) suggest three site classes with the ω ratios at ω0=0.085 (grey), ω1=0.911 (green) and ω2=3.065 (red), and with proportions p0=0.329, ρ1=0.402 and ρ2=0.269. These proportions are the prior probabilities (Box 1) that any site belongs to the three classes. The data (codon configurations in different species) at a site alter the prior probabilities dramatically, and thus the posterior probabilities might be different from the prior probabilities. For example, the posterior probabilities for Site 1 are 0.944, 0.056 and 0.000, and thus this site is likely to be under strong purifying selection. The posterior probabilities for Site 4 are 0.000, 0.000 and 1.000, and thus this site is almost certainly under diversifying selection. (b) Lysin crystal structure from the red abalone (Protein Data Bank file 1LIS), with sites coloured according to their most likely class inferred in (a). The structure starts from amino acid four (His) at the N-terminus, because the first three amino acids are unresolved. The five α-helices are indicated: α1 from amino acids 13 to 38, α2 from 44 to 74, α3 from 82 to 95, α4 from 99 to 107 and α5 from 116 to 123. Note that sites potentially under positive selection (red) are scattered all over the primary sequence but tend to cluster around the top and bottom of the crystal structure. Reproduced, with permission, from.