| Literature DB >> 20920202 |
Ming-Shun Chen1, Xuming Liu, Ziheng Yang, Huixian Zhao, Richard H Shukle, Jeffrey J Stuart, Scot Hulbert.
Abstract
BACKGROUND: In most protein-coding genes, greater sequence variation is observed in noncoding regions (introns and untranslated regions) than in coding regions due to selective constraints. During characterization of genes and transcripts encoding small secreted salivary gland proteins (SSSGPs) from the Hessian fly, we found exactly the opposite pattern of conservation in several families of genes: the non-coding regions were highly conserved, but the coding regions were highly variable.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20920202 PMCID: PMC2955719 DOI: 10.1186/1471-2148-10-296
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Genomic organization and structural comparison of the Hessian fly . A: SSSGP-1 family members derived by sequencing a BAC clone made from biotype GP. B: Nucleotide sequence comparison of the seven SSSGP-1 genes. Comparisons were divided into the promoter region (Promoter), 5'-untranslated region (5'-UTR), signal peptide coding region (SPCR), an intron, mature protein coding region (MPCR), and 3'-untranslated region (3'-UTR). The numbers in boxes are average scores and score range (in parentheses) derived from pair-wise comparisons of all possible combinations of the genes (see Materials and Methods). Red color indicates conserved regions. Blue color indicates diversified regions. The lowest score for any pairwise comparison in the MPCR was 13. Unrelated random sequences can produce scores as high as 15. The actual alignments of these genes are shown in Additional file 1, Figure S1A. C: Sequence alignments of putative proteins. Dashes represent gaps in the sequence alignments. The first 18 amino acids constitute a putative signal peptide.
Similarity of different regions among cDNAs from different gene groups
| Gene group | No. of Seq. | 5'-Untranslated region (5'-UTR) | Signal peptide coding region (SPCR) | Mature protein coding region (MPCR) | 3'-untranslated region (3'-UTR) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6 | 58 - 59 | 92 | 87 - 98 | 57 | 94 | 92 - 98 | 252 - 484 | 49 | 13 - 100 | 186 - 258 | 88 | 79 - 94 | |
| 15 | 6 - 198 | 84 | 52 - 100 | 60 | 87 | 78 - 95 | 258 - 462 | 10 | 3 - 57 | 77 - 269 | 81 | 57 - 92 | |
| 4 | 31 - 36 | 87 | 80 - 100 | 54 | 80 | 70 - 96 | 189 - 312 | 22 | 6 - 71 | 64 - 210 | 73 | 49 - 98 | |
| 7 | 37 - 47 | 94 | 83 - 100 | 75 | 80 | 58 - 93 | 360 - 636 | 10 | 4 - 27 | 121 - 222 | 52 | 14 - 82 | |
| 3 | 81 - 87 | 94 | 93 - 95 | 69 | 96 | 95 - 97 | 4 - 279 | 15 | 11 - 19 | 71 - 124 | 65 | 48 - 85 | |
| 2 | 24 - 25 | 87 | 87 | 63 | 95 | 95 | 249 - 255 | 79 | 79 | 57 - 60 | 91 | 91 | |
| 4 | 52 - 65 | 93 | 88 - 97 | 39 | 97 | 94 - 100 | 213 - 330 | 16 | 8 - 28 | 90 - 122 | 92 | 85 - 96 | |
Score average (AVE) and score range were derived by pair-wise comparison (Materials and Methods). Except for the two members from group SSSGP-6, the score average for MPCR is at least 40% less than those for other regions. The two members from group SSSGP-6 are likely relatively recent duplicates since they share an overall 83% sequence identity. The score average for these two members is 13% less than those for other regions.
Evidence for positive selection on SSSGP group members
| Group | cDNA pair | Codon | dN/dS | dN ± SE | dS ± SE |
|---|---|---|---|---|---|
| 1C1/1C2 | 85 | 1.32 | 0.11 ± 0.03 | 0.08 ± 0.04 | |
| G19B4/S21C6 | 122 | 9.43 | 0.12 ± 0.02 | 0.01 ± 0.01 | |
| G10H7/G14E6 | 129 | 3.59 | 0.08 ± 0.02 | 0.02 ± 0.01 | |
| L4C4/G28G12 | 78 | 1.12 | 0.22 ± 0.04 | 0.20 ± 0.07 | |
| G13E6/G22C4 | 127 | 1.21 | 0.10 ± 0.02 | 0.08 ± 0.04 | |
| G2C8/L4H12 | 173 | 1.46 | 0.14 ± 0.02 | 0.10 ± 0.03 | |
| G16C10/G29D6 | 156 | 2.49 | 0.26 ± 0.03 | 0.11 ± 0.03 | |
| G9B3/L1C12 | 179 | 1.52 | 0.13 ± 0.02 | 0.09 ± 0.03 | |
| G7H5/G8C4 | 81 | 2.92 | 0.31 ± 0.05 | 0.11 ± 0.05 | |
| G10C11/G15G6 | 119 | 1.48 | 0.11 ± 0.02 | 0.07 ± 0.03 | |
| G8C1/G14G1 | 75 | 2.51 | 0.18 ± 0.04 | 0.07 ± 0.03 | |
| G4C3/S22E4 | 162 | 5.67 | 0.04 ± 0.01 | 0.00 ± 0.00 | |
| G7D10/S4A9 | 100 | 1.82 | 0.06 ± 0.02 | 0.03 ± 0.03 | |
| S5E9/G9G9 | 41 | 2.51 | 0.11 ± 0.04 | 0.04 ± 0.04 | |
| S14G9/S22D12 | 63 | 18.8 | 0.19 ± 0.04 | 0.01 ± 0.01 | |
| L7H8/S8C9 | 123 | 1.84 | 0.04 ± 0.01 | 0.03 ± 0.02 | |
dN/dS ratio were obtained by comparing MPCRs using PAML42 [53].
Evidence for positive selection on different alleles (Additional file 6, Figure S6)
| Gene | Total cDNAs sequenced | Unique | Unique | Nonsyn/Syn | ||||
|---|---|---|---|---|---|---|---|---|
| 60 | 43 | 45 | 148 | 71 | 27 | 0.83 (5/6) | 1.56 (28/18) | |
| 106 | 24 | 36 | 166 | 66 | 35 | 0.80 (4/5) | 1.50 (21/14) | |
| 27 | 5 | 12 | 44 | 20 | 14 | 0.60 (3/5) | 2.33 (21/9) | |
| 157 | 34 | 23 | 214 | 96 | 47 | N/A | 0.26 (50/195) | |
cDNAs were derived from three Hessian fly populations: biotype GP, biotype L, and a Syrian population (S). As a control, cDNAs coding for 26 different ribosomal proteins (RPs) that were isolated along with SSSGP cDNAs were included in this analysis. Sequence alignments for different SSSGP cDNAs are shown in Additional file 6, Figure S6 whereas alignments for RP cDNAs are shown in Additional file 7, Figure S7. "Nonsyn/Syn" represents the ratios of non-synonymous against synonymous substitutions in SPCR and MPCR, respectively.
Figure 2Phylogenetic tree for different regions of .
Figure 3Distribution and abundance of transcripts corresponding to specific . A: Transcript distribution among tissues was determined using 3-day old biotype GP larvae. The remains after removing salivary glands, gut, and Malphigian tubules were designated as carcass. B: Transcript abundance in 0.5 to 12-day old larvae on susceptible wheat plants (cultivar 'Newton'). C: Transcript abundance in 0.5 to 4-day old (dying) larvae on resistant wheat (cultivar 'Molly' containing H13 R-gene). Primer pairs and methods are shown in Additional file 8, Table S1.