| Literature DB >> 20035631 |
Dietmar Schwarz1, Hugh M Robertson, Jeffrey L Feder, Kranthi Varala, Matthew E Hudson, Gregory J Ragland, Daniel A Hahn, Stewart H Berlocher.
Abstract
BACKGROUND: The full power of modern genetics has been applied to the study of speciation in only a small handful of genetic model species--all of which speciated allopatrically. Here we report the first large expressed sequence tag (EST) study of a candidate for ecological sympatric speciation, the apple maggot Rhagoletis pomonella, using massively parallel pyrosequencing on the Roche 454-FLX platform. To maximize transcript diversity we created and sequenced separate libraries from larvae, pupae, adult heads, and headless adult bodies.Entities:
Mesh:
Substances:
Year: 2009 PMID: 20035631 PMCID: PMC2807884 DOI: 10.1186/1471-2164-10-633
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Host origin, treatment, and number of individuals used for construction of the four stage/tissue specific libraries.
| Stage/Tissue | Samples in each library | Host race | N |
|---|---|---|---|
| Larva | L2+L3, in fruit | Apple | 16 |
| L3, migrant | Apple | 6 | |
| Pupa | 3 days at 25°C | Hawthorn | 8 |
| 10 days at 25°C | Hawthorn | 8 | |
| 20 days at 25°C | Hawthorn | 8 | |
| 22 days at 25°C | Hawthorn | 8 | |
| 3 days at 4°C | Hawthorn | 6 | |
| 1 months at 4°C | Hawthorn | 6 | |
| 3 months at 4°C | Hawthorn | 8 | |
| 4 months at 4°C | Hawthorn | 8 | |
| 3 days at 25°C after diapause | Hawthorn | 8 | |
| 7 days at 25°C after diapause | Hawthorn | 8 | |
| 24 days at 25°C after diapause | Hawthorn | 8 | |
| 40 days at 25°C after diapause | Hawthorn | 8 | |
| Body (no head) | non-diapaused, 3 days after eclosion | Hawthorn | 20 |
| non-diapaused, 10 days after eclosion | Hawthorn | 16 | |
| Wild-caught | Apple | 20 | |
| Head | non-diapaused, 3 days after eclosion | Hawthorn | 98 |
| non-diapaused, 10 days after eclosion | Hawthorn | 91 | |
| wild caught | Apple | 51 |
Figure 1Annotated .
Figure 2A comparison of the distribution of ESTs across 14 major Biological Process GO sub-classes in our . The sub-categories are CS = cell communication (signalling), RP = Regulation of cellular physiological process, T = Transport, OB = Cell organization and biogenesis, M = Metabolism, RS = Response to stimulus, CA = Cell adhesion, CD = Cell death, R = Reproduction, CC = Cell cycle and division, H = Homeostasis, CM = Cell motility, D = Development, and GP = Cell growth, differentiation, and proliferation.
Contigs and single reads of Rhagoletis pomonella candidate chemoreceptors†.
| Match | CG | %I | bp | Read/Contig | Source | |
|---|---|---|---|---|---|---|
| OR 22c | 15377 | 33 | 51.0 | 116 | E7OMS0H04JIN70 | P |
| OR 43a | 1854 | 82 | 53.7 | 249 | E7OMS0H02EYJ4G | H |
| OR 49a | 13158 | 43 | 44.0 | 283 | EZ4BI6301E5Z3B* | B |
| OR 49b | 1758 | 76 | 65.8 | 231 | C11063 (2) | H |
| OR 83a | 10612 | 66 | 51.5 | 244 | E7OMS0H02EEYGV | H |
| OR 94a | 17241 | 34 | 52.0 | 134 | E7OMS0H01BOR2 M | B |
| OR 94b | 6679 | 58 | 43.1 | 214 | E7OMS0H04H6HYT | P |
| IR 25a | 15627 | 85 | 90.0 | 266 | EZ4BI6301FWMUT‡ | B |
| 78 | 77.0 | 280 | EZ4BI6301EI8 MK | B | ||
| IR 92a | 15685 | 38 | 76.0 | 247 | EZ4BI6301FTL6Z | B |
| GR 43a | 1712 | 80 | 60.0 | 242 | E7OMS0H02EBJ2S | H |
| 37 | 56.8 | 146 | EY1FUWY01BWJL1 | H | ||
| 19 | 85.0 | 290 | EZ4BI6301FFA7L | B | ||
| GR 64b | 32257 | 21 | 66.7 | 222 | E3CVG0K02EHCM3 | H |
†Contigs and reads matching the same D. melanogaster locus map to different regions of the Drosophila gene. Match is the D. melanogaster locus name for the closest match, CG is the Celera Genome number of the match, aa is the number of amino acids in the single read or contig, %I is the percent aa match between the R. pomonella and D. melanogaster homologous proteins, bp is the base pair length of the single read or contig, Read/Contig is the R. pomonella identifier in our data base, GenBank is the GenBank accession number, and So. is source (Larva, Pupa, Head, or Body). Number of reads assembled in contig is in parenthesis.
*Also matches an EST fragment of an OR from a congener, R. suavis (ABW80750.1) at 100% I; see also GenBank EU204908.1.
‡A second fragment, EZ4BI6301FZM98, was identical but shorter (contained entirely within EZ4BI6301FWMUT).
Contigs and single reads of Rhagoletis pomonella odorant binding proteins (OBPs) and other candidate transcripts for odor reception†.
| Match | ID | Aa | %I | bp | Read/Contig |
|---|---|---|---|---|---|
| OBP 19a | 11748 | 105 | 63.8 | 602 (15) | C10486 [EZ126705] |
| OBP 19b | 2297 | 120 | 41.6 | 578 (29) | C21814 [EZ138033] |
| OBP 44a | 2297 | 125 | 65. | 934 (159) | C21478 [EZ137697] |
| OBP 49a | 30052 | 40 | 50.0 | 213 (4) | C02098 [EZ118317 |
| OBP 50e | 13939 | 43 | 41.8 | 233 (3) | C15401 [EZ131620] |
| 51 | 49.0 | 264 | E7OMS0H01BS27Q | ||
| OBP 56a | 11797 | 94 | 27.6 | 436 (70) | C23516 [EZ139735] |
| OBP 56d | 11218 | 123 | 38.2 | 532 (35) | C00020 [EZ116239] |
| OBP 56 h | 13874 | 112 | 37.5 | 642 (73) | C22766 [EZ138985] |
| OBP 59a | 13517 | 47 | 63.8 | 215 | E7OMS0H02EEDPG |
| OBP 83cd | 15582 | 126 | 47.6 | 886 (20) | C20125 [EZ136344] |
| OBP 83ef | 31557 | 217 | 49.7 | 1395 (71) | C20870 [EZ137089] |
| OBP 83 g | 31558 | 59 | 57.6 | 474 (44) | C20023 [EZ136242] |
| OBP 99b | 7592 | 123 | 53.6 | 536 (244) | C19484 [EZ135703] |
| OBP 99c | 7584 | 139 | 57.5 | 759 (76) | C22673 [EZ138892] |
| OBP 99d | 15505 | 51 | 45.1 | 613 (10) | C02834 [EZ119053] |
| Pbprp 1* | 10436 | 42 | 50.0 | 286 (21) | C23956 [EZ140175] |
| 30 | 46.0 | 190 (11) | C22750 [EZ138969] | ||
| Pbprp 2 | 1668 | 150 | 25.0 | 800 (52) | C23271 [EZ139490] |
| Similar to Pbprp 2* | 1668 | 106 | 38.6 | 520 (73) | C14712 [EZ130931] |
| Pbprp 3* | 11421 | 18 | 72.0 | 241 (2) | C02940 [EZ119159] |
| 144 | 68.0 | 410 (4) | C08103 [EZ124322] | ||
| Pbprp 4* | 1176 | 124 | 54.8 | 737 (60) | C22809 [EZ139028] |
| Pbprp 5* | 6641 | 128 | 34.3 | 820 (136) | C22963 [EZ139182] |
| Similar to Pbprp 5* | 6641 | 63 | 44.4 | 351 (6) | C16946 [EZ133165] |
| Sensory neuron membrane protein 1 | 7000 | 81 | 75.0 | 257 | E7OMS0H01CAV8S |
| 92 | 73.0 | 276 (5) | C07451 [EZ123670] | ||
| G protein salpha 60A | 2835 | 274 | 93.0 | 1538 (30) | C08446 [124665] |
| Arrestin 2 | 5962 | 168 | 97.0 | 807 (91) | C15900 [EZ132119] |
| 56 | 89.0 | 265 | E7OMS0H02EWNVX | ||
| Arrestin 1 | 5711 | 260 | 92.0 | 1306 (79) | C00173 [EZ116392] |
| Pherokine 3 | 9358 | 113 | 66.0 | 442 (21) | C02839 [EZ119058] |
| Putative chemosensory protein CSP1 | 30172 | 93 | 75.0 | 485 (25) | C23468 [EZ139687] |
| Cytochrome P450 reductase | 11567 | 140 | 83.0 | 1253 (80) | C22056 [EZ138275] |
†Contigs and reads matching the same D. melanogaster locus map to different regions of the Drosophila gene. Match is the D. melanogaster locus name for the closest match, CG is the Celera Genome number of the match, aa is the number of amino acids in the single read or contig, %I is the percent aa match between the R. pomonella and D. melanogaster homologous proteins, bp is the base pair length of the single read or contig (number of sequences contributing to contig), Read/Contig is the R. pomonella ID in our data base and the GenBank TSA Accession number.
*Homologous sequence also found in the congener R. suavis.
Figure 3A comparison of the distribution of SNPs across 14 major Biological Process GO sub-classes in our . The sub-categories are CS = cell communication (signalling), RP = Regulation of cellular physiological process, T = Transport, OB = Cell organization and biogenesis, M = Metabolism, RS = Response to stimulus, CA = Cell adhesion, CD = Cell death, R = Reproduction, CC = Cell cycle and division, H = Homeostasis, CM = Cell motility, D = Development, and GP = Cell growth, differentiation, and proliferation. * = GO categories with slight, but statistically significant, under- or overrepresentation of SNPs (see text).
Sequences expressed primarily in larvae†.
| Contig | Total reads | Larval reads | % L reads | Match | Annotation |
|---|---|---|---|---|---|
| 17158 [EZ133377], 24176 [EZ140395], 9309 [125528] | 175 | 173 | 99 | CG32400 | Larval cuticular protein 65Ab1 |
| 17157 [EZ133376] | 38 | 37 | 97 | CG6956 | Larval cuticular protein 65Ac |
| 9308 [EZ125527], 9306 [EZ125525] | 177 | 171 | 97 | CG2044 | Larval cuticular protein 4 |
| 22477 [EZ138696] | 29 | 28 | 97 | CG15515 | Cuticle protein |
| 11180 [EZ127399] | 82 | 79 | 96 | CG9070 | Larval cuticular protein 2a |
| 21675 [EZ137894], 24313 [EZ140532] | 308 | 290 | 94 | CG8697 | Larval cuticle protein 2 |
| 17322 [EZ133541] | 34 | 32 | 94 | CG9077 | Cuticular protein 47Ec |
| 20182 [EZ136401] | 95 | 83 | 87 | CG8502 | Cuticular protein 49Ac |
| 23604 [EZ139823] | 36 | 36 | 100 | CG12385 | theta-Trypsin |
| 24013 [EZ140232] | 48 | 46 | 96 | CG12385 | theta-Trypsin |
| 10762 [EZ126981] | 89 | 85 | 96 | CG12385 | theta-Trypsin |
| 21083 [EZ137302] | 54 | 49 | 91 | CG17571 | trypsin-like serum protease |
| 9347 [EZ125566] | 120 | 108 | 90 | CG30028 | gammaTrypsin |
| 9544 [EZ125763] | 79 | 70 | 89 | CG12385 | theta Trypsin |
| 22598 [EZ138817] | 20 | 17 | 85 | CG12387 | zeta Trypsin |
†"Contig" is our assembly number followed by the GenBank TSA accession number, "total reads" is the total number of reads contributing to the contig, "larval reads" is the number of the total recovered from larvae, "%L" is the percent of reads from larvae, "Match" is the Celera Genome number of the best match with Drosophila melanogaster, and "Annotation" is a brief description of the match.