| Literature DB >> 19159479 |
Francesca D Frentiu1, Marcin Adamski, Elizabeth A McGraw, Mark W Blows, Stephen F Chenoweth.
Abstract
BACKGROUND: The native Australian fly Drosophila serrata belongs to the highly speciose montium subgroup of the melanogaster species group. It has recently emerged as an excellent model system with which to address a number of important questions, including the evolution of traits under sexual selection and traits involved in climatic adaptation along latitudinal gradients. Understanding the molecular genetic basis of such traits has been limited by a lack of genomic resources for this species. Here, we present the first expressed sequence tag (EST) collection for D. serrata that will enable the identification of genes underlying sexually-selected phenotypes and physiological responses to environmental change and may help resolve controversial phylogenetic relationships within the montium subgroup.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19159479 PMCID: PMC2653539 DOI: 10.1186/1471-2164-10-40
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Map of the east coast of Australia. Populations of D. serrata sampled in this study are indicated by filled in circles.
EST statistics
| Total number | Length (bases) | |||
| Min | Max | Average | ||
| Contigs (≥ 2 ESTs) | 1188 | 113 | 1502 | 795 |
| Contigs (singletons) | 5419* | 30 | 1054 | 575 |
| Contigs – peptides | 6009 | 60 | 1502 | 630 |
| Contigs – other# | 585 | 47 | 1411 | 467 |
The number and average length (bp) of contigs obtained from the D. serrata EST library.
* Includes singletons of high (CL0; N = 3639) and poor (CLx; N = 1780) quality. # Includes transposable elements, microRNAs (miRNA), noncoding RNAs (ncRNA), pseudogenes and transfer RNAs (tRNAs).
EST contigs annotated to genes of interest
| Gene | Contig ID | Length | Reference |
| Sexual selection and CHCs | |||
| Acp70A-1 | CL0Est000004995273P08 | 157 | [ |
| CL336Contig1 | 788 | [ | |
| CL36Contig1 | 1222 | [ | |
| CL1138Contig1 | 779 | [ | |
| CL67Contig1 | 774 | [ | |
| Protein, ejaculatory bulb (CG2668-PA) | CL833Contig1 | 745 | [ |
| Sperm protein | CL448Contig1 | 718 | - |
| Yolk protein 2 (CG2979-PA) | CL2Contig1 | 803 | [ |
| Yolk protein 3 | CL9Contig1 | 848 | [ |
| Climatic adaptation | |||
| CL134Contig1 | 730 | [ | |
| CL121Contig1 | 787 | [ | |
| CL0EST000004965173F20 | 503 | [ | |
| CL14Contig2 | 839 | [ | |
| CL106Contig1 | 783 | [ | |
| CL106Contig1 | 783 | [ | |
| CL753Contig1 | 727 | [ | |
| Phylogenetic | |||
| CL158Contig1 | 899 | [ | |
| CL110Contig1 | 768 | [ | |
| CL37Contig1 | 795 | [ | |
| Cytochrome oxidase subunit II (COII) | CL0Est000004981673F03 | 574 | [ |
| CL217Contig1 | 1010 | [ | |
| Rhodopsin (CG5638-PA) | CL0Est000004965473E08 | 434 | [ |
D. serrata contigs annotated to Drosophila genes of interest from sexual selection, climatic adaptation and phylogenetic studies. Contig identification codes are those used on the D. serrata web database .
Figure 2Distribution of GO terms. The five most frequently represented GO terms for each of three major gene functions in the D. serrata EST library, as indicated by the number of contigs in each category.
EST-derived microsatellite statistics
| Perfect repeats | Imperfect repeats (< 2 mismatches) | |||
| Repeat size | Number | Average length | Number | Average length |
| 2 | 295 | 18.87 (± 6.43) | 356 | 23.31 (± 9.22) |
| 3 | 95 | 20.58 (± 3.40) | 292 | 22.43 (± 11.61) |
| 4 | 3 | 29.00 (± 4.97) | 116 | 19.81 (± 8.42) |
| 5 | - | - | 129 | 17.99 (± 4.08) |
| 6 | 1 | 44.00 (± 0) | 123 | 23.98 (± 8.31) |
The number and average length of microsatellites (SSRs) found in the D. serrata EST library, based on 6,607 contigs. The minimum-repeat number for both analyses was six.
Figure 3SNP discovery versus alignment depth. Proportion of SNPs identified in the D. serrata EST database versus contig alignment depth, restricted to the dataset of 1,254 'double hit' high quality SNPs. The total number of contigs for each alignment depth class (number of ESTs/contig) is represented by the height of the column. The different shading patterns within a column indicate the number of SNP classes (e.g. 1–5 SNPs/contig). Most contigs were shallow in depth (4–8 ESTs/contig) and contained relatively few (1–5 SNPs) sequence variants per contig. Note that here alignment depth is not constant across all sites along the contig and instead denotes the total number of ESTs per contig.