| Literature DB >> 24832518 |
Michael A Gonzalez1, Derek Van Booven2, William Hulme3, Rick H Ulloa4, Rafael F Acosta Lebrigio5, Jeannette Osterloh6, Mary Logan7, Marc Freeman8, Stephan Zuchner9.
Abstract
Forward genetic screens in Drosophila melanogaster using ethyl methanesulfonate (EMS) mutagenesis are a powerful approach for identifying genes that modulate specific biological processes in an in vivo setting. The mapping of genes that contain randomly-induced point mutations has become more efficient in Drosophila thanks to the maturation and availability of many types of genetic tools. However, classic approaches to gene mapping are relatively slow and ultimately require extensive Sanger sequencing of candidate chromosomal loci. With the advent of new high-throughput sequencing techniques, it is increasingly efficient to directly re-sequence the whole genome of model organisms. This approach, in combination with traditional chromosomal mapping, has the potential to greatly simplify and accelerate mutation identification in mutants generated in EMS screens. Here we show that next-generation sequencing (NGS) is an accurate and efficient tool for high-throughput sequencing and mutation discovery in Drosophila melanogaster. As a test case, mutant strains of Drosophila that exhibited long-term survival of severed peripheral axons were identified in a forward EMS mutagenesis. All mutants were recessive and fell into a single lethal complementation group, which suggested that a single gene was responsible for the protective axon degenerative phenotype. Whole genome sequencing of these genomes identified the underlying gene ect4. To improve the process of genome wide mutation identification, we developed Genomes Management Application (GEM.app, https://genomics.med.miami.edu), a graphical online user interface to a custom query framework. Using a custom GEM.app query, we were able to identify that each mutant carried a unique non-sense mutation in the gene ect4 (dSarm), which was recently shown by Osterloh et al. to be essential for the activation of axonal degeneration. Our results demonstrate the current advantages and limitations of NGS in Drosophila and we introduce GEM.app as a simple yet powerful genomics analysis tool for the Drosophila community. At a current cost of <$1,000 per genome, NGS should thus become a standard gene discovery tool in EMS induced genetic forward screens.Entities:
Year: 2012 PMID: 24832518 PMCID: PMC4009818 DOI: 10.3390/biology1030766
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Whole genome sequencing metrics per sample.
| Sample | # of reads | # of reads aligned | % reads aligned | Avg depth | # of SNVs | # of high quality SNVs | # of NS/SS cSNV | # of NS/SS cSNV on 3L | Unique NS/SS cSNV on 3L |
|---|---|---|---|---|---|---|---|---|---|
| Background | 42,278,410 | 37,650,186 | 88.1 | 27.1 | 412,362 | 88,593 | 21,745 | 2,851 | 25 |
| Mutant 1 | 83,425,870 | 72,784,178 | 87.2 | 44.6 | 849,658 | 456,778 | 35,172 | 6,544 | 863 |
| Mutant 2 | 43,972,798 | 38,172,143 | 86.81 | 23.4 | 647,655 | 427,332 | 25,054 | 5,252 | 533 |
| Mutant 3 | 125,284,692 | 111,178,422 | 88.7 | 68.1 | 657,987 | 231,449 | 32,114 | 4,913 | 48 |
| Total in Mutants | 2,155,300 | 1,115,559 | 92,340 | 16,709 | 1,444 |
# - number; SNV – single nucleotide variant; cSNV – coding SNV; NS/SS – nonsynonymous/splice site; 3L – chromosome 3L.
Observed number of SNVs per million base pair (Mbp) in each chromosome.
| Chromsome | Variants per Mbp |
|---|---|
| 2L | 11023 |
| 2R | 9848 |
| 3L | 11043 |
| 3R | 8554 |
| 4 | 3331 |
| X | 8108 |
| Y | 72 |
Figure 1Distribution of detected SNVs not present in background on each chromosome.Purple bars represent the total number of SNVs that were observed in mutant genomes but not in the background genome. Green bars represent the number of SNVs that were G>A or C>T transitions. Red bars represent the number of nonsynonymous or splice-site SNVs that were not present in the wild-type genome and detected in the mutant genomes. Blue bars represent the number of nonsynonymous and splice-site SNVs that were G>A or C>T transitions.
Figure 2Filtering strategies used for gene identification by whole genome sequencing. Each box lists the number of genes with one or more protein-coding SNV (cSNV). Columns show the effect of requiring that one or more cSNV be observed in each of one to three mutant flies. Rows show the effect of filtering variants that meet certain analysis criteria. Row 6 displays the effect of including 8 samples from different screens in analysis.
Figure 3Mutation density on chromosome 3L in screen (N = 3). (A) Screenshot of chromosome 3L with custom tracks in the UCSC genome browser. The pink line displays all SNVs on chromosome 3L that were detected in this screen. The navy line displays all heterozygous non-synonymous and splice-site SNVs on chromosome 3L not present in the wild-type fly. (B) Screen shot of ect4/dSarm with custom track displaying the location and conservation of the three ect4 mutations.
Figure 4GEM.app bioinformatics pipeline and online graphical interface. (A) The systematic processes of the automated GEM.app bioinformatics pipeline. (B) Screenshot of the GEM.app form with some of the filtering options displayed. (C) The output of GEM.app query used to identify ect4 mutations. A subset of 34 total columns are shown. Links within the table (U, N, S, e) provide direct access to common resources, such as UCSC genome browser, NCBI, String gene networks, and Ensembl. Up to three tabs allow for parallel queries. Additional options include modifying filter settings, managing columns, and downloading results in an excel sheet.