| Literature DB >> 15461800 |
John Fernandes1, Qunfeng Dong, Bret Schneider, Darren J Morrow, Guo-Ling Nan, Volker Brendel, Virginia Walbot.
Abstract
Derived from the maize Mu1 transposon, RescueMu provides strategies for maize gene discovery and mutant phenotypic analysis. 9.92 Mb of gene-enriched sequences next to RescueMu insertion sites were co-assembled with expressed sequence tags and analyzed. Multiple plasmid recoveries identified probable germinal insertions and screening of RescueMu plasmid libraries identified plants containing probable germinal insertions. Although frequently recovered parental insertions and insertion hotspots reduce the efficiency of gene discovery per plasmid, RescueMu targets a large variety of genes and produces knockout mutants.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15461800 PMCID: PMC545602 DOI: 10.1186/gb-2004-5-10-r82
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Schematic diagram of RescueMu grid tagging and sequencing (RescueMu not to scale). Step 1: RescueMu is introduced into embryogenic callus followed by crossing of regenerated plants to active Mutator lines. Lines are screened for transposed RescueMu elements in plants lacking the original transgene array. Pollen from one RescueMu donor plant is crossed to multiple ears of a non-RescueMu line to generate tagging grids of up to 48 rows × 48 columns of trRescueMu plants in the field. Step 2: plant DNA prepared from pools of row or column leaves is used to generates transformed bacterial libraries of RescueMu plasmids. These are used as sequencing templates and for construction of a library plate representing the diverse insertion sites in grid plants. Step 3: genomic DNA is digested using two restriction enzymes (BamHI, BglII), religated into plasmids and transformed into E. coli. Step 4: after transformation, RescueMu plasmid-containing E. coli colonies are selected by plating onto carbenicillin agar plates and picked into 384-well plates with growth/freezing media. Overnight incubation is followed by a PCR reaction designed to amplify longer inserts with lengths up to 16 kb. Using the PCR product, eight 96-well sequencing plates (four for sequence from the left TSD and four from the right TSD) are created. Step 5: priming strategy and relative locations of PCR and sequencing primers within the RescueMu element. The sequencing reactions are read out from the TSDs to recover the germinal insert sequence. Although a BamHI and BglII double-restriction digest produces a shorter, easier-to-sequence insert length, it also increases the ambiguity in interpreting the sequence during analysis. Given successful sequencing in both directions, two GSS sequences may be submitted for every plasmid (sequence flanking the left and right TIRs). Two additional GSS sequences may be submitted for a plasmid when a BamHI, BglII or BamHI-BglII ligation site is encountered. Each of these occurrences yields sequence that was not necessarily contiguous in vivo. Dubious GSS sequences are designated with the suffix .1EL (re-created enzyme ligation site) or .2EL (re-created enzyme ligation of two restriction sites not encountered in vivo). Sequence flanking TIRs in vivo is submitted as GSS sequences with no suffix except the .x or .y (right or left) direction designation.
Grid organization and analysis of mutant phenotypes segregating among selfed progeny of grid plants
| Grid* | Year† | Grid size‡ (row × col) | Plasmid rescued | Libraries sequenced§ | Transposition frequency¶ | Independent mutations (% of families)¥ | |
| Seed | Seedling | ||||||
| A | 1999H | 34 × 48 | No | No | 0.07 | 7.2 | 4.5 |
| B | 1999SD | 52 × 48 | No | No | 0.10 | 8.6 | 10.1 |
| C | 1999B | 40 × 40 | No | No | 0.13 | 8.3 | 28.3 |
| D | 1999S | 48 × 48 | No | No | 0.26 | 8.7 | 15.1 |
| E | 2000H | 40 × 48 | No | No | 0.25 | 8.6 | 27.0 |
| F | 2000AZ | 41 × 41 | No | No | 0.57 | 6.6 | 19.5 |
| G | 2000S | 46 × 48 | Yes | Yes | 0.68 | 5.0 | 11.9 |
| H | 2000B | 38 × 36 | Yes | Yes | 0.62 | 7.5 | 6.9 |
| I | 2000B | 38 × 34 | Yes | Yes | 0.62 | 9.5 | 9.8 |
| J | 2000SD | 38 × 45 | Yes | Survey | 0.38 | 9.8 | 11.1 |
| K | 2001H | 30 × 30 | Yes | Yes | 0.66 | 8.0 | 20.3 |
| L | 2001H | 36 × 20 | Yes | Yes | 0.66 | 12.8 | 17.4 |
| M | 2001AZ | 40 × 40 | Yes | Partial | 1.30 | 8.2 | ND |
| N | 2001B | 32 × 44 | In progress | No | 0.20 | 6.3 | ND |
| O | 2001S | 47 × 48 | Yes | Survey | 0.50 | 5.2 | ND |
| P | 2002H | 48 × 48 | Yes | Yes | 1.40 | 5.9 | ND |
| Q | 2002H | 48 × 24 | Yes | Yes | 1.00 | 2.7 | ND |
| R | 2002AZ | 36 × 36 | Yes | Survey | 0.72 | 3.7 | ND |
| S | 2002SD | 48 × 48 | Yes | Survey | 1.00 | 12.7 | ND |
| T | 2002H | 48 × 46 | Yes | Survey | 1.00 | ND | ND |
| U | 2002H | 48 × 48 | Yes | Partial | >1.30 | ND | ND |
| AA | 2002S | 48 × 48 | Yes | Yes | 0.60 | ND | ND |
| BB | 2001B | 34 × 48 | Yes | Survey | 0.60 | 6.2 | ND |
| V | 2003AZ | 45 × 45 | In progress | Survey | 1.00 | ND | ND |
| X | 2003SD | 44 × 44 | In progress | Survey | 1.00 | ND | ND |
*Grids with a single letter contain mainly plants with a RescueMu pollen parent plus the seed from the ear of the founder male crossed by a non-Mutator line. In grids with a double letter, both parents contained RescueMu. †Summer nurseries are designated by year and location: A, Tucson, AZ; B, Berkeley, CA; SD, San Diego, CA; S, Stanford, CA. H indicates the winter Hawaii nursery. ‡Vandalism, animals, and environmental damage in the field resulted in some losses compared to expectation of the ear harvest. Ears with fewer than 100 kernels and those from outcross pollinations of male or female sterile grid plants were not assessed for mutation frequency; these lines are being propagated at the Maize Coop by sib pollination to establish a permanent line for later evaluation and distribution. §Yes, indicates that all rows plus four columns were sequenced with the goal of coverage to a depth such that there was a 80-95% probability that plasmids representing germinal insertions would be identified at least once. Grids listed as partial have limited (40%-80%) depth from some rows. Survey sequencing was performed on several rows and columns on the indicated grids to verify that plasmids organized into library plates contained authentic trRescueMu. Library plates will be available from all grids, including V and X, during 2004 as listed at [31]. ¶Frequency of newly transposed RescueMu per plant based on DNA blot hybridization, sampling 30-200 plants per grid. For grid A only, the data are from plants sibling to those in the grid. ¥Progeny families generated by self-pollination of grid plants were examined for kernel defects before shelling, and seedling traits were scored on up to 30 surviving individuals grown from each family. A minimum of 200 families were scored for the seedling forward-mutation frequency, and all selfed ears were scored for the seed defects. Mutations were scored as independent if they were not segregating in multiple families from the same founder. Phenotypic descriptions are available at [31], and it is expected that the grids not yet analyzed (ND) and the summer 2003 grids V, W, and X will be scored during 2004 and 2005 for reporting through the project database. Most mutations are caused by standard Mu elements.
Figure 2DNA blot hybridization analysis of trRescueMu elements in grid G. Total DNA was prepared from individual grid G plants in rows 1 and 5, as listed at the top of the lanes; these rows represent two ears crossed by the same founder RescueMu pollen source. DNA samples were digested with HindIII, a unique site 0.5 kb from the internal end of the left TIR of the RescueMu element, and the resulting gel blot was hybridized with an ampicillin-resistance gene fragment to visualize RescueMu. The two parental trRescueMu had been identified in the founder plant, and these size classes are marked along the right side of the autoradiogram. Hybridizing bands corresponding to new trRescueMu are indicated with a black square; the hybridizing band too small to be a full-length trRescueMu is marked with a white arrow. GP, grid G parental insertion sites 1 and 2 shown to be segregating in the progeny.
Probable germinal insertions (PGI) based on row and column matches
| Grid | Rows (r) | Columns (c) | Transposition frequency (τ)* | Expected PGI (e)† | Row + column matches (m) | Percentage of expected‡ | Transposition frequency (using row + column)§ |
| G | 46 | 4 | 0.68 | 125.1 | 149 | 119% | 0.81 |
| H¶ | 36 | 4 | 0.62 | 89.3 | 115 | 129% | 0.80 |
| I | 38 | 5 | 0.62 | 117.8 | 128 | 109% | 0.67 |
| K | 30 | 3 | 0.66 | 59.4 | 32 | 54% | 0.36 |
| M¶ | 40 | 3 | 1.30 | 156.0 | 33 | 21% | 0.28 |
| P | 37 | 4 | 1.40 | 207.2 | 71 | 34% | 0.48 |
| Total | 754.8 | 528 | 70% |
*Expected frequency of PGI was determined from DNA gel blot analysis of frequency of newly transposed RescueMu per plant as stated in Table 1; †expected = r × c × τ; ‡percentage of expected = 100 × m/e; §transposition frequency = m/(r × c); ¶for grids H and M, rows were considered columns and vice versa to simplify calculations.
Probable germinal insertions (PGI) based on multiply recovered plasmids
| Grid | Multiple recovery (m) | Single recovery (s) | Percentage PGI* | Expected frequency (τ)† | Expected PGI (e)‡ | Percentage of expected§ | Plasmids in multiple recoveries |
| G | 1,091 | 3,801 | 22% | 0.68 | 1,501 | 73% | 5,535 |
| H | 535 | 2,142 | 20% | 0.62 | 848 | 63% | 2,945 |
| I | 544 | 2,000 | 21% | 0.62 | 801 | 68% | 3,162 |
| K | 228 | 1,000 | 19% | 0.66 | 594 | 38% | 1,202 |
| M | 330 | 1,075 | 23% | 1.3 | 2,080 | 16% | 2,053 |
| P¶ | 410 | 1,731 | 19% | 1.4 | 2,486¶ | 16% | 2,342 |
| Total | 3,138 | 11,749 | 21% | 8,311 | 38% | 17,239 |
Single recoveries are also shown. *Percentage of PGI = m/(m + s); †expected frequency of PGI was taken from Table 1; ‡expected PGI = τ × rows × columns (see Table 2); §percentage of expected = 100 × m/e; ¶based on 37 rows only.
Figure 3RescueMu plasmid library plate screening for a gene with multiple insertion sites. (a) Schematic diagrams of a RescueMu insertion hotspot: demonstration of the assembly of flanking genomic sequences; locations and directions of all primers used in this study; EST alignment to genomic sequence assembly showing introns. (b) An ethidium bromide-stained agarose gel of the PCR products from columns 1 and 22 and row 42 plasmid libraries, using primer pair 2 + L. (c) An ethidium bromide-stained agarose gel of the PCR products with leaf DNA extracted from G42-22(x) progeny 1 to 8, using primer pair 3 + 6. (d) An ethidium bromide-stained agarose gel of the PCR products with the same DNA used in (c), except using primer pair 3 + L (column B is blank). (e) NcoI-digested DNA blot from plants 1 and 3 to 8 probed with a fragment spanning a 0.6-kb PCR product amplified with primer pair 1 + 5. (f) Phenotypes at several developmental stages (from left to right): 10-day-old seedlings (1 to 10 from left to right) of the G42-22(x) progeny; a side-by-side comparison of plants 5 and 6 at 10 days, including their root mass; adult plants at 1 month showing plant 5 in the foreground of the picture with two siblings on either side; a close-up of the plant 5 adult leaf phenotype.
Figure 4Functional spectrum of genes targeted by trRescueMu. Functional spectrum of probable proteins, identified by BLASTX of GSS contigs against the SPTR database, for trRescueMu targeted genes. Functional categories were derived from the Gene Ontology (GO) database.
Detailed analysis of insertion sites recovered multiple times
| Number of same-base insertions that occurred in the indicated number of grids | Number of contigs with the indicated number of different insertion sites | |||
| Grids | Insertions (N) | Percentage of total | Sites per contig | Contigs (N) |
| 1 | 572 | 90% | 1 | 48 |
| 2 | 60 | 9% | 2 | 71 |
| 3 | 6 | 1% | 3 | 89 |
| 4 | 1 | 0% | 4 | 32 |
| 5 | 7 | |||
| 6 | 2 | |||
| 7 | 1 | |||
| Total | 639 | 100% | Total | 250 |
Matching of RescueMu genomic loci to other available databases to determine percentage of genic and repeat loci
| Category | Number of genomic loci |
| Total maize genomic loci discovered | 14,265* |
| Number of genic loci identified by: | |
| Maize EST/cDNA | 7,555 |
| Plant EST/cDNA | 1,253 |
| Protein database | 84 (62%) |
| GENSCAN prediction | 708 |
| Number of genic loci† (percentage of total) | 9,600 (67.3%) |
| Number of loci matching repeats: | |
| Retrotransposon | 1,074 |
| DNA transposon | 212 |
| MITEs | 193 |
| Centromere-related repeats | 57 |
| Telomere-related repeats | 3 |
| Unknown repeats | 221 |
| Other repeats | 8 |
| 45S ribosomal DNA (18S + 28S) | 23 |
| 5S ribosomal DNA | 10 |
| Transfer DNA | 25 |
| Number of repeat loci‡ (percentage of total) | 1,113 (8%) |
*The 14,887 unique loci were collapsed into 14,265 unique loci by linking forward/reverse sequence pairs. This provides a more conservative estimate, but in some cases may have incorrectly combined separate loci. †Numbers are cumulative: that is, GSSs were first matched to maize EST/cDNAs, then the unmatched GSSs were screened against other plant EST/cDNAs, and so on. ‡Numbers are not cumulative: that is, some loci could match to both retrotransposon and DNA transposon sequences.
Single and multiple recovery of specific RescueMu insertion sites within the sequenced rows of grids G, H, I, K, M, and P
| Grid | Single recovery | Multiple recoveries | ||||||||
| 0* | 1† | 2 | 3 | 4 | 5-9 | 10-19 | 20+ | Total | ||
| G | 3,801 | 22 | 640 | 326 | 62 | 20 | 17 | 2 | 2 | 4,892 |
| H | 2,142 | 13 | 331 | 136 | 31 | 9 | 6 | 3 | 6 | 2,677 |
| I | 2,000 | 10 | 348 | 124 | 39 | 3 | 9 | 6 | 6 | 2,544 |
| K | 1,000 | 6 | 155 | 50 | 11 | 2 | 1 | 2 | 1 | 1,228 |
| M | 1,075 | 3 | 225 | 74 | 13 | 5 | 5 | 3 | 2 | 1,405 |
| P | 1,731 | 8 | 246 | 100 | 27 | 5 | 22 | 1 | 1 | 2,141 |
| All | 11,749 | 62 | 1,945 | 810 | 183 | 44 | 60 | 17 | 17 | 14,887 |
Counts are of contigs containing sequences from the indicated number of rows. *The zero class represents plasmids identified in column sequencing that were not identified in any row. † The 1 column data include singlet plasmids as well as plasmids recovered two or more times but within a single row.
Insertions found in at least two rows or columns among plants with both row and column sequence data
| Grid | Plants from sequenced rows + columns | Insertions found in 2+ rows or 2+ columns |
| G | 184 | 65 |
| H | 144 | 35 |
| I | 190 | 35 |
| K | 90 | 6 |
| M | 120 | 6 |
| P | 148 | 23 |
| Total | 876 | 177 |