| Literature DB >> 17156445 |
Kay L Shopinski1,2, Muhammad J Iqbal3, Jeffry L Shultz1,4, Dheepakkumaran Jayaraman1, David A Lightfoot1.
Abstract
BACKGROUND: Genome analysis of soybean (Glycine max L.) has been complicated by its paleo-autopolyploid nature and conserved homeologous regions. Landmarks of expressed sequence tags (ESTs) located within a minimum tile path (MTP) of contiguous (contig) bacterial artificial chromosome (BAC) clones or radiation hybrid set can identify stress and defense related gene rich regions in the genome. A physical map of about 2,800 contigs and MTPs of 8,064 BAC clones encompass the soybean genome. That genome is being sequenced by whole genome shotgun methods so that reliable estimates of gene family size and gene locations will provide a useful tool for finishing. The aims here were to develop methods to anchor plant defense- and stress-related gene paralogues on the MTP derived from the soybean physical map, to identify gene rich regions and to correlate those with QTL for disease resistance.Entities:
Year: 2006 PMID: 17156445 PMCID: PMC1716159 DOI: 10.1186/1746-4811-2-20
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Identification of the number of hybridizing colonies (BAC clones) and inferred significantly conserved paralogues (G) identified by probes (P) to 201 small gene families in soybean. MTP BAC DNA was hybridized to 166 EST probes from cv. Williams (Gm library) and 143 EST probes from cv. Forrest (Fi library). Hybridization stringency was about 25 C below Tm so paralogue detected were expected to share more than 75% nucleotide identity.
| Probe | Number of paralogues per gene family | ||||||||||||
| Library | total | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 13 | 15 |
| Fi-G | 334 | 34 | 40 | 42 | 36 | 25 | 30 | 35 | 16 | 18 | 30 | 13 | 15 |
| Gm-G | 279 | 36 | 58 | 21 | 48 | 20 | 30 | 14 | 16 | 0 | 10 | 26 | 0 |
| Total-G | 613 | 70 | 98 | 63 | 84 | 45 | 60 | 49 | 32 | 18 | 40 | 39 | 15 |
Identification of the number of EST clusters within individual BACs(G) identified by probes to 201 small gene families in soybean MTP BAC DNA using soybean 166 cv. Williams ESTs (Gm library) and 143 cv. Forrest ESTs (Fi library).
| Probe | Number of different gene families per BAC | ||||||||||||||
| Library | total | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 12 | 14 | 15 | 21 | 39 |
| Fi | 216 | 158 | 36 | 6 | 12 | 0 | 1 | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 0 |
| Gm | 130 | 100 | 15 | 2 | 5 | 1 | 1 | 0 | 1 | 0 | 0 | 1 | 2 | 1 | 1 |
* 12 BACs hybridized to 2 different probes from 2 different libraries
Proportion of genes in different organisms present as either singletons or in paralogous families.
| Species | No of genes* | Unique gene families containing | |||||
| 1 | 2 | 3 | 4 | 5 | > 5 member | ||
| 1,587 | 88.8% | 6.8% | 2.3% | 0.7% | 0.0% | 1.4% | |
| 5,105 | 71.4% | 13.8% | 3.5% | 2.2% | 0.7% | 8.4% | |
| 10,736 | 72.5% | 8.5% | 3.4% | 1.9% | 1.6% | 12.1% | |
| 14,177 | 55.2% | 12.0% | 4.5% | 2.7% | 1.6% | 24.0% | |
| 11,601 | 12.5% | 7.0% | 4.4% | 3.6% | |||
| 201 | 10.4% | 4.5% | 15% | ||||
* The number of genes in the genomes of Haemophilus influenzae, S. cerevisiae, Drosophila, C. elegans, Arabidopsis and Glycine max that were present either as singletons or in gene families with two or more members were listed. To be grouped in a gene family, two genes had to show similarity exceeding a BLASTP value E,10 -20 and a FASTA alignment over at least 80%of the protein length or hybridize at 65 C and 0.1 M ionic strength (Tm -25). In column 1, the number of genes that were unique plus the number of gene families were listed. Columns 2 to 6 give the percentage of genes present as singletons or in gene families of n members (from TAIR 2000).
High copy number gene families detected in the MTP with EST probes. When an EST hybridize to a BAC clone on the MTP membrane, it is considered as positive for that EST.
| GenBank accession | Homology to the known genes | No. of BACs + ve for EST | EST source |
| Threonine synthase | 7 | Gm | |
| Calmodulin like protein | 7 | Gm | |
| Calcium dependent protein kinase | 8 | Gm | |
| Calmodulin-stimulated calcium ATPase | 8 | Gm | |
| Ascorbate peroxidase type 1 | 8 | Fi | |
| Ascorbate peroxidase type 2 | 8 | Fi | |
| Putative elongation factor 1B alpha-subunit | 9 | Fi | |
| EST | 9 | Fi | |
| 5.8S, 18S and 25S ribosomal RNA | 10 | Fi | |
| EST | 10 | Fi | |
| Putative water channel protein | 10 | Fi | |
| MAP kinase kinase alpha protein kinase | 10 | Gm | |
| Kinesin like protein A | 13 | Gm | |
| β-galactosidase. | 13 | Gm | |
| Ascorbate peroxidase type 3 | 13 | Fi | |
| Lipoxygenase | 15 | Fi |
Figure 1Methods for identifying gene families in the minimum tile path of BAC clones. Panel A): The strategy for probe pool designs. Panel B): An example of three positive clones from colony hybridizations to double spotted colonies from four 384 plates per location. Panel C): DNA from BAC clones identified from the MTP hybridizations were restriction digested with the restriction enzyme used for insertion, transferred to a membrane and hybridized to the EST probe. (a): The BAC clones B48B23, H59N09, and H76L07 (Lane 1, 2, and 3) were positive for the identified EST FiS1H9 (BI273631 a histone H2A orthologue) from the initial MTP hybridization. (b): DNA of BAC clones B23A05, B23C13, B38M08, B48M07, H15A05, H15A06, H20G14, H36D08, and H42K03 (Lane 1, 2, 3, 4, 5, 6, 7, 8, and 9 respectively) were hybridized to the EST Fi36H18 (BI347330 a translational elongation factor 1B-alpha 1 orthologue) identified from the initial MTP hybridization for second confirmation. The hybridization proved that 6 out of the 9 (Lane 2, 3, 4, 5, 7, and 8) initial positives were positive with a second Southern hybridization.
Correspondence between paralogue copy number A: Estimated from published Southern hybridizations and the colony hybridizations to the MTP. B: Comparison of paralogue number in MTP colony hybridization and by Southern hybridization.
| Probe/gene family | Southern estimate of copy number | Reference | MTP estimate of copy number |
| A: | |||
| G-box factor | 5–7 | Hong, et al. (1995) | 1 |
| Epoxide hydrolase | 5 | Arahira et al. (2000) | 1 |
| Chalcone synthase | 3–7 | Estabrook et al., (1991) | 1 |
| Phenylalanine ammonia lyase | 2–3 | Estabrook et al., (1991) | 1 |
| ATP synthase | 2–3 | Smith et al. (1994) | 3 |
| Aspartate aminotransferase 1 | 1–2 | Gebhardt et al. (1998) | 3 |
| Leghemoglobin. | 2 | Ji et al. (1994) | 3 |
| 4-coumarate CoA ligase 1. | 1–3 | Lindermayr et al. (2002) | 5 |
| Calmodulin | 4 | Lee et al. (1995) | 5 |
| Nodulin 22 | 4–5 | Sandal et al. (1987) | 5 |
| B: | |||
| EST BM499228 | 1 | This work | 1 |
| EST BI347333 | 1 | This work | 3 |
| EST BI273631 | 3 | This work | 3 |
| EST BI347330 | 6 | This work | 9 |
Figure 2SoyGD images showing the locations of ESTs in build 3. Panel A): A 'Forrest' physical map of Build 3 contig176 showing BAC clones, QTL and markers. The contig is located on linkage group G and contains SSR marker Sat_163. The BAC clone was positive for ESTs encoding a QM-family orthologue, two types of ascorbate oxidase (laccase, or diphenol oxidase paralogues) and an un-annotated EST. Panel B): A 'Forrest' physical map contig 1120 representing BAC clone H35I01. This contig was located on MLG H and contained SSR marker Sat_122. The BAC clone was positive for ESTs BI119568 and AI441021 homologous to calcium binding protein isolog and calcium dependent protein kinase.