| Literature DB >> 16381975 |
Jeffry L Shultz1, Deepak Kurunam, Kay Shopinski, M Javed Iqbal, Samreen Kazi, Kimberley Zobrist, Rabia Bashir, Satsuki Yaegashi, Nagajyothi Lavu, Ahmed J Afzal, Charles R Yesudas, M Abdelmajid Kassem, Chengcang Wu, Hong Bin Zhang, Christopher D Town, Khalid Meksem, David A Lightfoot.
Abstract
Genomes that have been highly conserved following increases in ploidy (by duplication or hybridization) like Glycine max (soybean) present challenges during genome analysis. At http://soybeangenome.siu.edu the Soybean Genome Database (SoyGD) genome browser has, since 2002, integrated and served the publicly available soybean physical map, bacterial artificial chromosome (BAC) fingerprint database and genetic map associated genomic data. The browser shows both build 3 and build 4 contiguous sets of clones (contigs) of the soybean physical map. Build 4 consisted of 2854 contigs that encompassed 1.05 Gb and 404 high-quality DNA markers that anchored 742 contigs. Many DNA markers anchored sets of 2-8 different contigs. Each contig in the set represented a homologous region of related sequences. GBrowse was adapted to show sets of homologous contigs at all potential anchor points, spread laterally and prevented from overlapping. About 8064 minimum tiling path (MTP2) clones provided 13,473 BAC end sequences (BES) to decorate the physical map. Analyses of BES placed 2111 gene models, 40 marker anchors and 1053 new microsatellite markers on the map. Estimated sequence tag probes from 201 low-copy gene families located 613 paralogs. The genome browser portal showed each data type as a separate track. Tetraploid, octoploid, diploid and homologous regions are shown clearly in relation to an integrated genetic and physical map.Entities:
Mesh:
Year: 2006 PMID: 16381975 PMCID: PMC1347413 DOI: 10.1093/nar/gkj050
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Representation in SoyGD by GBrowse of a portion of the soybean genome from linkage group D1A+Q showing duplicated regions, multiple markers and multiple contigs per marker. (A) An 1 Mb region of build 4 SoyGD image with contigs, clones, markers sequences and related genes is shown. Contigs are shown as blue bars. Some of the BAC clones in the contigs are shown as green bars, some were cut off for brevity in this figure. Markers are shown as red arrowheads. BES and marker sequences are shown as magenta bars. Gene annotations are shown as purple lines (the five most probable at P < 10−5 are listed; only one is shown here). (B) Part of a 4 Mb view encompassing the same region, note that clones coalesce to single bars at this resolution to aid clone density estimates. Sequences and related genes are de-selected and thus hidden but ESTs, MTP4BH and BES-SSRs are selected in this view. EST hybridizing clones are shown as golden bars. MTP4BH BAC clones are shown as blue bars. BES derived microsatellite markers are shown as green arrowheads (BES-SSR). (C) A part of a 10 Mb view encompassing the same region. QTL are shown as red bars and can be viewed in relationship to other resources. QTL views in a 4–10 Mb window are most useful because the map accuracy for QTL of ∼10–25 cM is common. All sequences and features with entries at GenBank are linked to their gene index numbers by mouse over. Views ∼10 Mb can be arranged by contacting the administrator. In views above 10 Mb, sequence and related genes coalesce to single bars and are useful for gene density estimates and correlations with chromosome karyotpe.
Microsatellite markers were used to anchor BAC clones of three genotypes comprising the physical map builds 3 and 4 in SoyGD (the 86 RFLP markers used are not listed in the table)
| Genotype(s) | Marker numbers |
|---|---|
| Williams only | 67 |
| Faribault only | 45 |
| Forrest only | 66 |
| Faribault and Williams 82 | 80 |
| Faribault and Forrest | 11 |
| Williams 82 and Forrest | 12 |
| Faribault, Williams 82 and Forrest | 32 |
| Total integrated | 318 |
Progress in the soybean physical map (builds 2–4) shown in SoyGD
| Automated | Manual edit | Manual edit | |
|---|---|---|---|
| Build 2 | Build 3 | Build 4 | |
| September 2001 | August 2002 | October 2003 | |
| Total (Repeat) | |||
| BAC clones in FPC database | 81 024 | 78 001 | 78 001 |
| BACs used in contig assembly | 75 568 | 76 749 | 72 942 |
| Number of singletons | 5884 | 3702 | 27 810 |
| Marker anchored singletons | 0 | 0 | 120 |
| Clones in contigs (fold genome) | 69 684 | 73 047 | 45 130 |
| Fold genome in contigs | 8.7 | 9.1 | 5.6 |
| Number of contigs | 5597 | 2905 | 2854 (646) |
| Anchoring markers | 0 | 385 | 404 |
| Anchored contigs | 0 | 781 | 742 (181) |
| Q contigs | n/a | 1040 | 0 |
| Contigs that contained | |||
| >25 clones | 220 | 921 | 477 (268) |
| 10–25 clones | 3038 | 920 | 1458 (433) |
| 3–9 clones | 1845 | 850 | 820 (0) |
| 2 clones | 385 | 216 | 99 (0) |
| Unique bands within contigs | 396 843 | 345 457 | 258 240 (54 560) |
| Length of the contigs (Gb) | 1.667 | 1.451 | 1.037 (0.258) |
aAdjusted for 1252 ‘NoFp’ 0 band clones only (includes all clones with ≥1 band).
bAdjusted for all 0–4 band clones, contamination-suspect clones and most high band number (>65) clones.
cBased on 4.5 kb per unique band, for 2854 contigs containing ∼68 unique bands in 15 clones, 264 duplicated region contigs containing ∼68 unique bands in 30 clones, 15 840 unique bands, and 406 highly repeated region contigs containing ∼68 unique bands in 60 clones, 48 720 unique bands.
dBased on 4.0 kb per unique band.
Frequency of duplicated regions detected by anchors in SoyGD
| State | Anchors | Clones |
|---|---|---|
| One anchor 1 contig | 124 | 124 |
| One anchor 2 contigs | 110 | 220 |
| One anchor 3 contigs | 45 | 135 |
| One anchor 4 contigs | 31 | 124 |
| One anchor 5 contigs | 17 | 85 |
| One anchor 6 contigs | 4 | 24 |
| One anchor 7 contigs | 2 | 14 |
| One anchor 8 contigs | 2 | 16 |
| Total | 363 | 762 |
aIncludes 318 replicated high quality microsatellites and 45 potentially contaminated anchor markers.