| Literature DB >> 22802719 |
Luis C Belarmino1, Ana R da S Oliveira, Ana C Brasileiro-Vidal, Kyria C de A Bortoleti, João Pacífico Bezerra-Neto, Ricardo V Abdelnoor, Ana M Benko-Iseppon.
Abstract
Physical maps are important tools to uncover general chromosome structure as well as to compare different plant lineages and species, helping to elucidate genome structure, evolution and possibilities regarding synteny and colinearity. The increasing production of sequence data has opened an opportunity to link information from mapping studies to the underlying sequences. Genome browsers are invaluable platforms that provide access to these sequences, including tools for genome analysis, allowing the integration of multivariate information, and thus aiding to explain the emergence of complex genomes. The present work presents a tutorial regarding the use of genome browsers to develop targeted physical mapping, providing also a general overview and examples about the possibilities regarding the use of Fluorescent In Situ Hybridization (FISH) using bacterial artificial chromosomes (BAC), simple sequence repeats (SSR) and rDNA probes, highlighting the potential of such studies for map integration and comparative genetics. As a case study, the available genome of soybean was accessed to show how the physical and in silico distribution of such sequences may be compared at different levels. Such evaluations may also be complemented by the identification of sequences beyond the detection level of cytological methods, here using members of the aquaporin gene family as an example. The proposed approach highlights the complementation power of the combination of molecular cytogenetics and computational approaches for the anchoring of coding or repetitive sequences in plant genomes using available genome browsers, helping in the determination of sequence location, arrangement and number of repeats, and also filling gaps found in computational pseudochromosome assemblies.Entities:
Keywords: BAC; FISH; SSR; aquaporin; bioinformatics; gene families
Year: 2012 PMID: 22802719 PMCID: PMC3392886 DOI: 10.1590/S1415-47572012000200015
Source DB: PubMed Journal: Genet Mol Biol ISSN: 1415-4757 Impact factor: 1.771
Figure 1Distribution and microsyntenic relationships of 100 aquaporin coding genes in the soybean genome. Chromosomes are depicted with the centromere (in orange) in their expected position. Scale = 1 Mb. Tracks outside show the subfamily-based gene name. Not all gene names are written for image clarity. Microsyntenic relationships are shown as links between chromosome regions. Pink line = TIP genes; Purple line = PIP genes; Green line = NIP genes; Orange line = SIP genes.
Figure 2BAC selection scheme, using the SoyBase browser (a’). BACs were selected from a high exon density region (a”), considering their associated mapped markers and physical map locations (b). In the BAC selection, some aspects could be observed: (c) its position in the linkage map (cM), with its associated molecular markers and QTLs; (d) synteny regions with other species, and (e) recent and old duplicated regions in the soybean genome. Information sourced at http://soybase.org/gbrowse/cgi-bin/gbrowse/gmax1.01/.
Bacterial Artificial Chromosomes (BACs) containing the selected individual information on the exact position in the genome, repeat size in bp, exon density and the molecular markers linked to QTLs, species that are aligned with soybean [Tentative Consensus (TC) sequence, transcript, gene, transcript assembly and EST], synteny and duplications in the soybean genome.
| BAC | Localization | Size (bp) | Exon density | Markers (SSR) mapped/QTL associated | Aligned | Duplication | |
|---|---|---|---|---|---|---|---|
| Gm_WBc:0088G15 | Gm16: 27610047..27758800 | −149 | High | Satt622/ Yield increasing | Recent: | ||
| Gm_WBc:0102N16 | Gm16: 1231125..1382294 | −151 | High | Satt405/ Drought tolerance or plant height/ | Recent: |
BACs were ordered according to the University of Arizona (www.genome.arizona.edu/orders). Information sourced at (http://soybase.org/gbrowse/cgi-bin/gbrowse/gmax1.01/) in January 2011.
Gene models located in the BACs GM_WBc0088G15 and GM_WBc0102N16 through navigation in the soybean genome browser (http://soybase.org/gbrowse/cgi-bin/gbrowse/gmax1.01/) in January 2011.
| Gene models BAC | Protein of interest and/or means of action |
|---|---|
| GM_WBc0088G15 | |
| Glyma16g23750 | Heat shock protein binding |
| Glyma16g23760 | No information |
| Glyma16g23770 | No information |
| Glyma16g23780 | No information |
| Glyma16g23790 | Leucine Rich repeat containing protein (nucleic acid binding) |
| Glyma16g23800 | Leucine Rich repeat containing protein (protein binding) |
| Glyma16g23810 | No information |
| Glyma16g23820 | Alcohol Dehydrogenase related (oxidoreductase activity) |
| Glyma16g23830 | Uncharacterized (Putative methyltransferase) |
| Glyma16g23840 | Myb-like DNA-binding domain |
| Glyma16g23850 | AP endonuclease (zinc ion binding) |
| Glyma16g23870 | Calcium/Calmodulin dependent protein kinase related (EF hand) |
| Glyma16g23880 | FE(II)/ Ascorbate oxidase (oxidoreductase activity) |
| Glyma16g23890 | No information [gb def: MKIAA0431 protein (Fragment)] |
| Glyma16g23900 | No information |
| GM_WBc0102N16 | |
| FJ014811.1 | Clone cw129 leucine-rich repeat transmembrane protein kinase mRNA |
| FJ014812.1 | Clone cw130 leucine-rich repeat receptor-like kinase mRNA |
| Glyma16g01660 | Exocyst complex component Sec10 (vesicle docking) |
| Glyma16g01670 | No information |
| Glyma16g01680 | (ubiquitin-like-protein ligase activity) |
| Glyma16g01690 | No information |
| Glyma16g01700 | Zinc finger, C3HC4 type (RING finger) |
| Glyma16g01710 | Ring finger protein 11(NEDD4 WW DOMAIN-BINDING PROTEIN 2) |
| Glyma16g01730 | No information |
| Glyma16g01740 | No information |
| Glyma16g01750 | Protein-tyrosine kinase activity |
| Glyma16g01760 | Calcineurin-like phosphoesterase |
| Glyma16g01770 | DVL family |
| Glyma16g01780 | RNA binding protein (nucleic acid binding) |
| Glyma16g01790 | Serine threonine protein kinase, plant type (protein-tyrosine kinase activity) |
| Glyma16g01800 | NADH Dehydrogenase |
| Glyma16g01810 | 26S Protease Regulatory Subunit (ATP binding) |
| Glyma16g01820 | Uncharacterized conserved protein (DUF2343) |
| Glyma16g01830 | Integral to membrane |
| Glyma16g01840 | Protein of unknown function, DUF654 |
Figure 3In silico of (AAC)5 SSR oligonucleotide. (a) Anchoring by using the BLASTn algorithm at http://www.phytozome.com/search.php; (b) screen print of the UltraEdit text editor for organizing data sheets; (c) Microsoft Office Excel sheet for data handling; (d) sequence location in the soybean genome at http://soybase.org/gbrowse/cgi-bin/gbrowse/gmax1.01/.
Identified (AAC)5 repeats containing individual information on the exact position in the genome, repeat size in bp, associated genes, synteny and duplications in the soybean genome. “-” represents no identified or associated gene, function of gene, synteny and/or duplication site (information sourced at http://soybase.org/gbrowse/cgi-bin/gbrowse/gmax1.01/ in January 2011).
| Code | Localization | Size (bp) | Exon density | Gene | Function | Synteny | Duplication |
|---|---|---|---|---|---|---|---|
| (AAC)5- 1 | Gm01:5803004..5802969 | 35 | Moderate | Glyma01g05980 | Transcription factor | ||
| (AAC)5- 2 | Gm01:1783436..1783399 | 37 | High | Glyma01g02270 | - | - | |
| (AAC)5- 3 | Gm04:44935690..44935640 | 50 | High | Glyma04g38560 | No apical meristem (NAM) protein | ||
| (AAC)5- 4 | Gm04:46981082..46981048 | 34 | High | Glyma04g41120 | Secretory pathway protein | - | |
| (AAC)5- 5 | Gm05:29695067..29695108 | 41 | Moderate | Glyma05g23860 | Pentatricopeptide repeat -Containing protein | ||
| (AAC)5- 6 | Gm05:1164061..1163995 | 66 | High | - | - | - | |
| (AAC)5- 7 | Gm06:3929642..3929593 | 49 | High | - | - | - | |
| (AAC)5- 8 | Gm06:12312453..12312414 | 39 | High | Glyma06g15640 | WD domain, G-beta repeat | - | |
| (AAC)5- 9 | Gm06:3929642..3929608 | 34 | High | - | - | - | |
| (AAC)5- 10 | Gm06:13222198..13222239 | 41 | High | DQ822886.1 Glyma06g16820 | MYB transcription factor MYB56 (MYB56) mRNA | - | |
| (AAC)5- 11 | Gm07:4085718..4085773 | 55 | High | - | - | ||
| (AAC)5- 12 | Gm07:6265279..6265248 | 31 | High | - | - | - | - |
| (AAC)5- 13 | Gm07:8257096..8257142 | 46 | High | Glyma07g09800 | Fibronectin type III domain |
Figure 4Representation of the (AAC)5 SSR oligonucleotide in silico distribution, centromeric and pericentromeric regions of each Glycine max pseudochromosome, as well the in silico localization of BACs 88G15 and 102N16 (both on Gm16) and 5S and 45S rDNA (Gm19 and Gm13, respectively). Information sourced at http://soybase.org/gbrowse/cgi-bin/gbrowse/gmax1.01/.
Figure 5FISH analysis of Glycine max metaphase cells. (a) in situ hybridization with BACs 88G15 and 102N16 (both belonging to Gm16) stained with Cy3 and pseudocolored in yellow and red, respectively; (b) the (AAC)5 synthetic oligonucleotide as probe, colored in green. The chromosomes were counterstained with DAPI and pseudocolored in gray. Bar in b (for both pictures) corresponds to 10 μm.
Figure 6Comparative analysis of the genetic (cM*), cytogenetic (μm) and in silico (Mbp*) maps of Glycine max. (a) Ideogram showing the in situ mapping of BACs 88G15 (yellow) and 102N16 (orange) on chromosome 16, compared to the in silico mapping of pseudochromosome Gm16 and its corresponding linkage group J. The same chromosome Gm16 was used as standard for chromosome size and positioning of the linkage group. Lines indicate the distortion between cytogenetic and genetic distances as well as cytogenetic and in silico distances. (b) Complementary analysis of the (AAC)5 oligonucleotide distribution sites (green) and 45S rDNA site (red) by in silico and in situ analysis on the pseudochromosome and chromosome Gm13, respectively. *Information sourced at http://soybase.org/gbrowse/cgi-bin/gbrowse/gmax1.01/ in January 2011.