| Literature DB >> 19930676 |
Sithichoke Tangphatsornruang1, Prakit Somta, Pichahpuk Uthaipaisanwong, Juntima Chanprasert, Duangjai Sangsrakru, Worapa Seehalak, Warunee Sommanas, Somvong Tragoonrung, Peerasak Srinives.
Abstract
BACKGROUND: Mungbean is an important economical crop in Asia. However, genomic research has lagged behind other crop species due to the lack of polymorphic DNA markers found in this crop. The objective of this work is to develop and characterize microsatellite or simple sequence repeat (SSR) markers from genome shotgun sequencing of mungbean. RESULT: We have generated and characterized a total of 470,024 genome shotgun sequences covering 100.5 Mb of the mungbean (Vigna radiata (L.) Wilczek) genome using 454 sequencing technology. We identified 1,493 SSR motifs that could be used as potential molecular markers. Among 192 tested primer pairs in 17 mungbean accessions, 60 loci revealed polymorphism with polymorphic information content (PIC) values ranging from 0.0555 to 0.6907 with an average of 0.2594. Majority of microsatellite markers were transferable in Vigna species, whereas transferability rates were only 22.90% and 24.43% in Phaseolus vulgaris and Glycine max, respectively. We also used 16 SSR loci to evaluate phylogenetic relationship of 35 genotypes of the Asian Vigna group. The genome survey sequences were further analyzed to search for gene content. The evidence suggested 1,542 gene fragments have been sequence tagged, that fell within intersected existing gene models and shared sequence homology with other proteins in the database. Furthermore, potential microRNAs that could regulate developmental stages and environmental responses were discovered from this dataset.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19930676 PMCID: PMC2788553 DOI: 10.1186/1471-2229-9-137
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Distribution of identified SSRs using the Troll software according to SSR motif type and repeat number.
| Number of motif repeat | Di | Tri | Tetra | Penta | Hexa | Hepta | Octa |
|---|---|---|---|---|---|---|---|
| N/A | N/A | N/A | 97 | 45 | 14 | 2 | |
| N/A | N/A | 89 | 20 | 4 | 2 | 0 | |
| N/A | N/A | 23 | 4 | 3 | 1 | 0 | |
| N/A | 142 | 9 | 2 | 0 | 0 | 0 | |
| N/A | 75 | 1 | 0 | 0 | 0 | 0 | |
| N/A | 36 | 0 | 0 | 0 | 1 | 1 | |
| 137 | 17 | 1 | 1 | 1 | 0 | 1 | |
| 115 | 4 | 0 | 0 | 0 | 0 | 0 | |
| 80 | 3 | 0 | 0 | 0 | 0 | 0 | |
| 50 | 1 | 0 | 0 | 0 | 0 | 0 | |
| 59 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 50 | 2 | 0 | 0 | 0 | 0 | 0 | |
| 46 | 1 | 0 | 0 | 0 | 0 | 0 | |
| 45 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 49 | 1 | 0 | 0 | 0 | 0 | 0 | |
| 43 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 48 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 32 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 25 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 30 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 34 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 10 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 19 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 10 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 3 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 3 | 0 | 0 | 0 | 0 | 0 | 0 | |
| 1 | 0 | 0 | 0 | 0 | 0 | 0 | |
The table lists number of contigs containing SSRs, observed number of contigs from 454 data set, predicted number of contigs according to the Lander-Waterman model for sampling a completely non-repetitive genome and the repetitive sequences calculated using the differences between the observed number of contigs and the predictions.
| Number of reads in contigs | Predicted number of contigs by model | Observed number of contigs | repetitive reads (observed-predicted) | Observed number of contigs containing SSR |
|---|---|---|---|---|
| 2 | 59398 | 19622 | n/a | 87 |
| 3 | 10350 | 13576 | 9678 | 48 |
| 4 | 1803 | 6020 | 16868 | 16 |
| 5 | 314 | 2491 | 10885 | 6 |
| 6 | 55 | 1270 | 7290 | 10 |
| 7 | 10 | 765 | 5285 | 2 |
| 8 | 2 | 518 | 4128 | 3 |
| 9 | 1 | 363 | 3258 | 4 |
| 10 | 0 | 262 | 2620 | 1 |
| 11 | 0 | 216 | 2376 | 2 |
| 12 | 0 | 169 | 2028 | 1 |
| 13 | 0 | 149 | 1937 | 1 |
| 14 | 0 | 116 | 1624 | 1 |
| 15 | 0 | 115 | 1725 | 3 |
| ≥ 16 | 0 | 994 | 171708 | 7 |
| total | 241410 | 192 | ||
Figure 1Cross-species amplification of 127 mungbean microsatellite markers in various species from genus . Abbreviations are as followed: Vac = V. aconitifolia, Van = V. angularis var. angularis, Van (wild) = V. angularis var. nipponensis, Var = V. aridicola, Vex = V. exilis, Vgr = V. grandiflora, Vhi = V. hirtella, Vmi = V. minima, Vmu = V. mungo var. mungo, Vum(wild) = V. mungo var. sylvestris, Vna = V. nakashimae, Vne = V. nepalensis, Vra = V. radiata var. radiate, Vra(wild) = V. radiata var. sublobata, Vst = V. stipulacea, Vsu = V. subramaniana, Vte = V. tenuicaulis, Vtr = V. trilobata, Vum = V. umbellate, Vsn = V. subterranean, Vun-Ung = V. unguiculata cv-gr. Unguiculata, Vun-Ses = V. unguiculata cv-gr. Sesquipedalis, Pha = P. vulgaris and Gly = G. max.
Figure 2A dendrogram depicting genetic diversity and relationships among 35 genotypes from 20 taxa of Asian . Accession codes from the AVRDC-The world vegetable center and the National Institute of Agrobiological Sciences (Japan) are provided in brackets.
Figure 3Gene Ontology classification of the predicted mungbean ORFs according to molecular function (a) and biological process (b) using BLAST2GO [51]with E-6 cutoff.
Figure 4Comparison of mungbean ORFs with 8 other plant gene indices by tBLASTX (e-value cutoff = E-6). Blue bars represent mungbean contigs with similar homology search against other plant gene index databases including soybean (GMGI, 13.0), Arabidopsis (AGI, 13.0), rice (OGI, 17.0), M. truncatula (MTGI, 9.0) and Vitis vinifera (VVGI, 6.0).
Results from homology search of the mungbean microRNA candidates against the microRNA database.
| Read name | Contig | miRNA family | MFE (kcal/mol) | Target | Ref |
|---|---|---|---|---|---|
| E4UUDJH02I4UG8 | contig25352 | miR171 | -37 | SCARECROW-like protein | Reinhart et al., 2002 |
| E4UUDJH02HINHM | contig16040 | miR408 | -29 | plantacyanin | Sunkar and Zhu, 2004 |
| E4UUDJH01ECGTX | contig11544 | miR1171 | -25 | putative copper chaperone3 | Molnar et al., 2007 |
| E4UUDJH01AVSEX | contig25342 | miR414 | -26 | TIF3H1 | Fattash et al., 2007 |
microRNA families and their target genes are also present in the table.