| Literature DB >> 23761803 |
Puji Lestari1, Kyujung Van, Jayern Lee, Yang Jae Kang, Suk-Ha Lee.
Abstract
Understanding several modes of duplication contributing on the present genome structure is getting an attention because it could be related to numerous agronomically important traits. Since soybean serves as a rich protein source for animal feeds and human consumption, breeding efforts in soybean have been directed toward enhancing seed protein content. The publicly available soybean sequences and its genomically featured elements facilitate comprehending of quantitative trait loci (QTL) for seed protein content in concordance with homeologous regions in soybean genome. Although parts of chromosome (Chr) 20 and Chr 10 showed synteny, QTLs for seed protein content present only on Chr 20. Using comparative analysis of gene contents in recently duplicated genomic regions harboring QTL for protein/oil content on Chrs 20 and 10, a total of 27 genes are present in duplicated regions of both Chrs. Notably, 4 tandem duplicates of the putative homeobox protein 22 (HB22) are present only on Chr 20 and this Medicago truncatula homolog expressed in endosperm at seed filling stage. These tandem duplicates could contribute on the protein/oil QTL of Chr 20. Our study suggests that non-shared gene contents within the duplicated genomic regions might lead to absence/presence of QTL related to protein/oil content.Entities:
Keywords: QTL; genome duplication; seed protein content; sequence divergence; soybean
Year: 2013 PMID: 23761803 PMCID: PMC3672674 DOI: 10.3389/fpls.2013.00176
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Circular map showing homeologous relationships among 20 soybean chromosomes based on QTLs associated with protein and oil content in recently duplicated segments. Chr01–Chr20 denote chromosome number of the soybean genome. The scale of 0–50 kb represents physical locations. Gray ribbons represent as duplicated positions of the G. max genome after similar duplicated regions are grouped as bundles. Circular lines from the inner to the outer: 1, LOD of protein QTL (red); 2, R2 of protein QTL (blue); 3, LOD of oil QTL (red); 4, R2 of oil QTL (blue).
Gene divergence of duplicated regions between Chr 20 (24,867,385..28,878,629) and Chr 10 (30,286,648..34,294,718).
| − | Glyma10g23750 | Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein |
| − | Glyma10g23790 | Uricase/urate oxidase/nodulin 35, putative |
| − | Glyma10g23800 | Concanavalin A-like lectin protein kinase family protein |
| − | Glyma10g23810 | Exocyst subunit exo70 family protein A1 |
| − | Glyma10g23840 | Double Clp-N motif-containing P-loop nucleoside triphosphate hydrolases superfamily protein |
| − | Glyma10g23910 | Polynucleotidyl transferase, ribonuclease H-like superfamily protein |
| − | Glyma10g24030 | Glycosyl hydrolase superfamily protein |
| Glyma20g17960 | Glyma10g24060 | GTP-binding family protein |
| Glyma20g17990 | − | Urease accessory protein D |
| Glyma20g18010 | − | Pentatricopeptide (PPR) repeat-containing protein |
| Glyma20g18280 | − | Terpene synthase 03 |
| − | Glyma10g24080 | Expansin B2 |
| − | Glyma10g24100 | Double Clp-N motif-containing P-loop nucleoside triphosphate hydrolases superfamily protein |
| − | Glyma10g24120 | Expansin B2 |
| − | Glyma10g24190 | Leucine-rich repeat transmembrane protein kinase family protein |
| − | Glyma10g24200 | AT1G21280.1 |
| − | Glyma10g24270 | Gibberellin 2-oxidase |
| Glyma20g18290 | Glyma10g24340 | AMMECR1 family |
| Glyma20g18420 | − | F-box and associated interaction domains-containing protein |
| Glyma20g18440 | Glyma10g24350 | U2 small nuclear ribonucleoprotein A |
| Glyma20g18450 | − | Homeobox protein 22 |
| Glyma20g18460 | − | Homeobox protein 22 |
| Glyma20g18520 | − | Homeobox protein 22 |
| Glyma20g18540 | − | Homeobox protein 22 |
| Glyma20g18550 | Glyma10g24360 | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| − | Glyma10g24400 | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| Glyma20g18620 | Glyma10g24420 | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| Glyma20g18860 | Glyma10g24420 | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| − | Glyma10g24430 | Vesicle-associated membrane protein 726 |
| Glyma20g18870 | Glyma10g24540 | Protein kinase superfamily protein |
| Glyma20g18890 | Glyma10g24550 | Ankyrin repeat family protein |
| − | Glyma10g24570 | Ankyrin repeat family protein |
| Glyma20g18900 | Glyma10g24580 | RING/U-box superfamily protein |
| Glyma20g18970 | Glyma10g24580 | RING/U-box superfamily protein |
| Glyma20g18980 | Glyma10g24590 | Peroxisomal 3-ketoacyl-CoA thiolase 3 |
| Glyma20g18990 | − | hAT transposon superfamily |
| − | Glyma10g24600 | DZC (Disease resistance/zinc finger/chromosome condensation-like region) domain containing protein |
| Glyma20g19000 | Glyma10g24620 | Potassium channel beta subunit 1 |
| Glyma20g19200 | Glyma10g24630 | Pectin lyase-like superfamily protein |
| Glyma20g19210 | Glyma10g24650 | Inosine triphosphate pyrophosphatase family protein |
| Glyma20g19250 | − | pfkB-like carbohydrate kinase family protein |
| Glyma20g19470 | − | Modifier of rudimentary [Mod(r)] protein |
| − | Glyma10g24670 | P-loop containing nucleoside triphosphate hydrolases superfamily protein |
| − | Glyma10g24740 | Glyma10g24740.1 |
| − | Glyma10g25070 | Ferritin 4 |
| − | Glyma10g25340 | Plant U-Box 15 |
| Glyma20g19550 | Glyma10g25420 | Protein of unknown function (DUF3741) |
| Glyma20g19580 | − | Oligosaccharyltransferase complex/magnesium transporter family protein |
| Glyma20g19600 | − | Glyma20g19600.1 |
| Glyma20g19620 | − | Glyma20g19620.1 |
| − | Glyma10g25440 | Leucine-rich repeat receptor-like protein kinase family protein |
| Glyma20g19640 | Glyma10g25480 | GATA transcription factor 1 |
| − | Glyma10g25490 | Uncharacterized protein family (UPF0016) |
| − | Glyma10g25500 | Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein |
| Glyma20g19670 | Glyma10g25510 | AT3G22520.1 |
| Glyma20g19680 | − | HSP20-like chaperones superfamily protein |
| Glyma20g19710 | − | Hydroxyethylthiazole kinase family protein |
| − | Glyma10g25550 | Hydroxyethylthiazole kinase family protein |
| Glyma20g19720 | Glyma10g25560 | Disease resistance-responsive (dirigent-like protein) family protein |
| Glyma20g19920 | − | Disease resistance-responsive (dirigent-like protein) family protein |
| Glyma20g19930 | Glyma10g25560 | Disease resistance-responsive (dirigent-like protein) family protein |
| Glyma20g19940 | − | Glyma20g19940.1 |
| − | Glyma10g25570 | Disease resistance-responsive (dirigent-like protein) family protein |
| Glyma20g19970 | Glyma10g25620 | RING/FYVE/PHD zinc finger superfamily protein |
| Glyma20g19980 | Glyma10g25630 | Heat shock protein 60 |
| − | Glyma10g25640 | AT5G41980.1 |
| Glyma20g20010 | Glyma10g25670 | GRIM-19 protein |
| − | Glyma10g25680 | HVA22 homologue E |
| Glyma20g20040 | Glyma10g25690 | BLISTER |
| Glyma20g20050 | Glyma10g25700 | BLISTER |
| Glyma20g20070 | Glyma10g25710 | Coenzyme F420 hydrogenase family/dehydrogenase, beta subunit family |
| − | Glyma10g25750 | AT2G29880.1 |
| Glyma20g20180 | Glyma10g25760 | YUCCA 3 |
| Glyma20g20280 | Glyma10g25760 | YUCCA 3 |
| Glyma20g20300 | − | Protein kinase superfamily protein |
| − | Glyma10g25790 | Beta-1,2-N-acetylglucosaminyltransferase II |
| − | Glyma10g25800 | Disease resistance family protein / LRR family protein |
| − | Glyma10g26100 | Glyma10g26100.1 |
| − | Glyma10g26120 | ROTUNDIFOLIA like 8 |
| − | Glyma10g26150 | Ubiquitin C-terminal hydrolase 3 |
| − | Glyma10g26160 | Disease resistance family protein/LRR family protein |
Indicates no G. max gene presents on the chromosome.
Figure 2Comparison of two soybean chromosomes. (A) Syntenic dot plot of soybean Chr 10 and Chr 20 [adapted from Cannon and Shoemaker (2012)]. Each dot represents homology of the predicted synteny between two chromosomes. A large inversion of synteny in the upper right quadrant is indicated by a line of homology dots that slope down to the right. A positive correlation based on synteny between two chromosomes is denoted by a discontinuous line with a slope moving up to the right. Red square denotes a determined region for alignment algorithm analysis. (B) The conserved blocks in the recently duplicated regions of Chr 10 and Chr 20 are based on the alignment of large genomic regions using LastZ algorithm, which is available at http://genomevolution.org/r/56zj. (C) Conserved genes in the recently duplicated regions of Chr 10 and Chr 20 based on predicted transcripts using Genome Threader algorithm (spliced alignment of peptides) at http://genomevolution.org/r/56zh.