| Literature DB >> 23986765 |
Dong Hyun Kim1, Swathi Parupalli, Sarwar Azam, Suk-Ha Lee, Rajeev K Varshney.
Abstract
Legumes play an important role as food and forage crops in international agriculture especially in developing countries. Legumes have a unique biological process called nitrogen fixation (NF) by which they convert atmospheric nitrogen to ammonia. Although legume genomes have undergone polyploidization, duplication and divergence, NF-related genes, because of their essential functional role for legumes, might have remained conserved. To understand the relationship of divergence and evolutionary processes in legumes, this study analyzes orthologs and paralogs for selected 20 NF-related genes by using comparative genomic approaches in six legumes i.e., Medicago truncatula (Mt), Cicer arietinum, Lotus japonicus, Cajanus cajan (Cc), Phaseolus vulgaris (Pv), and Glycine max (Gm). Subsequently, sequence distances, numbers of synonymous substitutions per synonymous site (Ks) and non-synonymous substitutions per non-synonymous site (Ka) between orthologs and paralogs were calculated and compared across legumes. These analyses suggest the closest relationship between Gm and Cc and the highest distance between Mt and Pv in six legumes. Ks proportional plots clearly showed ancient genome duplication in all legumes, whole genome duplication event in Gm and also speciation pattern in different legumes. This study also reports some interesting observations e.g., no peak at Ks 0.4 in Gm-Gm, location of two independent genes next to each other in Mt and low Ks values for outparalogs for three genes as compared to other 12 genes. In summary, this study underlines the importance of NF-related genes and provides important insights in genome organization and evolutionary aspects of six legume species analyzed.Entities:
Keywords: Ks; comparative analysis; evolution; legume; nitrogen fixation
Year: 2013 PMID: 23986765 PMCID: PMC3749373 DOI: 10.3389/fpls.2013.00300
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
List of 20 NF-related genes analyzed in six legume species.
| Medtr2g005620 | Ca_00033 | Lj6.CM0508.260.r2.m | C.cajan_17266 | Phvulv091019046m | ||
| C.cajan_11017 | ||||||
| Medtr5g032400 | Ca_11537 | Lj2.CM0177.340.r2.m | C.cajan_12295 | Phvulv091027352m | ||
| Ca_17066 | ||||||
| Medtr8g047760 | Ca_15707 | Lj3.LjT02O17.60.r2.m | C.cajan_46131 | Phvulv091013422m | Glyma15g35070 | |
| Medtr5g009940 | Glyma08g24360 | |||||
| Glyma10g11020 | ||||||
| Medtr7g102550 | Ca_08232 | Lj1.CM0104.2670.r2.m | C.cajan_08385 | Phvulv091004951m | ||
| Medtr6g031080 | C.cajan_16144 | |||||
| Medtr6g015110 | Ca_08582 | Lj4.CM0046.750.r2.a | C.cajan_23330 | Phvulv091030938m | ||
| Medtr4g134350 | ||||||
| Medtr3g137870 | C.cajan_09162 | Phvulv091009868m | Glyma06g06930 | |||
| Medtr1g099720 | Glyma04g06830 | |||||
| Medtr5g027010 | Ca_10616 | Lj2.CM0803.150.r2.m | C.cajan_12408 | Phvulv091016359m | ||
| Medtr1g112060 | Ca_08341 | Lj5.CM0909.400.r2.m | C.cajan_22455 | Phvulv091027173m | Glyma10g33851 | |
| Medtr5g093450 | Ca_10278 | Lj2.CM0545.250.r2.m | C.cajan_09999 | Phvulv091021871m | ||
| Medtr5g093440 | Lj6.CM0041.460.r2.a | C.cajan_15801 | ||||
| Medtr5g093730 | ||||||
| Medtr5g093410 | ||||||
| Medtr5g019000 | Ca_02085 | Lj2.CM0323.420.r2.d | C.cajan_12623 | Phvulv091008254m | ||
| Medtr5g018990 | Ca_02086 | Lj2.CM0323.400.r2.d | C.cajan_12621 | Phvulv091008306m | ||
| Medtr8g093910 | Ca_16029 | |||||
| Medtr5g106690 | Ca_09832 | Lj2.CM0102.250.r2.m | C.cajan_33924 | Phvulv091031090m | ||
| C.cajan_37712 | Phvulv091004689m | |||||
| Medtr5g093170 | Ca_10291 | Lj2.CM0826.350.r2.m | C.cajan_09986 | Phvulv091021785m | ||
| Lj2.CM0826.370.r2.m | ||||||
| Lj2.CM0545.330.r2.m | ||||||
| Medtr8g025000 | Ca_10004 | Lj3.CM0416.1260.r2.d | C.cajan_27701 | Phvulv091018505m | ||
| Medtr5g015580 | Phvulv091030806m | |||||
| Medtr8g101580 | Phvulv091007340m | |||||
| Medtr3g097800 | Ca_26279 | Lj1.CM1976.90.r2.m | C.cajan_01355 | Phvulv091012665m | ||
| Medtr5g065380 | Ca_23494 | C.cajan_32376 | ||||
| Medtr1g074280 | Ca_26056 | Lj5.CM1077.650.r2.m | C.cajan_33337 | Phvulv091005582m | ||
| Ca_19055 | ||||||
| Medtr7g121800 | Ca_12043 | Lj1.CM0012.1100.r2.m | C.cajan_45110 | Phvulv091008769m | Glyma03g33850 | |
| Medtr4g096420 | Ca_15399 | Lj3.CM0091.1690.r2.m | C.cajan_21258 | Phvulv091015304m | ||
| Medtr4g096400 | Ca_09375 | C.cajan_24880 | ||||
| C.cajan_39327 | ||||||
| Medtr1g146810 | Ca_13985 | Lj5.CM0492.390.r2.m | C.cajan_07899 | Phvulv091005854m | ||
| Ca_26114 | Lj1.CM0001.650.r2.m | C.cajan_37827 | ||||
| Lj1.CM0001.690.r2.m | C.cajan_46126 | |||||
| Lj1.CM0001.710.r2.m | ||||||
| Medtr8g119590 | Ca_06646 | C.cajan_46055 | Phvulv091017136m | Glyma06g24760 | ||
| C.cajan_26197 | ||||||
| C.cajan_26199 | ||||||
Orthologs by bidirectional best hit and OrthoMCL were mentioned on the top of the genes listed for each species except GmENOD93 (Medtr8g119590 and Ca_06646).
Gene names as per the research articles in which these genes were cloned and published.
Gene name of Lj were changed from chr to Lj for convenience.
Underlined genes of Gm were present in syntenic regions.
(Ané et al., 2004),
(Endre et al., 2002),
(Lévy et al., 2004),
(Middleton et al., 2007),
(Andriankaja et al., 2007),
(Haney and Long, 2010),
(Messinese et al., 2007),
(Kiss et al., 2009),
(Smit et al., 2007),
(Arrighi et al., 2006),
(Marsh et al., 2007),
(Morère-Le Paven et al., 2011),
(Hirsch et al., 2009),
(Arrighi et al., 2008),
(Penmetsa et al., 2008),
(Elise et al., 2005),
(Kouchi and Hata, 1995),
(Kouchi and Hata, 1993).
Figure 1Comparative genome location of 16 genes in six legume species. Chromosome/pseudomolecule/linkage group have been shown by arc. Legume species have shown in different colors: Mt-red, Ca-orange, Lj-yellow, Cc-green, Pv-blue, and Gm-dark blue, purple. Gap indicates that the gene is on unmapped scaffold.
Figure 2Phylogenetic trees based on sequence data using the neighbor-joining method. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) are shown above the branches (A). ERN3 has one cluster. (B). NSP1 has two clusters.
Figure 3Ks proportional plot based on sequence data for 20 NF-related genes. This plot shows peaks based on Ks values between Mt and other five legume species and one peak based on Ks values of inparalogs of Gm (Gm-Gm). The plots shows three types of peaks: (i) the first type of peak observed at Ks 0.1 indicates recent genome duplication in Gm, (ii) the second type of peaks observed at or near Ks 0.4 indicate speciation, and (iii) the third type of peaks observed at Ks 0.6 or Ks 0.7 correspond to ancient genome duplication.