| Literature DB >> 28973974 |
Madhav P Nepal1, Ethan J Andersen2, Surendra Neupane3, Benjamin V Benson4.
Abstract
Disease resistance genes (R genes), as part of the plant defense system, have coevolved with corresponding pathogen molecules. The main objectives of this project were to identify non-Toll interleukin receptor, nucleotide-binding site, leucine-rich repeat (nTNL) genes and elucidate their evolutionary divergence across six plant genomes. Using reference sequences from Arabidopsis, we investigated nTNL orthologs in the genomes of common bean, Medicago, soybean, poplar, and rice. We used Hidden Markov Models for sequence identification, performed model-based phylogenetic analyses, visualized chromosomal positioning, inferred gene clustering, and assessed gene expression profiles. We analyzed 908 nTNL R genes in the genomes of the six plant species, and classified them into 12 subgroups based on the presence of coiled-coil (CC), nucleotide binding site (NBS), leucine rich repeat (LRR), resistance to Powdery mildew 8 (RPW8), and BED type zinc finger domains. Traditionally classified CC-NBS-LRR (CNL) genes were nested into four clades (CNL A-D) often with abundant, well-supported homogeneous subclades of Type-II R genes. CNL-D members were absent in rice, indicating a unique R gene retention pattern in the rice genome. Genomes from Arabidopsis, common bean, poplar and soybean had one chromosome without any CNL R genes. Medicago and Arabidopsis had the highest and lowest number of gene clusters, respectively. Gene expression analyses suggested unique patterns of expression for each of the CNL clades. Differential gene expression patterns of the nTNL genes were often found to correlate with number of introns and GC content, suggesting structural and functional divergence.Entities:
Keywords: NBS-LRR; R genes; evolutionary divergence; gene clustering; gene duplication; nucleotide-binding site, leucine-rich repeat; plant defense
Year: 2017 PMID: 28973974 PMCID: PMC5664099 DOI: 10.3390/genes8100249
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Domestication history and genomic information of Arabidopsis [32], Soybean [33], Rice [34], Medicago [35], Common bean [36] and Poplar [37].
| Plant Species | Domestication (Years Ago) | Native Range | Chromosome Number ( | Genome Size |
|---|---|---|---|---|
|
| None | Eurasia (adaptive elsewhere) | 5 | 135 MB |
|
| 6000–9000 | Central or Northern China | 20 | 1100 MB |
|
| 11,000–12,000 | Southern China | 12 | 383 MB |
|
| None | Mediterranean basin | 8 | 465 MB |
|
| 4400 | Americas (Argentina, Guatemala) | 11 | 650 MB |
|
| Literature unclear before 1977 | North Western United States ranging from Alaska to California and as far east as Western North Dakota | 19 | 423 MB |
Distribution and diversity of nTNL (non-Toll interleukin receptor, nucleotide-binding site, eucine-rich repeat) genes in six different plant species (At = Arabidospis thaliana, Gm = Glycine max, Mt = Medicago truncatula, Os = Oryza sativa, Pv = Phaseolus vulgaris and Pt = Populus trichocarpa).
| Clades | At | Gm | Mt | Os | Pv | Pt | Total |
|---|---|---|---|---|---|---|---|
| CNL-A | 8 | 14 | 19 | 0 | 1 | 7 | 49 |
| CNL-B | 25 | 37 | 22 | 3 | 17 | 61 | 165 |
| CNL-C | 7 | 134 | 191 | 145 | 75 | 124 | 676 |
| CNL-D | 12 | 2 | 1 | 0 | 1 | 1 | 17 |
| Unnested | 0 | 0 | 0 | 1 | 0 | 0 | 1 |
| Total | 52 a | 187 | 233 | 149 b | 94 | 193 | 908 |
a Three sequences AT4G19050 (NL-A), AT5G66630 (CN-A), and AT3g15700 (CN-B) [19] did not contain either a Leucine-rich region or the coiled-coil region. b Previously reported CNL member LOC_Os12g10410 in rice was highly divergent and did not nest inside of the four CNL clades.
Figure 1Maximum-likelihood tree of the six plant species (A. thaliana O. sativa, M. truncatula, P. vulgaris, P. trichocarpa, and G. max) based on rbcL gene sequences: the values on the branches represent bootstrap support of 1000 replicates (upper left). Diversity of nTNL genes across the genomes from six species: classification of the 12 subgroups was based on the presence of coil, NBS, LRR (Lxx), RPW8, and BED zinc finger domain (upper right). Maximum-likelihood tree of the identified nTNL NBS-LRR proteins from six plant species; on the branches are the bootstrap support of 100 replicates (lower middle). Altogether 908 protein sequences were included, using the Streptomyces outgroup P25941 (GI: 19857619). Tree branches are color-coded: blue, pink, red, and green for CNL-A, CNL-B, CNL-C, and CNL-D, respectively. The phylogenetic tree was visualized using Interactive tree of life (iTOL) v3. The nTNL gene accessions are color-coded creating a band of 12 colors, exterior to the gene accession names. Branch lengths of 11 sequences (LOC_Os11g42090, LOC_Os11g42070, LOC_Os04g41370, Medtr5g028420, Medtr5g028750, Medtr5g028290, Medtr5g028340, Glyma20g33531, Glyma20g33740, AT5G66910, and AT5G66900) were collapsed to their respective clades to accommodate band size around the tree.
Figure 2G+C content of the nTNL genes in non-legumes appeared higher than legume species. Rice nTNl genes had highest G+C content among the six species. Species are color-coded: legumes are in red-themed colors (Mt = red, Gm = pink and Pv = dark red) and non-legumes are in non-red colors (At = Black, Pt = Blue, Os = green).
Intron-exon structure of the identified nTNL genes in six plant species (Arabidopsis, soybean, Medicago, rice, common bean and poplar). Max = maximum number of exons, Min = minimum number of exons and Avg. = average number of exons.
| Species | CNL-A | CNL-B | CNL-C | CNL-D | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Max | Min | Avg. | Max | Min | Avg. | Max | Min | Avg. | Max | Min | Avg. | |
| At | 10 | 3 | 5.6 | 4 | 1 | 1.5 | 2 | 1 | 1.1 | 6 | 3 | 4.3 |
| Gm | 7 | 2 | 5.2 | 15 | 1 | 7.1 | 9 | 1 | 2.7 | 4 | 2 | 3.0 |
| Mt | 7 | 4 | 5.4 | 14 | 1 | 6.3 | 17 | 1 | 2.7 | 2 | 2 | 2.0 |
| Os | 0 | 0 | 0.0 | 2 | 1 | 1.7 | 8 | 1 | 2.6 | 0 | 0 | 0.0 |
| Pt | 8 | 2 | 4.7 | 11 | 1 | 3.7 | 10 | 1 | 2.4 | 2 | 2 | 2.0 |
| Pv | 5 | 5 | 5.0 | 16 | 2 | 8 | 7 | 1 | 2.3 | 2 | 2 | 2.0 |
Gene clustering in six plant genomes as predicted by 200 kb sliding windows. (At = A. thaliana, Gm = G. max, Mt = M. truncatula, Os = O. sativa, Pv = P. vulgaris, and Pt = P. trichocarpa).
| Species | Count of Clusters Observed | Genes in Clusters | Total nTNL Genes in Clusters (%) | Genes in Largest Cluster | Average Genes Per Cluster |
|---|---|---|---|---|---|
| At | 10 | 29 | 58% | 5 | 2.90 |
| Gm | 33 | 134 | 71% | 11 | 4.06 |
| Mt | 35 | 190 | 81% | 15 | 5.43 |
| Os | 29 | 74 | 50% | 7 | 2.64 |
| Pv | 15 | 82 | 87% | 26 | 5.47 |
| Pt | 34 | 156 | 80% | 18 | 4.59 |
Summary of Ka/Ks ratios for the nTNL gene members from CNL-A2, CNL-B3, CNL-C13 and CNL-D groups.
| Sub Clade (Number of Gene Members) | Minimum Ka/Ks | Maximum Ka/Ks | Average Ka/Ks |
|---|---|---|---|
| CNL-A2 (14) | 0.09 | 0.43 | 0.20 |
| CNL-B3 (32) | 0.10 | 0.81 | 0.27 |
| CNL-C13 (23) | 0.12 | 0.55 | 0.30 |
| CNL-D (17) | 0.76 | 2.29 | 1.55 |
CNL-C genes from the genomes of five different species showed differential gene expression (see Figures S20–S24 for more information).
| Highly expressed Genes | AT3G46710, AT3G14470, Glyma20g33740, Glyma09g02401, Glyma12g14700, Potri.001G025400, LOC_Os01g23380, LOC_Os01g25720, LOC_Os01g33684, LOC_Os01g57270, LOC_Os01g57280, LOC_Os02g16270, LOC_Os02g17304, LOC_Os02g25900, LOC_Os03g40194, LOC_Os03g63150, LOC_Os04g02110, LOC_Os05g30220, LOC_Os05g31570, LOC_Os05g34220, LOC_Os05g34230, LOC_Os07g17250, LOC_Os07g29820, LOC_Os08g07330, LOC_Os08g10260, LOC_Os08g31800, LOC_Os08g43010, LOC_Os10g10360, LOC_Os11g12000, LOC_Os11g12340, LOC_Os11g37740, LOC_Os11g37759, LOC_Os11g42070, LOC_Os11g44580, LOC_Os11g44960, LOC_Os11g45790, LOC_Os12g32660, LOC_Os12g33160. LOC_Os12g37760. Medtr1g016210, Medtr2g014820, Medtr3g014080, Medtr3g015260, Medtr3g027420, Medtr3g032150, Medtr3g055720, Medtr3g056190, Medtr3g056300, Medtr3g070590, Medtr5g021080, Medtr6g046930, Medtr6g047210, Medtr6g052390, Medtr7g089080, Medtr7g091110, Medtr8g011280, Medtr8g038590Potri.001G134700, Potri.001G261300, Potri.003G149800, Potri.003G201800, Potri.005G119800, Potri.006G014400, Potri.006G271800, Potri.007G137100, Potri.008G212200, Potri.011G040800, Potri.012G121900, Potri.017G121500, and Potri.017G136400 |
| Genes expressed at the basal level | Glyma01g35120, Glyma02g03010, Glyma02g03520, Glyma03g05772, Glyma05g08621, Glyma13g26000, Glyma13g26141, Glyma13g26380, LOC_Os01g25630, LOC_Os03g36920, LOC_Os03g50150, LOC_Os05g12770, LOC_Os05g31550, LOC_Os05g40150, LOC_Os09g13820, LOC_Os10g03570, LOC_Os10g07400, LOC_Os11g45160, LOC_Os11g45970, LOC_Os12g29280 and LOC_Os12g31620, Medtr1g023600, Medtr2g039010, Medtr3g035480, Medtr3g086070, Medtr5g035240, Medtr5g070470, Medtr6g046440, Medtr8g011590, Medtr8g011600, Potri.003G099000, Potri.004G195200, Potri.012G122200, Potri.017G127000, Potri.017G143400, Potri.017G143500, and Potri.018G017900 |