Literature DB >> 26559332

Species-specific duplications of NBS-encoding genes in Chinese chestnut (Castanea mollissima).

Yan Zhong1, Yingjun Li1, Kaihui Huang1, Zong-Ming Cheng1.   

Abstract

The disease resistance (R) genes play an important role in protecting plants from infection by diverse pathogens in the environment. The nucleotide-binding site (NBS)-leucine-rich repeat (LRR) class of genes is one of the largest R gene families. Chinese chestnut (Castanea mollissima) is resistant to Chestnut Blight Disease, but relatively little is known about the resistance mechanism. We identified 519 NBS-encoding genes, including 374 NBS-LRR genes and 145 NBS-only genes. The majority of Ka/Ks were less than 1, suggesting the purifying selection operated during the evolutionary history of NBS-encoding genes. A minority (4/34) of Ka/Ks in non-TIR gene families were greater than 1, showing that some genes were under positive selection pressure. Furthermore, Ks peaked at a range of 0.4 to 0.5, indicating that ancient duplications arose during the evolution. The relationship between Ka/Ks and Ks indicated greater selective pressure on the newer and older genes with the critical value of Ks = 0.4-0.5. Notably, species-specific duplications were detected in NBS-encoding genes. In addition, the group of RPW8-NBS-encoding genes clustered together as an independent clade located at a relatively basal position in the phylogenetic tree. Many cis-acting elements related to plant defense responses were detected in promoters of NBS-encoding genes.

Entities:  

Mesh:

Year:  2015        PMID: 26559332      PMCID: PMC4642323          DOI: 10.1038/srep16638

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Plants have a large number of disease RESISTANCE (R) genes for defense against numerous and various pathogens, including bacteria, fungi, oomycetes, viruses, and nematodes123. R genes encode proteins that allow plants to systematically recognize and respond to pathogen infection456. NBS-LRR genes are one of the largest gene families in plant genomes and the largest class of known disease resistance genes. Among the over 140 R genes characterized from different flowering plants, approximately 80% belong to the NBS-LRR (nucleotide-binding site and leucine-rich repeat) gene families, members of which can directly or indirectly identify the pathogen278. The NBS domain is thought to be required in binding and hydrolyzing ATP and GTP1. The LRR motif in NBS-encoding genes functions is to regulate direct or indirect interactions with pathogens9. Based on the presence or absence of an N-terminal Toll/Interleukin-1 receptor (TIR) domain, NBS-encoding genes are divided into TIR-NBS-LRR/TIR-NBS (TNL/TN) genes and non-TIR-NBS-LRR/non-TIR-NBS (non-TNL/non-TN) genes10. The latter usually has a coiled-coil (CC) or other domains at the N-terminus, therefore this category can be further divided into CC-NBS-LRR/CC-NBS genes (CNL/CN) and X-NBS-LRR/X-NBS genes (XNL/XN)1. CC and LRR domains co-regulate the signaling capacity of the NBS domain in a recognition-specific manner11. Further, some NBS-encoding genes have another domain RPW8, which confers resistance to powdery mildew with a transmembrane region before the CC structure at N-terminus12. The RPW8-NBS-LRR (RNL) group is generally regarded as a small but special subclass of non-TNL1314. However, recent studies indicate that the RNL group is an individual sister group to the non-TNL group151617. Gene duplications have contributed to the high numbers and proportions of NBS-LRR genes in plant families18. The 174, 519, 416 NBS-LRR genes have been identified in Arabidopsis, rice, and poplar, the model systems for dicots, monocots, and woody plants, respectively2192021. Chestnut acts as a model species for the Fagaceae that dominate the hardwood forests of the northern hemisphere22 and have significant economic and ecological value. Chestnut can be infected with pathogens that cause diseases such as chestnut blight2324, ink disease25 or bark disease2627. Furthermore, many disease resistance genes belonging to the NBS-LRR gene families have been identified92829303132333435. Chestnuts (Castanea spp.) are important nuts and forest trees, and play important roles in ecosystem and generate nuts for wildlife and specialty nuts for human consumption. Chinese chestnut is resistant to chestnut blight, caused by fungal pathogen Cryphonectria parasitica (formerly Endothia parasitica), which, when it was accidentally introduced to North America around 1904 from Japan with nursery stocks, has almost wiped out the American chestnut tree (Castanea dentata) in the early 1900’s, once plentiful tree in the eastern United States. However, the resistance mechanism to chestnut blight in Chinese chestnut is not clear. The whole genome sequence recently released2236 provides the opportunity to undertake a whole-genome analysis of NBS-encoding R genes in Chinese chestnut to obtain insight into the evolutionary development of this gene family. We used poplar (Populus tricocarpa)2137 as a reference, which was the most closely related to chestnut among all the sequenced species, to identify chestnut-specific duplications after the divergence between Chinese chestnut and poplar. Results of this genome-wide analysis suggests that ancient and species-specific duplications have contributed to the expansion of NBS-encoding genes in Chinese chestnut. This research lays a foundation for further characterizing these R genes and helps identifying R genes that may be involved in resistance to chestnut blight.

Results

Total number of NBS-encoding genes in Chinese chestnut

A total of 519 NBS-encoding genes were identified in the C. mollissima genome, including 374 NBS-LRR genes and 145 NBS-only genes (Table 1). The NBS-encoding genes comprised 1.36% of expressed genes in Chinese chestnut compared with 0.91% in poplar. Accordingly, the proportion of NBS-LRR genes in Chinese chestnut (0.98%) was also higher than that in poplar (0.72%). Among NBS-LRR and NBS-only genes, two types of genes could be subclassified based on their N-terminal structures: TIRs and non-TIRs. Among the NBS-encoding genes, 27 TIR genes (22 TNLs and 5 TNs) and 492 non-TIR genes (352 non-TNLs and 140 non-TNs) were found. These results demonstrated that the number of non-TIR genes was greater than that of TIR genes, which was similar to the numbers of the two types of NBS-encoding genes in poplar. Furthermore, the proportions of non-TNLs were greater than these of TNLs in both Chinese chestnut and poplar genomes. In the Chinese chestnut genome, 0.06% and 0.92% of genes detected were TNLs and non-TNLs, respectively, compared to proportions of 0.17% and 0.55% for those gene categories in poplar. However, although the number of NBS-encoding genes in Chinese chestnut was greater than that in poplar, the number of TIR genes in Chinese chestnut was smaller than that in poplar. Moreover, the non-TIR genes could also be further divided into 32 CNLs, 96 CNs, 320 XNLs, and 44 XNs (Table 1).
Table 1

The number of NBS-encoding genes in the Chinese chestnut genome.

Predicted protein domainsLetter codeCastanea mollissimaPopulus trichocarpaa
NBS-encoding genes 519416
NBS-LRR type 374330
  TIR-NBS-LRRTNL2278
  non-TIR-NBS-LRRnon-TNL352252
   CC-NBS-LRRCNL32120
   X-NBS-LRRXNL320132
 NBS 14586
  TIR-NBSTN510
  non-TIR-NBSnon-TN14076
   CC-NBSCN9614
   X-NBSXN4462
genes from the entire genome 3808145555
Proportion of NBS-encoding genes 1.36%0.91%
Proportion of NBS-LRR genes 0.98%0.72%
Proportion of TIR-NBS-LRR genes 0.06%0.17%
Proportion of non-TIR-NBS-LRR genes 0.92%0.55%
Average exon of all genes 4.372.35
Average exon of TIR-NBS-LRR 2.503.5
Average exon of non-TIR-NBS-LRR 2.932.23
Average exon of CC-NBS-LRR 3.32b
Average exon of NBS-encoding genes 2.66
Average exon of NBS-LRR genes 2.90

aData from Yang et al. (2008).

bNot given in Yang et al. (2008).

The average number of exons identified in NBS-LRR genes from Chinese chestnut was 4.37, which was greater than the average of 2.35 exons for NBS-LRR genes in poplar. In addition, the average number of exon in TNL genes was less than that in non-TNL genes in Chinese chestnut, which was different from the average exon numbers in these genes in grape and poplar21. The average exon number in TNL, non-TNL, CNL, NBS-encoding, and NBS-LRR genes was 2.50, 2.93, 3.32, 2.66, and 2.90, respectively, which were all less than for all genes predicted in the Chinese chestnut genome.

Duplication of NBS-encoding genes in the Chinese chestnut genome

According to criteria for both coverage ≥70% and identity ≥70%, 273 genes were detected in 64 NBS gene families in Chinese chestnut. Therefore, 246 of these genes were singletons in the Chinese chestnut genome (Table 2). The percentage of multiple NBS-encoding genes in Chinese chestnut (52.60%) was much lower than that in poplar (78.13%). The average number of members per family was 4.27 in chestnut and 5.33 in poplar, and the maximal number members within a family of these genes in Chinese chestnut (19) was less than that in poplar (23), which indicated fewer NBS gene duplications and multi-gene families in Chinese chestnut than in poplar.
Table 2

Classification of NBS-encoding genes from Chinese chestnut.

Gene familyC. mollissima
P. trichocarpa
70%80%90%70%80%
Multi-gene273215101325310
Single gene24630441891106
Proportion of multiple genes52.60%41.43%19.46%78.13%74.52%
Gene Family No.6461416164
Average number of members/family4.273.522.465.334.84
Maximal members of a family191752317
TIR multiple genes420a
TIR multi-gene family No.210
Proportion of TIR multiple genes14.81%7.41%0.00%
non-TIR multiple genes269213101
non-TIR multi-gene family No.626041
Proportion of non-TIR multiple genes54.67%43.29%20.53%

aNot given in Yang et al. (2008).

If coverage and identity criteria were changed to 80%, the proportion of multiple genes among all NBS-encoding genes decreased in both Chinese chestnut and poplar. The proportions of multiple NBS-encoding genes were 41.43% and 74.52% between Chinese chestnut and poplar, respectively, which could allow the inference that more relatively recent duplications have occurred in the poplar genome. When coverage and identity criteria were increased to 90%, 19.46% of the multiple genes were still identified in Chinese chestnut, indicating that recent duplication events partly contributed to the expansion of NBS-encoding genes. The multi-gene families and number of multiple genes encoding TNLs and non-TNLs were diverse in Chinese chestnut. The numbers of both multi-gene families and multiple genes in non-TNLs were greater than those in TNLs, indicating that duplication of NBS-encoding genes occurred primarily among non-TNL genes (Table 2).

Duplication time of NBS-LRR genes in chestnut

Ks is the time indicator for duplication events, and the frequency distributions of individual Ks values reflect the relative time of genome duplications21. Firstly, we calculated the rate of synonymous substitutions and obtained the frequency distribution of Ks values in non-TIR-NBS-encoding genes (Fig. 1), which had a significant Ks peak in the range from 0.4 to 0.5. However the tendency decreased when Ks values ranged from 0.5 to 1. Ks values in the range from 0.1 to 0.2 accounted for 12.19% of the values in the range of Ks values between 0 and 1.0, which demonstrated relatively recent duplications in Chinese chestnut. However, the Ks range from 0.4 to 1.0 indicates that 35.41% of the duplication events occurred during chronologically relatively distant periods. Secondly, the curve between Ka/Ks and Ks for non-TIR-NBS-encoding genes in Chinese chestnut was fitted to detect any relationship between evolutionary pressure and duplication time for NBS paralogs. It was clear that the younger and older genes had larger Ka/Ks values, indicating that they were under greater selective pressures within the critical range of Ks values between 0.4 and 0.5.
Figure 1

The frequency distribution of relative Ks nodes (bar chart) and the relationship between Ks and Ka/Ks (line chart).

The X-axis denotes average Ks per unit of 0.1 and Y-axis denotes frequency and average Ka/Ks ratios, respectively.

However, among TIR-NBS-encoding genes, only two Ks values (0.4781 and 0.1624) were calculated from two gene families including four members, with Ka/Ks values of 0.3606 and 0.4554, respectively, which were lower than the average Ka/Ks value for non-TIR gene families, indicating greater functional constraints in TIR gene families.

Selective pressure on NBS-encoding genes in Chinese chestnut

Plant disease resistance genes have been shown to be subject to positive selection38. To better understand the evolutionary fate of NBS-encoding duplicates in Chinese chestnut, we used site and branch models in PAML4 to detect positive selection patterns. Because this analysis requires comparison of at least three genes, TIR-NBS-encoding genes were not evaluated and 34 non-TIR genes families with greater than three members each were analyzed. We calculated the ratios of nonsynonymous to synonymous nucleotide substitutions (ω), a molecular evolutionary measure of selection pressure39, to detect positive selection in NBS-encoding genes. A value of ω greater than 1 indicates that a gene is evolving with more constraint on nonsynonymous substitutions than on synonymous substitutions, which is evidence of positive selection40. In contrast, value of ω less than 1 indicate purifying selection. A value for ω of 1 means that a gene is under neutral selection. As Fig. 2 shows, most of these gene families (30/34) have undergone purifying selection. Meanwhile, four gene families had values for ω of greater than 1, which demonstrated that some NBS-encoding gene families were under positive selection.
Figure 2

Selective pressure on non-TIR-NBS-encoding genes in Chinese chestnut.

Numbers represent the dN/dS ratio for each gene family using the branch model; 2∆ln represents the result of the LR test for the site model; * and ** represent, respectively, significant (2∆ln > 5.991, p < 0.05) and highly significant (2∆ln > 9.210, p < 0.01) tests for positive selection between model M7 and M8.

Subsequently, the result of the LR test, 2Δln detected positive selection of significant differences between the M7 and M8. It was important to note that 85.29% (29/34) (Fig. 2) of non-TIR gene families had some sites under highly significant positive selection pressure, showing positive selection on gene families played a certain role in the evolution of non-TIR-NBS-encoding genes. Specifically, the analysis revealed that two gene families (Family 30 and 33) (Fig. 2) were driven by significant positive selection. Further, Bayes Empirical Bayes (BEB) analysis was performed to detect amino acid sites that have been under positive selection41. The sites inferred to be under positive selection at the 95% (*) and 99% (**) confidence intervals are shown in Table S1 (Additional file 1), which suggested that these positively selected sites possessed relatively high substitutions compared with others among NBS-LRR genes. Taken together, 530 amino acid sites from 22 families were revealed to be under positive selection in non-TIR gene families, which might have driven the evolution of function in NBS-encoding genes in Chinese chestnut (Additional file 1: Table S1).

Phylogenetic analysis of NBS-encoding genes

To confirm the phylogenetic relationships among the NBS-encoding genes, the NBS domains of 519 NBS-encoding genes were analyzed and compared with those in poplar, and a phylogenetic tree was constructed. A few genes had longer branch lengths but the majority had relatively shorter branches (Additional file 2: Figure S1), which indicated two different evolutionary patterns occurred in NBS-encoding genes. In general, the majority of the NBS-encoding genes were clustered according to species, which suggested species-specific duplication during the evolution of Chinese chestnut. In the phylogenetic tree, when the clade was defined by bootstrap values of greater than 50%, the clades resulting from species-specific duplication were counted. Specifically, 81 clades, including 401 NBS-encoding genes were identified with 4.95 R paralogs per clade. Additionally, the proportion of species-specific duplicated genes in Chinese chestnut was 77.26%. This result clearly indicated that NBS-encoding genes expanded into multiple gene family members after the divergence of Chinese chestnut and poplar. The N-terminal RPW8 domain (RESISTANCE TO POWDERY MILDEW8) in the NBS-encoding genes, medicates broad-spectrum resistance in Arabidopsis12. The analysis of RPW8 genes helps explain the origin and relationships of RPW8 genes to other genes. In the present study, a total of 15 Chinese chestnut RPW8-NBS-encoding genes are marked with solid circle in phylogenetic tree (Additional file 2: Figure S1). For comparison, five NBS-encoding genes from poplar containing the RPW8 domain are marked with triangle in poplar (Additional file 2: Figure S1). Interestingly, the genes carrying the RPW8 domain were clustered together (Additional file 2: Figure S1). Additionally, the 15 RPW8 genes from Chinese chestnut formed a relatively independent and monophyletic group, that was not phylogenetically embedded within other clades. Similar results were obtained for RNL genes from M. truncatula, potato, soybean, common bean and pigeon pea genomes151617. Notably, the position of RPW8 genes of Chinese chestnut were located at a relatively basal, but not the most basal position on the phylogenetic tree.

The cis-element analysis of NBS-encoding genes promoter sequences

Plant defense is controlled by cis-regulatory elements corresponding to key genes involved in defense, and pathogen-specific responses42. Therefore, the investigation and identification of cis-acting elements in the promoters of NBS-encoding genes will help us understand the function in plant defense responses. In this analysis, we performed the cis-elemtne analysis of all NBS-encoding genes in the PLACE. It is noted that many cis-regulatory elements associate with plants responding to pathogens, including DOFCOREZM, EECCRCAH1, GT1GAMSCAM4, GT1CONSENSUS, and AGCBOXNPGLB4344 in promoter regions of NBS-encoding genes (Table 3 and Additional file 3), which might demonstrate that NBS-encoding genes involve in response to pathogen infections. Moreover, the results (Additional file 3) point a way to identify candidate genes that might be used for conferring disease resistance.
Table 3

The cis-acting element analysis of NBS-encoding genes promoter sequences.

Element nameNumberElement nameNumberElement nameNumberElement nameNumber
DOFCOREZM495CAREOSREP1172MYBATRD2247GBOXLERBCS4
CACTFTPPCA1487IBOX172GARE1OSREP146HEXAT4
CAATBOX1486TATABOX3172DRE2COREZMRAB1744MNF1ZMPPC14
GT1CONSENSUS482SORLIP2AT170REBETALGLHCB2142OPAQUE2ZMB324
GTGANTG10481MYBGAHV169ACGTABOX41RGATAOS4
ARR1AT477SREATMSD167HDZIP2ATATHB240S2FSORPL214
GATABOX475ERELEE4163SURE1STPAT2136SORLIP4AT4
POLLEN1LELAT52475WBOXNTCHN48162PALBOXAPC35SORLREP4AT4
WRKY71OS472WBBOXPCWRKY1160TRANSINITDICOTS35TELOBOXATEEF1AA14
ROOTMOTIFTAPOX1454SEF1MOTIF149SORLREP3AT34UPRMOTIFIAT4
NODCON2GM452SP8BFIBSP8BIB149ACGTABREMOTIFA2OSEM33ANAERO5CONSENSUS3
OSE2ROOTNODULE452ASF1MOTIFCAMV148BOXCPSAS133HBOXCONSENSUSPVCHS3
EBOXBNNAPA451ELRECOREPCRP1145QARBNEXTA33HSELIKENTACIDICPR13
MYCCONSENSUSAT451SEBFCONSSTPR10A144CARGATCONSENSUS32MRNA3ENDTAH33
TAAAGSTKST1446GT1CORE136ATHB5ATCORE29NONAMERATH43
WBOXNTERF3432RYREPEATBNNAPA136UP1ATMSD27OCTAMERMOTIFTAH3H43
TATABOX5431LECPLEACS2130EVENINGAT25PALBOXPPC3
GT1GMSCAM4428RBCSCONSENSUS126GCCCORE25PIATGAPB3
POLASIG1428ARFAT124IRO2OS23SP8BFIBSP8AIB3
CCAATBOX1423MYBPLANT122NRRBNEXTA23SPHCOREZMC13
INRNTPSADB422T/GBOXATPIN2122PROXBBNNAPA23ABREATRD222
SEF4MOTIFGM7S422PYRIMIDINEBOXHVEPB1119S1FSORPL2122ABREMOTIFAOSOSEM2
RAV1AAT417PRECONSCRHSP70A118SURE2STPAT2122BOX2PSGS22
IBOXCORE414TATAPVTRNALEU118BOXIIPCCHS21CAATBOX22
POLASIG3404CBFHV115ATHB1ATCONSENSUS20E2F1OSPCNA2
WBOXATNPR1392MYB2AT115CMSRE1IBSPOA20E2FANTRNR2
WBOXHVISO1379SV40COREENHAN115AGMOTIFNTMYB219ELRENTCHN502
BIHD1OS375BOXLCOREDCPAL114BP5OSWX19GMHDLGMVSPB2
MYBCORE367IBOXCORENT113GARE2OSREP119JASE1ATOPR12
CURECORECR364AACACOREOSGLUB1111ANAERO4CONSENSUS17LBOXLERBCS2
EECCRCAH13622SSEEDPROTBANAPA110LTREATLTI7817LREBOXIIPCCHS12
MARTBOX352CATATGGMSAUR108HEXAMERATH416PALINDROMICCBOXGM2
ACGTATERD1339CCA1ATLHCB1105ABREOSRAB2115PE2FNTRNR1A2
MYB1AT336E2FCONSENSUS105ACGTOSGLUB115ABRE3HVA11
NODCON1GM333QELEMENTZMZM13105ATHB6COREAT15ABREAZMRAB281
OSE1ROOTNODULE333LTRECOREATCOR15101AUXREPSIAA415ACGTABREMOTIFAOSOSEM1
MYBST1327CGCGBOXAT100BS1EGCCR15ACGTROOT11
-300ELEMENT314TATCCAYMOTIFOSRAMY3D100ABREATCONSENSUS13ACIPVPAL21
NTBBF1ARROLB313-300CORE93DRE1COREZMRAB1713AT1BOX1
REALPHALGLHCB21312ANAERO2CONSENSUS92MYB26PS13AUXRETGA2GMGH31
POLASIG2300LTRE1HVBLT4991SBOXATRBCS12BOX1PSGS21
DPBFCOREDCDC3290RYREPEATLEGUMINBOX85UP2ATMSD12C2GMAUX281
PYRIMIDINEBOXOSRAMY1A285PROLAMINBOXOSGLUB183AMMORESIVDCRNIA111CONSERVED11NTZMATP11
SURECOREATSULTR11276WUSATAg80AGCBOXNPGLB10CPRFPCCHS1
-10PEHVPSBD268MARABOX179ATHB2ATCONSENSUS10D1GMAUX281
ANAERO1CONSENSUS268CTRMCAMV35S72ACGTCBOX9DE1PSPRA21
BOXIINTPATPB268MYB1LEPR72AMMORESIIUDCRNIA19DR5GMGH31
TBOXATGAPB268XYLAT68CRTDREHVCBF29E2FBNTRNR1
MYB2CONSENSUSAT256MARARS66UPRMOTIFIIAT9GRAZMRAB281
CARGCW8GAT243DRECRTCOREAT65CARGNCAT8GT2OSPHYA1
ABRELATERD1238INTRONLOWER65INTRONUPPER8HDMOTIFPCPR21
CIACADIANLELHC236P1BS63LRENPCABE8HSRENTHSR203J1
SEF3MOTIFGM231ACGTTBOX62ZDNAFORMINGATCAB18L1DCPAL11
SORLIP1AT230GT1MOTIFPSRBCS62EMBP1TAEM7LREBOXIPCCHS11
CPBCSPOR226L1BOXATPDF162GGTCCCATGMSAUR7NONAMERMOTIFTAH3H41
TATABOX2220ANAERO3CONSENSUS61GLMHVCHORD7O2F1BE2S11
PREATPRODH210RAV1BAT61RYREPEATVFLEB47OPAQUE2ZM22Z1
TATABOX4206CGACGOSAMY359-300MOTIFZMZEIN6RBCSBOX3PS1
GAREAT205GCN4OSGLUB158ABREZMRAB286RBCSGBOXPS1
RHERPATEXPA7203SORLIP5AT58AGL2ATCONSENSUS6SGBFGMGMAUX281
TATABOXOSPAL203CEREGLUBOX2PSLEGA57D4GMAUX286SORLREP2AT1
AMYBOX1201NAPINMOTIFBN56POLLEN2LELAT526SORLREP5AT1
CANBNNAPA197AMYBOX255SITEIOSPCNA6TATABOX11
S1FBOXSORPS1L21191TRANSINITMONOCOTS55AGATCONSENSUS5TE2F2NTPCNA1
SITEIIATCYTC184CACGTGMOTIF54AUXRETGA1GMGH35TOPOISOM1
MYCATERD1182RYREPEATGMGY254CACGCAATGMGH35VOZATVPP1
MYCATRD22182ARE153PALBOXLPC5VSF1PVGRP181
MYBPZM181HEXMOTIFTAH3H451SORLIP3AT5WRECSAA011
MYBCOREATCYCB1180TATCCACHVAL21505659BOXLELAT56594  
TATCCAOSAMY174LEAFYATAG49ACIIIPVPAL24  
ABRERATCAL172EMHVCHORD47DREDR1ATRD29AB4  

Discussion

A small number of TIR-NBS-encoding genes in Chinese chestnut

Surveys for NBS-encoding genes in the sequenced genomes of many species, including Arabidopsis, rice, grapevine, and poplar have found variable numbers of NBS-encoding genes2192021. Based on the structure of the N-terminal domain, NBS-encoding genes have been categorized into two classes, TIR type and non-TIR type. TNLs were absent in monocots, such as rice1920, the dicot Aquilegia coerulea14, and the dicotyledonous order Lamiales, but were found in most dicots104546. However, only 27 TIR-type genes including 22 TNL genes and five TN genes, were identified in Chinese chestnut. The proportion of TNLs in entire genome of Chinese chestnut was only 0.0577%, which was much lower than that in Arabidopsis (0.348%), grapevine (0.319%), or poplar (0.171%)221. Furthermore, the percentage of TNLs among the NBS-LRR genes of poplar (23.6%) was four times to that in Chinese chestnut (5.88%), which also demonstrated that the number of TNLs was lower in Chinese chestnut. In Chinese chestnut, the average number of exons estimated for TNLs (2.50) was lower than that in whole genome genes and non-TNLs, a result that differed from those of previous studies in Arabidopsis, poplar, grapevine, apple, pear, and peach22147. Furthermore, because of the small number of TIR-type NBS-encoding genes, only two Ks values (0.4781 and 0.1623) were estimated for the two TIR gene families. The average Ka/Ks ratio for TIR-type genes, with Ks ranging from 0 to 1, was lower than that for non-TIR-type genes (0.4080 compared with 0.5871). This result was consistent with those of previous studies of TNLs and non-TNLs in Arabidopsis, grapevine, poplar, apple, pear, and peach22147. The lower Ka/Ks values for TIR gene families than for non-TIR gene families indicates greater functional constraints during the evolution of TIR gene subfamilies48 and stronger diversifying selection in non-TIR gene families, which could provide variation that would allow plants to adapt to different pathogens in their environment47.

Duplication events of NBS-encoding genes in Chinese chestnut

Gene duplication plays a critical role in the generation of new R genes, increasing the number of these genes and dispersing them in the genome7. The number of NBS-encoding genes in Chinese chestnut was 3- and 1.2-fold that in Arabidopsis and poplar, respectively. To identify duplicated gene pairs, we defined a gene family according to three criteria. Using 70% criterion of both cutoff of coverage and sequence identity of not less than 70%, 64 gene families were detected, including 273 NBS-encoding genes in Chinese chestnut (53.60%) (Table 2). The percentage of NBS-encoding genes occurring in multi-gene families in Chinese chestnut was nearly the same as that in the rice genome (53.7%) and lower than that in the poplar genome (78.1%). Using our 80% criterion of both cutoff of coverage and sequence identity of not less than 80%, 41.43% of NBS genes were found to belong to multi-gene families in Chinese chestnut, while 74.52% of NBS genes occurred in multi-gene families in the poplar genome. However, the proportion of genes in multi-gene families sharply declined to 19.46% when the 90% criterion was used, which showed that relatively recent duplications resulted in a small portion of the multi-gene families in Chinese chestnut. The number of synonymous substitutions per synonymous site and the frequency distribution of Ks values could be used to infer the age of genome duplications21. The peak and distribution of Ks values was distinct in different species. Ks peaks varied greatly, occurring at Ks values from 0 to 0.1 in grape and poplar, and at Ks values from 0.1 to 0.2 in Arabidopsis, apple, pear, peach, and Prunus mume, but at Ks values from 0.3 to 0.4 in rice and strawberry2147. In Chinese chestnut, Ks peaked in the range of values from 0.4 to 0.5 (Fig. 1), a much higher Ks value than that in poplar (Ks = 0–0.1) or in other species, indicating that the gene expansions of Chinese chestnut were more ancient. Furthermore, the Ks values for Chinese chestnut were mainly distributed in a range from 0.2 to 0.5 and were rarely greater than 0.5 (Fig. 1). The fact that 63.94% of multiple NBS-encoding genes were present in the range of Ks values from 0.2 to 0.5, demonstrating that relatively ancient duplications played an important role in the expansion of NBS-encoding genes in Chinese chestnut.

Species-specific duplication driving in the expansion of NBS-encoding genes in Chinese chestnut

Gene duplication has supplied raw genetic material for evolution49 and has been a major force for generating biological novelties that can lead to adaptation to environments50. To elucidate the evolutionary pattern of NBS-encoding genes in Chinese chestnut, a phylogenetic tree of conserved NBS domain from Chinese chestnut and poplar genomes was constructed (Fig. 3). The most distinct characteristic was species-specific duplication events, that might be responsible for the evolution of recognition of species-specific pathogens, and that responsed to selective pressure imposed by species-specific pathogens21.
Figure 3

Phylogenetic tree of NBS domains from NBS-encoding genes.

Blue (Chinese chestnut), purple (poplar).

The 401 species-specific NBS-encoding genes detected in Chinese chestnut were classified into 81 clades with bootstrap values were greater than 50%. The proportion of the species-specific duplicates reached 77.26%, suggesting large-scale genes expansion after the divergence of Chinese chestnut and poplar. A similar result has also been reported for four gramineous plants (Zea mays, Sorghum bicolor, Brachypodium distachyon, and Oryza sativa)51, and five species in the Rosaceae (Fragaria vesca, Malus × domestica, Pyrus bretschneideri, Prunus persica, and Prunus mume)47.

The independent and relatively basal NBS group containing the RPW8 domain

The RPW8 domain (PF05659.6) mediated broad-spectrum pathogen resistance and was originally identified in the protein encoded by the polymorphic powdery mildew resistance locus RPW812. In previous studies, proteins containing the RPW8 domain have been categorized into the non-TIR-NBS-encoding genes subgroup1314. However, 15 RPW8 domain-encoding genes (1 XNs, 2 TNLs, 3 CNLs and 9 XNLs) clustered into an independent lineage in phylogenetic relationships (Additional file 2: Figure S1). Also, five NBS-encoding-genes from poplar encoding the RPW8 domain clustered together with 15 RPW8-NBS-encoding genes from Chinese chestnut. The clustering of RPW8-NBS-encoding genes observed in Chinese chestnut also occurred with RPW8-NBS-LRR genes from M. truncatula15, potato16, soybean, common bean, and pigeon pea17, which illustrated that RPW8-NBS-encoding genes did not comprise a special group of non-TIR genes but formed a separate group independent from the non-TIR genes. Moreover, the group of RPW8-NBS-encoding genes located at a relatively basal but not the most basal position in this phylogenetic tree of Chinese chestnut (Additional file 2: Figure S1). This result is similar to the study in Arabidopsis that a monophyletic group (the CNL-A group) consisted of RNL genes217. Thus, the RPW8-NBS-encoding genes were phylogenetically separate from the non-TIR genes.

Materials and Methods

Identification of NBS-encoding genes in Chinese chestnut and classification of gene family members

The entire genome sequence and annotation V1.0 of Chinese chestnut (C. mollissima) were downloaded from the hardwood genomics project (http://www.hardwoodgenomics.org/chinese-chestnut-genome). To identify NBS-encoding genes in Chinese chestnut, we used the amino acid sequence of the NB-ARC domain (PF00931) as blastp query against all known protein sequences with the threshold expectation value set to 1.0. All hits were further submitted to Pfam analysis (http://pfam.xfam.org/) to verify the presence of the NBS (NB-ARC) domain. Furthermore, the identified NBS-encoding genes were examined to detect whether they encode the LRR or TIR domain using the merged results from Pfam analysis and SMART protein motif analysis (http://smart.embl-heidelberg.de/). Finally, all identified genes were examined to detect the presence of CC domain using COILS (http://embnet.vital-it.ch/software/COILS_form.html) databases with a threshold of 0.952 in Chinese chestnut and RPW8 domain using Pfam analysis in Chinese chestnut and poplar. Genes were grouped into gene families according to two criteria, including the cutoff of coverage (aligned sequence lengths/gene lengths), and the sequence identity of not less than 70%. Likewise, the stricter criteria for both coverage and sequence identity of not less than 80% and 90%, respectively, were used to detect the relatively recent duplications among NBS-encoding genes.

Sequence alignment and phylogenetic analyses

The NBS domain sequences of all identified NBS-encoding genes were aligned in MEGA 5.0 using the MUSCLE program53. A neighbor-joining (NJ) method was then applied to construct a phylogeny of NBS-encoding genes using ClustalW 2.0 with default options and 1000 bootstrap replications54. Data for NBS-encoding genes from poplar (Populus trichocarpa) were obtained from a previous study21.

The ratio of nonsynonymous substitutions to synonymous substitutions

To detect the mode of selection, we evaluated the ratio of nonsynonymous substitutions to synonymous substitutions. Firstly, based on the protein sequences, the CDSs (nucleotide coding sequences) of NBS-encoding genes in each gene family were aligned using ClustalW 2.054. Subsequently, nonsynonymous substitutions (Ka), synonymous substitutions (Ks), and the ratio between them (Ka/Ks) were calculated in each gene family using MEGA 5.053.

Detection of positive selection

The Phylogenetic Analysis by Maximum Likelihood 4 (PAML4) package55 was used for the site model and branch model test to determine selective pressure on NBS-LRR genes in Chinese chestnut. A single dN/dS ratio (model = 0), in addition to models M7 (beta) and M8 (beta-ω) (NS site = 7 8) were used for the site model in all gene families with at least three members. Subsequently, the LR test between model M7 and M8 was carried out with critical criteria of chi-square 5.991 (p < 0.05, df = 2) and 9.210 (p < 0.01, df = 2), respectively. For the branch model, a single dN/dS ratio (model = 0) and model 0 (NS site = 0) were applied in the codeml program.

Promoter regions analysis

To characterize the cis-acting element(s) of NBS-encoding genes, we isolated an approximately 1000 bp promoter sequence of each NBS-encoding gene. Analysis of promoter sequences was conducted using SIGNALSCAN program available in Plant cis-acting regulatory DNA Elements (PLACE) (http://www.dna.affrc.go.jp/PLACE/), a database containing mainly plant motifs extracted from the published reports56.

Conclusions

NBS-LRR genes, as one of the largest families of R genes, were analyzed in Chinese chestnut (Castanea mollissima), a model species for the Fagaceae, to determine their pattern of evolution. In the present study, several TNLs were identified in Chinese chestnut that were absent from monocot genomes. In addition, we found a relatively ancient duplication in Chinese chestnut compared with poplar. The expansion of NBS-encoding genes could be attributed to such species-specific duplications during the evolution of Chinese chestnut. The values for Ka/Ks in all TIR and most non-TIR gene families were less than 1, indicating purifying selection as a leading force in the evolution of NBS-encoding genes. However, the Ka/Ks values for four non-TIR gene families were greater than 1, demonstrating that their evolution was driven by positive selection. Furthermore, the relationship between Ka/Ks and Ks illustrated higher selective pressure on the newer and older genes compared with genes in the critical range of Ks from 0.4 to 0.5. Interestingly, RPW8-NBS-encoding genes clustered into an independent clade at a relatively basal, but not most basal, position in this phylogenetic analysis. Finally, many cis-elements in NBS-encoding genes promoter were related to disease resistance, which demonstrated the function in responsing pathogens and laid the foundation of identifying candidate R genes.

Additional Information

How to cite this article: Zhong, Y. et al. Species-specific duplications of NBS-encoding genes in Chinese chestnut (Castanea mollissima). Sci. Rep. 5, 16638; doi: 10.1038/srep16638 (2015).
  52 in total

1.  Plant disease resistance genes encode members of an ancient and diverse protein family within the nucleotide-binding superfamily.

Authors:  B C Meyers; A W Dickerman; R W Michelmore; S Sivaramakrishnan; B W Sobral; N D Young
Journal:  Plant J       Date:  1999-11       Impact factor: 6.417

2.  Recent insights into R gene evolution.

Authors:  John M McDowell; Stacey A Simon
Journal:  Mol Plant Pathol       Date:  2006-09       Impact factor: 5.663

Review 3.  Elicitors, effectors, and R genes: the new paradigm and a lifetime supply of questions.

Authors:  Andrew F Bent; David Mackey
Journal:  Annu Rev Phytopathol       Date:  2007       Impact factor: 13.078

4.  Expanded functions for a family of plant intracellular immune receptors beyond specific recognition of pathogen effectors.

Authors:  Vera Bonardi; Saijun Tang; Anna Stallmann; Melinda Roberts; Karen Cherkis; Jeffery L Dangl
Journal:  Proc Natl Acad Sci U S A       Date:  2011-09-12       Impact factor: 11.205

5.  Genome-wide investigation on the genetic variations of rice disease resistance genes.

Authors:  Sihai Yang; Zhumei Feng; Xiuyan Zhang; Ke Jiang; Xinqing Jin; Yueyu Hang; Jian-Qun Chen; Dacheng Tian
Journal:  Plant Mol Biol       Date:  2006-08-17       Impact factor: 4.076

6.  Patterns of positive selection in the complete NBS-LRR gene family of Arabidopsis thaliana.

Authors:  Mariana Mondragón-Palomino; Blake C Meyers; Richard W Michelmore; Brandon S Gaut
Journal:  Genome Res       Date:  2002-09       Impact factor: 9.043

Review 7.  Scientifically advanced solutions for chestnut ink disease.

Authors:  Altino Branco Choupina; Letícia Estevinho; Ivone M Martins
Journal:  Appl Microbiol Biotechnol       Date:  2014-03-13       Impact factor: 4.813

8.  Identification and localisation of the NB-LRR gene family within the potato genome.

Authors:  Florian Jupe; Leighton Pritchard; Graham J Etherington; Katrin Mackenzie; Peter J A Cock; Frank Wright; Sanjeev Kumar Sharma; Dan Bolser; Glenn J Bryan; Jonathan D G Jones; Ingo Hein
Journal:  BMC Genomics       Date:  2012-02-15       Impact factor: 3.969

9.  Species-specific duplications driving the recent expansion of NBS-LRR genes in five Rosaceae species.

Authors:  Yan Zhong; Huan Yin; Daniel James Sargent; Mickael Malnoy; Zong-Ming Max Cheng
Journal:  BMC Genomics       Date:  2015-02-14       Impact factor: 3.969

Review 10.  Current understanding of grapevine defense mechanisms against the biotrophic fungus (Erysiphe necator), the causal agent of powdery mildew disease.

Authors:  Wenping Qiu; Angela Feechan; Ian Dry
Journal:  Hortic Res       Date:  2015-05-20       Impact factor: 6.793

View more
  5 in total

1.  Whole genome sequencing of a banana wild relative Musa itinerans provides insights into lineage-specific diversification of the Musa genus.

Authors:  Wei Wu; Yu-Lan Yang; Wei-Ming He; Mathieu Rouard; Wei-Ming Li; Meng Xu; Nicolas Roux; Xue-Jun Ge
Journal:  Sci Rep       Date:  2016-08-17       Impact factor: 4.379

2.  UGT74S1 is the key player in controlling secoisolariciresinol diglucoside (SDG) formation in flax.

Authors:  Bourlaye Fofana; Kaushik Ghose; Jason McCallum; Frank M You; Sylvie Cloutier
Journal:  BMC Plant Biol       Date:  2017-02-02       Impact factor: 4.215

3.  Genome-scale examination of NBS-encoding genes in blueberry.

Authors:  Jose V Die; Belén Román; Xinpeng Qi; Lisa J Rowland
Journal:  Sci Rep       Date:  2018-02-21       Impact factor: 4.379

4.  Lineage-specific duplications of NBS-LRR genes occurring before the divergence of six Fragaria species.

Authors:  Yan Zhong; Xiaohui Zhang; Zong-Ming Cheng
Journal:  BMC Genomics       Date:  2018-02-08       Impact factor: 3.969

5.  Deep RNA-Seq profile reveals biodiversity, plant-microbe interactions and a large family of NBS-LRR resistance genes in walnut (Juglans regia) tissues.

Authors:  Sandeep Chakraborty; Monica Britton; P J Martínez-García; Abhaya M Dandekar
Journal:  AMB Express       Date:  2016-02-17       Impact factor: 3.298

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.