Literature DB >> 26449254

Genome-Wide Architecture of Disease Resistance Genes in Lettuce.

Marilena Christopoulou1, Sebastian Reyes-Chin Wo1, Alex Kozik1, Leah K McHale1, Maria-Jose Truco1, Tadeusz Wroblewski1, Richard W Michelmore2.   

Abstract

Genome-wide motif searches identified 1134 genes in the lettuce reference genome of cv. Salinas that are potentially involved in pathogen recognition, of which 385 were predicted to encode nucleotide binding-leucine rich repeat receptor (NLR) proteins. Using a maximum-likelihood approach, we grouped the NLRs into 25 multigene families and 17 singletons. Forty-one percent of these NLR-encoding genes belong to three families, the largest being RGC16 with 62 genes in cv. Salinas. The majority of NLR-encoding genes are located in five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 and cosegregate with multiple disease resistance phenotypes. Most MRCs contain primarily members of a single NLR gene family but a few are more complex. MRC2 spans 73 Mb and contains 61 NLRs of six different gene families that cosegregate with nine disease resistance phenotypes. MRC3, which is 25 Mb, contains 22 RGC21 genes and colocates with Dm13. A library of 33 transgenic RNA interference tester stocks was generated for functional analysis of NLR-encoding genes that cosegregated with disease resistance phenotypes in each of the MRCs. Members of four NLR-encoding families, RGC1, RGC2, RGC21, and RGC12 were shown to be required for 16 disease resistance phenotypes in lettuce. The general composition of MRCs is conserved across different genotypes; however, the specific repertoire of NLR-encoding genes varied particularly of the rapidly evolving Type I genes. These tester stocks are valuable resources for future analyses of additional resistance phenotypes.
Copyright © 2015 Christopoulou et al.

Entities:  

Keywords:  Bremia lactucae; NB-LRR; gene silencing; lettuce downy mildew; reverse genetics

Mesh:

Substances:

Year:  2015        PMID: 26449254      PMCID: PMC4683639          DOI: 10.1534/g3.115.020818

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Plants use multiple cell-surface and cytosolic receptors to perceive and react to pathogen attacks. These receptors are categorized into pattern recognition receptors that recognize conserved microbial signatures known as microbe-associated molecular patterns and nucleotide binding-leucine rich repeat receptor (NLR) proteins that recognize pathogen effector molecules (Jones and Dangl 2006; Ronald and Beutler 2010; Monaghan and Zipfel 2012). Most pattern recognition receptors are receptor-like kinases (RLKs) or receptor-like proteins (RLPs), both containing an extracellular domain that can bind ligands and a transmembrane domain (TM) but only RLKs have a cytosolic kinase domain (Monaghan and Zipfel 2012). The majority of the RLKs known to play a role in disease resistance belong to the class of non-arginine-aspartate (non-RD) kinases (Dardick and Ronald 2006; Dardick ). NLRs are cytoplasmic receptors that directly or indirectly recognize effectors and activate a strong defense reaction known as effector-triggered immunity (Jones and Dangl 2006; Bozkurt ). They often contain additional domains but usually not a kinase. NLRs typically consist of a variable N terminus, a conserved internal nucleotide binding (NB)-ARC domain, and a highly polymorphic carboxy (C) terminus with multiple leucine-rich repeats (LRRs) (Hulbert ). NLRs can be classified into two major groups, TNLs and non-TNLs, based on the presence or absence of a TOLL/interleukin-1 receptor (TIR) domain at the N-terminus (Takken and Goverse 2012). The sequence conservation of the NB domain within and between species is high and allows the classification into TNL vs. non-TNL proteins (Meyers ; McHale ). Some non-TNLs contain a coiled-coil (CC) domain and therefore collectively non-TNLs are referred to as CNLs. The NLR-encoding genes comprise one of the most polymorphic and abundant gene super-families in plants (Clark ; Karasov ). The numbers of NLR-encoding genes vary greatly among species, for example, there are only 54 in Carica papaya (Porter ), 150 in Arabidopsis thaliana (Meyers ), 571 in Medicago truncatula (Shao ), 465 in soybean (Shao ), 402 in Populus trichocarpa (Kohler ), 535 in Oryza sativa (Zhou ), and as many as 1015 in Malus domestica (Arya ). The prevalence of the different classes also can vary considerably; TNL families have expanded in eudicots but are absent in grass genomes (Meyers ; Yue ). NLR-encoding genes are clustered in the genome as a result of lineage-specific segmental duplication events followed by local rearrangements (Zhang ). Expansion of individual clusters occurs through tandem duplications resulting from unequal crossing over (Richly ; Meyers ; Zhou ). The complexity of these clusters also is affected by gene loss through natural selection acting on a birth-and-death process (Michelmore and Meyers 1998; Leister 2004; Mondragon-Palomino and Gaut 2005; Guo ). NLR-encoding genes also exhibit presence/absence variation polymorphisms within or among related species (Kuang ; Shen ; Guo ; Luo ). These genetic and evolutionary processes have resulted in complex, variable clusters of NLR genes in the genome that confer resistance to multiple diverse pathogens. Lettuce (Lactuca sativa) is one of the most valuable vegetable crops in the United States, with a value of approximately 2 billion dollars per annum and is host to a wide range of pathogens and pests. A wealth of genetic and genomic resources and tools are now available that enable genetic and functional studies of disease resistance in lettuce. More than 52 loci have been genetically defined that confer resistance phenotypes to 10 pathogens and one pest, nearly all of which cosegregate with clusters of NLR-encoding genes (McHale ; Hayes ; M. J. Truco and R. W. Michelmore, unpublished data). Lettuce downy mildew, caused by the obligate biotroph Bremia lactucae Regel, is the most economically important disease affecting the lettuce production worldwide (Davis ) that has been studied extensively (Michelmore and Wong 2008). Lettuce downy mildew is one of the most characterized gene-for-gene plant−pathogen interactions; more than 25 genes for resistance to downy mildew (Dm) have been identified, most of which are dominant genes that confer resistance in a gene-for-gene manner (Crute and Johnson 1976; Farrara ; Ilott ; Michelmore ). Many of these resistance genes have been introgressed from wild Lactuca species. Studies of sequence diversity between lettuce cultivars showed elevated levels of polymorphism for five major resistance clusters (MRCs) on chromosomes 1, 2, 3, 4, and 8 (McHale ; Truco ). A draft 2.4-Gb reference sequence assembly is now available for the 2.7-Gb genome of L. sativa cv. Salinas along with an ultradense genetic map that assigns more than 96% of the assembled sequence to 1460 genetic bins ordered along the nine chromosomal linkage groups (http://lgr.genomecenter.ucdavis.edu; S. Reyes-Chin Wo, A. Kozik, D. Lavelle, and R. W. Michelmore, unpublished data). The MRC on chromosome 2 is the most studied MRC in lettuce. MRC2 includes at least eight Dm genes (McHale ), including Dm3 the first resistance (R) gene cloned from lettuce (Shen ), as well as resistance to root aphid (Ra) (Ellis ) and a quantitative trait locus (QTL) for resistance to anthracnose (ANT1) (McHale ). Analysis of deletion mutants during the map-based cloning efforts to clone Dm3 identified 24 Resistance Gene Candidate 2 (RGC2) family members, most of which mapped within MRC2, spanning a region of at least 3 Mb in cv. Diana (Meyers ). Sequencing of dm3 mutants and transgenic complementation established RGC2B as the gene conferring Dm3 (Shen ). Genetic analysis revealed that this region was not highly recombinogenic but subject to gene conversions and spontaneous mutations associated with chromosomal deletions (Chin ). Analysis of RGC2 sequences from 47 accessions of Lactuca spp. indicated that gene conversions, mutation, and recombination events have been involved in the complex evolutionary history of this locus and distinguished two types of RGC2 genes: fast-evolving Type I RGC2 genes that are the product of frequent sequence exchange, are under diversifying selection, and exhibit low prevalence among different lettuce genotypes and slowly evolving Type II RGC2 genes that have maintained allelic/orthologous relationships and are under balancing selection (Kuang ). A fragment of the LRR-encoding sequence of RGC2B used as the trigger sequence for RNA interference (RNAi) abrogated the resistance mediated by not only Dm3 but also Dm14, Dm16, Dm18, and Ra, suggesting that these linked resistance phenotypes also are encoded by RGC2 family members (Wroblewski ). In this paper we use a combination of genetic, whole-genome, and functional genomic approaches to determine the genome-wide distribution, sequence relationships, and phenotypes of NLR-encoding genes in lettuce. Candidate R genes were identified with motif searches and a maximum-likelihood approach was used to group NLR-encoding genes into 42 families. The lettuce reference genome assembly allowed the analysis of the five MRCs in lettuce. MRC2, MRC3, and MRC8 are the focus of this paper. MRC1 and MRC4 have been described in detail previously (Christopoulou ). A library of 33 RNAi tester stocks was generated and analyzed for members of 12 RGC families; it was used to identify four families involved in 16 resistance specificities against two pathogens. This set of tester stocks is a valuable resource for future analysis of additional resistance phenotypes.

Materials and Methods

Plant materials and DNA isolation

All lettuce accessions were obtained from our collection of germplasm at UC Davis. Plants were grown under greenhouse conditions. DNA isolations were performed on seedlings or leaf tissue with a CTAB protocol (Bernatzky and Tanksley 1986) with minor modifications.

Generation and evaluation of transgenic RNAi tester stocks

Candidate genes were selected for RNAi analysis based on their cosegregation with disease resistance phenotypes and representation of as many diverse gene sequences as possible. This study was initiated before the sequencing and assembly of the lettuce genome. Thus, the majority of candidate genes could not be assigned initially to a particular family due to limited sequence available from the lettuce EST collection; the average length of the cDNA clones was 0.8 kb and the maximum 2 kb. Motif searches of the lettuce ESTs and genomic amplification using degenerate primers were used to generate a library of candidate NLR encoding genes (McHale ). Only sequences that colocalized with mapped resistance specificities were considered as candidates for silencing. The most divergent genes, based on the available sequenced fragments, were selected for RNAi, including at least one candidate RNAi construct for every cluster of mapped resistance specificities. Selected EST clones were obtained from the Arizona Genomics Institute (http://www.genome.arizona.edu/) and used as a template for amplification of RNAi trigger sequences for each gene (supporting information, Table S1). iProof high-fidelity DNA polymerase was used for all DNA amplifications according to manufacturer’s instructions (Bio-Rad Laboratories, Hercules CA). Sequences were validated by Sanger sequencing. One 400- to 500-bp fragment of each candidate gene (Table 1) was cloned as inverted repeats into either the pGollum or pSmeagol vectors. The first set of constructs used pGollum (Wroblewski , 2014) and resulted in the trigger sequence between a 400-bp fragment of the UidA, a beta-glucuronidase (GUS) encoding reporter gene and the PDK intron in an inverted repeat structure (Figure S1). The later set of constructs was made with pSmeagol by the use of the same restriction sites as in pGollum (Figure S1). Initial RNAi constructs were generated with pGollum, and later constructs were generated with pSmeagol because data indicated that the arrangement in pSmeagol (Figure S1) provided greater levels of RNAi (Wroblewski ); however, RNAi was successfully induced with both versions.
Table 1

All RNAi constructs generated and tested targeting NLR-encoding genes, the gene model of the progenitor EST sequences were derived from, gene family they belong to, resistance specificities tested, vector used, predicted number of RGCs targeted for RNAi, and resistance phenotypes abrogated

RNAi construct (Chromosomea)Similar Salinas Gene Model (% Identity), NLR FamilyNumber of RGC Members Predicted to be Targeted by RNAi/Total Number of RGC MembersDm Specificities TestedbEffectors tested by Agro-InfiltrationSilenced PhenotypesbVector
QGC20G02_LRR_RNAid (chr 1)Lsa038417.1 (99.7%), not assignedN/AcDm17, Dm43pGollum
QGD13H21_NB_RNAid (chr 1)Lsa139090.1 (87.6%), RGC1629/62 RGCC16Dm17, Dm43pGollum
QGD7B12_LRR_RNAid (chr1)Lsa037789.1 (99.8%), RGC341/4 RGC34Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
CLV_S1_Contig142_NB_RNAid (chr 1)Lsa053434.1 (91.1%), RGC118/22 RGC1Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
QGF16M04_LRR_RNAid (chr 1)Lsa039421.1 (88%), RGC115/22 RGC1Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2Dm5/8, Dm45, HR-AvrB, HR-AvrRpm1, HR-AvrRpt2pGollum
QGF20G21_LRR_RNAid (chr 1)Lsa025432.1 (96%), RGC116/22 RGC1Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2Dm5/8pGollum
4/16 RGC21
QGD6G21_NB_RNAid (chr 1)Lsa002259.1 (84.1%), RGC1619/62 RGC16Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
QGC12K15_TIR_RNAid (chr 1)Lsa031265.1 (100%), RGC1620/62 RGC16Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
CLRX1526_LRR_RNAid (chr 1)Lsa004568.1 (98.5%), RGC1628/62 RGC16Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
CLRX7678_NB_RNAid (chr 1)Lsa002271.1 (97.8%), RGC1635/62 RGC16Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
CLSX3769_NB_RNAid (chr 1)Lsa021698.1 (99.3%), RGC1621/62 RGC16Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
CLSM10181_NB_RNAid (chr 1)Lsa029824.1 (89.7%), RGC214/20 RGC2Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
LserNBS03_NB_RNAid (chr 1)Lsa021288.1 (100%), RGC2115/28 RGC21 4/22 RGC1Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
Contig_1306_NB_RNAid (chr 1)Lsa014706.1 (99%), RGC62/2 RGC6Dm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pSmeagol
Contig_7390_LRR_RNAid (chr 1)Lsa017144.3 (98.87%), not assignedN/AcDm10, Dm43, Dm5/8, Dm17, Dm45, Dm36AvrB, Avrpm1, AvrRpt2pGollum
AY153836_NB_RNAid
(chr 1)Lsa031651.1 (100%), RGC1620/62 RGC16pGollum
CLSS6596_TIR_RNAid (chr 4)Lsa011787.1 (94.6%), RGC1227/55 RGC12Dm4, Dm7, Dm11, Dm44,Dm4, Dm11, Dm44pGollum
1/62 RGC16
CLSM18700_LRR_RNAid (chr 4)Lsa025738.1 (99.4%), RGC1228/55 RGC12Dm4, Dm7, Dm11, Dm44,pGollum
2/6 RGC20
CLSX586_NB_RNAid (chr 4)Lsa007832.3 (99.9%), RGC1239/55 RGC12Dm4, Dm7, Dm11, Dm44, Dm5/8, Dm3Dm4, Dm7, Dm11, Dm44pGollum
1/6 RGC20
LsatNBS09_NB_RNAi (chr 2)Lsa125929.1 (100%), RGC431/42 RGC4Dm1pSmeagol
RGC2B_LRR_RNAie (chr 2)RGC2B11/20 RGC2Dm1, Dm3, Dm2, Dm6, Dm14, Dm15, Dm16, Dm18, RaDm3, Dm14, Dm16, Dm18, RapGollum
RGC2B_NB_RNAi (chr 2)RGC2B12/20 RGC2Dm2, Dm3, Dm6, Dm15, Dm16, Dm18Dm3, Dm6, Dm18pGollum
LsatNBS05_NB_RNAic (chr 2)Lsa009249.1 (99.32%), RGC1827/29 RGC18No specificities testedpGollum
QGC7A16_LRR_RNAi (chr 3)Lsa021288.1 (99.84%), RGC2115/28 RGC21Dm13Dm13pSmeagol
AF017754_NB_RNAi (chr 3)Lsa034516.1 (98.73%), RGC426/42 RGC4Dm13pSmeagol
LserNBS02_NB_RNAi (chr 3)Lsa002429.1 (98.84%), RGC2120/28 RGC21Dm13pSmeagol
AY153833.1_LRR_RNAi (chr 8)Lsa010510.1 (99.65%), RGC156/11 RGC15Dm5/8AvrRps4pSmeagol
LE0395_LRR_RNAi (chr 8)Lsa022448.1 (86.05%), RGC294/5 RGC29Dm5/8AvrRps4pGollum
LEO414_LRR_RNAi (chr 8)Lsa031490.1 (100%), RGC91/4 RGC7AvrRps4pGollum
LEO266_TIR_RNAi (chr 8)Lsa017362.1 (97.6%), RGC46/42 RGC4AvrRps4pGollum
Lsat11_NB_RNAi (chr 8)Lsa088367.1 (99.49), RGC343/3 RGC34AvrPpiCpGollum
Contig5632_TIR_RNAi (chr 8)Lsa022918.1 (85.28%), RGC430/42 RGC4AvrPpiCpSmeagol
QGD14O14_NB_RNAi (chr 8)Lsa083425.1 (89.1%), RGC51/1 RGC5AvrPpiCpSmeagol

RNAi, RNA interference; EST, expressed sequence tag; NLR, nucleotide binding-leucine rich repeat receptor; HR, hypersensitive response; N/A, not applicable.

Chromosome on which the phenotypes tested map to.

Genes expressed in wild-type germplasm and evaluated for abrogration by RNAi construct in stable transgenics.

Targeted NLR encoding gene cosegregates with resistance to lettuce dieback.

Described in Christopoulou .

Described in Wroblewski .

RNAi, RNA interference; EST, expressed sequence tag; NLR, nucleotide binding-leucine rich repeat receptor; HR, hypersensitive response; N/A, not applicable. Chromosome on which the phenotypes tested map to. Genes expressed in wild-type germplasm and evaluated for abrogration by RNAi construct in stable transgenics. Targeted NLR encoding gene cosegregates with resistance to lettuce dieback. Described in Christopoulou . Described in Wroblewski . Validated clones were transferred to Agrobacterium tumefaciens strain LBA4404 (Hoekema ) by electroporation (Dower ). Stable transgenics of cv. Cobham Green (that lacks any of the tested disease resistance phenotypes) expressing each RNAi construct were generated by cocultivation with A. tumefaciens by the U.C. Davis Ralph Pearson Transformation Facility (http://ucdptf.ucdavis.edu/; Michelmore )). Typically 10−15 primary (T1) transgenic lines were generated and evaluated for each construct. Efficacy of silencing was evaluated in transgenic T1 plants and their progeny by the level of silencing of the UidA gene, which is simultaneously targeted for RNAi by these constructs as described previously (Wroblewski , 2007). GUS expression levels were determined using Agrobacterium-mediated transient expression assays (Schöb ). Two leaves were infiltrated for each GUS assay and the assays were repeated at least twice (Christopoulou ). The presence of the RNAi transgene in planta was confirmed by polymerase chain reaction (PCR) using primers TrangeneF and TransgeneR, which amplify a ∼800-bp fragment spanning one half of the inverted repeat structure (Table S1). Primers Le9005L and Le9005R were used as a control to confirm successful PCR amplifications in progeny segregating for the RNAi transgene.

Phenotypic evaluation of RNAi transgenics

T1 lines lacking GUS expression, and therefore presumed to be silenced for the targeted NLR-encoding gene, were selected for assays of resistance. Two or three independent silenced T1 lines per construct were crossed to different genotypes carrying the resistance phenotypes to be tested. The F1s were tested for DM resistance and/or hypersensitive response (HR) elicited by bacterial effectors as well as for silencing of GUS expression (Table 1). The F1s were expected to segregate for the silencing phenotype because the T1 transgenics were hemizygous. On average 10 to 15 F1s, were tested per cross for each resistance phenotype (Figure 1). Resistance to DM was evaluated with B. lactucae isolates diagnostic for specific Dm genes (Table 2) on detached cotyledons and detached leaves as described previously (Christopoulou ). To evaluate HR to bacterial effectors, F1s at the four- to six-leaf stage were pressure infiltrated with A. tumefaciens strain C58 expressing AvrB, AvrRpm1, AvrRpt2, AvrRps4, or AvrPpiC as described previously (Christopoulou ). A C58 strain expressing AvrPto was used as a positive control and PTFS40 strain expressing UidA as the negative control.
Figure 1

Flowchart of the experimental design for the evaluation of RNA interference (RNAi) constructs, in particular QGC7A16_LRR_RNAi. The methodology applies to all constructs described in this study (different genetic background and isolates used in each case, as described in Table 1 and Table 2). (A) Transient beta-glucuronidase (GUS) expression assays were used to evaluate silencing efficiency of primary transgenic lines (T1s). (B) The most efficiently silenced T1s were crossed to cv. Pennlake, carrying Dm13. (C) Fifteen and ten F1s derived from each cross were evaluated for silencing efficiency [GUS assays and (PCR) polymerase chain reaction detection] and resistance to B. lactucae isolate CS12. Pathogenicity assays were performed on detached cotyledons and repeated on leaf disks of 3- to 4-week-old plants. Sporulation was only observed in the presence of the RNAi construct (C2). The presence of the transgene was detected with GUS transient assays (C1) and further confirmed by polymerase chain reaction detection (C3).

Table 2

Lettuce genotypes, their respective resistance specificities, and the diagnostic B. lactucae isolates used to test for LDM resistance

Lettuce Cultivars or Breeding LinesResistance SpecificityHR to Bacterial EffectorsIsolates Tested (Relevant Avr)
CapitanDm11CG1 (Avr11)
CGN14263Dm43, Dm44C01O879 (Avr43)
LSE57/15Dm7R60 (Avr7)
R4T57Dm4C980648ED (Avr4)
DianaDm1, Dm3, Dm5/8, Dm7IM25P11 (Avr3)
CG1 (Avr5/8),
R60 (Avr7)
NinjaDm3, Dm11, Dm36AvrB, AvrRpm1, AvrRpt2CG1 (Avr36)
ValmaineDm5/8, Tu, FusariumAvrPpiC, AvrRps4CG1 (Avr5/8)
UCDM10Dm10C83M47 (Avr10)
LSE102Dm17C01O879 (Avr17)
09G1126Dm43C01O879 (Avr43)
09G952Dm45C01O879 (Avr45)
PennlakeDm13CS12 (Avr13)
Cobham GreenNone known

LDM, lettuce downy mildew; HR, hypersensitive response.

Flowchart of the experimental design for the evaluation of RNA interference (RNAi) constructs, in particular QGC7A16_LRR_RNAi. The methodology applies to all constructs described in this study (different genetic background and isolates used in each case, as described in Table 1 and Table 2). (A) Transient beta-glucuronidase (GUS) expression assays were used to evaluate silencing efficiency of primary transgenic lines (T1s). (B) The most efficiently silenced T1s were crossed to cv. Pennlake, carrying Dm13. (C) Fifteen and ten F1s derived from each cross were evaluated for silencing efficiency [GUS assays and (PCR) polymerase chain reaction detection] and resistance to B. lactucae isolate CS12. Pathogenicity assays were performed on detached cotyledons and repeated on leaf disks of 3- to 4-week-old plants. Sporulation was only observed in the presence of the RNAi construct (C2). The presence of the transgene was detected with GUS transient assays (C1) and further confirmed by polymerase chain reaction detection (C3). LDM, lettuce downy mildew; HR, hypersensitive response.

In silico identification of RNAi targets

BLAST searches (Altschul ) using the fragment of the gene that had been cloned as an inverted repeat were performed to identify potential targets of the trigger sequences in the lettuce genome. The minimum requirement for a target was one or more contiguous sequences of at least 21 nucleotides with 100% identity to the reference genome of cv. Salinas.

Identification and classification of genes involved in pathogen recognition

Predicted gene models of the lettuce reference genome L. sativa cv. Salinas (http://lgr.genomecenter.ucdavis.edu; R. W. Michelmore and S. Reyes-Chin Wo, unpublished data) were screened using Hidden Markov Models (HMMs), using E-value 1 e−10 as the cutoff, to search for genes encoding domains characteristic of NLR, RLK, and ABC transporter proteins that are known to be involved in pathogen recognition in other plant species. The HMMs (acquired from http://pfam.xfam.org/ unless indicated otherwise) were PF00931.17 and NBS_712.hmm (HMM profile available at http://niblrrs.ucdavis.edu/At_RGenes/) for the NB domain, PF01582.15 for TIR, 9 HMMS for the LRR (PF00560.28, PF07723.8, PF007725.7, PF12799.2, PF13306.1, PF13516.1, PF13504.1, PF13855.1 and PF14580.1), PF00069.20 for protein kinase, PF00005.22 for ABC transporters, and PF00400 for the WD40 domain. The CC motifs were predicted with Paircoil2 at P scores below 0.025 (McDonnell ). TM domains were predicted using both TMHMM 2.0 (Krogh ) and SCAMPI (Bernsel ). Gene models with protein kinase, TM and LRR domains were classified as putative RLK encoding genes. The presence of an LRR domain alone classified genes in a separate category since LRR motifs can be found in many functionally unrelated proteins that are not involved in disease resistance (Kobe and Deisenhofer 1994). A few additional genes with domains characteristic of NLR- and RLP-encoding genes were identified, using InterproScan, when an improved genome annotation of the reference genome of cv. Salinas became available at the end of the study. Twenty-eight additional gene models (Table S2) predicted to have a NB domain were combined with the results of the HMM searches to generate the final list of predicted gene models involved in pathogen recognition.

Cladistic analysis of NLR-encoding genes

For analysis of the sequences of putative NLR-encoding genes, the NB domain was extracted with HMMER3 (http://hmmer.org/.) using default parameters with NBS_712.hmm. The amino acid sequences were trimmed to ∼30 aa before the Walker A and 10 aa after the MHD motifs. Multiple-sequence alignments were generated with MAFFT (version 7.1, einsi parameters, 1,00 maximum iterations) (Katoh and Standley 2013) and refined manually with the alignment editor GeneDoc (Nicholas et al. 1997). The sequence identity was estimated by pairwise comparisons by the use of ClustalX over the entire alignment. Maximum-likelihood analyses were conducted with the customizable version of RAxML 8.0 (Stamatakis 2014), hosted on the CIPRES portal (Miller ). The Jones-Taylor-Thornton protein evolutionary model with a GAMMA model of heterogeneity and empirical amino acid frequencies was selected as the best-fitting substitution matrix based on ProtTest 3.3 (Darriba ). Branch support values were calculated by 1000 replicates of fast bootstrapping (Stamatakis ). Genes were assigned to families using a cutoff of 70% nucleotide sequence identity. We used Dendroscope (Huson ) to display the sequence relationships of the RGCs.

Identification of RGC2 sequences in cv. Salinas

Sequencing and assembly of the lettuce reference genome permitted a detailed comparison of the MRC2 structure in cv. Salinas with that previously deduced for cv. Diana (Meyers ). The sequences of 22 RGC2 members obtained from cv. Diana (Meyers ) were searched by BLASTN against the cv. Salinas genome assembly. Presence/absence was determined by reciprocal best BLAST hits at a minimum of 97% identity and 97% of the query represented in the alignment. The lack of a unique best match, in terms of both sequence identity and length of alignment, was recorded as absence of that particular family member in the Salinas genome. To address possibility of miss-assembly, the raw genomic reads from cv. Salinas were mapped back to the query sequences using the CLC Genomics Workbench (http://www.clcbio.com), using parameters: mismatch cost = 2, insertion and deletion cost = 3, minimum length fraction = 0.9 and similarity 1. The genome assembly and the raw reads of cv. Diana were used as a control.

Identification of singleton NLR-encoding genes in other lettuce genotypes

The draft genome assemblies of lettuce cvs. Diana, Valmaine, Greenlake, La Brillante, and PI125246 of L. sativa and accession US96UC23 of the wild progenitor L. serriola (D. Lavelle and R. W. Michelmore, unpublished data) were mined for the presence of singleton NLR encoding genes. The identification of the best BLAST hits was performed as described in the section “Identification of RGC2 sequences in cv. Salinas.”

Statement on data and reagent availability:

All vectors, silencing constructs, and B. lactucae isolates are available upon request. File S1 describes the cloning process for the RNAi vector. Figure S1 displays the orientation of the inverted repeats in both RNAi vectors. Figure S2 shows the distribution of NLR and non-NLR encoding genes as well as the position of the scaffolds and the genetic bins of MRC2 on chromosome 2. Table S1 has all the primer sequences. Table S2 lists all the genes putatively involved in pathogen recognition ordered on chromosomal linkage groups along with all the mapped disease resistance phenotypes. Table S3 shows all the NLR encoding genes predicted to be targeted by the RNAi constructs tested for the resistance phenotypes in MRC8A. Table S4 has all the predicted RNAi targets for LEO266_TIR_RNAi. Table S5 lists all the genes predicted to be silenced by constructs QGC7A16_LRR_RNAi and LserNBS02_NB_RNAi.

Data availability

The lettuce genome sequence is available at https://lgr.genomecenter.ucdavis.edu/. The genome sequences also can be found at GenBank: http://www.ncbi.nlm.nih.gov/bioproject/68025 and http://www.ncbi.nlm.nih.gov/genome/352/.

Results

Genome-wide identification of candidate resistance genes

Motif searches and InterproScan identified 1134 gene models in the reference genome of lettuce cv. Salinas that encoded domains characteristic of proteins involved in pathogen recognition (Table S2). Although 408 gene models were designated initially as NLRs, 23 were classified as genes encoding only LRRs because the NB motifs could not be validated visually. A total of 385 genes were used to generate a multiple sequence alignment of the NB domain for cluster analysis. N- and C-terminal domains were identified for 328 of 385 genes; the remaining 57 lacked one or more characteristic domains, which could reflect the diversity of gene structures present and/or be an artifact of assembling multi-gene families from short reads (Table 3). Forty-five genes were predicted to have a TIR domain but not an NB or LRR. The average length of the coding region for genes containing at least a NB domain (385 in total) was 3064 kb.
Table 3

Classification of the 1134 genes identified with domains characteristic of R genes

Predicted Protein DomainsNo. GenesAcronym
CC, NB, LRR47CNL
TIR, NB, LRR189TNL
NB, LRR (CNL-type)78NcL
NB, LRR (TNL-type)3NtL
NB (CNL-type)40Nc
NB (TNL-type)17Nt
CC, NB2CN
TIR, NB9TN
TIR45T
LRR248L
TM, LRR155RLP
P kinase, LRR28PkinL
P kinase, TM, LRR270RLK
P kinase, TIR1PkinT
WD40, LRR2

CC, coiled-coil; NB, nucleotide binding; LRR, leucine-rich repeat; TIR, TOLL/interleukin-1 receptor; TM, transmembrane.

CC, coiled-coil; NB, nucleotide binding; LRR, leucine-rich repeat; TIR, TOLL/interleukin-1 receptor; TM, transmembrane. A total of 703 LRR-encoding gene models were identified that did not contain a NB, CC, or TIR domain; 425 of these also had a predicted TM domain (Table 3). A protein kinase domain was detected in 298 of these 703 LRR-containing genes, 270 of which also had a TM domain and were therefore classified as putative RLKs. Clusters of RLK encoding genes are located on multiple chromosomes (Figure 2). These clusters are distinct from the clusters of NLR-encoding genes. Thirty-nine RLKs were non-RD kinases, 14 of which clustered within 22 Mb of each other at the opposite end of chromosome 2 to MRC2 (Table S2); no resistance phenotypes are known to map to this RLK cluster. Three RLPs with a predicted TM domain but no kinase domain and high sequence similarity to the tomato Ve gene are located within a 100 kb region of chromosome 9 that cosegregates with resistance to race 1 of Verticillium dahliae (Hayes ).
Figure 2

Genomic distribution of candidate genes involved in disease resistance over the nine chromosomes of lettuce (LG1−LG9; assembly version Lsat_1_v6_lg). Precise coordinates and references are provided in Table S2. Track A: Disease resistance phenotypes; bars reflect the resolution determined by mapping in populations other than the core reference mapping population. Track B: NLR-encoding genes; TNLs are colored green and CNLs blue. Each pixel represents a gene. MRCs are shown as red bars below the genes. The scale bar enumerates each chromosome in Mb. Track C: Genes encoding receptor-like kinases: non - arginine-aspartate (non-RD) RLKs are colored orange. Track D: Genes encoding cytochrome P450 proteins. Track E: Gene density. Track F: Repeat density.

Genomic distribution of candidate genes involved in disease resistance over the nine chromosomes of lettuce (LG1−LG9; assembly version Lsat_1_v6_lg). Precise coordinates and references are provided in Table S2. Track A: Disease resistance phenotypes; bars reflect the resolution determined by mapping in populations other than the core reference mapping population. Track B: NLR-encoding genes; TNLs are colored green and CNLs blue. Each pixel represents a gene. MRCs are shown as red bars below the genes. The scale bar enumerates each chromosome in Mb. Track C: Genes encoding receptor-like kinases: non - arginine-aspartate (non-RD) RLKs are colored orange. Track D: Genes encoding cytochrome P450 proteins. Track E: Gene density. Track F: Repeat density.

Grouping of NLR-encoding genes into 42 multigene families and singletons based on sequence similarity within the NB domain

A maximum-likelihood approach was used to study the sequence relationships of the NLR-encoding genes. Three of the 385 sequences were considered as a single sequence for alignment purposes because they were identical at the amino acid level in the NB region although they differed in their other domains. Clades with at least 70% sequence identity in pairwise comparisons of the NB domain were designated as distinct multigene families. This grouped the 385 NLR-encoding genes into 25 multigene families and 17 singletons (Figure 3). Nomenclature for each family was kept consistent with previous assignations as much as possible (Meyers ; McHale ). Genes lacking a CC domain or TIR domain were designated either as CNL-like or TNL-like based on sequence similarity to the NB domains of CC-NB-LRR- and TIR-NB-LRR-encoding genes respectively.
Figure 3

Dendrogram of lettuce nucleotide binding-leucine rich repeat receptors generated with RaxML. Bootstrap support values on major clades generated by 1000 repetitions. Nomenclature was kept consistent with previously described families (Meyers ; McHale ).

Dendrogram of lettuce nucleotide binding-leucine rich repeat receptors generated with RaxML. Bootstrap support values on major clades generated by 1000 repetitions. Nomenclature was kept consistent with previously described families (Meyers ; McHale ). A larger number of CNL than TNL families and singletons were identified (28 and 14 respectively), although fewer CNL-encoding genes than TNL-encoding genes were identified in total (167 and 21, respectively). Seventeen of the 42 clades were singletons, whereas 12 families contained more than 10 members each and accounted for 84% of the 385 NLR-encoding genes analyzed (Table 4). Three TNL families, RGC4, RGC12, and RGC16, were greatly expanded with more than 40 members each, comprising 73% of all the TNLs in the lettuce reference genome. The largest family was RGC16 with 62 members.
Table 4

Number of NLR encoding genes per family and type

SingletonsNLR typeMultigene FamilyNumber of GenesNLR type
RGC3CNLRGC1829CNL
RGC5CNLRGC2128CNL
RGC10CNLRGC122CNL
RGC22CNLRGC220CNL
RGC24CNLRGC4114CNL
RGC26CNLRGC912CNL
RGC28CNLRGC176CNL
RGC30CNLRGC295CNL
RGC31CNLRGC74CNL
RGC39CNLRGC273CNL
RGC43CNLRGC62CNL
RGC44CNLRGC232CNL
RGC13TNLRGC252CNL
RGC35TNLRGC322CNL
RGC36TNLRGC332CNL
RGC37TNLRGC422CNL
RGC38TNLRGC1662TNL
RGC1255TNL
RGC442TNL
RGC1414TNL
RGC4013TNL
RGC1511TNL
RGC87TNL
RGC206TNL
RGC343TNL

Singletons have only a single member. NLR, nucleotide binding-leucine rich repeat receptor.

Singletons have only a single member. NLR, nucleotide binding-leucine rich repeat receptor. Draft genome assemblies of lettuce cvs. Diana, Valmaine, Greenlake, La Brillante, and PI125246 of L. sativa and accession US96UC23 of the wild progenitor L. serriola were mined for the singletons identified in cv. Salinas. Single homologs with 99–100% sequence identity for the entire length of the coding sequence at the nucleotide level were identified for all singleton RGCs in all six additional genotypes. Therefore, the RGC singletons were highly conserved and did not exhibit presence or absence polymorphism across this diverse germplasm. The complex sequence relationships of the NLR-encoding multigene families make assembly and identification of orthologs difficult; this precluded their facile analysis across these draft assemblies.

Distribution of NLR-encoding genes and MRCs in the genome of lettuce

NLR-encoding genes are found in every chromosome. Chromosomes 5 and 6 have the lowest numbers, with only 13 and 2 such genes, respectively (Figure 2; Table S2). Chromosomes 2 and 3 predominantly contain CNLs (≥50% of all CNLs), whereas chromosomes 1, 4, 5, 7, 8, and 9 contain primarily TNLs (≥89% of all TNLs). Members of each multigene family are typically found in clusters on the same chromosome (at least 68% of the gene families) with the exception of RGC12 that has a genome-wide distribution and RGC14 that is present on four chromosomes. RGC12 is the second largest family with 55 members present on six of the nine chromosomes; 20 members are present on chromosome 9, followed by 19 on chromosome 4, six on chromosomes 2 and 5, and one on chromosomes 6 and 8 (two are unmapped). Eight RGC14 genes are on chromosome 7 with one, two, and three, on chromosomes 1, 5, and 9 respectively. The genome distribution of singletons was varied; RGC37 was located with other NLR-encoding genes; however, the remainder was several Mb away from other NLRs (RGC30 has not been mapped). NLR-encoding genes are concentrated in five MRCs as defined by multiple disease resistance phenotypes on chromosomes 1, 2, 3, 4, and 8 (Table S2; Figure 2) (McHale ; Truco ). The MRCs contain both NLR-encoding genes as well as genes not related to disease resistance. They are in large gene-poor, transposon-rich regions of the cv. Salinas genome. The average gene density for the entire genome is 2.43 genes per 100 kb; however, the gene density within each MRC ranges from 1 to 2.08 and is even lower for the flanking region extending 50 Mb on either end of each MRC (Table 5). MRC1 and MRC4 both span ∼5 Mb and are composed of multiple families of NLR-encoding genes as described earlier (Christopoulou ). MRC2 extends over ∼73 Mb and cosegregates with the large cluster of Dm genes that map to chromosome 2 (Figure 4). MRC3 spans approximately 25 Mb near the telomere of chromosome 3, contains almost exclusively RGC21 genes (Figure 5), and cosegregates with Dm13. Two clusters of NLRs on chromosome 8, MRC8A (Figure S3) and MRC8B (Figure S4), are defined by the disease resistance phenotypes that map to chromosome 8. MRC8A spans ∼52 Mb; MRC8B is smaller and spans only 7 Mb. A third cluster of RGCs spans almost 26 Mb at the end of chromosome 8 and cosegregates with two QTL for resistance to Fusarium and Verticillium wilts respectively (Figure S5). Three distinct clusters of NLR encoding genes, MRC9A (Figure S6), MRC9B, and MRC9C, are present on chromosome 9; each spans approximately 72 Mb, 35 Mb, and 15 Mb. MRC9A cosegregates with two QTL for resistance to DM and the candidate RLK–encoding genes for resistance to Verticillium dahliae. MRC9B consists of a large cluster of 19 RGC12s; however, no known resistance phenotypes have so far been mapped to this locus.
Table 5

Sizes and gene densities of the MRC

MRCNumber of NLRsNumber of Non-NLRsMRC Size, MbTotal Gene Density (Genes/100 kb)Gene Density Of Flanking Regions (Genes/100 kb)
50 Mb before50 Mb after
MRC178788661.302.090.75
MRC261691731.031.82a0.97
MRC322300251.270.860.75
MRC421627631.020.821.26b
MRC8A61126522.081.69c1.44
MRC8B369371.741.681.71
MRC8C26322261.341.280.90
MRC9A111454722.062.55d0.88

The genome-wide average is 2.43 genes/100 kb. MRC, major resistance cluster; NLR, nucleotide binding-leucine rich repeat receptor.

This region is only 1.76 Mb, because MRC2 starts almost at the beginning of the chromosome.

This region is only 41 Mb, because MRC4 is at the end of the chromosome.

This region is only 3.19 Mb, because MRC8A starts almost at the beginning of the chromosome.

This region is only 19.6 Mb, because MRC9A starts almost at the beginning of the chromosome.

Figure 4

Comparison of the MRC2 locus in cvs. Diana (A) and Salinas (B and C). (A) Summary of the data previously published for the region as part of the map-based cloning of Dm3 (Meyers ). The order of 24 RGC2 members in cv. Diana, sequenced from 22 BACs, was based on the deletion breakpoints (shown as red bars) of nine fast neutron–induced dm3 mutants. Genes depicted in green are Type I and in orange Type II genes (Kuang ). RGC2DB is Dm3 and it is a Type I gene. The spacing shown is for illustrative clarity; the physical distance between the BACs is unknown. (B) The Dm3 mediated resistance was originally mapped to a region defined by markers SCV12 and SCI11 (Chin ). This region spans approximately 17 Mb in cv. Salinas. The positions of the nucleotide binding-leucine rich repeat receptor (NLR) encoding genes are shown on the lower bar and they are color-coded with a different color for each gene family. NLR genes that do not belong to the RGC2 gene family are colored light pink (RGC18) and light blue (RGC41). The upper bar shows the distribution of scaffolds for this piece of chromosome. Black represents scaffolds without any predicted genes. The rest of the scaffolds are colored randomly. (C) The distribution of NLRs across the entire MRC2 locus, including the Dm3 region, is shown to scale on the lower bar. The scaffolds are shown on the upper bar.

Figure 5

Graphical representation of the major resistance cluster on chromosome 3 (MRC3) of the reference genome assembly of L. sativa cv. Salinas. The top panel shows the position of nucleotide binding-leucine rich repeat receptor (NLR) encoding genes relative to the genetic bins and the scaffolds, whereas the bottom panel shows the position of all other genes that map to this locus. Color-coding: TNLs are colored in shades of green and CNLs in shades of blue, defense response (DR) genes in red and the remainder of the genes in yellow. The NLRs are further subdivided based on the number of characteristic domains detected. Abbreviations used: N for NB, L for leucine-rich repeat, C for coiled-coil, T for TOLL/interleukin-1 receptor. Lowercase letters used whenever the domain itself was not detected but inferred based on the phylogeny of the NB domain.

The genome-wide average is 2.43 genes/100 kb. MRC, major resistance cluster; NLR, nucleotide binding-leucine rich repeat receptor. This region is only 1.76 Mb, because MRC2 starts almost at the beginning of the chromosome. This region is only 41 Mb, because MRC4 is at the end of the chromosome. This region is only 3.19 Mb, because MRC8A starts almost at the beginning of the chromosome. This region is only 19.6 Mb, because MRC9A starts almost at the beginning of the chromosome. Comparison of the MRC2 locus in cvs. Diana (A) and Salinas (B and C). (A) Summary of the data previously published for the region as part of the map-based cloning of Dm3 (Meyers ). The order of 24 RGC2 members in cv. Diana, sequenced from 22 BACs, was based on the deletion breakpoints (shown as red bars) of nine fast neutron–induced dm3 mutants. Genes depicted in green are Type I and in orange Type II genes (Kuang ). RGC2DB is Dm3 and it is a Type I gene. The spacing shown is for illustrative clarity; the physical distance between the BACs is unknown. (B) The Dm3 mediated resistance was originally mapped to a region defined by markers SCV12 and SCI11 (Chin ). This region spans approximately 17 Mb in cv. Salinas. The positions of the nucleotide binding-leucine rich repeat receptor (NLR) encoding genes are shown on the lower bar and they are color-coded with a different color for each gene family. NLR genes that do not belong to the RGC2 gene family are colored light pink (RGC18) and light blue (RGC41). The upper bar shows the distribution of scaffolds for this piece of chromosome. Black represents scaffolds without any predicted genes. The rest of the scaffolds are colored randomly. (C) The distribution of NLRs across the entire MRC2 locus, including the Dm3 region, is shown to scale on the lower bar. The scaffolds are shown on the upper bar. Graphical representation of the major resistance cluster on chromosome 3 (MRC3) of the reference genome assembly of L. sativa cv. Salinas. The top panel shows the position of nucleotide binding-leucine rich repeat receptor (NLR) encoding genes relative to the genetic bins and the scaffolds, whereas the bottom panel shows the position of all other genes that map to this locus. Color-coding: TNLs are colored in shades of green and CNLs in shades of blue, defense response (DR) genes in red and the remainder of the genes in yellow. The NLRs are further subdivided based on the number of characteristic domains detected. Abbreviations used: N for NB, L for leucine-rich repeat, C for coiled-coil, T for TOLL/interleukin-1 receptor. Lowercase letters used whenever the domain itself was not detected but inferred based on the phylogeny of the NB domain.

Partial conservation of MRC2 structures in cvs. Salinas and Diana

Eight Dm genes and resistance to root aphid (Ra) are clustered near one end of chromosome 2 that genetically define the borders of MRC2 (McHale ) (Table S2). The structure of MRC2 had been previously characterized extensively in cv. Diana as part of the map-based cloning of Dm3 (Meyers ; Shen ). A total of 23 BAC clones from cv. Diana were isolated containing 24 RGC2 family members, 16 of which were ordered using the breakpoints in a series of deletion mutants (Meyers ). Also, seven additional RGC2 members were partially sequenced from multiple genotypes to study the evolution of this cluster (Kuang ). We revisited the structure of the MRC2 cluster taking into consideration the genome assembly of cv. Salinas. The total size of MRC2 in cv. Salinas is ∼73 Mb and contains 754 genes of which 61 are NLRs, 2 are RLKs, and 5 are putative RLPs encoding genes (Figure 4). Dm3 was originally mapped in cv. Diana to a region that spans approximately 17 Mb within MRC2 in cv. Salinas. Many rearrangements of the RGC2 genes are evident between the two haplotypes (Figure 4). Fourteen of 21 RGC2 genes present in Diana could be identified in the MRC2 assembly of cv. Salinas; 12 of these had been previously classified as conserved Type II genes in cv. Diana. Variable Type I genes, which are primarily found in the middle section of MRC2 in cv. Diana, are largely absent in cv. Salinas, possibly due to a large deletion. Only RGC2SE and RGC2SN Type I genes in cv. Diana had reciprocal best hits with single genes in cv. Salinas. There are two additional NLR genes in cv. Salinas that had high levels of sequence identity to multiple Type I genes from cv. Diana and therefore orthologous relationships could not be established. These results support the heterogeneous evolution of NLR genes at this locus with genes at the periphery evolving more slowly than genes in the middle of the cluster.

Differential silencing of NLRs encoded at MRC2 by NB and LRR trigger sequences

The role of RGC2 gene family members in determining resistance specificities cosegregating with Dm3 had been studied previously using RNAi (Wroblewski ). Not only Dm3, but also Dm14, Dm16 and Dm18, as well as Ra were abrogated when an RNAi construct targeting the LRR-encoding domain of RGC2B was tested in genotypes expressing each of these resistance phenotypes. Sequence identities typically differ between NB and LRR domains, with NB domains tending to be more conserved than LRR domains. Therefore, a fragment of the NB domain of RGC2B was used as the trigger sequence for RNAi to determine whether more, the same, or a different spectrum of Dm specificities were abrogated. Transgenics were generated in cv. Diana and T1 lines were evaluated for RNAi efficacy. The Diana T2s tested for Dm3-mediated resistance segregated for the resistant phenotype as expected from hemizygous T1s. Presence of the transgene, confirmed by both PCR and GUS expression assays, was absolutely coincident with loss of resistance (Figure 1). The two cv. Diana transgenics with the greatest levels of silencing were crossed to different genotypes to test for Dm2, Dm6, Dm15, Dm16, and Dm18 specificities. All the F1s tested segregated for the presence of the RNAi construct. Dm3 and Dm18 were abrogated by trigger sequences derived from both the NB and LRR domains of Dm3; however, Dm14 and Dm16 were abrogated by the LRR-derived trigger sequence but not the NB-derived trigger sequence (Dm14 was not tested with the NB-RNAi construct). Conversely, Dm6 was compromised by the NB-derived trigger sequences but not those derived from the LRR. This was surprising because NB domains tend to be more conserved than LRR domains; this suggests that Dm16 is a chimeric Type I gene with a LRR domain that is more closely related to Dm3 than Dm6 but with a more distantly related NB domain.

Structure and functional analysis of NLRs encoded at MRC3 in cv. Salinas

MRC3 is a tight cluster consisting primarily of RGC21 family members on chromosome 3 that cosegregates with Dm13. It spans ∼25 Mb and contains 322 genes of which 31 are predicted to be involved in pathogen recognition: 27 CNL (26 RGC21, 1 RGC40), two RLK, and two LRR encoding genes (Figure 5; Table 5). Transgenics of cv. Cobham Green were generated for two constructs that targeted members of the RGC21 family of NLR-encoding genes that mapped to chromosome 3. These trigger sequences were derived from ESTs that were subsequently shown to encode the NB and LRR domains of different RGC21 family members (Table 1). Two T1 transgenic lines per construct were silenced for GUS and therefore presumably silenced for the targeted NLR-encoding gene(s). F1s of crosses to introduce the transgenes into cv. Pennlake carrying Dm13 segregated for the transgene as evidenced by both assays of transient expression of GUS and PCR amplification of the transgene. The F1s of Pennlake x QGC7A16_LRR_RNAi also segregated 1:1 for susceptibility:resistance to isolate CS12 of B. lactucae. The loss of resistance cosegregated perfectly with the presence of the transgene for F1s derived from two independent primary transgenics (Figure 1). Therefore, QGC7A16 or another RGC21 family member(s) confers Dm13. However, LserNBS02_NB_RNAi that also targeted RGC21 family members did not abrogate Dm13-mediated resistance, even though F1s carrying the transgene were effectively silenced for GUS expression. QGC7A16_LRR_RNAi was predicted as potentially targeting 16 of 28 RGC21 family members and eight other genes encoding only an LRR domain, whereas LserNBS02_NB_RNAi was predicted to target 26 of 33 members of the RGC21 family, 3 RGC1 members and 3 unassigned CNLs. All of NLR-encoding genes predicted to be silenced by QGC7A16_LRR_RNAi also were predicted to be silenced by LserNBS02_NB_RNAi (Table S5). The lack of abrogation of resistance by LserNBS02_NB_RNAi despite it being predicted to target most known RGC21 members could be explained by additional diverse RGC21 genes unique to cv. Pennlake that are not present in the cv. Salinas assembly; however, both Salinas and Pennlake exhibit Dm13 specificity. In addition to silencing NLR-encoding genes, QGC7A16_LRR_RNAi was predicted to silence eight LRR encoding genes that lack an NB domain. Seven out of the eight LRR encoding genes only targeted by QGC7A16_LRR_RNAi have high sequence similarity to other RGC21 members. These could either be RGC21 genes that are improperly assembled in the reference assembly of cv. Salinas or correctly assembled genes that lack an NB domain, one or more of which can confer Dm13 specificity. Alternatively, Dm13 may be encoded by a full length RGC21 gene and inadequate levels of silencing occurred from LserNBS02_NB_RNAi.

Structure and functional analysis of NLRs encoded at MRC8

There are three MRCs on chromosome 8 that cosegregate with different resistance-related phenotypes. Agrobacterium-mediated transient expression of bacterial effectors AvrRps4 and AvrPpiC triggers a HR in several cultivars. MRC8A is a small cluster spread over a 54-Mb region on chromosome 8 in cv. Salinas that cosegregates with HR to AvrPpiC in cv. Valmaine. MRC8A has a total of 1132 genes, of which 23 genes are potentially involved in pathogen recognition: four TNLs (RGC4) and two CNLs (RGC27), eight RLKs, four RLPs, one TIR-, and four LRR-encoding genes (Figure S3; Table 5). Three constructs were tested for abrogation of HR to AvrPpiC: Lsat11_NB_RNAi, Contig5632_TIR_RNAi, and QGD14014_NB_RNAi (Table 1). These were predicted to potentially silence members of RGC34, RGC4, and RGC5, respectively (Table S3). Two RNAi transgenic lines exhibiting silencing per construct were crossed to cv. Valmaine and the hybrids were tested for silencing and the response to the AvrPpiC. All the tested F1 progenies segregated for high levels of silencing of GUS coincident with the presence of the transgene. A strong HR occurred in both the presence and the absence of the transgene. Therefore, there is no evidence that any of these NLRs determine the HR to AvrPpiC. MRC8B is a dense cluster cosegregating with the HR to AvrRps4. It spans ∼7 Mb in cv. Salinas and contains 127 genes, 36 of which are RGC4 members (Figure S4; Table 5). Four constructs were tested: LEO266_TIR_RNAi (RGC4), LEO414_LRR_RNAi (RGC9), AY153833.1_LRR_RNAi (RGC15), and LEO_395_LRR_RNAi (RGC29) (Table 1). The last three constructs were tested because at the time of the experiments genetic data suggested that all of these RGCs co-segregated with HR to AvrRps4; however, they were subsequently shown not to be at this location in the whole genome assembly. Two independent transgenic lines for each construct were crossed to cv. Valmaine. The leaves of F1s were pressured infiltrated with A. tumefaciens cultures expressing AvrRps4. The F1s segregated for the transgene as shown by PCR and GUS transient assays; however, all of them retained a strong HR to AvrRps4 regardless of the silencing and the presence of the transgene (Table S4). Therefore, there is no evidence that members of RGC4, RGC9, or RGC15, RGC29 targeted by these constructs have a role in the recognition of AvrRps4 in lettuce. In silico predictions of RNAi targets for these constructs showed that there are 3 and 30 NLR-encoding genes within MRC8A and MRC8B respectively that were unlikely to be silenced by these constructs (Table S3; Table S4). These are candidates for additional RNAi targets to test their involvement in HR to AvrPpiC and AvrRps4. MRC8C is a complex cluster that spans 25 Mb and contains 348 genes, 26 of which are NLR-encoding genes (Table 5). There are 9 RGC15, 11 RGC9, 4 RGC29, 1 RGC7, and 1 RGC43 members (Table S2; Figure S5). A QTL to Fusarium oxysporum and a minor QTL for resistance to V. dahliae cosegregate with MRC8C. These resistance phenotypes have to be assessed in the field and therefore no constructs were tested for these phenotypes.

Discussion

This paper provides a detailed description of the genome-wide architecture of disease resistance in lettuce. Thirty-six major phenotypic genes and 20 QTLs for resistance to 10 pathogens and one pest have been positioned relative to 1134 candidate genes in genomic scaffolds assigned to genetic bins ordered along the nine chromosomal linkage groups (Table S2). Most but not all resistance phenotypes are located with arrays of NLR-encoding genes. Conversely, of the 42 families of NLR-encoding genes in L. sativa cv. Salinas, 26 were co-located with resistance phenotypes. We therefore defined five MRCs as regions of the genome enriched for determinants of resistance based on both phenotypic information from a variety of genotypes and the genomic distribution of NLR-encoding genes. This clustered distribution is similar to that observed in other species (Zhang ). The total number of NLR-encoding genes observed in cv. Salinas (385) is in the middle of the range (54−1015) observed in other plant species (Meyers ; Zhou ; Kohler ; Porter ; Arya ; Shao ). We detected more TNL than CNL genes in total but fewer TNL than CNL families; this is due the expansion of three TNL families, RGC4, RGC12, and RGC16, that make up 68% of all NLR-encoding genes. Seventeen families were only represented by singletons, of which 12 were CNLs and 5 were TNLs (Table 4). Most families tend to be colocated in the same MRC, likely reflecting expansion by unequal crossing over (Chin ; Meyers ; Zhou ). The largest family, RGC16, has 62 members, of which 60 are at MRC1 (Table S2). However, there are exceptions to the colocation of family members. RGC14 is distributed in two clusters (on chromosomes 7 and 9), possibly as a consequence of a segmental duplication event. The second largest family, RGC12, with 55 members is widely distributed through the genome and present on six chromosomes. The multiple chromosomal locations of large families may reflect grouping of more slowly evolving genes into single families based on our 70% threshold for sequence identity. Several clusters are homogeneous, comprising of a single RGC family (MRC1, MRC4, MRC3, MRC8A, MRC8B). Some are complex comprising of multiple TNLs and CNLs; the most complex was MRC2 that had representatives of three TNL and three CNL families (Table S2). All MRCs contained genes other than NLR-encoding genes; however, they were not enriched for any classes of genes, other than NLR-encoding genes. The majority of the data presented are based on the reference genome of cv. Salinas. However, NLR-encoding gene repertoires can differ greatly between genotypes; the number of RGC2 family members can vary from ∼12 to 42 (Kuang ). The numbers detected in the genome assemblies of cvs. Salinas and Diana were in the middle of this range (20 and 29, respectively). Although all of the conserved Type II genes were present in both, cv. Diana had more of the variable Type I genes; however, the order of the Type II genes apparently differs between the haplotypes. Therefore, while the general architecture observed in cv. Salinas may be representative of L. sativa, the fine-scale details may be specific to individual genotypes; this needs to be taken into consideration when implementing functional studies and cloning strategies. Functional studies using RNAi successfully assigned 16 resistance phenotypes to four RGC families. The selection of RGCs targeted for RNAi predated the sequencing and assembly of the reference genome and was based on ESTs of NLR-encoding genes that had been mapped relative to disease resistance phenotypes (McHale ). This provided RNAi transgenics targeting 12 of the 42 RGC families representing most of the major gene families and the majority of NLRs that cosegregate with resistance phenotypes. Members of RGC1 are required for the Dm5/8- and Dm45-mediated resistance as well as for the HR to AvrB, AvrRpm1, and AvrRpt2, that map to MRC1. RGC12 family members are required for Dm4, Dm7, Dm11, and Dm44 specificities in MRC4 as described previously (Christopoulou ). For MRC2 and MRC4, members of the largest RGC family present were shown to be required for the co-segregating resistance specificities. Interestingly, six constructs predicted to target all but four members of the largest family at MRC1, RGC16, failed to abrogate any of the nine tested phenotypes that map to this cluster. RNAi was an efficient method to assign resistance phenotypes to RGC families. Fragments of the NB, LRR and TIR domains were used as trigger sequences for RNAi for the 33 constructs used in this study. Abrogation of resistance was observed with all three types of trigger sequences. The different regions appeared to confer RNAi with similar efficacy; 1 of 4 TIR, 2 of 17 NB, and 4 of 12 LRR trigger sequences abrogated resistance. Although the NB domain is the most conserved NLR domain, targeting of the TIR and the LRR domains also resulted in silencing of multiple family members. When multiple trigger sequences targeted the same gene, as for Dm3, overlapping sets of specificities were abrogated. This provides the opportunity for designing trigger sequences that can abrogate specific subsets of family members. These 33 transgenic RNAi lines are resources for assigning additional resistance phenotypes to RGC families. Also, further lines can be generated to target more of the RGC families within MRCs with resistance phenotypes for which causal RGCs have not been identified, such as MRC1. After the sequence analysis and the ultradense map were refined it is evident that some existing RNAi constructs could also be tested for other specificities that cosegregate with RGCs of the targeted family. For example constructs Contig5632_TIR_RNAi and AF017754_NB_RNAi were tested for phenotypes that map to MRC8A and MRC3 respectively but target a large subset of RGC4 members and would be good tester stocks for the HR to AvrRps4 (MRC8B). The RNAi strategy that we have implemented provides an efficient approach to attribute phenotypes to subsets of a multigene RGC family; however, achievement of single gene resolution is challenging by RNAi methods. Nonetheless, it does provide candidates for knockouts of individual genes using CRISPR/Cas9 technology and valuable information for the development of robust markers to facilitate breeding efforts. Overall, resistance phenotypes were shown to be encoded by both CNL and TNL multigene families. No resistance phenotype was encoded by a singleton such as the allelic series observed at the L locus in flax (Ellis ). Interestingly, when a Dm specificity was abrogated, other resistance phenotypes that mapped to the same locus were also compromised indicating that the same RGC family encoded multiple Dm genes. Conversely, in all cases in which silencing of a Dm specificity occurred, the causal RGC family contained more than 20 members. Together, these data suggest that the expansion of specific RGC families in lettuce is in response to evolutionary pressures exerted by variation in B. lactucae.
  55 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  Dm3 is one member of a large constitutively expressed family of nucleotide binding site-leucine-rich repeat encoding genes.

Authors:  Katherine A Shen; Doris B Chin; Rosa Arroyo-Garcia; Oswaldo E Ochoa; Dean O Lavelle; Tadeusz Wroblewski; Blake C Meyers; Richard W Michelmore
Journal:  Mol Plant Microbe Interact       Date:  2002-03       Impact factor: 4.171

Review 3.  How to build a pathogen detector: structural basis of NB-LRR function.

Authors:  Frank L W Takken; Aska Goverse
Journal:  Curr Opin Plant Biol       Date:  2012-06-01       Impact factor: 7.834

Review 4.  Plant and animal sensors of conserved microbial signatures.

Authors:  Pamela C Ronald; Bruce Beutler
Journal:  Science       Date:  2010-11-19       Impact factor: 47.728

5.  Gene conversion and the evolution of three leucine-rich repeat gene families in Arabidopsis thaliana.

Authors:  Mariana Mondragon-Palomino; Brandon S Gaut
Journal:  Mol Biol Evol       Date:  2005-08-24       Impact factor: 16.240

Review 6.  Clusters of resistance genes in plants evolve by divergent selection and a birth-and-death process.

Authors:  R W Michelmore; B C Meyers
Journal:  Genome Res       Date:  1998-11       Impact factor: 9.043

7.  Silencing of transgenes introduced into leaves by agroinfiltration: a simple, rapid method for investigating sequence requirements for gene silencing.

Authors:  H Schöb; C Kunz; F Meins
Journal:  Mol Gen Genet       Date:  1997-11

8.  Genome-wide analysis of Carica papaya reveals a small NBS resistance gene family.

Authors:  Brad W Porter; Maya Paidi; Ray Ming; Maqsudul Alam; Wayne T Nishijima; Yun J Zhu
Journal:  Mol Genet Genomics       Date:  2009-03-05       Impact factor: 3.291

9.  Silencing of the major family of NBS-LRR-encoding genes in lettuce results in the loss of multiple resistance specificities.

Authors:  Tadeusz Wroblewski; Urszula Piskurewicz; Anna Tomczak; Oswaldo Ochoa; Richard W Michelmore
Journal:  Plant J       Date:  2007-06-22       Impact factor: 6.417

10.  An Ultra-High-Density, Transcript-Based, Genetic Map of Lettuce.

Authors:  Maria José Truco; Hamid Ashrafi; Alexander Kozik; Hans van Leeuwen; John Bowers; Sebastian Reyes Chin Wo; Kevin Stoffel; Huaqin Xu; Theresa Hill; Allen Van Deynze; Richard W Michelmore
Journal:  G3 (Bethesda)       Date:  2013-04-09       Impact factor: 3.154

View more
  13 in total

1.  Variability among Cucurbitaceae species (melon, cucumber and watermelon) in a genomic region containing a cluster of NBS-LRR genes.

Authors:  Jordi Morata; Pere Puigdomènech
Journal:  BMC Genomics       Date:  2017-02-08       Impact factor: 3.969

2.  Transfer of tomato immune receptor Ve1 confers Ave1-dependent Verticillium resistance in tobacco and cotton.

Authors:  Yin Song; Linlin Liu; Yidong Wang; Dirk-Jan Valkenburg; Xianlong Zhang; Longfu Zhu; Bart P H J Thomma
Journal:  Plant Biotechnol J       Date:  2017-11-15       Impact factor: 9.803

3.  Inflorescence Development and the Role of LsFT in Regulating Bolting in Lettuce (Lactuca sativa L.).

Authors:  Zijing Chen; Yingyan Han; Kang Ning; Yunyu Ding; Wensheng Zhao; Shuangshuang Yan; Chen Luo; Xiaotang Jiang; Danfeng Ge; Renyi Liu; Qian Wang; Xiaolan Zhang
Journal:  Front Plant Sci       Date:  2018-01-18       Impact factor: 5.753

4.  Genome reconstruction in Cynara cardunculus taxa gains access to chromosome-scale DNA variation.

Authors:  Alberto Acquadro; Lorenzo Barchi; Ezio Portis; Giulio Mangino; Danila Valentino; Giovanni Mauromicale; Sergio Lanteri
Journal:  Sci Rep       Date:  2017-07-17       Impact factor: 4.379

5.  Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce.

Authors:  Sebastian Reyes-Chin-Wo; Zhiwen Wang; Xinhua Yang; Alexander Kozik; Siwaret Arikit; Chi Song; Liangfeng Xia; Lutz Froenicke; Dean O Lavelle; María-José Truco; Rui Xia; Shilin Zhu; Chunyan Xu; Huaqin Xu; Xun Xu; Kyle Cox; Ian Korf; Blake C Meyers; Richard W Michelmore
Journal:  Nat Commun       Date:  2017-04-12       Impact factor: 14.919

6.  Effector-mediated discovery of a novel resistance gene against Bremia lactucae in a nonhost lettuce species.

Authors:  Anne K J Giesbers; Alexandra J E Pelgrom; Richard G F Visser; Rients E Niks; Guido Van den Ackerveken; Marieke J W Jeuken
Journal:  New Phytol       Date:  2017-08-21       Impact factor: 10.151

Review 7.  The Enhancement of Plant Disease Resistance Using CRISPR/Cas9 Technology.

Authors:  Virginia M G Borrelli; Vittoria Brambilla; Peter Rogowsky; Adriano Marocco; Alessandra Lanubile
Journal:  Front Plant Sci       Date:  2018-08-24       Impact factor: 5.753

8.  Genome-wide functional analyses of plant coiled-coil NLR-type pathogen receptors reveal essential roles of their N-terminal domain in oligomerization, networking, and immunity.

Authors:  Tadeusz Wróblewski; Laurentiu Spiridon; Eliza Cristina Martin; Andrei-Jose Petrescu; Keri Cavanaugh; Maria José Truco; Huaqin Xu; Dariusz Gozdowski; Krzysztof Pawłowski; Richard W Michelmore; Frank L W Takken
Journal:  PLoS Biol       Date:  2018-12-12       Impact factor: 8.029

9.  Assessing the performance of different irrigation systems on lettuce (Lactuca sativa L.) in the greenhouse.

Authors:  Zijing Chen; Yingyan Han; Kang Ning; Chen Luo; Wei Sheng; Shenglin Wang; Shuangxi Fan; Yanfang Wang; Qian Wang
Journal:  PLoS One       Date:  2019-02-04       Impact factor: 3.240

10.  Exploring sunflower responses to Sclerotinia head rot at early stages of infection using RNA-seq analysis.

Authors:  Mónica I Fass; Máximo Rivarola; Norma B Paniego; Verónica V Lia; Guillermo F Ehrenbolger; Carla A Maringolo; Juan F Montecchia; Facundo Quiroz; Francisco García-García; Joaquín Dopazo Blázquez; H Esteban Hopp; Ruth A Heinz
Journal:  Sci Rep       Date:  2020-08-07       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.