| Literature DB >> 29467425 |
Jose V Die1, Belén Román2, Xinpeng Qi3, Lisa J Rowland3.
Abstract
Blueberry is an important crop worldwide. It is, however, susceptible to a variety of diseases, which can lead to losses in yield and fruit quality. Although screening studies have identified resistant germplasm for some important diseases, still little is known about the molecular basis underlying that resistance. The most predominant type of resistance (R) genes contains nucleotide binding site and leucine rich repeat (NBS-LRR) domains. The identification and characterization of such a gene family in blueberry would enhance the foundation of knowledge needed for its genetic improvement. In this study, we searched for and found a total of 106 NBS-encoding genes (including 97 NBS-LRR) in the current blueberry genome. The NBS genes were grouped into eleven distinct classes based on their domain architecture. More than 22% of the NBS genes are present in clusters. Ten genes were mapped onto seven linkage groups. Phylogenetic analysis grouped these genes into two major clusters based on their structural variation, the first cluster having toll and interleukin-1 like receptor (TIR) domains and most of the second cluster containing a coiled-coil domain. Our study provides new insight into the NBS gene family in blueberry and is an important resource for the identification of functional R-genes.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29467425 PMCID: PMC5821885 DOI: 10.1038/s41598-018-21738-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Number and classification of NBS-encoding genes in the blueberry genome.
| Predicted protein domain | Letter code | N. of genes |
|---|---|---|
|
| ||
| TIR-NBS-LRR subclass | ||
| TIR-NBS-LRR | TNL | 10 |
| TIR-CC-NBS-LRR | TCNL | 1 |
| non-TIR-NBS-LRR subclass | ||
| CC-NBS-LRR | CNL | 53 |
| X-NBSCC-LRR | XNL | 20 |
| NBSCC-LRR | NL | 10 |
| CC-NBS-CC-LRR | CNCL | 1 |
| NBS-LRR-NBS-LRR | NLNL | 1 |
| RPW8-NBS-LRR | RNL | 1 |
|
| ||
| CC-NBS | CN | 5 |
| TIR-CC-NBS | TCN | 2 |
| X-NBS | XN | 2 |
|
|
| |
TIR Toll/interleukin-1-receptor, CC coiled-coil domain, LRR leucine-rich repeat domain, NBS nucleotide-binding site. CN, TCN and XN are structurally incomplete genes. NL and XNL are sequences with the N-terminal region comparable in length to intact CNL but we could not identify the CC domain based on the COILS program. The CC domain was detected in XNL sequences only by the Conserved Domain database at NCBI.
Figure 1Analysis of the N-terminal domain in non-TNL sequences. (a) Amino acid position of predicted CC motif. (b) Regular expression of the 30–60 amino acids region from blueberry CNL sequences. Sequence logo representation was generated from multiple alignments with MEME software.
Figure 2Comparative analysis between blueberry NBS sequences and other species. (a) Boxplot of identity distribution scores by plant RefSeq database. Blueberry NBS sequences were used as queries against the RefSeq database and the identity from the best-matching NBS protein for each blueberry sequence was recorded. Figure shows the five species with higher number of hits. (b) Species tree of some species in which homologs of NBS-LRR genes have been identified. The data were downloaded from NCBI Common Tree in the Taxonomy section (http://www.ncbi.nlm.nih.gov/taxonomy) and the tree was constructed using the R package “ape”[30].
Figure 3Position of 10 NBS-LRR genes on the blueberry linkage groups. Linkage map of F1#10 × W85-23 diploid population. First and last marker on each linkage group (cM) are shown as references. Positions of NBS genes are shown in red color. Only linkage groups with NBS markers are shown.
Organization in families of NBS-encoding genes in five plant genomes.
| Organization | Blueberry |
| Grapevinea | Poplara | Chestnutb | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 70% | 80% | 70% | 80% | 70% | 80% | 70% | 80% | 70% | 80% | 70% | 80% | |
| Single-genes | 50 | 68 | 93 | 134 | 119 | 172 | 91 | 106 | 246 | 304 | 182 | 241 |
| Multi-genes | 56 | 38 | 81 | 40 | 416 | 363 | 325 | 310 | 273 | 215 | 170 | 111 |
| Gene families | 18 | 14 | 25 | 15 | 94 | 102 | 61 | 64 | 64 | 61 | 49 | 39 |
| Max. family members | 6 | 4 | 7 | 4 | 13 | 10 | 23 | 17 | 19 | 17 | 8 | 7 |
| Avg. family members | 3.11 | 2.71 | 3.24 | 2.67 | 4.43 | 3.56 | 5.33 | 4.84 | 4.27 | 3.52 | 3.47 | 2.85 |
| Multi-genes/single genes | 1.12 | 0.56 | 0.87 | 0.30 | 3.50 | 2.11 | 3.57 | 2.92 | 1.11 | 0.71 | 0.93 | 0.46 |
| % Multi-gene families | 52.8 | 35.8 | 46.6 | 23.0 | 77.8 | 67.9 | 78.1 | 74.5 | 52.6 | 41.4 | 48.3 | 31.5 |
| % TNL multi-genes | 18.2 | 0 | 14.8 | 7.4 | 53.6 | 29.4 | ||||||
| % non-TNL multi-genes | 62.8 | 44.2 | 54.7 | 43.3 | 44.2 | 33.1 | ||||||
aData from[24].
bData from[61].
cData from[25].
Figure 4Phylogenetic tree of NBS genes in the blueberry genome. The tree is based on the maximum likelihood method using MEGA software. Numbers on the branches indicate the percentage of 1000 bootstrap replicates. Gene names are intended to represent blueberry scaffolds and domain configurations. Numbers between brackets denote more than one gene per scaffold.
Conserved cis-regulatory elements (CRE) in blueberry NBS-encoding gene promoters.
| CRE | Motif | Query1 | Promoters Observed | Total Occurrences Observed2 | Avg. Occurrs. per Promoter | Total Occurrences Expected3 | Enrichment Factor4 | |
|---|---|---|---|---|---|---|---|---|
| GT1GMSCAM4 | GAAAAA | 65 | 48 | 102 | 2.13 | 50.64 | 2.01 | 0.011 |
| ABRE | ACGTGTC | 65 | 6 | 8 | 1.33 | 4.10 | 1.95 | 0.009 |
| MYCATERD1 | CATGTG | 65 | 26 | 31 | 1.19 | 18.69 | 1.66 | 0.010 |
| Motif CTCTT | CTCTT | 65 | 58 | 172 | 2.97 | 117.36 | 1.47 | 0.014 |
| CAATBOX 1 | CCAAT | 65 | 56 | 133 | 2.38 | 107.22 | 1.24 | 0.015 |
| G-box | CACATG | 65 | 21 | 24 | 1.14 | 20.05 | 1.20 | 0.012 |
| DOFCOREZM | AAAG | 65 | 65 | 621 | 9.55 | 520.10 | 1.19 | 0.015 |
| ATHB5 | CAATNATTG | 65 | 4 | 4 | 1.00 | 3.41 | 1.17 | 0.013 |
| RAV1AAT | CAACA | 65 | 54 | 111 | 2.06 | 104.03 | 1.07 | 0.015 |
| CACTFTPPCA1 | YACT | 65 | 65 | 727 | 11.18 | 930.09 | 0.78 | — |
1Total number of promoters in the query set.
2Total number of motifs in the query set.
3Total number of motifs expected to occur by chance/1.5 kb promoter based on nucleotide frequency in 65 blueberry promoter sequences.
4Number of motifs observed divided by the number of motifs expected to occur by chance.
5Probabilities based on 2,000 Monte Carlo simulations.
Figure 5Distribution of CREs in the blueberry promoters data set and simulated control set. The control dataset is based on 2000 simulations, where each simulation contains n = 65 sequences, with length = 1500 bp per sequence, and an expected frequency GC = 0.37.