| Literature DB >> 27557478 |
James K Hane1,2,3, Yao Ming4, Lars G Kamphuis1,5, Matthew N Nelson5,6, Gagan Garg1, Craig A Atkins5,6, Philipp E Bayer6, Armando Bravo7, Scott Bringans8, Steven Cannon9,10, David Edwards6,11, Rhonda Foley1, Ling-Ling Gao1, Maria J Harrison7, Wei Huang10, Bhavna Hurgobin6,11, Sean Li12, Cheng-Wu Liu13, Annette McGrath12, Grant Morahan14, Jeremy Murray12, James Weller15, Jianbo Jian4, Karam B Singh1,5.
Abstract
Lupins are important grain legume crops that form a critical part of sustainable farming systems, reducing fertilizer use and providing disease breaks. It has a basal phylogenetic position relative to other crop and model legumes and a high speciation rate. Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is gaining popularity as a health food, which is high in protein and dietary fibre but low in starch and gluten-free. We report the draft genome assembly (609 Mb) of NLL cultivar Tanjil, which has captured >98% of the gene content, sequences of additional lines and a dense genetic map. Lupins are unique among legumes and differ from most other land plants in that they do not form mycorrhizal associations. Remarkably, we find that NLL has lost all mycorrhiza-specific genes, but has retained genes commonly required for mycorrhization and nodulation. In addition, the genome also provided candidate genes for key disease resistance and domestication traits. We also find evidence of a whole-genome triplication at around 25 million years ago in the genistoid lineage leading to Lupinus. Our results will support detailed studies of legume evolution and accelerate lupin breeding programmes.Entities:
Keywords: Genistoids; Legume comparative genomics; flowering time genes; polyploidy; synteny; whole-genome assembly
Mesh:
Substances:
Year: 2016 PMID: 27557478 PMCID: PMC5316927 DOI: 10.1111/pbi.12615
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Summary of the narrow‐leafed lupin cv. Tanjil genome assembly. The assembly comprises scaffolds, the majority of which have been placed into pseudochromosomes based on genetic map and synteny data
| Assembly statistics | Total length (bp) | Average length (bp) | Maximum length (bp) | Minimum length (bp) | N50 | N50 length | Total # sequences | Total unknown N bases |
|---|---|---|---|---|---|---|---|---|
| Contigs | 810 353 784 | 758 | 922 429 | 100 | 4 246 | 45 646 | 1 068 669 | 0 |
| Scaffolds | 609 123 749 | 42 362 | 4 089 732 | 200 | 232 | 703 185 | 14 379 | 4 078 848 |
| Pseudochromosomes | 470 424 067 | 23 521 203 | 36 457 581 | 16 251 777 | 8 | 24 697 652 | 20 | 3 351 285 |
| Unplaced scaffolds | 138 780 182 | 10 239 | 1 472 692 | 200 | 610 | 45 366 | 13 554 | 808 063 |
Summary of transposon content in the narrow‐leafed lupin cv. Tanjil genome assembly
| Repbase TEs | TE Proteins |
| Combined TEs | % in Genome | Length (bp) | % in Genome | ||
|---|---|---|---|---|---|---|---|---|
| Length (bp) | % in Genome | Length (bp) | % in Genome | Length (bp) | ||||
| DNA | 8 983 926 | 1.47 | 7 351 979 | 1.20 | 23 429 353 | 3.83 | 29 084 889 | 4.76 |
| LINE | 8 299 104 | 1.35 | 10 841 081 | 1.77 | 13 051 653 | 2.13 | 16 438 300 | 2.69 |
| LTR | 79 075 250 | 12.95 | 90 533 453 | 14.83 | 154 738 027 | 25.35 | 172 348 763 | 28.23 |
| SINE | 66 384 | 0.01 | 0 | 0 | 483 328 | 0.08 | 544 025 | 0.09 |
| Other | 3917 | 0.000642 | 0 | 0 | 0 | 0 | 3917 | 0.000642 |
| Unknown | 0 | 0 | 2988 | 0.00049 | 134 545 040 | 22.04 | 134 548 028 | 22.04 |
| Total | 95 943 148 | 15.71 | 108 715 411 | 17.81 | 319 388 057 | 52.32 | 331 905 409 | 54.37 |
Summary of predicted protein‐coding gene annotations of narrow‐leafed lupin and their supporting evidence types
| Gene set | Number | Average transcript length (bp) | Average CDS length (bp) | Average exon per gene | Average exon length (bp) | Average intron length (bp) | |
|---|---|---|---|---|---|---|---|
|
|
| 34 525 | 2 983.98 | 1 252.69 | 5.49 | 228.06 | 385.36 |
|
| 29 436 | 10 570.55 | 1 367.30 | 6.22 | 219.52 | 1 760.19 | |
|
| 48 717 | 2 815.40 | 968.47 | 3.72 | 260.12 | 678.23 | |
| Homolog |
| 46 735 | 2 422.52 | 929.51 | 3.90 | 238.24 | 514.57 |
|
| 42 856 | 4 349.29 | 1 125.34 | 4.04 | 278.37 | 1 059.62 | |
|
| 39 433 | 3 648.01 | 1 245.43 | 4.55 | 273.19 | 675.09 | |
|
| 61 321 | 2 454.66 | 843.16 | 3.10 | 271.33 | 764.67 | |
|
| 68 168 | 1 936.06 | 786.66 | 3.23 | 242.93 | 513.54 | |
| EST | 1 795 | 2 134.32 | 606.22 | 3.16 | 191.27 | 704.40 | |
|
| 32 413 | 3 568.05 | 1 305.97 | 5.58 | 233.78 | 493.22 | |
| RNA‐seq | 49 946 | 2 309.00 | 803.54 | 4.01 | 199.94 | 498.02 | |
| Final set | 33 076 | 3 673.44 | 1 289.14 | 5.52 | 233.52 | 488.41 |
Figure 1Summary of sequence variability in narrow‐leafed lupin lines Unicrop, 83A:476 and P27255, relative to pseudochromosomes (corresponding to linkage groups) of the reference genome of cv. Tanjil. (A) Pseudochromosomes (black), with sequence‐based genetic markers relevant to this study highlighted. (B–D) Per cent of 100‐Kb windows covered by ≥5× read depth for resequencing data from lines Unicrop (b, blue), 83A:476 (C, red) and P27255 (D, light green). (E–G) Density of polymorphic sequence sites ranging from 0 to 20 000 variants/Mb calculated within 100‐Kb windows, for lines Unicrop (E, blue), 83A:476 (F, red) and P27255 (G, light green). (H) Per cent of 100‐kb windows representing annotated genes in cv. Tanjil (dark green). (i) Per cent of 100‐kb windows represented annotated repetitive DNA in cv. Tanjil (purple). (j) Per cent G:C content ranging from 0% to 50%, calculated in 100‐Kb windows, in cv. Tanjil (black).
Figure 2Synonymous substitution (Ks) analysis, showing proportion of values per Ks bin. Ks values are medians from synteny blocks for the indicated comparisons, and values in these plots are scaled to the total number of Ks counts for each comparison. (a) Orthologous comparisons between narrow‐leafed lupin (NLL) and Glycine max (Gm, red line), Phaseolus vulgaris (Pv, green dots), Lotus japonicus (Lj, purple dashed line) and Medicago truncatula (Mt, blue dashed lines). Asterisks show a primary peak for the speciation‐derived orthologs, and a probable smaller secondary peak for the papilionoid whole‐genome duplication‐derived ‘old orthologs’. (b) Paralogous genome self‐comparisons for narrow‐leafed lupin, Medicago truncatula, Phaseolus vulgaris, Glycine max and Lotus japonicus. Coloured arrows show two peaks in both Glycine max and narrow‐leafed lupin: the first peak in each case represents independent whole‐genome duplications in these lineages (Glycine at ~11 Mya and Lupinus at ~24 Mya), and the second peaks correspond to the shared papilionoid whole‐genome duplication. See Figure S7 and Data S7 for additional rate and date estimations.
Overview of genes associated with arbuscular mycorrhizal and rhizobial associations in the genomes of Medicago truncatula and narrow‐leafed lupin
| Symbiotic component | Gene product | Medicago | Lupin | Reference |
|---|---|---|---|---|
| NUP85 | Nucleoporin |
|
| 1 |
| NUP133 | Nucleoporin |
|
| 2 |
| NENA | Nucleoporin |
|
| 3 |
| MCA8 | Calcium pump |
|
| 4 |
| DELLA1 | Transcriptional regulators |
|
| 5 |
| DELLA2 | Transcriptional regulators |
|
| 5 |
| CCD7 | Carotenoid cleavage dioxygenase |
|
| 6 |
| CCD8 | Carotenoid cleavage dioxygenase |
|
| 6 |
| PDR1 | ABC transporter |
|
| 7 |
| D27 | Carotenoid isomerase |
|
| 8 |
| SUT2 | Sucrose transporter |
|
| 9 |
| DMI1 (Pollux) | Cation channel |
|
| 10 |
| NSP1 | GRAS transcription factor |
|
| 11 |
| NSP2 | GRAS transcription factor |
|
| 12 |
| DMI3 (CCaMK) | Calcium/calmodulin‐dependent protein kinase |
|
| 13 |
| DMI2 (SYMRK/NORK) | Receptor‐like kinase |
|
| 14 |
| Castor | Cation channel |
|
| 15 |
| VAPYRIN | MSP and ANK repeat‐containing protein |
|
| 16 |
| IPD3 (Cyclops) | Coiled‐coil domain containing protein |
|
| 17 |
| NFP | LysM receptor‐like kinase |
|
| 18 |
| PP2AB'1 | Protein phosphatase 2A |
|
| 19 |
| LYK3 | LysM receptor kinase |
|
| 20 |
| ERN1 | Transcription factor |
|
| 21 |
| ERN2 | Transcription factor |
|
| 22 |
| NIN | Transcription factor |
|
| 23 |
| NF‐YA1 | Transcription factor |
|
| 24 |
| NF‐YA2 | Transcription factor |
|
| 25 |
| RPG | Coiled‐coil protein |
|
| 26 |
| LIN | E3 ubiquitin ligase |
|
| 27 |
| PUB1 | E3 ubiquitin ligase |
|
| 28 |
| SUNN | LRR receptor kinase |
|
| 29 |
| NPL | Pectate lyase |
|
| 30 |
| CRE1 | Cytokinin receptor |
|
| 31 |
| FLOT4 | Flotillin |
|
| 32 |
| SYP132A | Syntaxin |
|
| 33 |
| SbtM1 |
|
| 34 | |
| SbtM3 |
|
| 34 | |
| HA1 |
|
| 35,36 | |
| Exo70 |
|
| 37 | |
| RAM2 |
|
| 38 | |
| PT4 |
|
| 39 | |
| STR1 |
|
| 40 | |
| STR2 |
|
| 40 | |
| RAM1 |
|
| 41 | |
| ERF1 |
|
| 42 | |
| RAD1 |
|
| 43 | |
| DIP1 |
|
| 44 | |
| FatM |
|
| 45 | |
| KIN2 |
|
| 45 | |
| KIN3 |
|
| 45 | |
| KIN5 |
|
| 45 | |
| RFCb |
|
| 45 | |
| CYT733A1 |
|
| 45 |
References for each of the functionally characterized genes in relation to either AM symbiosis or Rhizobia symbiosis can be found in Data S10.