| Literature DB >> 25060816 |
Lars G Kamphuis1, James K Hane, Matthew N Nelson, Lingling Gao, Craig A Atkins, Karam B Singh.
Abstract
Narrow-leafed lupin (NLL; Lupinus angustifolius L.) is an important grain legume crop that is valuable for sustainable farming and is becoming recognized as a human health food. NLL breeding is directed at improving grain production, disease resistance, drought tolerance and health benefits. However, genetic and genomic studies have been hindered by a lack of extensive genomic resources for the species. Here, the generation, de novo assembly and annotation of transcriptome datasets derived from five different NLL tissue types of the reference accession cv. Tanjil are described. The Tanjil transcriptome was compared to transcriptomes of an early domesticated cv. Unicrop, a wild accession P27255, as well as accession 83A:476, together being the founding parents of two recombinant inbred line (RIL) populations. In silico predictions for transcriptome-derived gene-based length and SNP polymorphic markers were conducted and corroborated using a survey assembly sequence for NLL cv. Tanjil. This yielded extensive indel and SNP polymorphic markers for the two RIL populations. A total of 335 transcriptome-derived markers and 66 BAC-end sequence-derived markers were evaluated, and 275 polymorphic markers were selected to genotype the reference NLL 83A:476 × P27255 RIL population. This significantly improved the completeness, marker density and quality of the reference NLL genetic map.Entities:
Keywords: SNP; indel; lupin; molecular marker; next-generation sequencing
Mesh:
Substances:
Year: 2014 PMID: 25060816 PMCID: PMC4309465 DOI: 10.1111/pbi.12229
Source DB: PubMed Journal: Plant Biotechnol J ISSN: 1467-7644 Impact factor: 9.803
Lupinus angustifolius cv. Tanjil de novo transcriptome assembly statistics
| Total length | 51.6 Mb |
| Components | 63 271 |
| Components + isoforms | 104 766 |
| Components w. 1 isoform | 53 761 |
| Components w. >1 isoform (alternate mRNA-splicing or misassembled multigene family or heterozygous allele) | 9150 |
Transcripts (Trinity components) with significantly tissue-specific mRNA expression predicted by cufflinks. T × U = Tanjil × Unicrop; D × W = Domestic × Wild = 83A:476 × P27255
| Components with candidate markers | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| mRNA expression specificity | Count of Trinity components | With functional annotation | Standard SNP: TxU only | Standard SNP: DxW only | Standard SNP: TxU + DxW | Standard Length: TxU only | Standard Length: DxW only | Standard Length: TxU + DxW | Fluidigm SNP: TxU only | Fluidigm SNP: DxW only | Fluidigm SNP: TxU + DxW |
| Constitutive (no significant differences in expression between all tissues) | 10 240 | 5257 | 1471 | 4736 | 1769 | 194 | 752 | 208 | 411 | 1667 | 289 |
| Root | 7 | 7 | 1 | 3 | – | – | – | – | 1 | 1 | – |
| Stem | – | – | – | – | – | – | – | – | – | – | – |
| Leaf | – | – | – | – | – | – | – | – | – | – | – |
| Flower | 28 | 18 | 9 | 12 | 6 | – | – | – | 2 | 3 | 1 |
| Seed | 17 | 14 | 3 | 5 | 2 | 1 | – | 1 | 1 | – | 3 |
| Root + stem | – | – | – | – | – | – | – | – | – | – | – |
| Root + leaf | – | – | – | – | – | – | – | – | – | – | – |
| Root + flower | – | – | – | – | – | – | – | – | – | – | – |
| Root + seed | 1 | 1 | – | – | – | – | – | – | – | 1 | – |
| Stem + leaf | – | – | – | – | – | – | – | – | – | – | – |
| Stem + flower | – | – | – | – | – | – | – | – | – | – | – |
| Stem + seed | – | – | – | – | – | – | – | – | – | – | – |
| Leaf + flower | – | – | – | – | – | – | – | – | – | – | – |
| Leaf + seed | – | – | – | – | – | – | – | – | – | – | – |
| Flower + seed | 2 | 1 | – | 1 | – | – | – | – | – | – | – |
| Stem + leaf + flower (above ground) | – | – | – | – | – | – | – | – | – | – | – |
| Not conforming to above categories | 52 976 | 43 | 24 057 | 12 853 | 4092 | 462 | 1730 | 413 | 1559 | 5845 | 1065 |
| Total | 63 271 | 5341 | 25 541 | 17 610 | 5869 | 657 | 2482 | 622 | 1974 | 7517 | 1358 |
Figure 1Heat-map depicting normalized log2-fold changes in mRNA expression inferred from RNAseq data for transcripts (Trinity components) predicted to be significantly tissue-specific by cufflinks.
Lupinus angustifolius cv. Tanjil draft survey assembly statistics
| Contig | Scaffold | |
|---|---|---|
| N90 | 698 252 | 47 079 |
| N50 | 93 792 | 10 137 |
| L90 | 133 bp | 1837 bp |
| L50 | 1088 bp | 13 762 bp |
| Max | 33 271 bp | 206 256 bp |
| Total length | 453 133 600 bp | 521 188 753 bp |
| Number ≥100 bp | 1 096 703 | 191 701 |
| Number ≥2 kb | 43 040 | 45 210 |
Figure 2Comparison of the distribution and density of RNAseq-derived markers to the previously published genetic map (Kroc et al., 2014), using the draft genome assembly scaffolds as a point of reference to relate to the existing genetic map. Each series of 3 black concentric rings indicate different marker types. (A) Fluidigm-compatible bi-allelic SNP markers. (B) Standard PCR indel markers. (C) Standard PCR-compatible SNP markers. Marker polymorphism relative to the cv Tanjil reference transcriptome is indicated by colour as 83A:476 × P27255 (red), Tanjil × Unicrop (green) and both Tanjil × Unicrop and 83A:476 × P27255 (blue). Normalized marker density per Mbp is indicated within each ring from 0 to 100+ along the vertical axis.
Figure 3Venn diagrams demonstrating the in silico polymorphism identified in the L. angustifolius RIL populations derived from Tanjil and Unicrop or P27255 and 83A:476 were used for candidate marker discovery. (a) Standard PCR-based SNP polymorphic primers. (b) Standard PCR-based length-polymorphic primers. (c) Fluidigm platform biallelic SNP polymorphic primers. Overlapping markers are polymorphic in both RIL populations.
Lupinus angustifolius molecular marker statistics of markers derived from BAC-end sequencing and RNAseq datasets
| Designed | Tested | Polymorphic | Mapped onto the reference genetic map | |
|---|---|---|---|---|
| BAC-end sequence-derived SSR markers | 66 | 66 | 25 (37.9%) | 17 |
| RNAseq length polymorphisms (Indels) | 239 | 239 | 194 (81.2%) | 170 |
| RNAseq SNP polymorphisms | 2126 | 96 | 88 (91.6%) | 88 |
See Gao et al. (2011) for marker design details.
Indicates polymorphism between both parent-pairs of the two NLL RIL populations.
Lupinus angustifolius linkage group details. Skeleton markers were used for locus ordering and interval size calculation (Kosambi cM). Redundant markers mapped to the same location as skeleton markers. Attached markers mapped to approximate intervals on the genetic map
| Linkage group | Skeleton | Redundant | Attached | Total loci | Length (cM) |
|---|---|---|---|---|---|
| NLL-01 | 58 | 24 | 40 | 122 | 158.3 |
| NLL-02 | 34 | 28 | 18 | 80 | 119.9 |
| NLL-03 | 39 | 14 | 34 | 87 | 128.1 |
| NLL-04 | 41 | 14 | 13 | 68 | 145.7 |
| NLL-05 | 43 | 21 | 13 | 77 | 119.1 |
| NLL-06 | 60 | 20 | 35 | 115 | 137.2 |
| NLL-07 | 56 | 28 | 16 | 100 | 132.3 |
| NLL-08 | 38 | 20 | 17 | 75 | 140.4 |
| NLL-09 | 26 | 10 | 20 | 56 | 103.3 |
| NLL-10 | 42 | 19 | 9 | 70 | 91.6 |
| NLL-11 | 52 | 34 | 21 | 107 | 144.7 |
| NLL-12 | 24 | 9 | 18 | 51 | 95.4 |
| NLL-13 | 34 | 9 | 8 | 51 | 104.7 |
| NLL-14 | 27 | 7 | 18 | 52 | 91.7 |
| NLL-15 | 37 | 10 | 12 | 59 | 102.2 |
| NLL-16 | 29 | 14 | 14 | 57 | 93.8 |
| NLL-17 | 35 | 30 | 16 | 81 | 102.4 |
| NLL-18 | 38 | 8 | 8 | 54 | 93.1 |
| NLL-19 | 24 | 6 | 10 | 40 | 74.2 |
| NLL-20 | 41 | 14 | 12 | 67 | 82.8 |
| Cluster-2 | 5 | 1 | 0 | 6 | 3.0 |
| Total | 783 | 340 | 352 | 1475 | 2263.9 |
Distribution of new BAC-end-derived SSR (LaSSR) and transcriptome-derived length (indel; LaIND) and SNP (LaSNP) markers over 20 L. angustifolius linkage groups (prefixed NLL) and Cluster-2
| Marker type | ||||
|---|---|---|---|---|
| NLL linkage group | LaSSR | LaSNP | LaIND | Total number of new markers |
| NLL-01 | 1 | 3 | 10 | 14 |
| NLL-02 | 1 | 7 | 21 | 29 |
| NLL-03 | 1 | 7 | 5 | 13 |
| NLL-04 | 1 | 0 | 0 | 1 |
| NLL-05 | 1 | 4 | 10 | 15 |
| NLL-06 | 0 | 8 | 15 | 23 |
| NLL-07 | 1 | 3 | 4 | 8 |
| NLL-08 | 2 | 1 | 2 | 5 |
| NLL-09 | 0 | 2 | 3 | 5 |
| NLL-10 | 0 | 8 | 13 | 21 |
| NLL-11 | 0 | 3 | 16 | 19 |
| NLL-12 | 0 | 5 | 4 | 9 |
| NLL-13 | 0 | 4 | 4 | 8 |
| NLL-14 | 3 | 6 | 11 | 20 |
| NLL-15 | 2 | 5 | 8 | 15 |
| NLL-16 | 0 | 7 | 8 | 15 |
| NLL-17 | 0 | 8 | 11 | 19 |
| NLL-18 | 3 | 2 | 5 | 10 |
| NLL-19 | 0 | 1 | 2 | 3 |
| NLL-20 | 1 | 3 | 17 | 21 |
| Cluster-2 | 0 | 1 | 1 | 2 |
| Total | 17 | 88 | 170 | 275 |