| Literature DB >> 23734219 |
Huaan Yang1, Ye Tao, Zequn Zheng, Qisen Zhang, Gaofeng Zhou, Mark W Sweetingham, John G Howieson, Chengdao Li.
Abstract
Lupin (Lupinus angustifolius L.) is the most recently domesticated crop in major agricultural cultivation. Its seeds are high in protein and dietary fibre, but low in oil and starch. Medical and dietetic studies have shown that consuming lupin-enriched food has significant health benefits. We report the draft assembly from a whole genome shotgun sequencing dataset for this legume species with 26.9x coverage of the genome, which is predicted to contain 57,807 genes. Analysis of the annotated genes with metabolic pathways provided a partial understanding of some key features of lupin, such as the amino acid profile of storage proteins in seeds. Furthermore, we applied the NGS-based RAD-sequencing technology to obtain 8,244 sequence-defined markers for anchoring the genomic sequences. A total of 4,214 scaffolds from the genome sequence assembly were aligned into the genetic map. The combination of the draft assembly and a sequence-defined genetic map made it possible to locate and study functional genes of agronomic interest. The identification of co-segregating SNP markers, scaffold sequences and gene annotation facilitated the identification of a candidate R gene associated with resistance to the major lupin disease anthracnose. We demonstrated that the combination of medium-depth genome sequencing and a high-density genetic linkage map by application of NGS technology is a cost-effective approach to generating genome sequence data and a large number of molecular markers to study the genomics, genetics and functional genes of lupin, and to apply them to molecular plant breeding. This strategy does not require prior genome knowledge, which potentiates its application to a wide range of non-model species.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23734219 PMCID: PMC3667174 DOI: 10.1371/journal.pone.0064799
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genome sequence assembly and annotation statistics for Lupinus angustifolius.
| All scaffold | Scaffolds longer than 2 kb | |
| Number of scaffolds | 234,534 | 51,867 |
| Total span | 598 Mb | 538 Mb |
| Scaffold N50 | 12.546 kb | 15.597 kb |
| Longest scaffold | 151.003 kb | 151.003 kb |
| Average scaffold length | 2.511 kp | 10.372 kp |
| Longest contig | 70.567 kb | 70.567 kb |
| Average contig length | 927 bp | 6,470 bp |
| Number of contigs | 457,917 | 59,292 |
| Contig N50 | 5,806 bp | |
| GC content | 33.61% | |
| Gene models | 57,806 | |
| Mean transcript length | 2,038.6 bp | |
| Mean coding sequence length | 1,033.5 bp | |
| Mean number of exons per gene | 4.09 | |
| Mean exon length | 252 bp | |
| GC content in exons | 41.32% | |
| Mean intron length | 325 bp | |
| GC content in introns | 32.11% |
Summary of the genetic linkage map constructed based on 8,246 sequence-defined molecular markers in Lupinus angustifolius.
| Linkage groups | Number of sequence-defined markers | Number of loci | Genetic length (cM) | Number of integrated scaffolds | Scaffold length span (bp) |
| SLG-1 | 1498 | 235 | 234.3 | 763 | 129,562,88 |
| SLG-2 | 1496 | 140 | 156.7 | 724 | 11,025,395 |
| SLG-3 | 414 | 151 | 149.0 | 236 | 4,368,678 |
| SLG-4 | 814 | 120 | 144.2 | 400 | 6,428,848 |
| SLG-5 | 785 | 143 | 101.9 | 365 | 6,243,667 |
| SLG-6 | 242 | 73 | 89.0 | 129 | 2,743,736 |
| SLG-7 | 218 | 81 | 86.5 | 114 | 2,326,261 |
| SLG-8 | 539 | 64 | 85.0 | 289 | 4,760,577 |
| SLG-9 | 299 | 95 | 83.5 | 155 | 2,970,440 |
| SLG-10 | 232 | 104 | 82.6 | 138 | 2,151,365 |
| SLG-11 | 690 | 85 | 82.2 | 344 | 6,059,460 |
| SLG-12 | 246 | 34 | 64.9 | 143 | 2,103,598 |
| SLG-13 | 266 | 36 | 52.2 | 155 | 2,321,041 |
| SLG-14 | 102 | 40 | 51.1 | 57 | 1,150,524 |
| SLG-15 | 69 | 23 | 34.5 | 32 | 631,989 |
| SLG-16 | 81 | 23 | 33.3 | 47 | 733,299 |
| SLG-17 | 82 | 24 | 32.4 | 40 | 766,926 |
| SLG-18 | 68 | 16 | 26.6 | 28 | 733,186 |
| SLG-19 | 23 | 16 | 20.6 | 13 | 572,743 |
| SLG-20 | 82 | 14 | 19.4 | 42 | 703,582 |
| Sub total | 8,246 | 1,517 | 1629.9 | 4,214 | 71,751,603 |
Figure 1A framework linkage map of the Lupinus angustifolius genome.
The framework map is extracted from the full map which consisted of 8,246 sequence-defined molecular markers. Detailed lists of all the molecular markers in the sequence-defined lupin map, including their genetic distance in each linkage group, the DNA sequences, and their corresponding scaffolds in the genome sequence assembly, are presented in Table S2.
Identification of scaffolds containing molecular markers linked to key agronomic genes in Lupinus angustifolius.
| Agronomic traits | Name of markers | Distance between marker and target gene (cM) | Reference source | Scaffold identified | Scaffold size (bp) |
| Disease resistance gene |
| 0 | This study | scaffold 31581 | 15,706 |
| Disease resistance gene |
| 0 | This study | scaffold 31581 | 15,706 |
| Disease resistance gene | AntjM1 | 3.5 |
| scaffold83350 | 11,407 |
| Disease resistance gene | AntjM2 | 2.3 |
| scaffold2992 | 33,979 |
| Disease resistance gene | AnSeq3 | 0.9 |
| Scaffold33942 | 64,039 |
| Disease resistance gene | AnSeq4 | 0.9 |
| Scaffold31346 | 33,727 |
| Disease resistance gene |
| 0 | This study | Scaffold98007 | 17,744 |
| Disease resistance gene |
| 0 | This study | Scaffold98007 | 17,744 |
| Disease resistance gene |
| 0 | This study | Scaffold84773 | 33,448 |
| Disease resistance gene |
| 0 | This study | Scaffold84773 | 33,448 |
| Disease resistance gene |
| 0 | This study | Scaffold84773 | 33,448 |
| Seed coat colour |
| 0 | This study | C28631230 | 1,673 |
| Seed coat colour |
| 0 | This study | scaffold10511 | 21,570 |
| Seed coat colour |
| 0 | This study | scaffold11205 | 12,869 |
| Seed coat colour |
| 0 | This study | scaffold11676 | 22,481 |
| Seed coat colour |
| 0 | This study | scaffold13708 | 44,176 |
| Seed coat colour |
| 0 | This study | scaffold18757 | 3,783 |
| Seed coat colour |
| 0 | This study | scaffold18978 | 10,158 |
| Seed coat colour |
| 0 | This study | scaffold19305 | 14,364 |
| Seed coat colour |
| 0 | This study | scaffold23047 | 5,252 |
| Seed coat colour |
| 0 | This study | scaffold30742 | 10,173 |
| Seed coat colour |
| 0 | This study | scaffold35025 | 4,343 |
| Seed coat colour |
| 0 | This study | scaffold7657 | 3,023 |
| Seed coat colour |
| 0 | This study | scaffold87947 | 4,118 |
| Seed coat colour |
| 0 | This study | scaffold13764 | 19,254 |
| Seed coat colour |
| 0 | This study | scaffold40404 | 6,621 |
| Seed coat colour |
| 0 | This study | Scaffold48877 | 15,086 |
| Disease resistance gene | AnManM1 | 5.1 |
| scaffold36514 | 50,220 |
| Disease resistance gene | Ph258M1 | 5.7 |
| scaffold84752 | 21,471 |
| Disease resistance gene | Ph258M2 | 2.1 |
| scaffold16252 | 15,559 |
| Resistance gene against lupin rust disease | RustM1 | unknown | Yang et al unpublished data | scaffold15347 | 42,210 |
| Early flowering gene | KuH | 0 |
| scaffold21489 | 30,923 |
| Soft-seed coat gene | MoA | 0 |
| scaffold75616 | 14,783 |
| Soft-seed coat gene | MoLi | 0 |
| scaffold75616 | 14,783 |
| Pod-non-shattering | LeLi | 6.0 |
| scaffold87978 | 9,909 |
| Pod-non-shattering gene | LeM2 | 1.3 |
| scaffold79908 | 20,738 |
| Pod-non-shattering gene | TaM1 | 2.1 |
| scaffold15347 | 21,529 |
| Pod-non-shattering gene | TaLi | 1.4 |
| scaffold36274 | 8,191 |
| Low alkaloid gene | IucLi | 0.9 |
| scaffold30160 | 20,677 |
Markers co-segregating (0 cM) with the Lanr1 gene conferring resistance to anthracnose disease are highlighted in italics; markers co-segregating with the PhtjR gene conferring resistance against phomopsis stem blight are highlighted in bold; and markers co-segregating with the seed coat colour gene are highlighted in italics and bold. These highlighted markers were developed in this study. The remaining 16 markers were developed previously. All the markers linked to agronomic traits of interest in lupin were developed in the Lupin Molecular Laboratory at Department of Agriculture and Food Western Australia. Scaffold sequences from the lupin genome sequence assembly bearing these molecular markers have been deposited at Genbank (Submission number: SUB139069; Project number: PRJNA179231).