| Literature DB >> 26329386 |
Huaan Yang1, Jianbo Jian2, Xuan Li3, Daniel Renshaw4, Jonathan Clements5, Mark W Sweetingham6, Cong Tan7, Chengdao Li8,9.
Abstract
BACKGROUND: Molecular marker-assisted breeding provides an efficient tool to develop improved crop varieties. A major challenge for the broad application of markers in marker-assisted selection is that the marker phenotypes must match plant phenotypes in a wide range of breeding germplasm. In this study, we used the legume crop species Lupinus angustifolius (lupin) to demonstrate the utility of whole genome sequencing and re-sequencing on the development of diagnostic markers for molecular plant breeding.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26329386 PMCID: PMC4557927 DOI: 10.1186/s12864-015-1878-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics of denovo genome sequence assembly of re-sequenced nine cultivars of Lupinus angustifolius
| Unicrop | Yorrel | Merrit | Kalya | Tallerack | Quilinock | Mandelup | Coromup | Jenabilup | |
|---|---|---|---|---|---|---|---|---|---|
| Raw data (Mbp) | 13,334 | 14,322 | 15,958 | 15,760 | 11,043 | 17,275 | 17,727 | 15,242 | 14,588 |
| Clean data (Mbp) | 12,714 | 13,642 | 15,275 | 15,069 | 10,524 | 16,471 | 16,936 | 14,605 | 14,003 |
| Q20 base rate (%) | 95.3 | 96.6 | 96.9 | 97.0 | 95.9 | 96.8 | 97.0 | 97.0 | 97.1 |
| Number of scaffolds | 208,181 | 277,622 | 309,904 | 371,733 | 256,387 | 279,705 | 383,911 | 268,036 | 363,979 |
| Total scaffold span (Mbp) | 485 | 497 | 501 | 513 | 488 | 500 | 512 | 504 | 498 |
| Scaffold N50 (bp) | 10,864 | 9,463 | 8,814 | 9,307 | 9,070 | 9,835 | 9,423 | 10,487 | 7,633 |
| Average scaffold length (bp) | 2,332 | 1,789 | 1,617 | 1,380 | 1,905 | 1,789 | 1,332 | 1,882 | 1,369 |
| Longest scaffold (bp) | 305,995 | 183,544 | 191,423 | 156,385 | 229,074 | 228,256 | 147,382 | 211,945 | 125,123 |
| GC content (%) | 32.96 | 32.70 | 32.87 | 32.87 | 32.62 | 32.72 | 32.90 | 32.65 | 32.89 |
Numbers of SNP markers and InDel markers discovered by pairwise comparison of whole genome sequencing and re-sequencing data among 10 cultivars of Lupinus angustifolius*
| Lupin cultivars | Unicrop | Yorrel | Merrit | Kalya | Tallerack | Quilinock | Mandelup | Coromup | Jenabillup | |
|---|---|---|---|---|---|---|---|---|---|---|
| Yorrel | SNP | 361,783 | ||||||||
| InDel | 74,074 | |||||||||
| Merrit | SNP | 387,619 | 379,884 | |||||||
| InDel | 42,670 | 53,825 | ||||||||
| Kalya | SNP | 231,674 | 363,644 | 399,442 | ||||||
| InDel | 50,771 | 70,606 | 40,572 | |||||||
| Tallerack | SNP | 457,861 | 516,424 | 581,288 | 466,314 | |||||
| InDel | 84,239 | 100,668 | 71,220 | 80,863 | ||||||
| Quilinock | SNP | 358,425 | 402,839 | 386,350 | 370,174 | 521,485 | ||||
| InDel | 60,592 | 74,470 | 39,501 | 56,952 | 86,272 | |||||
| Mandelup | SNP | 383,509 | 333,375 | 363,518 | 405,193 | 525,458 | 399,216 | |||
| InDel | 59,611 | 62,906 | 34,838 | 57,048 | 83,158 | 57,406 | ||||
| Coromup | SNP | 358,729 | 318,466 | 338,840 | 377,613 | 509,809 | 365,480 | 210,394 | ||
| InDel | 59,469 | 61,381 | 35,666 | 57,069 | 84,167 | 55,839 | 39,077 | |||
| Jenabillup | SNP | 325,324 | 360,401 | 312,064 | 330,028 | 452,170 | 180,596 | 287,423 | 266,773 | |
| InDel | 52,035 | 65,258 | 27,406 | 48,075 | 75,939 | 33,094 | 42,073 | 41,398 | ||
| Tanjil (Reference) | SNP | 644,901 | 510,722 | 432,717 | 564,221 | 795,735 | 609,359 | 601,497 | 543,048 | 467,465 |
| InDel | 93,730 | 105,235 | 59,780 | 90,986 | 122,513 | 93,675 | 88,261 | 88,910 | 79,623 | |
* SNP markers are presented in black; InDel markers are in green
Fig. 1Genome-wide genetic diversity as measured by SNP abundance along each linkage group between reference cultivar Tanjil and nine re-sequenced cultivars of Lupinus angustifolius. Twenty linkage groups (SLG) were displayed in a circle. The inner number was SLG index and the outer was physical position (Mb). The circular histograms from circular 1 to 9 with different filling colour were SNP frequency distributions of nine cultivars in whole genome and the response relationship was given in the core area. Higher peaks indicated larger number of SNPs in the interval and lower troughs meant low abundance of SNP. The SNP frequency was counted in non-overlapping 100 kb intervals along each chromosome
Summary of SNP markers and InDel markers integrated into the sequence-defined genetic linkage map through sequence comparison on scaffolds in Lupinus angustifolius a
| Linkage groups | Genetic length (cM) | Number of anchored scaffoldsb | Number of SNP markers detected | Number of InDel markers detected |
|---|---|---|---|---|
| SLG-1 | 234.3 | 763 | 35,605 | 5,036 |
| SLG-2 | 156.7 | 724 | 24,158 | 5,190 |
| SLG-3 | 149 | 236 | 8,071 | 2,027 |
| SLG-4 | 144.2 | 400 | 14,160 | 3,202 |
| SLG-5 | 101.9 | 365 | 13,028 | 2,654 |
| SLG-6 | 89 | 129 | 4,830 | 1,437 |
| SLG-7 | 86.5 | 114 | 6,959 | 1,512 |
| SLG-8 | 85 | 289 | 13,761 | 1,688 |
| SLG-9 | 83.5 | 155 | 8,772 | 1,578 |
| SLG-10 | 82.6 | 138 | 6,230 | 1,132 |
| SLG-11 | 82.2 | 344 | 13,869 | 2,164 |
| SLG-12 | 64.9 | 143 | 5,778 | 1,094 |
| SLG-13 | 52.2 | 155 | 6,566 | 1,022 |
| SLG-14 | 51.1 | 57 | 2,806 | 735 |
| SLG-15 | 34.5 | 32 | 1,676 | 430 |
| SLG-16 | 33.3 | 47 | 1,468 | 443 |
| SLG-17 | 32.4 | 40 | 1,612 | 549 |
| SLG-18 | 26.6 | 28 | 1,616 | 478 |
| SLG-19 | 20.6 | 13 | 1,499 | 416 |
| SLG-20 | 19.4 | 42 | 2,175 | 461 |
| Sub total | 1629.9 | 4,214 | 174,639 | 33,248 |
aThe sequence-defined genetic linkage map has been published previously [38]
bFull list of scaffolds anchored on the genetic linkage map, and the number of markers detected from each scaffold are presented in Additional file 2
Marker mining on 23 genome sequence assembly scaffolds bearing 24 markers linked to 11 key genes of agronomic traits of interest by sequence alignments among 10 sequenced cultivars of Lupinus angustifolius
| Agronomic traits | Name of markers | Distance between marker and target gene (cM) | Reference | Scaffold identified | Scaffold size (bp) | Number of SNP markers from scaffold sequence alignment | Number of InDel markers from scaffold sequence alignment |
|---|---|---|---|---|---|---|---|
| Disease resistance gene | DAFWA6895 | 0 | [ | Scaffold84773 | 33,448 | 489 | 101 |
| Disease resistance gene | PhtjM1 | 1.3 | [ | scaffold70674 | 11,068 | 102 | 39 |
| Disease resistance gene | PhtjM4 | 1.1 | [ | scaffold16849 | 40,716 | 526 | 259 |
| Disease resistance gene | PhtjM6 | 1.9 | [ | scaffold2572 | 55,753 | 808 | 263 |
| Disease resistance gene | PhtjM7 | 1.1 | [ | scaffold57606 | 13,893 | 188 | 62 |
| Disease resistance gene | DAFWA5820 | 0 | [ | scaffold 31581 | 15,706 | 225 | 33 |
| Disease resistance gene | AntjM1 | 3.5 | [ | scaffold83350 | 11,407 | 74 | 35 |
| Disease resistance gene | AntjM2 | 2.3 | [ | scaffold2992 | 33,979 | 341 | 188 |
| Disease resistance gene | AnSeq3 | 0.9 | [ | Scaffold33942 | 64,039 | 716 | 138 |
| Disease resistance gene | AnSeq4 | 0.9 | [ | Scaffold31346 | 33,727 | 221 | 158 |
| Seed coat colour | DAFWA6428 | 0 | [ | scaffold11676 | 22,481 | 588 | 154 |
| Seed coat colour | DAFWA4544 | 0 | [ | scaffold13708 | 44,176 | 821 | 81 |
| Disease resistance gene | AnManM1 | 5.0 | [ | scaffold36514 | 50,220 | 311 | 213 |
| Disease resistance gene | Ph258M1 | 5.7 | [ | scaffold84752 | 21,471 | 292 | 94 |
| Disease resistance gene | Ph258M2 | 2.1 | [ | scaffold16252 | 15,559 | 212 | 25 |
| Resistance gene against lupin rust disease | RustM1 | Unknown | Unpublished | scaffold15347 | 42,210 | 578 | 25 |
| Early flowering gene | KuH | 0 | [ | scaffold21489 | 30,923 | 676 | 23 |
| Soft-seed coat gene | MoA, MoLi | 0 | [ | scaffold75616 | 14,783 | 63 | 16 |
| Pod-non-shattering | LeLi | 6.0 | [ | scaffold87978 | 9,909 | 59 | 17 |
| Pod-non-shattering gene | LeM2 | 1.3 | [ | scaffold79908 | 20,738 | 103 | 22 |
| Pod-non-shattering gene | TaM1 | 2.1 | [ | scaffold15347 | 21,529 | 578 | 25 |
| Pod-non-shattering gene | TaLi | 1.4 | [ | scaffold36274 | 8,191 | 62 | 4 |
| Low alkaloid gene | IucLi | 0.9 | [ | scaffold30160 | 20,677 | 667 | 22 |
| Average scaffold size and marker numbers | 27,687 | 378 | 87 | ||||
The list of 10 sequenced cultivars is presented in Tables 2 and 5
Identification of candidate diagnostic markers through genotyping sequence-defined markers with whole genome sequencing data from 10 cultivars on genetic linkage map flanking the R gene PhtjR conferring resistance to phomopsis in Lupinus angustifolius
aMarkers showing genotypes completely consistent with PSB disease phenotypes on all 10 cultivars are considered candidate diagnostic markers and are highlighted in green
bTwo nucleotides separated by a stroke line in brackets are SNP markers; nucleotides in brackets without a stroke line are InDel markers
cMarker positions are the nucleotide positions on the reference genome sequence assembly from cultivar Tanjil (Genbank BioProject number PRJNA179231)
dMarkers showing R-allele genotype on cultivars without the R gene Phtj (false positives) are highlighted in red
eMarker sequences missing in genome re-sequencing were recorded as missing data “-”
fGenotypes of R gene PhtjR on sequenced cultivars presented in blue: R = presence of PhtjR gene; S = absence of PhtjR gene [44]
Conversion of SNP markers identified from genotyping markers on genetic linkage map flanking the R gene PhtjR into sequence-specific PCR markers suitable for genotyping by high resolution melting (HRM) with LightScanner
| Marker | Primers | Primer sequence (5′-3′) |
|---|---|---|
| DAFWA926 | DAFWA926F | GGTTGGGTTAACTTTTATGTCTAAAATC |
| DAFWA926R | GGTAAGTTTATTTTTCTAAAGTTGAAC | |
| DAFWA2836 | DAFWA2836F | CACATAAGAATATGGAAATGGAGA |
| DAFWA2836R | CTGTAAACTGAAGGTGGGCATT | |
| DAFWA3794 | DAFWA3794F | GAAAGGAGAAAACTAATCAACATAAG |
| DAFWA3794R | ATTAGGGTTTGAGATAGAGTAACAT | |
| DAFWA2747 | DAFWA2747F | CCTAACTTCCGATCCAGTAAGC |
| DAFWA2747R | CTTTGATCGCTTGGGTTTC | |
| DAFWA6277 | DAFWA6277F | TTCGGGAATTTGTATGAGCT |
| DAFWA6277R | GGATGGATTCAAAGGTTCAAG | |
| DAFWA8077 | DAFWA8077F | GAGATTATTTTCACAAGCTTCCTC |
| DAFWA8077R | CCTTTTAGCTTATTCAATTAGCTTG | |
| DAFWA6895 | DAFWA6895F | TGAAGGTCCAATACCAGCAAG |
| DAFWA6895R | CAACTTCCCTGGAGCAAAA | |
| DAFWA4020 | DAFWA4020F | CTAGATAGTTTCGTTTTATCATAC |
| DAFWA4020R | GACATAAAGCTTATATATTTGCA | |
| DAFWA3123 | DAFWA3123F | CCCTGGACTCTCTCCCTGTATT |
| DAFWA3123R | GAATGAAAGTTTGATATGCATAATAA | |
| DAFWA4021 | DAFWA4021F | GCTCAGAAACGGTGTCGTT |
| DAFWA4021R | GAAGACCTCCAAAACCAAAGC |
Validation of sequence-specific SNP markers identified from genotyping markers on a genetic linkage map flanking the R gene PhtjR conferring resistance to phomopsis stem blight disease on all historical and current commercial cultivars of Lupinus angustifolius released in Australia
aGenotypes of R gene PhtjR on commercial cultivars are presented as: R = presence of PhtjR gene; S = absence of PhtjR gene [44]
bGenetic distance of the marker to the R gene PhtjR in centiMorgans (cM) was adapted from the mapping studies [38]
cMarkers showing R-allele genotype on cultivars without the R gene (false positives) are in highlighted in red
dSNP markers showing marker genotypes completely consistent with the PhtjR gene phenotypes in all 27 commercial cultivars (no false positive) are diagnostic markers, and are highlighted in green
Fig. 2Validation of simple PCR-based SNP markers linked to the R gene PhtjR conferring phomopsis stem blight disease resistance on all 27 historical and current cultivars of Lupinus angustifolius released in Australia by high resolution melting (HRM) on LightScanner. SNP marker DAFWA6277 (left) was confirmed as diagnostic for the PhtjR gene, as the three cultivars (Wonga, Tanjil and Barlock) showed the resistance marker allele (melting curves in blue), while all the other 23 cultivar not possessing the R gene has the susceptible marker allele (melting curves in red). In contrast, SNP marker DAFWA3123 (right) was confirmed as non-diagnostic, since six cultivars (Table 7) without the R gene had the resistance marker allele (melting curves in blue). Detailed records of genotypes for 27 cultivars of these two markers are presented in Table 7
List of a small portion of SNP markers and InDel markers discovered by marker mining on scaffold84773 (Genbank accession # AOCW01145302) showing large variation in marker genotypes among 10 sequenced cultivars and identification of candidate diagnostic markers for the R gene PhtjR of Lupinus angustifolius a
aThe full lists of the 489 SNP markers and 101InDel markers discovered from sequence alignment on scaffold84773 are markers in Additional file 3. Names of identified markers are consistent with the names labelled numerically in Additional file 3
bMarkers showing R-allele genotypes on cultivars without the R gene PhtjR (false positives) are in highlighted in red
cMarkers showing genotypes consistent with disease resistance phenotypes on all 10 sequenced cultivars are considered as candidate diagnostic markers, and are highlighted in green
Conversion of SNP markers and InDel markers arising from marker mining on scaffold84773 into sequence-specific PCR markers in Lupinus angustifolius
| Marker name | Primers | Primer sequence (5′-3′) |
|---|---|---|
| SNP 20 | SNP20F | GTCCCTGCCATTATTAATAGTTACT |
| SNP20R | CATCATGAGTCAATTTACCACTTA | |
| SNP 25 | SNP25F | GTCACTAATTTTATCTTTGCAAGA |
| SNP25R | GATCATAAGAATAATAATAATAATTTGGT | |
| SNP 250 | SNP250F | GACTTAGTAATGTGCAACAAGAG |
| SNP250R | CTGACACTACAGGTTCGCCT | |
| SNP 263 | SNP263F | GGAACATTGTGATTCAGTCACC |
| SNP263R | GATAGGTTTGTTGCAATAAGCG | |
| SNP264 | SNP264F | GTTTCTTAGTTGCATAGTTGCAA |
| SNP264R | CAAAACATTCATAAGTAACAAGG | |
| SNP271 | SNP271F | CGACACCATCTGATATATGAAAATAA |
| SNP271R | ACCGGAAATCTGTGTTTTTC | |
| InDel2 | InDel2F | GATAAAGTATATCTAAATTATGTTTGC |
| InDel2R | CTATATTTTGTATCAATTATAACAAATT | |
| InDel10 | InDel10F | GTTAAGTGGTAAATTGACTCATG |
| InDel10R | GTTTTRCATTCTTGCAAAGATAAAATTAG | |
| InDel28 | InDel28F | CTACAATAGCCACACAAATAG |
| InDel28R | GTTTAGATGGCCMTGTGC | |
| InDel66 | InDel66F | CTTCTGAGTTGGACCATAAAC |
| InDel66R | ACTCACATTTACAGAACTTTAACT |
Validation of sequence-specific SNP and InDel markers arising from marker mining on scaffold84773 linked the R gene PhtjR conferring resistance to PSB disease on all historical and current commercial cultivars of Lupinus angustifolius released in Australia
aGenotypes of R gene PhtjR on commercial cultivars: R = presence of PhtjR gene; S = absence of PhtjR gene [44]
bMarkers showing R-allele genotype on cultivars without the R gene (false positives) are highlighted in red
cMarkers showing genotypes completely consistent with PhtjR gene phenotypes in all 27 commercial cultivars are diagnostic markers, and are highlighted in green
Fig. 3Validation of InDel markers arising from marker mining on genome sequence assembly scaffold84773 linked to the R gene PhtjR conferring phomopsis stem blight disease resistance on all 27 historical and current cultivars of Lupinus angustifolius by polyacrylamide electrophoresis gels. The 27 cultivars are: Uniwhite (Lane 1), Uniharvest (Lane 2), Unicrop (Lane 3), Marri (Lane 4), Illyarrie (Lane 5), Yandee (Lane 6), Chittick (Lane 7), Danja (Lane 8), Geebung (Lane 9), Gungurru (Lane 10), Yorrel (Lane 11), Warrah (Lane 12), Merrit (Lane 13), Myallie (Lane 14), Kalya (Lane 15), Wonga (Lane 16), Belara (Lane 17), Tallerack (Lane 18), Tanjil (Lane 19), Moonah (Lane 20), Quilinock (Lane 21), Jindalee (Lane 22), Mandelup (Lane 23), Coromup (Lane 24), Jenabillup (Lane 25), Gunyidi (Lane 26) and Barlock (Lane 27). Disease phenotypes of the cultivars are presented as “S” (susceptible) or “R” (resistant) in blue letters. Marker “InDel10” was confirmed as diagnostic for the PhtjR gene, since it showed the marker genotypes consistent with PSB phenotypes on all cultivars. In comparison, marker “InDel 66” was confirmed non-diagnostic, since eight cultivars (arrowed in red) without the R gene had the resistance marker allele (“false positives”)