| Literature DB >> 28077090 |
Thiago Willian Almeida Balsalobre1,2,3, Guilherme da Silva Pereira4, Gabriel Rodrigues Alves Margarido4, Rodrigo Gazaffi1, Fernanda Zatti Barreto1, Carina Oliveira Anoni4, Cláudio Benício Cardoso-Silva2,3, Estela Araújo Costa2,3, Melina Cristina Mancini2,3, Hermann Paulo Hoffmann1, Anete Pereira de Souza2,3, Antonio Augusto Franco Garcia4, Monalisa Sampaio Carneiro5.
Abstract
BACKGROUND: Sugarcane (Saccharum spp.) is predominantly an autopolyploid plant with a variable ploidy level, frequent aneuploidy and a large genome that hampers investigation of its organization. Genetic architecture studies are important for identifying genomic regions associated with traits of interest. However, due to the genetic complexity of sugarcane, the practical applications of genomic tools have been notably delayed in this crop, in contrast to other crops that have already advanced to marker-assisted selection (MAS) and genomic selection. High-throughput next-generation sequencing (NGS) technologies have opened new opportunities for discovering molecular markers, especially single nucleotide polymorphisms (SNPs) and insertion-deletion (indels), at the genome-wide level. The objectives of this study were to (i) establish a pipeline for identifying variants from genotyping-by-sequencing (GBS) data in sugarcane, (ii) construct an integrated genetic map with GBS-based markers plus target region amplification polymorphisms and microsatellites, (iii) detect QTLs related to yield component traits, and (iv) perform annotation of the sequences that originated the associated markers with mapped QTLs to search putative candidate genes.Entities:
Keywords: Allelic dosage; Molecular markers; Polyploidy; Quantitative traits; SNPs; Saccharum spp
Mesh:
Substances:
Year: 2017 PMID: 28077090 PMCID: PMC5225503 DOI: 10.1186/s12864-016-3383-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Number of markers generated after GBS-Tassel pipeline analyses to map the GBS sugarcane population data
|
| SNPs | Indels | Total | Excluded data | Filtered polymorphic sites | |
|---|---|---|---|---|---|---|
| Missing dataa | Low coverage locib | |||||
| Methyl-filtered sugarcane genome | 110,261 | 41,494 | 151,755 | 16,815 (11.1%) | 96,562 (63.6%) | 38,378 (25.3%) |
|
| 84,757 | 35,447 | 120,204 | 13,773 (11.5%) | 78,914 (65.6%) | 27,517 (22.9%) |
| RNA-seq sugarcane transcriptome | 73,275 | 26,778 | 100,053 | 11,809 (11.8%) | 63,658 (63.6%) | 24,586 (24.6%) |
| SUCEST project sequences | 29,238 | 9,820 | 39,058 | 4,878 (12.5%) | 25,295 (64.8%) | 8,885 (22.7%) |
Notes
a More than 25% of the population is missing data
b Less than 50 reads on average for the reference alleles
Fig. 1Mosaic plot showing the ploidy levels that produced the highest posterior probabilities for the mapping of GBS sugarcane population data considering the following four pseudo-references: the methyl-filtered sugarcane genome, Sorghum bicolor genome, RNA-seq sugarcane transcriptome and sequences from the SUCEST project. The areas of the rectangles are proportional to the number of loci that have the same ploidy level, as indicated within each rectangle in parentheses. According to the posterior probabilities calculated for each even-numbered ploidy level within a range from 2 to 20, each locus was classified into one category using the following ad hoc criteria: Category A (green), when the highest posterior probability was greater than or equal to 0.80; Category B (yellow), when no single value of the posterior probability was higher than 0.80 but the sum of the two highest ones was greater than or equal to 0.80; and Category C (red), which included all other cases. In parentheses: the number of loci as a percentage within the given ploidy level and category
Selected loci with good quality (category A) and ploidy (6 through 14) were classified as single-dose markers (SDMs) or multi-dose markers (MDMs). High-quality SDMs (median of all individual a posteriori probabilities > 0.80) were also characterized according to their segregation pattern in the sugarcane mapping population
| Reference | Total | Dosage | High-quality SDM | Segregation pattern | ||
|---|---|---|---|---|---|---|
| MDM | SDM | 1:2:1 | 1:1 | |||
| Methyl-filtered sugarcane genome | 15,906 | 7,014 (44.1%) | 8,892 (55.9%) | 5,266 | 912 (17.3%) | 4,354 (82.7%) |
|
| 11,789 | 5,784 (49.1%) | 6,005 (50.9%) | 3,433 | 605 (17.6%) | 2,828 (82.4%) |
| RNA-seq sugarcane transcriptome | 9,808 | 4,959 (50.6%) | 4,849 (49.4%) | 2,869 | 469 (16.4%) | 2,400 (83.6%) |
| SUCEST project sequences | 3,433 | 1,736 (50.6%) | 1,697 (49.4%) | 983 | 141 (14.3%) | 842 (85.7%) |
Fig. 2Circular plot showing the redundancy between single-dose markers from four pseudo-references (methyl-filtered sugarcane genome, Sorghum bicolor genome, RNA-seq sugarcane transcriptome and SUCEST project sequences) that were used to align the GBS sugarcane tags. The red regions represent redundancy within each pseudo-reference, whereas the green, orange and blue regions represent redundancy between four, three and two pseudo-references, respectively. The remaining grey regions represent loci that are unique to each pseudo-reference
Overall single-dose gel-based and GBS-based markers screened for the progeny of the cross between sugarcane cultivars SP80-3280 and RB835486
| Markers | Gel-based markers | GBS-based markers | |||
|---|---|---|---|---|---|
| Genomic SSR | Genic SSR | TRAP | Total | ||
| Number of SDMs evaluated (gel-based and GBS-based markers) | 109 | 842 | 80 | 7,049 | 8,080 |
| SDMs with 1:1 segregation | 66 | 456 | 23 | 5,757 | 6,302 |
| SDMs with 1:2:1 segregation (GBS-based markers) | - | - | - | 1,292 | 1,292 |
| Double SDMs (gel-based markers) with 3:1 segregation | 43 | 386 | 57 | - | 486 |
| Number of markers with distorted segregation | 25 | 336 | 41 | 0 | 402 |
| Total number (1:1, 1:2:1 and 3:1) feasible for linkage analysis | 84 | 506 | 39 | 7,049 | 7,678 |
Distribution of the different marker types as mapped according to their cross type
| Cross type | Number of markers | ||||
|---|---|---|---|---|---|
| Gel-based markers | |||||
| Genomic SSR | Genic SSR | TRAP | GBS-based markers | Total | |
| D1.10 ( | - | - | - | 500 | 500 |
| D1.13 ( | 4 | 14 | 0 | - | 18 |
| D2.15 ( | - | - | - | 180 | 180 |
| D2.18 ( | 4 | 22 | 0 | - | 26 |
| B3.7 ( | - | - | - | 254 | 254 |
| C.8 ( | 2 | 13 | 0 | - | 15 |
| Total | 10 | 49 | 0 | 993 | 993 |
Number of each type of mapped marker within each homo(eo)logous group (HG), number of linkage groups (LGs) within each HG, the length of each HG in centimorgans (cM) and the marker density in cM of each HG for the genetic map construct from a progeny of a cross between sugarcane cultivars SP80-3280 and RB835486
| HG | No. LGs | No. SSR | No. GBS-based markers | No. mapped markers | Length of HG (cM) | Marker density (cM) |
|---|---|---|---|---|---|---|
| 1 | 2 | 0 | 11 | 11 | 48.90 | 4.44 |
| 2 | 2 | 0 | 8 | 8 | 61.72 | 7.71 |
| 3 | 3 | 5 | 21 | 26 | 157.14 | 6.04 |
| 4 | 2 | 0 | 9 | 9 | 59.09 | 6.56 |
| 5 | 4 | 5 | 17 | 22 | 100.45 | 4.56 |
| 6 | 2 | 2 | 14 | 16 | 144.24 | 9.01 |
| 7 | 2 | 1 | 9 | 10 | 21.45 | 2.14 |
| 8 | 2 | 0 | 8 | 8 | 37.84 | 4.73 |
| 9 | 2 | 4 | 5 | 9 | 53.72 | 5.96 |
| 10 | 3 | 0 | 19 | 19 | 120.08 | 6.32 |
| 11 | 5 | 13 | 31 | 44 | 273.66 | 6.22 |
| 12 | 3 | 5 | 7 | 13 | 40.64 | 3.12 |
| 13 | 2 | 0 | 7 | 7 | 2.91 | 0.41 |
| 14 | 2 | 0 | 9 | 9 | 60.21 | 6.69 |
| 15 | 3 | 0 | 13 | 13 | 69.55 | 5.35 |
| 16 | 4 | 10 | 8 | 18 | 120.73 | 6.70 |
| 17 | 3 | 0 | 14 | 14 | 15.35 | 1.09 |
| 18 | 2 | 4 | 3 | 7 | 37.26 | 5.32 |
| Unassigned in HG | 175 | 10 | 721 | 730 | 2,257.10 | 3.09 |
| Total | 223 | 59 | 934 | 993 | 3,682.04 | 3.70 |
QTLs mapped for BRIX, POL%C, SD and FIB traits by applying a CIM model in Araras (Location 1) and Ipaussu (Location 2)
| Location | QTL | Trait | LGa | Position (cM)a | Flanking Markersb | Global LODb | R2 c | Additive effect SP80-3280d | LODe | Additive effect RB835486d | LODe | Dominance effectd | LODe | Segregf |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | B1 | BRIX | 4 | 43.32 | mf60753_2013 (QTL) | 5.75 | 9.19 | −0.31 | 5.22 | 0.08 | 0.25 | 0.04 | 0.09 | 1:1 |
| 1 | B2 | BRIX | 47 | 40.00 | SCSFAM1074E10_287 - QTL - mf16592_3766 | 4.79 | 4.78 | −0.23 | 2.22 | 0.28 | 3.77 | −0.17 | 1.26 | 3:1 |
| 2 | B3 | BRIX | 4 | 43.32 | mf60753_2013 (QTL) | 4.24 | 7.65 | −0.23 | 3.84 | 0.06 | 0.16 | 0.01 | 0.01 | 1:1 |
| 1 | P1 | POL%C | 4 | 43.32 | mf60753_2013 (QTL) | 4.18 | 8.10 | −0.23 | 4.14 | 0.14 | 0.84 | 0.08 | 0.27 | 1:2:1 |
| 2 | P2 | POL%C | 4 | 43.32 | mf60753_2013 (QTL) | 4.17 | 8.09 | −0.23 | 4.13 | 0.14 | 0.83 | 0.08 | 0.27 | 1:1 |
| 2 | SD1 | SD | 29 | 2.00 | sb2_61882838 - QTL - sb2_61882853 | 6.02 | 5.38 | 0.36 | 1.66 | 0.64 | 4.57 | −0.54 | 3.32 | 3:1 |
| 1 | FIB1 | FIB | 46 | 171.00 | sb1_29954417 - QTL - mf125302_409 | 4.77 | 2.71 | −0.47 | 3.11 | −0.75 | 3.95 | 0.38 | 3.93 | 3:1 |
aLG: Linkage groups; Position (cM): QTL position on LG; bAdjacent markers for QTLs and associated LODs; cExplained phenotypic variation; dAdditive effects of parents and dominance effects; eLODs of the additive and dominance effects; fEstimation of the segregation pattern of the QTLs
Fig. 3Composite interval mapping (CIM) for soluble solid content (BRIX, in °Brix), sucrose content of cane (POL%C, in %), stalk diameter (SD, in mm) and fiber content (FIB, in %) from the SP80-3280 and RB835486 F1 population. Blue and yellow dotted lines indicate the LOD thresholds for Ipaussu-SP and Araras-SP, respectively, obtained after permutation tests. The portions highlighted in gray in the linkage groups show the positions of the QTLs
Functional description of the sequences that gave rise to adjacent markers of the mapped QTLs for the traits BRIX, POL%C, SD and FIB, and references regarding their functions in plants
| Marker | QTLs | LG | Locations | Traits | Description | e-value | Reference |
|---|---|---|---|---|---|---|---|
| mf60753_2013 | B1, B3, P1, P2 | 4 | 1 and 2 | BRIX and POL%C | No homology found | - | - |
| SCSFAM1074E10_287 | B2 | 47 | 1 | BRIX | Extended synaptotagmin-1-like [ | 2.6e-26 | [ |
| mf16592_3766 | B2 | 47 | 1 | BRIX | Hypothetical protein SORBIDRAFT_03 g038130 [ | 1.2e-18 | Unknown function |
| sb2_61882838 | SD1 | 29 | 2 | SD | Hypothetical protein SORBIDRAFT_02 g026690 [ | 4.0e-13 | Unknown function |
| sb2_61882853 | SD1 | 29 | 2 | SD | Hypothetical protein SORBIDRAFT_02 g026690 [ | 4.0e-13 | Unknown function |
| mf125302_409 | FIB1 | 46 | 1 | FIB | Zinc finger protein CONSTANS-LIKE 15 [ | 0.0 | [ |
| sb1_29954417 | FIB1 | 46 | 1 | FIB | Transposon mutator sub-class [ | 2.0e-177 | [ |