| Literature DB >> 22726804 |
José Blanca1, Cristina Esteras, Pello Ziarsolo, Daniel Pérez, Victoria Fernã Ndez-Pedrosa, Carmen Collado, Raquel Rodrã Guez de Pablos, Alida Ballester, Cristina Roig, Joaquín Cañizares, Belén Picó.
Abstract
BACKGROUND: Melon (Cucumis melo L.) is a highly diverse species that is cultivated worldwide. Recent advances in massively parallel sequencing have begun to allow the study of nucleotide diversity in this species. The Sanger method combined with medium-throughput 454 technology were used in a previous study to analyze the genetic diversity of germplasm representing 3 botanical varieties, yielding a collection of about 40,000 SNPs distributed in 14,000 unigenes. However, the usefulness of this resource is limited as the sequenced genotypes do not represent the whole diversity of the species, which is divided into two subspecies with many botanical varieties variable in plant, flowering, and fruit traits, as well as in stress response. As a first step to extensively document levels and patterns of nucleotide variability across the species, we used the high-throughput SOLiD™ system to resequence the transcriptomes of a set of 67 genotypes that had previously been selected from a core collection representing the extant variation of the entire species.Entities:
Mesh:
Year: 2012 PMID: 22726804 PMCID: PMC3473316 DOI: 10.1186/1471-2164-13-280
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Origin and characteristics of genotypes included in the 8 pools sequenced with SOLiD
| Pool 1: African | |||||
| Tibish/CO199 | Sudan | MELRIP | Mostly monoecious. Mostly small inedible fruits (<5 cm). Round to oval. Light green-white flesh. Non climacteric. No aroma. No sugar. | 43.7 | 30,620,160 |
| Fadasi/CO133 | Sudan | MELRIP | | | |
| HSD/CO145 | Sudan | MELRIP | | | |
| Tayer/CO195 | Cameroon | MELRIP | | | |
| Agrestis/CUM 287 | Nigeria | IPK | | | |
| Pool 2: Asian | |||||
| Agrestis Wild chibbar/CO204 | India | COMAV | Monoecious. Small to medium sized fruits. Oval, elliptic to elongated. Mostly non climacteric. White-light orange flesh. No aroma. No sugar. Low pH. | 56.7 | 15,779,803 |
| Acidulus SLK/CO187 | Sri Lanka | MELRIP | | | |
| Agrestis Meloncito/CO153 | India | COMAV | | | |
| Acidulus TGR 1551/PI 482420 | Zimbabwe | NPGS | | | |
| Voatango/CO202 | Madagascar | MELRIP | | | |
| Arya/CO115 | India | COMAV | | | |
| Pool 3: Far East | |||||
| Korea | COMAV | Andromonoecious-hermaphroditic. Medium sized fruits. Flat, round to elongated. White-green-light orange flesh. Non climacteric-medium climacteric. Medium aroma. Medium sugar. | 56.1 | 17,962,640 | |
| Freemans´s Cucumber/CO136 | Japan | COMAV | | | |
| Korea | NPGS | | | ||
| Nabunkin/CO153 | China | MELRIP | | | |
| Paul/CO169 | Poland | MELRIP | | | |
| Pool 4: Middle East and Indian | |||||
| Momordica/PI124112 | India | NPGS | Andromonoecious-monoecious. Round, flat, oval to very elongated fruits. White-light orange flesh. Climacteric. No to intermediate sugar. Medium to strong aroma. Low pH. | 55.3 | 23,320,668 |
| Momordica/PI124111 | India | NPGS | | | |
| Momordica/CUM 438 | India | IPK | | | |
| Snakemelon/CO188 | Saudi Arabia | MELRIP | | | |
| Flexuosus/CUM 353 | Iraq | IPK | | | |
| Flexuosus/CUM 225 | India | IPK | | | |
| Dudaim Queen´s pocket melon/CO180 | Afghanistan | COMAV | | | |
| Pool 5: Group | |||||
| Israel | COMAV | Andromonoecious-monoecious- gynoecious. Medium to large size fruits. Flat to oval. Round or ribbed. Green- orange flesh. Climacteric. Sweet. Aromatic. | 48.1 | 23,237,004 | |
| Noir des carmes/CO161 | France | COMAV | | | |
| Prescott Fond Blanc/CO179 | France | COMAV | | | |
| TopMARK/NSL30032 | USA | NPGS | | | |
| Nantais Oblong/CO159 | France | MELRIP | | | |
| Gynadou/CO141 | France | MELRIP | | | |
| Cantalupd´alger/CO121 | France | MELRIP | | | |
| USA | MELRIP | | | | |
| 5 Charentais breeding lines | Spain | S.Fitó S.A | | | |
| Pool 6: Group | |||||
| Honeydew/CO143 | USA | COMAV | Monoecious-andromonoecious. Medium-size fruits. Oval, flat to elongated. White-green-light orange flesh. Climacteric. Sweet, variable sugar content. Medium to low aroma. | 34.3 | 8,367,385 |
| Kirkagac/CO150 | Turkey | COMAV | | | |
| Muchanesvi/CO156 | Georgia | MELRIP | | | |
| Baskavas/CO118 | Greece | MELRIP | | | |
| Korca/Cum168 | Rusia | IPK | | | |
| Kiziluruk/CO96 | Uzbequistan | COMAV | | | |
| Hami melon/CO142 | China | COMAV | | | |
| Winter type/PI169329 | Turkey | NPGS | | | |
| Maazoon/CO85 | Egypt | COMAV | | | |
| Blanco/CO67 | Spain | COMAV | | | |
| Carosello/CO122 | Italy | COMAV | | | |
| Pool 7: | |||||
| Cañadulce/CO48 | Spain | COMAV | Andromonoecious. Large-sized fruits. Round to elliptic. White-Green flesh. Non climacteric. Sweet. Low aroma. | 50.6 | 17,485,023 |
| Madura amarilla/CO58 | Spain | COMAV | | | |
| Erizo/CO75 | Spain | COMAV | | | |
| Amarillo oro/CO79 | Spain | COMAV | | | |
| Escrito oloroso/CO50 | Spain | COMAV | | | |
| Tendral/CO59 | Spain | COMAV | | | |
| Verde pinto/CO73 | Spain | COMAV | | | |
| Coca/CO49 | Spain | COMAV | | | |
| Mochuelo/CO48 | Spain | COMAV | | | |
| Largo de Villaconejos/CO69 | Spain | COMAV | | | |
| 5 Amarillo breeding lines | Spain | S.Fitó S.A | | | |
| Pool 8: | |||||
| Spain | S.Fitó S.A | Andromonoecious. Large-sized fruits. Round to elliptic. White-green flesh. Non climacteric. Sweet. No aroma. | 37.2 | 13,809,773 | |
| 5 Piel de Sapo breeding lines | Spain | S.Fitó S.A | |||
a1 Total number of reads and percentage of reads mapped on to the reference melon genome for SNP mining.
Genotypes marked in bold letters are those used for SNP mining trough transcriptome resequencing with Sanger and 454 in previous assays [20,21,27]. Seed source codes: IPK: Institut ftir Pflanzengenetik und Kulturpflanzenforschung, Gatersleben, Germany; NPGS: GRIN NPGS, National Plant System, USDA, USA; MELRIP (2007–2010): ERA-PG project (GEN2006-27773-C2-2-E); COMAV, Institute for the Conservation and Breeding of Agricultural Biodiversity, UPV, Spain; Semillas Fitó S.A. (Barcelona).
Figure 1MAF distribution of SNPs selected in the different sequenced pools. The number of SNPs with different MAF values is represented for each pool.
SNPs identified in the eight pools of genotypes resequenced with SOLiD
| 1) African | 13,230,637 bp | 117,204 | 8.9 | 9,133 (7.8) |
| 2) Asian | 14,275,353 bp | 96,460 | 6.8 | 10,197 (10.6) |
| 3) Far East | 13,218,638 bp | 81,807 | 6.2 | 1,305 (1.6) |
| Intermediate types | ||||
| 4) Middle East and Indian | 15,745,206 bp | 132,792 | 8.4 | 13,826 (10.1) |
| 5) Group | 13,982,666 bp | 102,565 | 7.3 | 6,317 (6.2) |
| 6) Group | 4,430,082 bp | 40,762 | 9.2 | 2,417 (5.9) |
| 7) | 12,505,399 bp | 79,551 | 6.4 | 3,210 (4.0) |
| 8) | 8,680,064 bp | 43,363 | 4.9 | 1,396 (3.2) |
1 Number of nucleotides sequenced at least 6 times used for SNP mining in each pool.
2 Total number of SNPs detected within each pool (SNPs with two or more alleles within the corresponding group).
3 Total number of highly variable SNPs (those with a frequency of the major allele, MAF, <0.7). In parentheses the percentage over the total number of SNPs is indicated.
Number of SNPs shared and differential between groups
| | Pool8 | Pool7 | Pool6 | Pool5 | Pool4 | Pool3 | Pool2 | Pool1 |
| | 15,560 | 9,149 | 19,451 | 21,566 | 14,168 | 14,488 | 18,172 | |
| Pool8 | | (36/20) | (21/22) | (45/19) | (50/16) | (33/17) | (33/15) | (42/16) |
| 1,564 | | 16,255 | 33,966 | 40,668 | 25,165 | 26,914 | 32,236 | |
| Pool7 | | | (20/40) | (43/33) | (51/31) | (32/31) | (34/28) | (41/26) |
| 3, 260 | 2,722 | | 19,727 | 23488 | 13,541 | 15,589 | 17,157 | |
| Pool6 | | | | (19/48) | (58/18) | (33/17) | (38/16) | (42/15) |
| 4,735 | 4,353 | 3, 178 | | 52,514 | 31,461 | 34,510 | 39,016 | |
| Pool5 | | | | | (51/40) | (31/39) | (34/36) | (38/33) |
| 4,441 | 4, 484 | 2, 417 | 4,224 | | 38,384 | 47,491 | 46,865 | |
| Pool4 | | | | | | (47/29) | (49/36) | (40/35) |
| 19,942 | 20,281 | 12,628 | 14,978 | 9,576 | | 27,670 | 33,783 | |
| Pool3 | | | | | | | (29/34) | (35/29) |
| 11,402 | 12,009 | 6,837 | 9,577 | 5,273 | 6,474 | | 36,162 | |
| Pool2 | | | | | | | | (44/31) |
| 20,501 | 20,931 | 14,132 | 18,294 | 13, 070 | 21,490 | 11,180 | | |
| Pool1 |
Numbers in the upper half of the table indicate the number of common SNPs between each pair of libraries. Numbers in brackets indicate percentages these common SNPs represent of the total SNP set detected within the corresponding library (row/column). For example, there are 15,560 SNPs common between pools 8 and 7 (that is with two alleles or more in each of these pools). This number represents the 36% and 20% of the total SNPs detected within Pool 8 and pool 7 (indicated in Table 2) respectively.
Numbers in the lower part of the table indicate the SNPs that are fixed within each pair of libraries, but polymorphic between them. For example, there are 1,564 SNPs for which all reads from pool 8 have one allele and all reads from pool 7 have the alternative allele.
Figure 2Degree of shared polymorphism between the group and the 7 pools of both subspecies. Total number of SNPs in each group is indicated in the center of each circle and the number of shared SNPs in the intersection. Numbers in brackets show the percentage of shared SNPs (first number referred to the total number in each group and second number in the momordica).
Figure 3SNPs detected in the coding region of and Short reads generated by SOLiD in the different pools are represented mapped to the genomic sequence (whole genome draft version 3.5 available in MELONOMICS) of both genes. Coverage in exonic and UTRs regions is shown for each nucleotide. SNPs detected by SOLiD and EcoTILLING are represented by colored bars in the different exons (red, green and yellow for mutations detected only by SOLiD, only by EcoTILLING and by both methods). The structure of each gene as annotated in the genome is shown below. Data are visualized with IGV (Integrative Genomics Viewer)[65].
Polymorphism in and detected by SOLiD sequencing and EcoTILLING[59]
| Coding region of | |||||||||
| 3015744 | 41 | A/G | D14G | Tolerated (1.00/3.02) | 1 | Yes/yes | G:6,7,4 | A:21 | A:5,2,8,- |
| | | 0.68 | | | | | | | |
| 3015944 | 241 | A/G | L46L | | 2 | Yes/yes | G:15,6,5 | A:15 | A:5,5,8,1 |
| | | 0.54 | | | | | | G:1 | G:1,0,1,0 |
| 3016016 | 313 | T/G | L70L | | 2 | Yes/yes | G:0,3,2 | T:12 | T:8,2,11,6 |
| | | 0.99 | | | | | | | |
| 3016091 | 388 | T/A | L95L | | 2 | Yes/yes | T:3,1,0 | T:10 | T:7,-,2,2 |
| | | 0.80 | | | | | A:0,0,6 | | |
| 3016304 | 601 | C/A | L131M | Not tolerated | 3 | Yes/yes | C:8,4,6 | C:12 | C:9,4,20,5 |
| | | 0.87 | | (0.03/3.02) | | | | A:10 | |
| - | 747 | C/T | D179D | | 3 | No/yes | | | |
| 3016920 | 1216 | T/C | V294A | Tolerated | 4 | Yes/no | T:11,7,7 | T:29 | T:18,5,20,7 |
| | | 0.97 | | (0.12/3.03) | | | | C:1 | C:1,0,0,1 |
| Coding region of | |||||||||
| - | 26 | G/A | G9D | Not tolerated | 1 | No/yes | | | |
| | | | | (0.00/4.32)* | | | | | |
| 1030561 | 41 | C/T | A14V | Not tolerated | 1 | Yes/yes | G:2,3,3 | G:3 | G:5,1,5,1 |
| | | 0.90 | | (0.00/4.32)* | | | A:4,0,0 | | |
| 1030440 | 162 | T/C | L54L | | 1 | Yes/yes | G:31,22,3 | G:12 | G:31,1,5,0 |
| | | 0.70 | | | | | T:0,1,0 | T:1 | T:3,3,21,7 |
| 1029938 | 664 | C/T | S112N | Tolerated | 2 | Yes/no | C:66,33,26 | C:25 | C:0,15,55,19 |
| | | 0.99 | | (0.56/3.11) | | | T:2,0,0 | | T:4,0,1,0 |
| 1029710 | 892 | A/G | L153L | | 3 | Yes/no | A:91,49,30 | A:46 | A:75,17,54,30 |
| | | 0.99 | | | | | G:1,0,0 | | G:1,1,0,0 |
| 1029697 | 905 | C/T | K158E | Not tolerated | 3 | Yes/no | T:0,38,13 | T:31 | T:67,10,39,26 |
| | | 0.99 | | (0.02/3.12) | | | C:5,0,0 | | C:3,0,0,0 |
| 1028781 | 1810 | C/T | D178G | Tolerated | 4 | Yes/no | T:0,23,17 | T:17 | T:57,17,27,26 |
| | | 0.99 | | (0.65/3.12) | | | C:7,0,0 | | C:1,0,2,0 |
| 1028629 | 1962 | C/T | K198K | | 5 | Yes/no | T:33,24,26 | T:34 | T:34,4,35,17 |
| | | 0.99 | | | | | | | C:2,0,1,0 |
| 1028619 | 1972 | C/T | S202G | Not tolerated | 5 | Yes/no | T:42,32,33 | T:37 | T:52,8,49,21 |
| 0.99 | (0.00/3.14) | C:2 | C:1,0,0,0 | ||||||
1 Position in the melon genome assembly v3.5 available at MELONOMICS [36].
2 The effect of mutations was analyzed with SIFT [64]. Prediction score and median sequence conservation, respectively, are indicated in brackets. * Low confidence in the prediction (few sequences represented at those positions).
3 Number of reads of each allele are indicated in each pool, (−) means that this nucleotide has not been sequenced in the corresponding pool, numbers are ordered according to pool number, agrestis (pools 1, 2 and 3), momord (pool 4), melo (pools 5, 6, 7, 8).
Number of SNPs meeting different criteria for optimizing validation with the sequenom genotyping array
| MAF 1 | 55,000 | 28,996 | 9 |
| | | (10.21%) | |
| MAF 0.99 | 108,731 | 65,500 | 158 |
| | | (23.07%) | |
| MAF 0.98 | 136,694 | 86,103 | 211 |
| | | (30.32%) | |
| MAF 0.97 | 150,590 | 96,657 | 231 |
| | | (34,04%) | |
| MAF 0.96 | 160,231 | 103,976 | 260 |
| | | (36,61%) | |
| MAF 0.95 | 167,718 | 109,734 | 277 |
| | | (38,64%) | |
| MAF 0.7 | 178,107 | 168,726 | 431 |
| (59,42%) |
1 Those SNPs having the filter CS60_MAF in Additional file 4 are flanked in a window of 60 bp by other SNPs with values of MAF over the threshold indicated. Columns indicate the number of these SNPs in the whole collection, in the subset free from introns in a flanking window of 60 bp (no IS60), and in the subset variable in all groups. These filters for each SNP of the whole collection are included in Additional file 4.
SNPs variable within and between different groups of botanical varieties selected for validation
| | | | | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Piel de Sapo (8) | 13,168 | 4,000 | 6,240 | 5,361 | 1,659 | 2,690 | 34 | 33 (97%) | |
| Piel de Sapo (8) | African | 15,261 | 4,226 | 7,095 | 6,724 | 1,894 | 3,322 | 24 | 24 (100%) |
| African | 13,168 | 3,559 | 5,972 | 5,052 | 1,354 | 2,284 | 12 | 12 (100%) | |
| African | 5,822 | 1,739 | 2,265 | 2,848 | 879 | 1,139 | 15 | 14 (93%) | |
| African | 5,102 | 1,544 | 2,006 | 2,305 | 744 | 954 | 24 | 24 (100%) | |
| Polymorphic in Piel de Sapo | 43,363 | | | 1,305 | | | 19 | 12 (63%) | |
| Polymorphic in all groups | 668 | 9 | 431 | 3 | 0 | 0 | 16 | 13 (81%) | |
1 Pool number indicated.