| Literature DB >> 25393538 |
Newton Medeiros Vidal1, Ana Laura Grazziotin1, Helaine Christine Cancela Ramos2, Messias Gonzaga Pereira2, Thiago Motta Venancio1.
Abstract
Carica papaya (papaya) is an economically important tropical fruit. Molecular marker-assisted selection is an inexpensive and reliable tool that has been widely used to improve fruit quality traits and resistance against diseases. In the present study we report the development and validation of an atlas of papaya simple sequence repeat (SSR) markers. We integrated gene predictions and functional annotations to provide a gene-centered perspective for marker-assisted selection studies. Our atlas comprises 160,318 SSRs, from which 21,231 were located in genic regions (i.e. inside exons, exon-intron junctions or introns). A total of 116,453 (72.6%) of all identified repeats were successfully mapped to one of the nine papaya linkage groups. Primer pairs were designed for markers from 9,594 genes (34.5% of the papaya gene complement). Using papaya-tomato orthology assessments, we assembled a list of 300 genes (comprising 785 SSRs) potentially involved in fruit ripening. We validated our atlas by screening 73 SSR markers (including 25 fruit ripening genes), achieving 100% amplification rate and uncovering 26% polymorphism rate between the parental genotypes (Sekati and JS12). The SSR atlas presented here is the first comprehensive gene-centered collection of annotated and genome positioned papaya SSRs. These features combined with thousands of high-quality primer pairs make the atlas an important resource for the papaya research community.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25393538 PMCID: PMC4231050 DOI: 10.1371/journal.pone.0112654
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Distribution of SSR type according to genomic location.
Number of simple sequence repeats by genomic location.
| SSR type | ||||||
| Genomic location | Di | Tri | Tetra | Penta | Hexa | Total |
| Exon | 130 | 3588 | 117 | 206 | 1303 | 5344 |
| Exon-intron | 7 | 27 | 4 | 4 | 31 | 73 |
| Intron | 7158 | 1959 | 2343 | 1878 | 2476 | 15814 |
| Intergenic | 54853 | 24841 | 19584 | 19956 | 19853 | 139087 |
|
|
|
|
|
|
|
|
Distribution of SSRs by genomic location.
| Motif | Exon | Exon-intron | Intron | Intergenic | Total | |||||
| # SSRs | Percent(%) | # SSRs | Percent(%) | # SSRs | Percent(%) | # SSRs | Percent(%) | # SSRs | Percent(%) | |
|
| 130 | 2.4 | 7 | 9.6 | 7158 | 45.3 | 54853 | 39.4 | 62148 | 38.8 |
| AT/TA | 6 | 4.6 | 0 | 0.0 | 4554 | 63.6 | 38906 | 70.9 | 43466 | 69.9 |
| AC/CA/GT/TG | 8 | 6.2 | 1 | 14.3 | 1049 | 14.7 | 7307 | 13.3 | 8365 | 13.5 |
| AG/GA/CT/TC | 115 | 88.5 | 6 | 85.7 | 1546 | 21.6 | 8584 | 15.6 | 10251 | 16.5 |
| CG/GC | 1 | 0.8 | 0 | 0.0 | 9 | 0.1 | 56 | 0.1 | 66 | 0.1 |
|
| 3588 | 67.1 | 27 | 37.0 | 1959 | 12.4 | 24841 | 17.9 | 30415 | 19.0 |
| AAC/ACA/CAA/GTT/TGT/TTG | 153 | 4.3 | 0 | 0.0 | 111 | 5.7 | 728 | 2.9 | 992 | 3.3 |
| AAG/AGA/GAA/CTT/TCT/TTC | 1236 | 34.4 | 7 | 25.9 | 546 | 27.9 | 5387 | 21.7 | 7176 | 23.6 |
| AAT/ATA/TAA/ATT/TAT/TTA | 78 | 2.2 | 0 | 0.0 | 913 | 46.6 | 13250 | 53.3 | 14241 | 46.8 |
| GGA/GAG/AGG/TCC/CTC/CCT | 406 | 11.3 | 3 | 11.1 | 44 | 2.2 | 537 | 2.2 | 990 | 3.3 |
| GGC/GCG/CGG/GCC/CGC/CCG | 102 | 2.8 | 1 | 3.7 | 10 | 0.5 | 260 | 1.0 | 373 | 1.2 |
| GGT/GTG/TGG/ACC/CAC/CCA | 316 | 8.8 | 7 | 25.9 | 49 | 2.5 | 350 | 1.4 | 722 | 2.4 |
| ACG/CGA/GAC/CGT/TCG/GTC | 89 | 2.5 | 2 | 7.4 | 7 | 0.4 | 107 | 0.4 | 205 | 0.7 |
| ACT/CTA/TAC/AGT/TAG/GTA | 225 | 6.3 | 1 | 3.7 | 88 | 4.5 | 2326 | 9.4 | 2640 | 8.7 |
| AGC/GCA/CAG/GCT/TGC/CTG | 372 | 10.4 | 4 | 14.8 | 26 | 1.3 | 278 | 1.1 | 680 | 2.2 |
| ATC/TCA/CAT/GAT/TGA/ATG | 611 | 17.0 | 2 | 7.4 | 165 | 8.4 | 1618 | 6.5 | 2396 | 7.9 |
|
| 117 | 2.2 | 4 | 5.5 | 2343 | 14.8 | 19584 | 14.1 | 22048 | 13.8 |
|
| 206 | 3.9 | 4 | 5.5 | 1878 | 11.9 | 19956 | 14.3 | 22044 | 13.8 |
|
| 1303 | 24.4 | 31 | 42.5 | 2476 | 15.7 | 19853 | 14.3 | 23663 | 14.8 |
|
|
|
|
|
|
|
|
|
|
|
|
Figure 2Percentage frequency of SSR according to SSR size.
Distribution of Class I and Class II SSRs in different genomic regions.
| Exon | Exon-Intron | Intron | Intergenic | ||||||
| SSR size (bp) | # SSRs | Percent (%) | # SSRs | Percent (%) | # SSRs | Percent (%) | # SSRs | Percent (%) | |
|
| 12 | 1100 | 20.6 | 13 | 17.8 | 1450 | 9.2 | 13749 | 9.9 |
| 13 | 683 | 12.8 | 9 | 12.3 | 1883 | 11.9 | 16031 | 11.5 | |
| 14 | 665 | 12.4 | 10 | 13.7 | 2237 | 14.1 | 19308 | 13.9 | |
| 15 | 796 | 14.9 | 9 | 12.3 | 2474 | 15.6 | 20879 | 15.0 | |
| 16 | 433 | 8.1 | 8 | 11.0 | 1542 | 9.8 | 13481 | 9.7 | |
| 17 | 456 | 8.5 | 6 | 8.2 | 1280 | 8.1 | 11760 | 8.5 | |
| 18 | 203 | 3.8 | 3 | 4.1 | 861 | 5.4 | 7970 | 5.7 | |
| 19 | 158 | 3.0 | 2 | 2.7 | 763 | 4.8 | 6957 | 5.0 | |
|
|
| 84.1 |
| 82.2 |
| 79.0 |
| 79.2 | |
|
| 20 | 178 | 3.3 | 4 | 5.5 | 522 | 3.3 | 5275 | 3.8 |
| 21 | 104 | 1.9 | 2 | 2.7 | 449 | 2.8 | 4145 | 3.0 | |
| 22 | 81 | 1.5 | 2 | 2.7 | 382 | 2.4 | 3223 | 2.3 | |
| 23 | 91 | 1.7 | 1 | 1.4 | 340 | 2.1 | 3174 | 2.3 | |
| 24 | 52 | 1.0 | 1 | 1.4 | 236 | 1.5 | 2095 | 1.5 | |
| 25 | 54 | 1.0 | 1 | 1.4 | 213 | 1.3 | 1741 | 1.3 | |
| 26 | 36 | 0.7 | 0 | 0.0 | 176 | 1.1 | 1725 | 1.2 | |
| 27 | 30 | 0.6 | 0 | 0.0 | 169 | 1.1 | 1340 | 1.0 | |
| 28 | 28 | 0.5 | 1 | 1.4 | 121 | 0.8 | 953 | 0.7 | |
| 29 | 47 | 0.9 | 0 | 0.0 | 104 | 0.7 | 953 | 0.7 | |
| 30 | 24 | 0.4 | 1 | 1.4 | 80 | 0.5 | 686 | 0.5 | |
| >30 | 125 | 2.3 | 0 | 0.0 | 532 | 3.4 | 3642 | 2.6 | |
|
|
|
|
|
|
|
|
|
| |
|
|
|
|
|
|
|
|
|
| |
Number of successfully designed primer pairs for SSR type and linkage group.
| SSRType | LG1 | LG2 | LG3 | LG4 | LG5 | LG6 | LG7 | LG8+LG10 | LG9 | Un | Total | |||||||||||
| # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | |
|
| 3669 | 41.3 | 4254 | 41.3 | 4212 | 40.4 | 3433 | 40.9 | 3154 | 39.5 | 4429 | 41.2 | 3221 | 41.7 | 3988 | 41.7 | 4446 | 41.1 | 10200 | 35.6 | 45008 | 39.7 |
|
| 1645 | 18.5 | 1922 | 18.7 | 1892 | 18.2 | 1565 | 18.6 | 1414 | 17.7 | 1871 | 17.4 | 1307 | 16.9 | 1639 | 17.1 | 1937 | 17.9 | 5483 | 19.2 | 20677 | 18.2 |
|
| 1233 | 13.9 | 1357 | 13.2 | 1351 | 13.0 | 1163 | 13.8 | 1068 | 13.4 | 1436 | 13.4 | 1003 | 13.0 | 1242 | 13.0 | 1458 | 13.5 | 3780 | 13.2 | 15093 | 13.3 |
|
| 1066 | 12.0 | 1271 | 12.4 | 1430 | 13.7 | 1048 | 12.5 | 1114 | 14.0 | 1342 | 12.5 | 1029 | 13.3 | 1252 | 13.1 | 1347 | 12.4 | 4539 | 15.9 | 15441 | 13.6 |
|
| 1264 | 14.2 | 1486 | 14.4 | 1531 | 14.7 | 1193 | 14.2 | 1225 | 15.4 | 1659 | 15.5 | 1158 | 15.0 | 1449 | 15.1 | 1639 | 15.1 | 4621 | 16.1 | 17227 | 15.2 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
LG represents the Linkage Groups associated to the nine papaya chromosomes.
LG8 and LG10 are associated to the same chromosome.
SSRs located in scaffolds/contigs were classified as unplaced.
Number of successfully designed primer pairs for each genomic context and linkage group.
| Genomic location | LG1 | LG2 | LG3 | LG4 | LG5 | LG6 | LG7 | LG8+LG10 | LG9 | Un | Total | |||||||||||
| # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | # | % | |
|
| 443 | 5.0 | 576 | 5.6 | 466 | 4.5 | 385 | 4.6 | 349 | 4.4 | 540 | 5.0 | 316 | 4.1 | 414 | 4.3 | 520 | 4.8 | 693 | 2.4 | 4702 | 4.1 |
|
| 10 | 0.1 | 7 | 0.1 | 9 | 0.1 | 5 | 0.1 | 8 | 0.1 | 8 | 0.1 | 5 | 0.1 | 4 | 0.0 | 2 | 0.0 | 5 | 0.0 | 63 | 0.1 |
|
| 1457 | 16.4 | 1468 | 14.3 | 1442 | 13.8 | 1231 | 14.7 | 1140 | 14.3 | 1589 | 14.8 | 1029 | 13.3 | 1346 | 14.1 | 1440 | 13.3 | 2018 | 7.1 | 14160 | 12.5 |
|
| 6967 | 78.5 | 8239 | 80.1 | 8499 | 81.6 | 6781 | 80.7 | 6478 | 81.2 | 8600 | 80.1 | 6368 | 82.5 | 7806 | 81.6 | 8865 | 81.9 | 25907 | 90.5 | 94521 | 83.3 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
LG represents the Linkage Groups associated to the nine papaya chromosomes.
LG8 and LG10 are associated to the same chromosome.
SSRs located in scaffolds/contigs were classified as unplaced.