| Literature DB >> 34611275 |
Deden Derajat Matra1,2, Muh Agust Nur Fathoni3, Muhammad Majiidu4, Hanif Wicaksono5, Agung Sriyono6, Gunawan Gunawan7, Hilda Susanti8, Rismita Sari9, Fitmawati Fitmawati10, Iskandar Zulkarnaen Siregar4,11, Winarso Drajad Widodo3, Roedhy Poerwanto3.
Abstract
Mangifera casturi Kosterm., a mango plant from Kalimantan Selatan, Indonesia, has limited genetic information, severely limiting the research on its genetic variation and phylogeny. We collected M. casturi's genomic information using next-generation sequencing, developed microsatellite markers and performed Sanger sequencing for DNA barcoding analysis. These markers were used to confirm parental origin and genetic diversity of M. casturi hybrids. The clean reads of the Kasturi accession were assembled de novo, producing 259 872 scaffolds (N50 = 1 445 bp). Fourteen polymorphic microsatellite markers were developed from 11 040 microsatellite motif-containing sequences. In total, 58 alleles were produced with a mean of 4.14 alleles per locus. Microsatellite marker analysis revealed broad genetic variation in M. casturi. Phylogenetic analysis was performed using internal transcribed spacers (ITS), matK, rbcL, and trnH-psbA. The phylogenetic tree of chloroplast markers placed Kasturi, Cuban, Pelipisan, Pinari, and Hambawang in one group, with M. indica as the female ancestor. Meanwhile, the phylogenetic tree of ITS markers indicated several Mangifera species as ancestors of M. casturi. Thus, M. casturi very likely originated from the cross-hybridization of multiple ancestors. Furthermore, crossing the F1 hybrids of M. indica and M. quadrifida with other Mangifera spp. may have generated much genetic variation. The genetic information for M. casturi will be a resource for breeding improvement, and conservation studies.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34611275 PMCID: PMC8492637 DOI: 10.1038/s41598-021-99381-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Appearance Fruits of M. casturi hybrids (from left to right: Pinari, Pelipisan, Kasturi, and Cuban) and closely related species (Rawa-rawa). This photo was taken by DDM.
Statistics of de novo assembly from M. casturi using Ray Assembler.
| Features | Number |
|---|---|
| Raw reads (bases) | 73.44 million (11.01 Gbp) |
| Clean reads (bases) | 73.10 million (10.95 Gbp) |
| Number of Scaffolds | 259 872 |
| N50 | 1 445 bp |
| Mean length | 947.68 bp |
| Longest Scaffold | 144 601 bp |
Summarized benchmarks in BUSCO annotation from M. casturi scaffolds.
| No | Categories | Number | Ratio (%) |
|---|---|---|---|
| 1 | Complete and single-copy BUSCOs (S) | 608 | 42.3 |
| 2 | Complete and duplicated BUSCOs (D) | 36 | 2.5 |
| 3 | Fragmented BUSCOs (F) | 241 | 16.7 |
| 4 | Missing BUSCOs (M) | 555 | 38.5 |
Number of microsatellite regions observed in M. casturi scaffolds, total and subdivided by motif.
| Characteristics | Number |
|---|---|
| Total number of identified SSRs | 11 040 |
| Number of SSR containing contig | 10 160 |
| Contig containing more than 1 SSR | 770 |
| SSRs present in compound formation | 272 |
| Dinucleotide | 3 680 |
| Trinucleotide | 5 826 |
| Tetranucleotide | 1 194 |
| Pentanucleotide | 213 |
| Hexanucleotide | 102 |
| Heptanucleotide | 25 |
Summary statistics of the fourteen analyzed microsatellite loci.
| No | Locus | Primer sequence (5'-3') | Sequence Motif | Allele size range | NA | DDBJ accession | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 | mc122955 | F : TGTTGATGGTAAGGATTTGGTGT | (GGATG)6 | 168–178 | 2 | 0.50 | 0.50 | 0.60 | 0.69 | LC594546 |
| R : TCAGGTGAGTATGTATTGTGCA | ||||||||||
| 2 | mc148231 | F : TCCCTCCCCTAAACCCTTCT | (ACCCTAA)5 | 188–209 | 4 | 0.68 | 0.68 | 0.76 | 1.24 | LC594549 |
| R : GCTTCTCCTTGCCTCTAAATCCT | ||||||||||
| 3 | mc151578 | F : GAGCCTTGTACTCGTTCAATGA | (CAAGCT)8 | 273–279 | 5 | 0.25 | 0.78 | 0.68 | 1.56 | LC594547 |
| R : ACGAGCTTAAAATGAGTTTGACT | ||||||||||
| 4 | mc167596 | F : AGCTGAACCTTGTTGCCCTT | (GA)27 | 192–224 | 3 | 0.16 | 0.40 | 0.58 | 0.72 | LC594539 |
| R : TCTGCTTGTTGGAACTGAACA | ||||||||||
| 5 | mc176197 | F : TGTATGCCCGAATTGTTCCAAC | (AC)19 | 237–250 | 3 | 0.50 | 0.62 | 0.20 | 1.04 | LC594537 |
| R : GCTGGCTTTAATGGAAGTTGCA | ||||||||||
| 6 | mc211123 | F : GGATGGTGGATGTCAGATTTTCG | (TGAAGT)6 | 323–339 | 6 | 0.60 | 0.76 | 0.21 | 1.60 | LC594548 |
| R : CGAAGAGAACGGGTCCCTTG | ||||||||||
| 7 | mc21672 | F : TGGTTGGTAAGAAGTAGGATTC | (ATAC)11 | 263–264 | 4 | 0.00 | 0.61 | 1.00 | 1.15 | LC594543 |
| R : CACAATGCAAATCACTCCTC | ||||||||||
| 8 | mc230178 | F : AGACAGCCATAATTTGCCCCA | (ATG)12 | 162–188 | 7 | 0.62 | 0.80 | 0.22 | 1.75 | LC594541 |
| R : GCTGGAGGTTGATCAGGGTC | ||||||||||
| 9 | mc28107 | F : GGTGTGCGTTCTGTTTTGACA | (TG)28 | 211–250 | 5 | 0.00 | 0.78 | 1.00 | 1.55 | LC594540 |
| R : CAGCAGCATCAACACAAGCA | ||||||||||
| 10 | mc4673 | F : TTTCCAAAGCCAAGACTCTC | (TAAACCC)5 | 231–245 | 3 | 0.25 | 0.66 | 0.62 | 1.08 | LC594550 |
| R : AAAATTGTATTCATTAAGCCCCT | ||||||||||
| 11 | mc58089 | F : TCTTGTCGTCGAATCAAACTCA | (AT)22 | 264–287 | 7 | 0.38 | 0.76 | 0.51 | 1.66 | LC594538 |
| R : CTCGGTCTATCAATGGTGTAGGT | ||||||||||
| 12 | mc8693 | F : CGAAGGGTTGAGGTTTGGGT | (CTTTT)7 | 159–183 | 4 | 0.62 | 0.68 | 0.08 | 1.22 | LC594545 |
| R : AAAGAGTGAGAGGGTTGCGT | ||||||||||
| 13 | mc88075 | F : CTCCAATCGAACAACCCAGC | (TTA)15 | 278–286 | 3 | 0.00 | 0.57 | 1.00 | 0.96 | LC594542 |
| R : AGGGGTGCATATGGAGGATT | ||||||||||
| 14 | mc88387 | F : CCATTTCGACGATGTTGGAAGT | (TATG)10 | 251–252 | 2 | 0.00 | 0.50 | 1.00 | 0.69 | LC594544 |
| R : GCAACCCTTACCAACAAGCA |
F forward primer sequence, R reverse primer sequence, NA number of alleles, H observed heterozygosity, H expected heterozygosity, F fixation index, I Shannon's information index.
Allele size information of microsatellite loci.
| No | Locus | Allele size (bp) |
|---|---|---|
| 1 | mc122955 | 168,178 |
| 2 | mc148231 | 188,196,203,209 |
| 3 | mc151578 | 256,270,273,279,284 |
| 4 | mc167596 | 193,224,226 |
| 5 | mc176197 | 235,237,249 |
| 6 | mc211123 | 318,323,333,335,339, 340 |
| 7 | mc21672 | 255,256,257,263 |
| 8 | mc230178 | 162,164,170,173,178,185,188 |
| 9 | mc28107 | 204,211,214,225,250 |
| 10 | mc4673 | 231,238,246 |
| 11 | mc58089 | 261,264,273,279,282,283,287 |
| 12 | mc8693 | 157,160,167,182 |
| 13 | mc88075 | 267,278,286 |
| 14 | mc88387 | 251,253 |
Figure 2Principal Coordinates Analysis (PCoA) via covariance matrix with data standardization in four M. casturi hybrids and three closely related species using 14 microsatellite loci (1: Hambawang (M. foetida), 2: Pelipisan, 3: Cuban, 4: Pinari, 5: Kasturi, 6: M. quadrifida, 7: Rawa-rawa, 8: M. indica).
Figure 3Dendrogram for UPGMA analysis in four M. casturi hybrids and four closely related species using 14 microsatellite loci.
Figure 4Phylogenetic analysis of M. casturi hybrids compared to the other Mangifera species (deposited in NCBI, Supplementary Table 2) using (a) matK, (b) rbcL, (c) trnH-psbA, and (d) internal transcribed spacers (ITS) by Maximum Likelihood method. (The evolutionary history was inferred by using the Maximum Likelihood method and Tamura 3-parameter model for matK, trnH-psbA, ITS and Jukes-Cantor model for rbcL. The bootstrap consensus tree was inferred from 10 000 replicates. The tree is rooted with the outgroup, Anacardium occidentale.).