| Literature DB >> 33028920 |
Jeremy R Shearman1, Chutima Sonthirod1, Chaiwat Naktang1, Duangjai Sangsrakru1, Thippawan Yoocha1, Ratchanee Chatbanyong2, Siriporn Vorakuldumrongchai2, Orwintinee Chusri2, Sithichoke Tangphatsornruang1, Wirulda Pootakham3.
Abstract
We have assembled the complete sequence of the Durio zibethinus chloroplast genome using long PacBio reads. Durian is a valuable commercial tree that produces durian fruit, which is popular in Southeast Asia. The chloroplast genome assembled into a single 143 kb cyclic contig that contained 111 genes. There were 46 short direct repeats (45 to 586 bp) and five short inverted repeats (63 to 169 bp). The long reads that were used for the assembly span the entire chloroplast with > 10 kb overlaps and multiple long reads join the start of the contig to the end of the contig. The durian chloroplast was found to lack the large inverted repeat that is common in chloroplast genomes. An additional 24 durian varieties were sequenced and compared to the assembly and found to also lack the large inverted repeat. There were nine SNPs among the varieties.Entities:
Mesh:
Year: 2020 PMID: 33028920 PMCID: PMC7541610 DOI: 10.1038/s41598-020-73549-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Structure of the Durio zibethinus chloroplast genome showing gene location and exon structure. Gray arrows at the top indicate transcription direction and gene location on the plus or minus strand is indicated by the exon being outside or inside the circle, respectively. GC content is indicated as a histogram on the inner circle. The sequence that typically comprises the IR is marked using the black line.
Genes encoded on the Durio zibethinus chloroplast genome, grouped according to function.
| Category | Gene groups | Name of genes |
|---|---|---|
| Self-replication | Large subunit of ribosomal proteins | rpl2, rpl14, rpl16, rpl20, rpl232, rpl32, rpl33, rpl36 |
| Small subunit of ribosomal proteins | rps2, rps3, rps4, rps7, rps8, rps11, rps12, rps14, rps15, rps16, rps18, rps19 | |
| DNA-dependent RNA polymerase | rpoA, rpoB, rpoC1, rpoC2 | |
| Ribosomal RNA genes | rrn4.5, rrn5, rrn16, rrn23 | |
| Transfer RNA genes | trnA-UGC, trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnfM-CAU, trnG-GCC, trnG-UCC, trnH-GUG, trnI-CAU, trnI-GAU, trnL-CAA, trnL-UAG, trnL-UAA, trnM-CAU, trnN-GUU, trnP-UGG, trnQ-UUG, trnR-ACG, trnR-UCU, trnS-GCU, trnS-UGA, trnS-GGA, trnT-UGU, trnT-GGU, trnV-GAC, trnV-UAC, trnW-CCA, trnY-GUA | |
| Photosynthesis | Photosystem I | psaA, psaB, psaC, psaI, psaJ |
| Photosystem II | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | |
| NADH dehydrogenase | NADH dehydrogenase | ndhA, ndhB, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK |
| Cytochrome b/f complex | petA, petB, petD, petG, petL, petN | |
| ATP synthase | atpA, atpB, atpE, atpF, atpH, atpI | |
| RubisCo large subunit | rbcL | |
| Other genes | Maturase K | matK |
| Envelope membrane protein | cemA | |
| Subunit of acetyl-CoAcarboxylase | accD | |
| C-type cytochrome synthesis gene | ccsA | |
| Protease | clpP1 | |
| Conserved hypothetical chloroplast open reading frames | ycf1, ycf2, ycf3, ycf4, ycf15 |
Figure 2Read depth of long PacBio reads mapped against the published durian chloroplast genome (MG138151.1).
Figure 3Read depth of Musang King (SRX3204603) against our chloroplast and the published chloroplast genome sequences (MG138151.1).
Chloroplast SNPs identified from 24 Thai varieties and the Musang King variety of duran.
| POS | REF | ALT | X41 | X43 | X46 | X62 | X79 | Mk |
|---|---|---|---|---|---|---|---|---|
| 4449 | C | T | Ref (243,5) | Alt (29,216) | Alt (31,216) | Alt (39,210) | Ref (243,5) | Alt (82,168) |
| 33,109 | G | C,T | Ref (242,6,0) | Alt2 (1,24,221) | Alt2 (3,10,236) | Alt2 (8,10,229) | Ref (245,1,0) | Alt2 (60,0,190) |
| 33,936 | G | A,T | Ref (36,0,0) | Alt2 (0,11,76) | Alt2 (0,18,108) | Alt2 (1,8,54) | Ref (89,2,5) | Alt2 (19,0,25) |
| 36,163 | A | T | Alt (33,216) | Ref (245,5) | Ref (247,1) | Ref (249,1) | Ref (247,3) | Ref (245,1) |
| 36,164 | A | T | Alt (5,244) | Ref (239,11) | Ref (245,5) | Ref (248,2) | Ref (241,7) | Ref (246,0) |
| 36,165 | A | T | Alt (14,233) | Ref (240,9) | Ref (244,5) | Ref (246,3) | Ref (243,7) | Ref (248,1) |
| 37,110 | G | T | Ref (236,5) | Alt (8,201) | Alt (0,213) | Alt (1,207) | Ref (232,7) | Alt (44,151) |
| 37,111 | A | C,G,T | Ref (214,0,16,7) | Alt (22,201,1,25) | Alt (19,213,1,16) | Alt (16,213,0,18) | Ref (231,0,9,6) | Ref (144,87,0,19) |
| 134,508 | T | A | Ref (248,1) | Alt (38,205) | Alt (27,211) | Alt (31,211) | Ref (248,2) | Alt (77,171) |
Call per sample is indicated as Ref: same as our chloroplast assembly; or Alt, Alt2; first or second allele in the ALT column, respectively. Number of reads that support each allele are given in the brackets in the order Ref, Alt, Alt2, Alt3.
Figure 4Phylogenetic tree using chloroplast genes.
List of genes that were used to construct a phylogenetic tree.
| Genes used to construct phylogenetic tree | ||||
|---|---|---|---|---|
| rpl21 | rpoC2 | trnR-ACG | psbI | petL |
| rpl14 | rrn4.5 | trnR-UCU | psbJ | petN |
| rpl16 | rrn5 | trnS-GCU | psbK | atpA |
| rpl20 | rrn16 | trnS-UGA | psbL | atpB |
| rpl232 | rrn23 | trnS-GGA | psbM | atpE |
| rpl32 | trnA-UGC | trnT-UGU | psbN | atpF1 |
| rpl33 | trnC-GCA | trnT-GGU | psbT | atpH |
| rpl36 | trnD-GUC | trnV-GAC | psbZ | atpI |
| rps2 | trnE-UUC | trnV-UAC | ndhA | rbcL |
| rps3 | trnF-GAA | trnW-CCA | ndhB1 | matK |
| rps4 | trnfM-CAU | trnY-GUA | ndhC | cemA |
| rps7 | trnG-GCC | psaA | ndhD | accD |
| rps8 | trnG-UCC | psaB | ndhE | ccsA |
| rps11 | trnH-GUG | psaC | ndhF | clpP1 |
| rps121 | trnI-CAU | psaI | ndhG | ycf1 |
| rps14 | trnI-GAU | psaJ | ndhH | ycf2 |
| rps15 | trnL-CAA | psbA | ndhI | ycf31 |
| rps161 | trnL-UAG | psbB | ndhJ | ycf4 |
| rps18 | trnL-UAA | psbC | ndhK | ycf15 |
| rps19 | trnM-CAU | psbD | petA | |
| rpoA | trnN-GUU | psbE | petB | |
| rpoB | trnP-UGG | psbF | petD | |
| rpoC11 | trnQ-UUG | psbH | petG | |