| Literature DB >> 30131893 |
Ashley M Thomson1,2, Oscar M Vargas1, Christopher W Dick1,3.
Abstract
PREMISE OF THE STUDY: The tropical tree family Lecythidaceae has enormous ecological and economic importance in the Amazon basin. Lecythidaceae species can be difficult to identify without molecular data, however, and phylogenetic relationships within and among the most diverse genera are poorly resolved.Entities:
Keywords: Amazonian trees; Bertholletia excelsa; DNA barcoding; Lecythidaceae; genetic markers; plastome
Year: 2018 PMID: 30131893 PMCID: PMC5991589 DOI: 10.1002/aps3.1151
Source DB: PubMed Journal: Appl Plant Sci ISSN: 2168-0450 Impact factor: 1.936
Figure 1Plastome map of the Brazil nut tree, Bertholletia excelsa. Genes outside of the circle are transcribed clockwise; genes inside of the circle are transcribed counterclockwise. Gray bars in the inner ring show the GC content percentage.
Genes contained within the chloroplast genome of Bertholletia excelsa
| Function | Gene group | Gene name |
|---|---|---|
| Self‐replication | Ribosomal proteins (large subunit) |
|
| Ribosomal proteins (small subunit) |
| |
| RNA polymerase subunits |
| |
| Ribosomal RNAs |
| |
| Transfer RNAs |
| |
| Photosynthesis | Photosystem I |
|
| Photosystem II |
| |
| NADH dehydrogenase |
| |
| Cytochrome b/f complex |
| |
| ATP synthase |
| |
| RuBisCO large subunit |
| |
| Other genes | Subunit of acetyl‐CoA‐carboxylase |
|
| Envelope membrane protein |
| |
| Protease |
| |
| c‐type cytochrome synthase |
| |
| Translational initiation factor |
| |
| Maturase |
| |
| Unknown function | Hypothetical chloroplast reading frames |
|
Comparison for plastome subunits for the samples for which the inverted repeats were completely assembled.a
| Species | LSC length (bp) | SSC length (bp) | IR length (bp) | GC content (%) | Protein‐coding genes | rRNAs | tRNAs |
|---|---|---|---|---|---|---|---|
|
| 85,269 | 18,738 | 27,618 | 36.9 | 81 | 4 | 30 |
|
| 85,119 | 18,756 | 27,635 | 36.9 | 81 | 4 | 30 |
|
| 85,840 | 18,950 | 27,841 | 36.4 | 81 | 4 | 30 |
|
| 85,861 | 18,778 | 27,638 | 36.7 | 81 | 4 | 30 |
|
| 85,673 | 18,759 | 27,594 | 36.7 | 81 | 4 | 30 |
|
| 83,785 | 18,728 | 27,614 | 37.0 | 81 | 4 | 30 |
|
| 85,547 | 18,491 | 27,576 | 36.9 | 81 | 4 | 30 |
|
| 85,056 | 18,721 | 27,635 | 36.6 | 81 | 4 | 30 |
|
| 84,713 | 18,759 | 27,638 | 37.0 | 81 | 4 | 30 |
|
| 84,815 | 18,167 | 27,715 | 37.1 | 81 | 4 | 30 |
|
| 84,688 | 18,796 | 27,592 | 36.9 | 81 | 4 | 30 |
|
| 85,286 | 18,719 | 27,668 | 36.8 | 81 | 4 | 30 |
|
| 85,378 | 18,815 | 27,603 | 36.8 | 81 | 4 | 30 |
|
| 85,506 | 18,845 | 27,622 | 36.7 | 81 | 4 | 30 |
IR = inverted repeat; LSC = large single‐copy region; SSC = small single‐copy region.
Length and GC content of the large single‐copy and small single‐copy regions in partial plastomes are estimates only.
Primers for the amplification of simple sequence repeats in the plastome of Bertholletia excelsa. All primer pairs amplify non‐coding sequences with the exception of ndhD.a
| Forward primer sequence (5′–3′) | Reverse primer sequence (5′–3′) | Repeat unit | Location | Region | No. of repeats | Product size (bp) |
|---|---|---|---|---|---|---|
| CCAAAATCATGAACTAACCCCCA | ACCAAGAGGGCGTTATTGCT | A | 396–409 |
| 14 | 226 |
| TGAAGTCGTGTTGCTGAGATCT | CTGTTGATAAGTTTGCCGAGGT | C | 3686–3702 |
| 17 | 197 |
| GAGGTTTTCTCCTCGGACGG | ACCACTCATTAAACGAAATGCCT | A | 5680–5691 |
| 12 | 244 |
| GTCCACTCAGCCATCTCTCC | AGCCCGGCCATAGGAATAAA | AAAG | 9396–9407 |
| 3 | 297 |
| TTTATTCCTATGGCCGGGCT | TGCATTGTTTAAGAATCCATAGTTTCA | A | 9769–9780 |
| 12 | 246 |
| TTTTCCCCACACTTCCCCTC | TGTCCGGTCATTTGATTTGGT | A | 17,925–17,938 |
| 14 | 192 |
| AAGAGAGGAGAAGTTTTTAGGCA | CCTTACCACTCGGCCATGTC | A | 29,392–29,403 |
| 12 | 232 |
| GGGATGCGAGAAAGAGACTT | CAAAAGTATATCTTTCTACGGGTCG | AAAG | 34,775–34,786 |
| 3 | 250 |
| TACCGGTTTTCAAGACCGGG | TCACAAATGGGCATGCTGGA | AAAAT | 38,160–38,174 |
| 3 | 201 |
| ACCCATCAATCATTCGATTCGT | GAAAGATCTTTCCTTGGGGGA | AAAG | 47,627–47,638 |
| 3 | 168 |
| No suitable primers found | No suitable primers found | AAAT | 49,610–49,625 |
| 4 | NA |
| No suitable primers found | No suitable primers found | AATT | 50,016–50,027 |
| 3 | NA |
| CCACTGAACAAGGGAGAGCC | ACCAAGGCAAACCCATGGAA | AAAATT | 75,475–75,492 |
| 3 | 128 |
| TGAATCACTGCTTTTCTTTGACTCT | AGGCGGTTCTCGAAAGAAGA | AAAAT | 77,155–77,169 |
| 3 | 183 |
| TTCAATCTCGGGATTCTTTGAGA | TCGCCTGCGAAAACTTAACT | A | 85,073–85,085 |
| 13 | 246 |
| TCGATCAATCCCTTTGCCCT | CGTACTCCTCGCTCAATGAGA | AAAT | 102,172–102,183 |
| 3 | 248 |
| TGGAGCACCTAACAACGCAT | AGACCTCCGGGAAAAGCATG | A | 106,208–106,219 |
| 12 | 119 |
| AGAGTAAACACAAGATACAAGGGT | GTGGGTTAGGTCAATCGGGA | AACTT | 117,345–117,359 |
| 3 | 194 |
| AGTCAACGTCAAAATTAATGAATGGT | AGGTTGAACGCGAGCGATAT | AT | 117,609–117,622 |
| 7 | 177 |
| AAATAACTCCCGCGGTCCAG | GCTTCTCTTGCATTACCGGG | AAAT | 119,729–119,740 |
| 3 | 240 |
| No suitable primers found | No suitable primers found | AAT | 122,820–122,831 |
| 4 | NA |
| No suitable primers found | No suitable primers found | AAT | 122,843–122,854 |
| 4 | NA |
| AACCCGCTTCAAGCCATGAT | AAACGGCTTATAAATTCGCAGT | AATC | 125,271–125,282 |
| 3 | 130 |
NA = not applicable.
Sequences have been deposited to GenBank (BioProject SUB2740669).
Figure 2Sliding 600‐site window analyses on the Lecythidaceae plastome alignment of 24 species showing nucleotide diversity (π) (top) and alignment site‐wise differences in log‐likelihood (LD) calculated from the chloroplast topology versus the average scores of 1000 random trees (bottom). Regions with π and LD above the 95th percentiles are indicated with dashed lines. Continuous vertical lines indicate the boundaries, from left to right, among the large single copy, the inverted repeat, and the small single copy.
Regions of the plastome alignment (windows of 600 sites) with significantly high (above the 95th percentile) nucleotide diversity and/or site‐wise log‐likelihood score differences.b
| Location in the alignment |
| Closest flanking expressed region | Region | π | LD | |
|---|---|---|---|---|---|---|
| 5′ | 3′ | |||||
| 1–600 | 1–490 |
|
| LSC |
| |
| 5401–6000 | 4885–5373 |
|
| LSC |
| |
| 34,801–35,400 | 30,925–31,450 |
|
| LSC |
| |
| 35,401–36,000 | 31,451–31,967 |
|
| LSC |
|
|
| 37,201–37,800 | 33,027–33,573 |
|
| LSC |
|
|
| 39,601–40,200 | 34,893–35,433 |
|
| LSC |
| |
| 43,801–44,400 | 38,798–39,254 |
|
| LSC |
|
|
| 44,401–45,000 | 39,255–39,744 |
|
| LSC |
|
|
| 61,201–61,800 | 54,771–55,275 |
|
| LSC |
| |
| 78,601–79,200 | 70,230–70,771 |
|
| LSC |
| |
| 89,401–90,000 | 80,536–81,103 |
|
| LSC |
| |
| 95,401–96,000 | 85,455–85,906 |
|
| LSC |
| |
| 131,401–132,000 | 119,237–119,759 |
|
| SSC |
| |
| 140,401–141,000 | 127,827–128,402 |
|
| SSC |
| |
| 144,001–144,600 | 131,283–131,868 |
|
| SSC |
|
|
| 144,601–145,200 | 131,869–132,446 |
|
| SSC |
|
|
π = nucleotide diversity (see main text); LD = log‐likelihood score differences; LSC = large single copy; SSC = small single copy.
Signifies regions with high (above the 95th percentile) nucleotide diversity or site‐wise log‐likelihood score differences.
Coding regions are indicated in windows that have the same 5ʹ‐ and 3ʹ‐expressed flanking regions in column 3. Notice that no regions are reported for the inverted repeat (IR). Coordinates are given on the alignment and the Bertholletia excelsa plastome that are assembled with the standard LSC‐SSC‐IR structure.
Nucleotide diversity and differences in log‐likelihood scores of the informative windows identified in this study and of previously proposed barcode markers
| Region | π | LD |
|---|---|---|
|
|
| 247.12 |
|
| 0.0153 | 136.92 |
|
|
| 260.79 |
|
| 0.0228 |
|
|
| 0.0176 |
|
|
|
|
|
|
|
|
|
|
| 0.0105 | 95.03 |
|
|
| 275.89 |
|
| 0.0097 | 120.53 |
|
| 0.0103 | 178.60 |
|
| 0.0212 |
|
|
|
|
|
|
|
|
|
|
| 0.0126 |
|
|
| 0.0164 |
|
|
| 0.0106 | 192.27 |
|
| 0.0239 |
|
|
| 0.0128 |
|
|
|
|
|
|
|
|
|
π = nucleotide diversity; LD = differences in log‐likelihood scores.
Informative windows identified in this study are indicated in bold.
High values (above the 95th percentile) for π and LD are indicated in bold.
Primer sequences designed to amplify the 10 most polymorphic Lecythidaceae plastome regions, as sorted by decreasing nucleotide diversity
| Window in the alignment | π | Region | Forward primer sequence (5′–3′) | Reverse primer sequence (5′–3′) | Length (bp) |
|---|---|---|---|---|---|
| 144,103–145,487 | 0.04691 |
| AGAACCTTTGATTATGTCTCGACG | AGAGACATGCTATAAAAATAGCCCA | 1186 |
| 95,034–95,741 | 0.03446 |
| AGAGTTTCTTCTCATCCAGCTCC | GCTTAGTGTGTGACTCGTTGG | 1014 |
| 35,585–36,413 | 0.02920 |
| CCGTTCTTTCTTTTCTATAACCTACCC | ACGCTGGTTCAAATCCAGCT | 1093 |
| 143,235–144,102 | 0.02733 |
| TGATTCGAATCTTTTAGCATTAKAACT | KCGTCGAGACATAATCAAAGGT | 1189 |
| 131,180–132,054 | 0.02576 |
| CCGAGTGGTTAATAATGCACGT | GCTTCTCTTGCATTACCGGG | 1180 |
| 44,398–45,132 | 0.02537 |
| TCGATYCCCGCTATCCGCC | GCCAATTTGATTCGATGGAGAGA | 883 |
| 89,032–89,688 | 0.02464 |
| TGGGAGTGTGTGACTTGAACT | TGACCCATCCCTTTAGCCAA | 824 |
| 43,412–44,397 | 0.02456 |
| TCCAATTGRCTGTTTTTGCATTAATTG | CCTTGAGGTCACGGGTTCAA | 706 |
| 37,444–38,345 | 0.02409 |
| AGACGATGGGGGCATACTTG | CCACTTACTTTTTCTTTTGTTTGTTGA | 1324 |
| 38,346–40,085 | 0.02391 |
| GGCGTAAGTCATCGGTTCAA | CCCAAAGCGAAATAGGCACA | 1717 |
π = nucleotide diversity.
The product size (length) references the Bertholletia excelsa plastome.
Figure 3Maximum likelihood phylogeny inferred from plastomes of 23 Neotropical Lecythidaceae. Numbers at nodes indicate bootstrap support.
Figure 4Average bootstrap support for trees inferred from either independent or concatenated regions with high nucleotide diversity, sorted in ascending order.
Figure 5Robinson–Foulds distances (RF) for trees inferred from either independent or concatenated regions with high nucleotide diversity, sorted in descending order. Lower RF distances, which measure the number of different taxa bipartitions from the complete plastome topology, indicate better accuracy.
| Species | Voucher | No. of reads | % of ref seq | Mean coverage | No. of contigs | Average length of assembled contigs (bp) | Minimum contig length (bp) | Maximum contig length (bp) | N50 | GenBank accession no. |
|---|---|---|---|---|---|---|---|---|---|---|
|
| Mori 25640 | 527,449 | 99.20 | 213 | 9 | 157,957 | 1052 | 42,447 | 22,733 |
|
|
| Chevalier 10101 | 697,746 | 99.60 | 295 | 8 | 158,449 | 400 | 34,633 | 32,463 |
|
|
| Tsou 1552 | 519,377 | 96.10 | 230 | 10 | 152,805 | 2636 | 46,991 | 32,608 |
|
|
| Mori 25637 | 1,036,874 | 100 | 646 | 14 | 160,472 | 461 | 160,472 | 160,472 |
|
|
| Nee 52828 | 759,042 | 85 | 292 | 70 | 130,037 | 278 | 22,237 | 3803 |
|
|
| Mori 24148 | 690,545 | 99.60 | 302 | 4 | 159,222 | 6643 | 75,002 | 42,750 |
|
|
| Mori 25518 | 606,728 | 99.60 | 260 | 5 | 158,819 | 6596 | 74,896 | 42,691 |
|
|
| Janovec 2506 | 340,696 | 99 | 144 | 9 | 156,981 | 1107 | 45,975 | 42,740 |
|
|
| Mori 24093 | 493,777 | 99.40 | 211 | 6 | 158,312 | 1374 | 109,344 | 109,344 |
|
|
| Mori 25516 | 503,417 | 96.50 | 314 | 12 | 154,792 | 1071 | 47,693 | 18,977 |
|
|
| Prévost 4607 | 851,683 | 100 | 358 | 2 | 158,981 | 61,051 | 97,930 | 97,929 |
|
|
| Cornejo 8185 | 273,053 | 98.30 | 116 | 11 | 156,630 | 1117 | 37,154 | 21,598 |
|
|
| Cornejo 8211 | 440,144 | 99 | 187 | 9 | 157,206 | 1105 | 42,497 | 21,591 |
|
|
| Mori 25410 | 289,775 | 98.90 | 120 | 11 | 157,890 | 1143 | 42,551 | 18,021 |
|
|
| Cornejo 8208 | 160,625 | 97 | 166 | 8 | 154,547 | 551 | 74,876 | 24,636 |
|
|
| Prévost 4252 | 367,631 | 98.90 | 151 | 11 | 157,757 | 1179 | 37,141 | 21,748 |
|
|
| Aguilar 7961 | 520,480 | 94.90 | 326 | 41 | 150,768 | 314 | 28,189 | 8102 |
|
|
| Mori 24255 | 761,640 | 95.50 | 476 | 33 | 152,601 | 358 | 28,353 | 11,657 |
|
|
| Cornejo 8184 | 534,143 | 98.90 | 334 | 10 | 157,746 | 1035 | 35,586 | 22,762 |
|
|
| Cornejo 8229 | 606,518 | 94.10 | 241 | 39 | 149,464 | 348 | 28,759 | 6527 |
|
|
| Molino 2019 | 1,073,567 | 96.90 | 405 | 27 | 154,882 | 302 | 21,264 | 11,580 |
|
|
| Mori 24265 | 544,831 | 90.70 | 243 | 50 | 143,859 | 317 | 28,741 | 4496 |
|
|
| Tsou 1542 | 666,355 | 94.80 | 416 | 26 | 151,568 | 354 | 31,188 | 14,768 |
|
|
| Mori 25728 | 690,202 | 99.50 | 301 | 4 | 158,832 | 11,782 | 75,019 | 49,184 |
|
Percentage of the sequence recovered in relation to Bertholletia excelsa.