| Literature DB >> 27917315 |
Marcelo Reginato1, Kurt M Neubig2, Lucas C Majure3, Fabian A Michelangeli1.
Abstract
BACKGROUND: In the past three decades, several studies have predominantly relied on a small sample of the plastome to infer deep phylogenetic relationships in the species-rich Melastomataceae. Here, we report the first full plastid sequences of this family, compare general features of the sampled plastomes to other sequenced Myrtales, and survey the plastomes for highly informative regions for phylogenetics.Entities:
Keywords: Chloroplast; Genome skimming; Melastomataceae; Myrtales; NGS; Phylogenomics; Plastome
Year: 2016 PMID: 27917315 PMCID: PMC5131623 DOI: 10.7717/peerj.2715
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Voucher information and GenBank accessions of the chloroplast sequenced in the Melastomataceae.
| Species | Tribe/“clade” | GenBank | Voucher | Herbarium |
|---|---|---|---|---|
| Cyphostyleae | David, H. 2188 | HUA, NY | ||
| Bertolonieae | Goldenberg, R. 810 | NY, UPCB | ||
| Blakeeae | Michelangeli, F.A. 1227 | NY | ||
| “Eriocnema” | Almeda, F. 8416 | CAS | ||
| Merianieae | Michelangeli, F.A. 832 | NY | ||
| Henrietteeae | Ionta, G. 2029 | FLAS | ||
| “Cambessedesia” | Goldenberg, R. 1153 | NY, UPCB | ||
| Miconieae | Michelangeli, F.A. 758 | NY | ||
| “Marcetia” | Michelangeli, F.A. 1998 | NY | ||
| Unplaced | Caddah, M.K. 578 | NY, UPCB | ||
| Melastomeae | Michelangeli, F.A. 540 | NY | ||
| Rhexieae | Michelangeli, F.A. 1448 | NY | ||
| Microlicieae | Zenteno, F. 8801 | NY | ||
| Merianieae | Clark, J.L. 13577 | UNA | ||
| Melastomeae | Majure, L. 4277 | FLAS | ||
| “Triolena” | Michelangeli, F.A. 1366 | NY |
Note:
Informal clades are quoted.
Figure 1Map of the Miconia dodecandra plastid genome.
Genes shown outside the outer circle are transcribed counterclockwise and genes inside the outer circle are transcribed clockwise. Genes in different functional groups are color coded following the legend. The shaded area inside the inner circle indicates the GC content, with dark shading indicating GC percent.
Figure 2Maximum likelihood tree recovered with the full data set (left).
On the right, linear plastid maps of the 16 Melastomataceae species. All genes are depicted as arrows (indicating transcription direction) and color coded following the legend of Fig. 1. Gray lines link the same genes on contiguous maps. LSC, long single copy region; SSC, small single copy region; IRA, inverted repeat A; IRB, inverted repeat B.
Comparison of plastid genome size and GC content across different regions in the 16 Melastomataceae species.
| Species | Coverage (mean) | LSC | SSC | IR | Full | ||||
|---|---|---|---|---|---|---|---|---|---|
| bp | GC | bp | GC | bp | GC | bp | GC | ||
| 278 | 85,915 | 0.347 | 16,975 | 0.306 | 26,781 | 0.425 | 156,452 | 0.369 | |
| 189 | 85,571 | 0.347 | 17,008 | 0.308 | 26,733 | 0.425 | 156,045 | 0.370 | |
| 170 | 85,370 | 0.349 | 16,998 | 0.308 | 26,747 | 0.425 | 155,862 | 0.370 | |
| 42 | 85,431 | 0.348 | 16,953 | 0.308 | 26,805 | 0.425 | 155,994 | 0.370 | |
| 683 | 85,341 | 0.347 | 16,924 | 0.309 | 26,734 | 0.425 | 155,733 | 0.370 | |
| 130 | 85,991 | 0.347 | 17,036 | 0.306 | 26,750 | 0.425 | 156,527 | 0.369 | |
| 56 | 85,621 | 0.348 | 17,001 | 0.307 | 26,773 | 0.424 | 156,168 | 0.370 | |
| 318 | 86,609 | 0.348 | 16,999 | 0.310 | 26,804 | 0.425 | 157,216 | 0.370 | |
| 705 | 84,644 | 0.348 | 17,066 | 0.310 | 26,700 | 0.426 | 155,110 | 0.371 | |
| 100 | 85,866 | 0.348 | 16,942 | 0.309 | 26,772 | 0.425 | 156,352 | 0.370 | |
| 184 | 84,718 | 0.351 | 17,156 | 0.312 | 26,537 | 0.425 | 154,948 | 0.372 | |
| 683 | 84,459 | 0.351 | 16,924 | 0.311 | 26,626 | 0.425 | 154,635 | 0.372 | |
| 304 | 85,093 | 0.347 | 16,729 | 0.307 | 26,643 | 0.426 | 155,108 | 0.370 | |
| 537 | 85,128 | 0.353 | 16,653 | 0.317 | 25,765 | 0.428 | 153,311 | 0.374 | |
| 195 | 86,297 | 0.349 | 17,124 | 0.311 | 26,684 | 0.425 | 156,789 | 0.371 | |
| 48 | 86,200 | 0.347 | 16,970 | 0.307 | 26,741 | 0.425 | 156,652 | 0.369 | |
Notes:
Length, length in bp; GC, GC content %.
LSC, long single copy region; SSC, small single copy region; IR, inverted repeat; Full, full plastome.
Comparison of length and GC content across different sequence classes in the plastome of the 16 Melastomataceae species.
| Species | Protein-coding | tRNA | rRNA | Intron | Intergenic | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| bp | GC | bp | GC | bp | GC | bp | GC | bp | GC | |
| 80,826 | 0.374 | 3,348 | 0.497 | 9,050 | 0.425 | 20,553 | 0.347 | 42,675 | 0.316 | |
| 80,670 | 0.375 | 3,356 | 0.497 | 9,050 | 0.425 | 20,437 | 0.347 | 42,532 | 0.316 | |
| 80,742 | 0.375 | 3,348 | 0.498 | 9,050 | 0.425 | 20,541 | 0.347 | 42,181 | 0.319 | |
| 80,628 | 0.375 | 3,354 | 0.497 | 9,050 | 0.425 | 20,540 | 0.347 | 42,422 | 0.318 | |
| 80,286 | 0.375 | 3,349 | 0.497 | 9,050 | 0.425 | 19,691 | 0.347 | 43,357 | 0.317 | |
| 80,781 | 0.374 | 3,363 | 0.495 | 9,050 | 0.425 | 20,571 | 0.347 | 42,762 | 0.315 | |
| 80,751 | 0.375 | 3,364 | 0.498 | 9,050 | 0.425 | 20,478 | 0.347 | 42,525 | 0.318 | |
| 80,586 | 0.376 | 3,354 | 0.498 | 9,050 | 0.425 | 20,548 | 0.347 | 43,678 | 0.317 | |
| 80,646 | 0.375 | 3,370 | 0.496 | 9,050 | 0.425 | 20,619 | 0.347 | 41,425 | 0.318 | |
| 80,643 | 0.376 | 3,360 | 0.496 | 9,050 | 0.425 | 20,641 | 0.347 | 42,658 | 0.317 | |
| 80,427 | 0.377 | 3,339 | 0.498 | 9,050 | 0.425 | 19,911 | 0.347 | 42,221 | 0.318 | |
| 80,466 | 0.377 | 3,353 | 0.496 | 9,050 | 0.425 | 20,260 | 0.347 | 41,506 | 0.319 | |
| 80,415 | 0.375 | 3,241 | 0.502 | 9,048 | 0.425 | 20,538 | 0.347 | 41,866 | 0.317 | |
| 79,326 | 0.376 | 3,349 | 0.500 | 9,050 | 0.425 | 18,991 | 0.347 | 42,595 | 0.326 | |
| 80,682 | 0.377 | 3,348 | 0.497 | 9,050 | 0.425 | 20,666 | 0.347 | 43,043 | 0.317 | |
| 80,619 | 0.375 | 3,337 | 0.496 | 9,050 | 0.425 | 20,476 | 0.347 | 43,170 | 0.316 | |
Note:
Length, length in bp; GC, GC content %.
Comparison of plastid genome size of one Melastomataceae species (Miconia dodecandra) with eight other Myrtales.
| Family | Species | Coding | tRNA | rRNA | LSC | SSC | IR | Full |
|---|---|---|---|---|---|---|---|---|
| Melastomataceae | 84 | 37 | 8 | 86,609 | 16,999 | 26,804 | 157,216 | |
| Myrtaceae | 84 | 37 | 8 | 88,218 | 18,571 | 26,402 | 159,563 | |
| Myrtaceae | 84 | 37 | 8 | 88,769 | 18,773 | 26,392 | 160,326 | |
| Myrtaceae | 84 | 37 | 8 | 88,310 | 17,197 | 27,603 | 160,713 | |
| Myrtaceae | 84 | 37 | 8 | 88,944 | 18,530 | 26,397 | 160,268 | |
| Myrtaceae | 84 | 37 | 8 | 87,459 | 18,318 | 26,334 | 158,445 | |
| Lythraceae | 84 | 37 | 8 | 83,923 | 16,933 | 25,792 | 152,440 | |
| Onagraceae | 84 | 38 | 8 | 89,862 | 19,035 | 28,824 | 166,545 | |
| Myrtaceae | 84 | 37 | 8 | 88,247 | 18,544 | 26,385 | 159,561 |
Notes:
Protein-coding, tRNA and rRNA (number of genes); LSC, long single copy region, length in bp; SSC, small single copy region, length in bp; IR, inverted repeat, length in bp and Full (length in bp).
Figure 3Maximum likelihood trees of the all non-coding–ncs (A) and all coding genes–cds (B) data sets.
Bootstrap support is given adjacent to the nodes. (C) Tree distance (RF) pairwise matrix between all six schemes analyzed. (D) Mean bootstrap support of all six schemes analyzed.
Summary statistics for the top 10 introns and intergenic spacers with suitable size for PCR amplification.
Markers are ranked by phylogenetic information based on a weighed mean of relative values of number of variable sites (weight = 1), mean bootstrap (weight = 2) and distance to the full plastid tree (weight = 3).
| Marker | Bases | Aligned (bp) | Variable sites | PIS | DNA distance (mean) | Tree distance | Bootstrap (mean) |
|---|---|---|---|---|---|---|---|
| 1. | 780 [628, 884] | 1,125 | 438 (38.9%) | 128 (11.4%) | 0.104 | 4 | 82 |
| 2. | 898 [849, 965] | 1,266 | 507 (40%) | 171 (13.5%) | 0.114 | 6 | 71 |
| 3. | 762 [743, 790] | 846 | 236 (27.9%) | 76 (9%) | 0.059 | 4 | 75 |
| 4. | 734 [504, 821] | 991 | 330 (33.3%) | 98 (9.9%) | 0.081 | 4 | 63 |
| 5. | 1,016 [939, 1,045] | 1,127 | 250 (22.2%) | 74 (6.6%) | 0.046 | 4 | 64 |
| 6. | 641 [550, 750] | 895 | 353 (39.4%) | 136 (15.2%) | 0.114 | 6 | 65 |
| 7. | 898 [638, 980] | 1,178 | 323 (27.4%) | 92 (7.8%) | 0.062 | 8 | 76 |
| 8. | 1,058 [570, 1,165] | 1,396 | 381 (27.3%) | 132 (9.5%) | 0.068 | 8 | 70 |
| 9. | 736 [420, 944] | 1,062 | 285 (26.8%) | 90 (8.5%) | 0.076 | 8 | 76 |
| 10. | 842 [478, 1,029] | 1,345 | 406 (30.2%) | 121 (9%) | 0.089 | 8 | 63 |
Note:
PIS, parsimony informative sites; Tree distance, RF distance.
Primer pair sequences for the identified top five highly informative markers across the 16 plastomes of Melastomataceae.
| Marker | Primer forward (5′–3′) | Primer reverse (5′–3′) | Ta (°C) |
|---|---|---|---|
| 1. | CACTCAGCCATCTCTCCCAA | ACCCGCTACAATGCCATTATTG | 55 |
| 2. | AGGAAAGGACCACATACGTCG | TCCTTGCTCATTGATTTTGATCCA | 55 |
| 3. | GGTCCCTCGGATTTGCTTCA | GAACCCGCATCGTTAGCTTG | 55 |
| 4. | AGATGAACTCCTAGGGAATGTGA | CCGAGAAGGTCTACGGTTCG | 55 |
| 5. | CGCTAGTCCAGAACCGTACA | ACCCCATGATTGGTTGATTAGTGA | 55 |