| Literature DB >> 32466772 |
Monika Ślipiko1, Kamil Myszczyński2, Katarzyna Buczkowska3, Alina Bączkiewicz3, Monika Szczecińska2, Jakub Sawicki2.
Abstract
BACKGROUND: Molecular research revealed that some of the European Calypogeia species described on the basis of morphological criteria are genetically heterogeneous and, in fact, are species complexes. DNA barcoding is already commonly used for correct identification of difficult to determine species, to disclose cryptic species, or detecting new taxa. Among liverworts, some DNA fragments, recommend as universal plant DNA barcodes, cause problems in amplification. Super-barcoding based on genomic data, makes new opportunities in a species identification.Entities:
Keywords: Calypogeia; DNA barcode; Super-barcoding; ndhB; ndhH; trnT-trnL
Mesh:
Substances:
Year: 2020 PMID: 32466772 PMCID: PMC7257191 DOI: 10.1186/s12870-020-02435-y
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Fig. 1Gene map of the Calypogeia muelleriana chloroplast genome. Genes inside and outside the outer circle are transcribed in counterclockwise and clockwise directions, respectively. The genes are color-coded based on their function. The dashed area in the inner circle visualizes the G/C content. Pseudogenes have been marked with Ѱ
Fig. 2SNP and indel variation among plastomes of Calypogeia. Track A shows nonsynonymous SNP occurrence within genes. Track B and C represent identified SNPs (small blue dots) and indels (small orange dots) per 100 bp window size (maximum value = 40). Line plot, comprising B and C track, represents SNPs (blue line) and indels (orange line) within each exon, intron or intergenic spacer (snp max. Value = 400, indel max. Value = 100). Track D represents percent of SNPs per CDS length (maximum value = 22) while track E represents percent of indels per CDS length (maximum value = 2). Track F represents percent of SNPs per noncoding region length (max. Value = 30) while track G represents percent of indels per noncoding region length (max value = 20)
The top of the most variable chloroplast regions of Calypogeia species.
Fig. 3CDS-based phylogram derived from a Bayesian analysis. The posterior probability value lower than 1 is given at the node
Species delimited from set of analyzed individuals by PTP simple heuristic search
| Species delimited by PTP analysis | Posterior delimitation probability |
|---|---|
| 1.0 | |
| 1.0 | |
| 0.999 | |
| 0.999 | |
| 0.999 | |
| 0.998 | |
| 0.998 | |
| 0.992 | |
| 0.991 | |
| 0.922 | |
| 0.891 | |
| 0.125 | |
| 0.125 | |
| 0.125 | |
| 0.078 | |
| 0.078 | |
| 0.009 | |
| 0.009 | |
| 0.002 | |
| 0.002 | |
| 0.002 | |
| 0.002 | |
| 0.002 | |
| 0.001 | |
| 0.001 | |
| 0.001 | |
| 0.001 | |
| 0.001 | |
| 0.001 |
Fig. 4Results of several analyzes across the plastome sequences of Calypogeia sp. using the sliding window method. a- the plot of the mean Kimura 2-parameter distance matrix for each 500 bp- long window. The greatest one is at about 30, 000 bp position and the lowest one at the 90, 000 bp position. b- the proportion of zero non- conspecific distances, which find their maximum at around 90, 000 bp. c- the proportion of zero cells in the distance matrix. This is maximized around 90, 000 bp. The unbroken horizontal line crossing the y-axis at 0 is the proportion of zero cells in the distance matrix created from the full dataset. d- the proportion of clades that are identical between the windows and the full dataset. It is pretty high at most windows and visibly low at around 90, 000 bp position. e- the sum of diagnostic nucleotide positions for all species. The most of these nucleotides are at around 30, 000 bp position and the least at around 90, 000 bp position. f- the proportion of species that are monophyletic. The position around 90, 000 bp is clearly in the doldrums, but most positions distinguish species pretty well.
Fig. 5Lineplot of the barcode gap for the 22 Calypogeia species. For each individual in the dataset, the grey lines represent the furthest intraspecific distance (bottom of line value), and the closest interspecific distance (top of line value). The red lines show where this relationship is reversed (no barcoding gap)