| Literature DB >> 35260700 |
Chiun-Jr Huang1,2,3, Fang-Hua Chu4, Yi-Shiang Huang5, Yu-Ching Tu6, Yu-Mei Hung6, Yu-Hsin Tseng7, Chang-En Pu6, Cheng Te Hsu8, Chi-Hsiang Chao6, Yu-Shyang Chou6, Shau-Chian Liu9, Ya Ting You10, Shuo-Yu Hsu4, Hsiang-Chih Hsieh4, Chieh-Ting Wang11, Chi-Tsong Chen12.
Abstract
Chamaecyparis formosensis is an endemic species of Taiwan, threatened from intensive use and illegal felling. An individual identification system for C. formosensis is required to provide scientific evidence for court use and deter illegal felling. In this study, 36 polymorphic simple sequence repeat markers were developed. By applying up to 28 non-linked of the developed markers, it is calculated that the cumulative random probability of identity (CPI) is as low as 1.652 × 10-12, and the identifiable population size is up to 60 million, which is greater than the known C. formosensis population size in Taiwan. Biogeographical analysis data show that C. formosensis from four geographic areas belong to the same genetic population, which can be further divided into three clusters: SY (Eastern Taiwan), HV and GW (Northwestern Taiwan), and MM (Southwestern Taiwan). The developed system was applied to assess the provenance of samples with 88.44% accuracy rate and therefore can serve as a prescreening tool to reduce the range required for comparison. The system developed in this study is a potential crime-fighting tool against illegal felling.Entities:
Mesh:
Year: 2022 PMID: 35260700 PMCID: PMC8904461 DOI: 10.1038/s41598-022-07870-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1Flowchart of Chamaecyparis formosensis individual identification system development. N: the number of individuals; P: the number of populations; MISA: MicroSAtellite software; CP : combined probability of identity.
Characteristics of 9 SSR loci developed in Chamaecyparis formosensis.
| Locus | Primer sequences (5’ -3’) | Repeat motif | Fluorescent label | Allele size (bp) | GenBank accession no | Putative function [organism] | |
|---|---|---|---|---|---|---|---|
| Cred603 | TTGCTACATTAGCACTAGATAGCAAAGAAA | (AAG)13 | 6-FAM | 106 | 60 | MW052386 | No hit |
| ACTGAAGATACTGAGGATATTGAAGAGGAA | |||||||
| Cred610 | TGAGATATACATGTGTGAAAGAGAGTGAAGC | (GTAT)5 | PET | 166 | 60 | MW052387 | No hit |
| TGCAATAATTTCTTCAGTGTTACCACTACC | |||||||
| Cred628 | GCTGGAGTCATTATAGTGCCATGTCTTTGT | (GCCC)3 | 6-FAM | 142 | 60 | MW052388 | No hit |
| TTTTCAAATAGCCGACCGACCTATGTAGAG | |||||||
| Cred640 | ACCCATATCTTCCTTCCCAACCATTAAGAT | (TCTT)5 | 6-FAM | 137 | 60 | MW052389 | No hit |
| CTTTCAGTGGAATGGAAGAAAGCCCTACTA | |||||||
| Cred641 | ACTTCTAATGAATCCCCATGCCGAATTGTA | (GC)19 | VIC | 193 | 60 | MW052390 | No hit |
| CTGTTCGCGATAAGATAATTGGCTAGTGTG | |||||||
| Cred674 | TAAAGAGGCTCTGCTACTGGCTTTTCAACT | (GGGC)4 | NED | 147 | 60 | MW052391 | No hit |
| GTGGGTGGCCCTCTATTCTATTGTTGAT | |||||||
| Cred678 | GGTCCATATCCTGGAGTAGAACCTCCCTAC | (GGGC)5 | PET | 162 | 60 | MW052392 | No hit |
| GTGTCGCAGGCATAGACTTCTCCCTATATT | |||||||
| Cred682 | CCGCCCTTCTAATAACAGGGAAGATAAGTT | (CCCT)5 | NED | 147 | 60 | MW052393 | No hit |
| CCGCCCTTCTAATAACAGGGAAGATAAGTT | |||||||
| Cred683 | GCAGCCTAAATAAACAATAGGGGGATTGAT | (GCCT)4 | NED | 146 | 60 | MW052394 | No hit |
| CATGTTACGTATAGAATCGAGTGCAGGTCA |
Figure 2The biogeographic information of Chamacyparis formosensis in this study. A total of 92 samples composed of 20 MM, 25 HV, 23 GW, and 24 SY individuals were analyzed (a) Biogeographic analysis data suggests that the samples fall into three genetical categories: SY (Eastern Taiwan), HV & GW (Northwestern Taiwan), and MM (Southwestern Taiwan) (b)–(e) The red spots represent the individuals that have been mis-assigned (denoted as M in figure legend) from provenance simulation result. N: the number of individuals, M: the number of mis-assigned.
Genetic characterization of 36 polymorphic SSR loci of 92 Chamaecyparis formosensis individuals. A: number of alleles, Ho: observed heterozygosity, He: expected heterozygosity, PIC: polymorphism information content or power of information content, PD: power of discrimination, P: the probability of identity, PD is equal to 1 – P. *Highly significant from Hardy–Weinberg equilibrium (P < 0.001). Significant linkage disequilibrium (P < 0.001) was detected in the same colored pairs (Groups 1–6).
The discrimination power in SSR marker combination. CP cumulative random probability of identity, CL = (1- CP), N number of individuals.
Detailed genetic diversity parameters were identified at 36 simple sequence repeat (SSR) in four populations (MM, HV, GW, SY) of Chamaecyparis formosensis. N: the number of individuals, A: number of alleles, Ho: observed heterozygosity, He: expected heterozygosity, Fis: inbreeding coefficient. When Ho < He, Fis > 0, indicating that the population is inbreeding[67].
| Population | Fis | |||
|---|---|---|---|---|
| MM ( | 4.417 | 0.376 | 0.474 | 0.206 |
| HV ( | 4.417 | 0.390 | 0.508 | 0.231 |
| GW ( | 5.444 | 0.380 | 0.503 | 0.242 |
| SY ( | 5.278 | 0.506 | 0.583 | 0.132 |
| Average | 4.889 | 0.413 | 0.517 | 0.200 |
Pairwise Fst and Nm among four populations (MM, HV, GW, SY) of Chamaecyparis formosensis using 36 simple sequence repeat (SSR) data. N: the number of individuals; Fst: the fixation index. Fst < 0.05, no differentiation among populations. 0.05 < Fst < 0.15, low differentiation among populations. Nm: the gene flow value. Nm > 1 represents the frequent exchange of genes, which counteracts the genetic drift and prevents the population differentiation[48]. When Nm is greater than 4, it would be a random mating population[49].
| Nm | Fst | |||
|---|---|---|---|---|
| MM ( | HV ( | GW ( | SY ( | |
| MM ( | – | 3.141 | 4.022 | 3.798 |
| HV ( | 0.074 | – | 6.832 | 3.603 |
| GW ( | 0.059 | 0.035 | – | 4.199 |
| SY ( | 0.062 | 0.065 | 0.056 | – |
Figure 3Genetic composition of Chamaecyparis formosensis. (a) The scatter plots of Delta K. (b) The 2, 3 and 4 clusters obtained from STRUCTURE analyses.
Chamaecyparis formosensis individual provenance simulation result. A total of 92 samples composed of 20 MM, 25 HV, 23 GW, and 24 SY individuals were subjected to provenance simulation. Correct provenance is 95.00% (MM), 88.00% (HV), 69.57% (GW), and 100.00% (SY), with an overall mean correct rate of 88.04%.
| MM ( | HV ( | GW ( | SY ( | Over all ( | |
|---|---|---|---|---|---|
| Sum of correct samples | 19 | 22 | 16 | 24 | 81 |
| Correct cluster (%) | 95.00 | 88.00 | 69.57 | 100.00 | 88.04 |