| Literature DB >> 34828262 |
Francesco Scariolo1, Fabio Palumbo1, Alessandro Vannozzi1, Gio Batta Sacilotto2, Marco Gazzola2, Gianni Barcaccia1.
Abstract
Lavender species are widely distributed in their wild forms around the Mediterranean Basin and they are also cultivated worldwide as improved and registered clonal varieties. The economic interest of the species belonging to the Lavandula genus is determined by their use as ornamental plants and important source of essential oils that are destinated to the production of cosmetics, pharmaceuticals and foodstuffs. Because of the increasing number of cases of illegal commercialization of selected varieties, the protection of plant breeders' rights has become of main relevance for the recognition of breeding companies' royalties. With this aim, genomic tools based on molecular markers have been demonstrated to be very reliable and transferable among laboratories, and also much more informative than morphological descriptors. With the rising of the next-generation sequencing (NGS) technologies, several genotyping-by-sequencing approaches are now available. This study deals with a deep characterization of 15 varietal clones, belonging to two distinct Lavandula species, by means of restriction-site associated DNA sequencing (RAD-Seq). We demonstrated that this technology screens single nucleotide variants that enable to assess the genetic identity of individual accessions, to reconstruct genetic relationships among related breeding lines, to group them into genetically distinguishable main subclusters, and to assign their molecular lineages to distinct ancestors. Moreover, a number of polymorphic sites were identified within genes putatively involved in biosynthetic pathways related to both tissue pigmentation and terpene production, useful for breeding and/or protecting newly registered varieties. Overall, the results highlighted the presence of pure ancestries and interspecific hybrids for the analyzed Lavandula species, and demonstrated that RAD-Seq analysis is very informative and highly reliable for characterizing Lavandula clones and managing plant variety protection.Entities:
Keywords: Lavandula; NGS; ancestry reconstruction; chloroplast DNA barcoding; flavonoids; genotyping by RAD sequencing; interspecific crosses; plant breeder’s rights; terpenes
Mesh:
Substances:
Year: 2021 PMID: 34828262 PMCID: PMC8621978 DOI: 10.3390/genes12111656
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
List of primers used for each chloroplast (cpDNA) and nuclear (nuDNA) marker with their nucleotide sequence, and reference source.
| Marker | Primer Name | Primer Sequence (5′-3′) * | T | References |
|---|---|---|---|---|
| rbcL_F | GCAGCATTYCGAGTAASTCCYCA | 55 | [ | |
| rbcL_R | GAAACGYTCTCTCCAWCGCATAAA | [ | ||
| matK4La | CCTTCGATACTGGGTGAAAGAT | 55 | [ | |
| matK1932Ra | CCAGACCGGCTTACTAATGGG | [ | ||
| psbA3′f | GTTATGCATGAACGTAATGCTC | 55 | [ | |
| trnHf | CGCATGGTGGATTCACAATCC | [ | ||
| ITS1 | ITS5 | GGAAGTAAAAGTCGTAACAAGG | 55 | [ |
| ITS2 | GCTGCGTTCTTCATCGATGC | [ |
* Y: C or T; S: G or C; W: A or T; T: primers’ annealing temperature.
Genetic Similarity matrix of 15 Lavandula individuals based on 16,228 SNPs with no missing data, and relative observed homozygosity (Obs. Ho) and heterozygosity (Obs. He).
| Obs. Ho | Obs. He | Sample | Genetic Similarity (GS) | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1603 | 66.1% | 33.9% | Cluster A | 1603 | 100.0% | ||||||||||||||
| 2601 | 60.1% | 39.9% | 2601 | 82.8% | 100.0% | ||||||||||||||
| 2604 | 72.8% | 27.2% | 2604 | 78.9% | 77.6% | 100.0% | |||||||||||||
| 1605 | 76.4% | 23.6% | Cluster B | 1605 | 79.8% | 77.8% | 73.8% | 100.0% | |||||||||||
| 1841 | 78.8% | 21.2% | 1841 | 77.9% | 75.7% | 71.1% | 86.4% | 100.0% | |||||||||||
| 1826 | 77.9% | 22.1% | 1826 | 79.2% | 76.9% | 74.1% | 87.4% | 86.5% | 100.0% | ||||||||||
| SD-014 | 85.5% | 14.5% | SD-014 | 76.5% | 74.3% | 70.6% | 88.5% | 83.9% | 83.3% | 100.0% | |||||||||
| BPI | 90.1% | 9.9% | Cluster C | BPI | 74.6% | 72.3% | 68.2% | 82.8% | 79.1% | 83.7% | 83.8% | 100.0% | |||||||
| ST-913 | 84.8% | 15.2% | ST-913 | 75.0% | 74.2% | 70.4% | 85.7% | 81.1% | 86.5% | 83.5% | 93.3% | 100.0% | |||||||
| SD-332 | 82.4% | 17.6% | SD-332 | 75.9% | 74.4% | 70.5% | 83.5% | 79.1% | 84.8% | 85.4% | 93.7% | 93.5% | 100.0% | ||||||
| 1811 | 89.7% | 10.3% | 1811 | 75.1% | 72.1% | 67.7% | 86.0% | 85.6% | 85.9% | 86.4% | 89.7% | 92.2% | 89.5% | 100.0% | |||||
| ST-103 | 77.6% | 22.4% | Cluster D | ST-103 | 75.7% | 72.9% | 69.4% | 80.7% | 79.4% | 76.9% | 82.8% | 83.2% | 82.2% | 82.3% | 82.0% | 100.0% | |||
| 3601 | 78.1% | 21.9% | 3601 | 72.2% | 70.4% | 67.8% | 78.0% | 76.5% | 75.2% | 80.0% | 76.3% | 77.2% | 77.2% | 80.9% | 86.0% | 100.0% | |||
| 2603 | 87.8% | 12.2% | Cluster E | 2603 | 63.0% | 64.0% | 65.9% | 55.4% | 58.8% | 56.2% | 54.0% | 53.9% | 53.9% | 53.4% | 51.6% | 54.9% | 53.1% | 100.0% | |
| 2605 | 71.6% | 28.4% | 2605 | 67.6% | 68.9% | 69.3% | 62.0% | 66.4% | 62.9% | 60.0% | 58.9% | 61.0% | 60.3% | 59.7% | 64.0% | 63.8% | 73.7% | 100.0% | |
| 1603 | 2601 | 2604 | 1605 | 1841 | 1826 | SD-014 | BPI | ST-913 | SD-332 | 1811 | ST-103 | 3601 | 2603 | 2605 | |||||
| Cluster A | Cluster B | Cluster C | Cluster D | Cluster E | |||||||||||||||
Figure 1(a) UPGMA dendrogram based on the pair-wise genetic similarity matrix highlighting five main “Clusters” for the no missing values containing dataset. (b) STRUCTURE software histogram for K = 3 of 15 individuals of Lavandula with a no missing values containing dataset (“red star” symbol labels individuals with homozygosity >80%).
Average genetic similarity of clusters identified through the construction of the UPGMA dendrogram, and average observed homozygosity (Avg. Obs. Ho).
| Avg. Obs. Ho | Cluster | Avg. Genetic Similarity (GS) | |||||
|---|---|---|---|---|---|---|---|
| 66.4% ± 3.7% | Cluster A | 79.8% ± 1.6% | |||||
| 79.7% ± 2.0% | Cluster B | 75.6% ± 0.9% | 86.0% ± 0.8% | ||||
| 86.7% ± 1.9% | Cluster C | 72.5% ± 0.8% | 83.9% ± 0.6% | 92.0% ± 0.8% | |||
| 77.9% ± 0.2% | Cluster D | 71.4% ± 1.1% | 78.7% ± 0.9% | 80.2% ± 1.0% | 86.0% ± N/A | ||
| 79.7% ± 8.1% | Cluster E | 66.4% ± 1.1% | 59.4% ± 1.5% | 56.6% ± 1.3% | 58.9% ± 2.9% | 73.7% ± N/A | |
| 78.5% ± 2.4% | A + B + C + D | 60.1% ± 1.0% | 79.7% ± 0.7% | ||||
| Cluster A | Cluster B | Cluster C | Cluster D | Cluster E | A + B + C + D | ||
Figure 2Principal Coordinate Analysis (PCoA) [24], based on the eigenvectors calculated starting from the genetic similarity matrix and highlighting the 5 mains “Clusters” (A to E) identified for the 15 analysed samples of Lavandula.
Summary statistics of the BLASTN analysis of the RAD-Seq reads against the exomes of S. indicum and S. splendens. Statistics information of the flavonoids and terpenes pathways involved genes is also reported.
| BLASTn | RAD-Tags ( | CDS ( | Protein Products | Avg. Identity (%) | Avg. Length (bp) | Avg. E-Value | Avg. Bitscore | Avg. Score | Avg. | Avg. Identity ( | Avg. |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Exome | 2618 | 2907 | 2077 | 87.3 | 64.4 | 5.33 × 10−12 | 80.2 | 87.5 | 8.2 | 56.2 | 87.3 |
| Flavonoids | 15 | 14 | 10 | 86.7 | 67.1 | 1.04 × 10−12 | 82.1 | 89.6 | 8.9 | 58.2 | 86.7 |
| Terpenes | 20 | 24 | 19 | 86.0 | 62.9 | 6.20 × 10−12 | 74.3 | 81.0 | 9.0 | 53.9 | 86.0 |
| Exome | 4239 | 6534 | 1215 | 88.7 | 64.2 | 2.90 × 10−12 | 83.8 | 91.5 | 7.3 | 56.9 | 88.7 |
| Flavonoids | 33 | 40 | 18 | 87.4 | 66.0 | 2.41 × 10−12 | 82.5 | 90.1 | 8.3 | 57.6 | 87.4 |
| Terpenes | 61 | 65 | 28 | 88.9 | 65.6 | 1.45 × 10−12 | 86.6 | 94.7 | 7.3 | 58.3 | 88.9 |
Genetic Similarity matrix of 15 Lavandula individuals based the BLASTN analysis against S. indicum exome, and relative observed homozygosity (Obs. Ho) and heterozygosity (Obs. He).
| Obs. Ho | Obs. He | Genetic Similarity (GS) | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 68.3% | 31.7% | Cluster A | 1603 | 100.0% | ||||||||||||||
| 60.4% | 39.6% | 2601 | 83.1% | 100.0% | ||||||||||||||
| 73.7% | 26.3% | 2604 | 79.6% | 77.3% | 100.0% | |||||||||||||
| 78.3% | 21.7% | Cluster B | 1605 | 81.2% | 78.5% | 74.8% | 100.0% | |||||||||||
| 86.1% | 13.9% | SD-014 | 77.9% | 75.5% | 72.3% | 88.9% | 100.0% | |||||||||||
| 78.2% | 21.8% | 1841 | 79.7% | 77.5% | 73.1% | 86.9% | 85.1% | 100.0% | ||||||||||
| 79.5% | 20.5% | 1826 | 81.0% | 78.3% | 75.9% | 87.0% | 84.5% | 88.1% | 100.0% | |||||||||
| 90.4% | 9.6% | Cluster C | BPI | 76.4% | 74.0% | 70.6% | 83.5% | 85.8% | 80.6% | 84.3% | 100.0% | |||||||
| 85.3% | 14.7% | ST-913 | 76.9% | 76.1% | 72.7% | 86.2% | 85.5% | 82.0% | 87.4% | 94.2% | 100.0% | |||||||
| 83.2% | 16.8% | SD-332 | 77.3% | 76.3% | 72.9% | 83.9% | 86.9% | 80.4% | 85.1% | 93.9% | 93.7% | 100.0% | ||||||
| 89.7% | 10.3% | 1811 | 77.5% | 74.3% | 70.4% | 86.8% | 87.9% | 86.2% | 86.6% | 90.4% | 92.6% | 90.0% | 100.0% | |||||
| 78.5% | 21.5% | Cluster D | ST-103 | 77.5% | 74.7% | 71.3% | 82.5% | 84.2% | 81.6% | 79.0% | 84.0% | 83.8% | 83.3% | 84.1% | 100.0% | |||
| 77.3% | 22.7% | 3601 | 75.0% | 73.2% | 70.4% | 79.4% | 80.9% | 78.5% | 76.6% | 77.8% | 78.9% | 78.4% | 82.2% | 87.0% | 100.0% | |||
| 87.0% | 13.0% | Cluster E | 2603 | 65.8% | 67.4% | 68.9% | 59.0% | 58.9% | 63.0% | 61.1% | 57.9% | 58.2% | 58.1% | 56.4% | 58.7% | 57.8% | 100.0% | |
| 70.2% | 29.8% | 2605 | 69.1% | 70.4% | 70.9% | 64.1% | 63.5% | 69.2% | 65.9% | 61.5% | 63.6% | 63.3% | 62.9% | 66.5% | 67.4% | 74.3% | 100.0% | |
| 1603 | 2601 | 2604 | 1605 | SD-014 | 1841 | 1826 | BPI | ST-913 | SD-332 | 1811 | ST-103 | 3601 | 2603 | 2605 | ||||
| Cluster A | Cluster B | Cluster C | Cluster D | Cluster E | ||||||||||||||
Average genetic similarity of clusters identified through the construction of the UPGMA dendrogram, and average observed homozygosity (Avg. Obs. Ho) The standard error is also reported.
| Avg. Obs. Ho | Sample | Avg. Genetic Similarity (GS) | |||||
|---|---|---|---|---|---|---|---|
| 67.4% ± 3.9% | Cluster A | 80.0% ± 1.7% | |||||
| 80.5% ± 1.9% | Cluster B | 77.1% ± 0.8% | 86.7% ± 0.7% | ||||
| 87.2% ± 1.7% | Cluster C | 74.6% ± 0.7% | 84.9% ± 0.6% | 92.5% ± 0.8% | |||
| 77.9% ± 0.4% | Cluster D | 73.7% ± 1.1% | 80.3% ± 0.9% | 81.6% ± 1.0% | 87.0% ± N/A | ||
| 78.6% ± 8.4% | Cluster E | 68.8% ± 0.8% | 63.1% ± 1.2% | 60.2% ± 1.0% | 62.6% ± 2.5% | 74.3% ± N/A | |
| 79.1% ± 2.3% | A + B + C + D | 63.4% ± 0.9% | 81.0% ± 0.7% | ||||
| Cluster A | Cluster B | Cluster C | Cluster D | Cluster E | A + B + C + D | ||
Multiple alignments results reporting read ID, S. indicum (GCF_000512975.1) accession number on NCBI database, Flavonoid/Terpenes product, KEGG ID, amino acid substitution based on the polymorphic SNP in the 15 individuals of Lavandula.
| FLAVONOIDS | ||||
|---|---|---|---|---|
|
|
|
|
|
|
| 3043 | XP_011100449.1 | anthocyanidin 3-O-glucosyltransferase 2 | K12930 | Ile -> Met |
| XP_011100453.1 | anthocyanidin 3-O-glucosyltransferase 2-like | |||
| 6706 | XP_011090466.1 | aspartate aminotransferase and glu/asp-prephenate aminotransferase | K15849 | Val -> Ala |
| 7480 | XP_011089364.1 | arogenate dehydratase/prephenate dehydratase 2, chloroplastic | K05359 | Glu -> Val |
| XP_011089363.1 | ||||
| 7969 | XP_011094662.1 | phenylalanine ammonia-lyase | K10775 | Gln -> Arg |
| 9011 | XP_011089239.2 | LOW QUALITY PROTEIN: 4-coumarate--CoA ligase-like 7 | K01904 | Gln -> Gln |
| 9012 | Gln -> Arg | |||
| 9955 | XP_020554052.1 | putative anthocyanidin reductase isoform X2 | K08695 | Uncertain |
| XP_011095308.1 | X -> Leu | |||
| 10947 | XP_011069886.1 | anthocyanidin 3-O-glucosyltransferase-like | K12930 | Arg -> Pro |
| 11587 | XP_011077338.1 | phenylalanine ammonia-lyase | K10775 | His -> Tyr |
|
| ||||
|
|
|
|
|
|
| 8036 | XP_011071094.1 | 1,4-dihydroxy-2-naphthoyl-CoA synthase, peroxisomal | K01661 | X -> X |
| 14576 | XP_011096130.1 | α-farnesene synthase | K14173 | Gly -> Glu |
| 6208 | XP_011093795.1 | β-amyrin synthase | K15813 | Lys -> Glu |
| 8386 | XP_011093795.1 | X -> Arg | ||
| 6208 | XP_011085901.1 | β-amyrin synthase-like | K15813 | Lys -> Glu |
| 8386 | XP_011085901.1 | X -> Arg | ||
| 6276 | XP_011095756.1 | ent-kaur-16-ene synthase, chloroplastic | N/A | Pro -> Ala |
| 7199 | XP_011083784.1 | ent-kaurene oxidase, chloroplastic-like | K04122 | Val -> Met |
| 3576 | XP_020550121.1 | geranylgeranyl transferase type-2 subunit α 1 | K09833 | Leu -> Ser |
| 10802 | XP_011092247.1 | gibberellin 20-oxidase-like protein | K05282 | Gln -> Gln |
| 11279 | XP_011096560.1 | gibberellin 2-β-dioxygenase | K04125 | Phe -> Leu |
| 10014 | XP_011098626.1 | gibberellin-regulated protein 4-like | N/A | Arg -> Gln |
| XP_011071640.1 | ||||
| 4578 | XP_011084658.1 | isopentenyl-diphosphate Delta-isomerase I | K01823 | Uncertain |
| 6515 | Phe -> Leu | |||
| 13525 | XP_011077171.1 | Pro -> Pro | ||
| 9817 | XP_011075409.1 | probable NAD(P)H dehydrogenase subunit CRR3, chloroplastic | N/A | Trp -> Leu |
| 14513 | XP_011082816.1 | probable solanesyl-diphosphate synthase 3, chloroplastic | K05356 | Leu -> Phe |
| 14513 | XP_011098150.1 | probable solanesyl-diphosphate synthase 3, chloroplastic isoform X2 | Leu -> Phe | |
| 5640 | XP_020551000.1 | protein prenyltransferase α subunit, isoform X6 | K14137 | Pro -> Gln |
| XP_020551002.1 | ||||
| 3603 | XP_011078470.1 | squalene monooxygenase | K00511 | Asn -> Thr |
| 9296 | XP_011092466.1 | squalene monooxygenase-like | Asp -> His | |
| 5280 | XP_011092839.1 | squalene synthase | K00801 | Pro -> Ser |
| XP_011092841.1 | ||||
| 4990 | XP_011082248.1 | vetispiradiene synthase 3 isoform X2 | K14182 | Asp -> Glu |
| 14152 | XP_020548233.1 | isochorismate synthase, chloroplastic-like | K01851 | Arg -> Met |
| 14154 | Gln -> Pro | |||
| 14685 | XP_020548234.1 | isochorismate synthase, chloroplastic-like | K01851 | Val -> Leu |
| 14687 | Thr -> Thr | |||
| 15015 | Lys -> Lys |
Figure 3(a) Neighbour Joining tree based on the polymorphic sites among ITS nuclear region, and matK, trnH-psbA and rbcL chloroplast barcoding regions. Bootstrap values are reported. (b) LOGO representation of polymorphic sites identified among the 15 Lavandula accessions analysed for the DNA barcoding.