| Literature DB >> 35628103 |
Felipe López-Hernández1, Andrés J Cortés1.
Abstract
Mint (Mentha L., Lamiaceae) is a strongly scented herb of the family Lamiaceae that is grown mostly by clonal propagation, making it a valuable species for the study of somaclonal variation and its phenotypic consequences. The recent introduction of a few species of mint in South America, followed by a presumably rampant propagation, make this region particularly ideal for studying the extent of somaclonal genetic diversity. Hence, the objective of this work was to offer a preliminary characterization of somaclonal genetically coding diversity of the mint in the northern Andes in order to address the question of whether somaclonal variants may have emerged despite relatively recent introductions in a region where mint is not native. A total of 29 clonally propagated specimens, collected in mint export farms in the province of Antioquia, a major region for mint production in the northwest Andes of Colombia, were genotyped using RNA sequencing (RNA-Seq). SNP calling was carried out from the leaves' transcriptome profiles of each plant by combining the GATK4 and TRINITY protocols, obtaining a total of 2033 loci across 912 transcripts with a minimum read depth of 20X and 4% of missing data. Unsupervised machine learning algorithms considered the K-means, AGNES and UPGMA approaches, all of which suggested three genetic clusters for M. spicata and a unique cluster for M. × piperita. The results indicate that at least two different origins of M. spicata reached the eastern region of the Antioquia province, clonally propagated in the locality ever since for local consumption and export. One of these ancestries had more population structure, possibly due to environmental or anthropological pressures that intervened in the fragmentation of this genetic group or to a higher somaclonal mutation rate. This work offers a first step into the study of the accumulation and transmission of presumably quasi-neutral somatic mutations at coding regions in an herbaceous clonally propagated scented species such as mint, likely favored by an expected population expansion after its Andean introduction. These ad hoc hypotheses warrant further study as part of future research.Entities:
Keywords: M. spicata; M. × piperita L.; RNA-Seq; comparative transcriptomics; crop biodiversity
Mesh:
Year: 2022 PMID: 35628103 PMCID: PMC9141282 DOI: 10.3390/ijms23105291
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 6.208
Mapping statistics against the supertranscript. Mapping of each specimen’s transcript profile used as reference the supertranscript and the GATK4 protocol. The total number of sequences per sample, mapped sequences, duplicates and purified sequences in the refinement of the protocol are shown in this table. For details on the sampling information of each specimen please refer to Materials and Methods.
| Sample ID | Total Transcripts | Supplementary Transcripts | Duplicate Transcripts | Mapped Transcripts | Unmapped Duplicates | Mapped Ratio |
|---|---|---|---|---|---|---|
| MLV20-15244 | 13,595,799 | 438,550 | 10,261,916 | 13,431,369 | 3,169,453 | 0.988 |
| MLV20-15245 | 13,893,704 | 422,388 | 9,887,386 | 13,560,195 | 3,672,809 | 0.976 |
| MLV20-15246 | 19,286,261 | 628,197 | 14,805,807 | 18,976,542 | 4,170,735 | 0.984 |
| MLV20-15247 | 13,264,697 | 388,203 | 9,755,435 | 12,984,582 | 3,229,147 | 0.979 |
| MLV20-15248 | 14,873,365 | 403,064 | 10,434,595 | 14,174,092 | 3,739,497 | 0.953 |
| MLV20-15249 | 11,814,041 | 375,642 | 8,206,501 | 11,568,380 | 3,361,879 | 0.979 |
| MLV20-15250 | 12,501,455 | 260,480 | 9,220,160 | 11,900,514 | 2,680,354 | 0.952 |
| MLV20-15251 | 12,138,901 | 348,593 | 8,715,938 | 11,859,700 | 3,143,762 | 0.977 |
| MLV20-15252 | 15,738,616 | 474,121 | 11,982,280 | 15,212,909 | 3,230,629 | 0.967 |
| MLV20-15253 | 13,479,991 | 429,102 | 9,136,387 | 13,252,005 | 4,115,618 | 0.983 |
| MLV20-15254 | 14,800,126 | 468,816 | 10,804,162 | 14,579,876 | 3,775,714 | 0.985 |
| MLV20-15255 | 15,818,146 | 444,333 | 11,731,011 | 15,593,508 | 3,862,497 | 0.986 |
| MLV20-15256 | 15,727,298 | 509,431 | 11,699,111 | 15,532,158 | 3,833,047 | 0.988 |
| MLV20-15257 | 21,828,779 | 704,838 | 17,154,954 | 21,564,209 | 4,409,255 | 0.988 |
| MLV20-15258 | 14,676,563 | 452,082 | 10,462,672 | 14,444,502 | 3,981,830 | 0.984 |
| MLV20-15259 | 14,468,839 | 474,835 | 10,691,456 | 14,284,865 | 3,593,409 | 0.987 |
| MLV20-15260 | 15,790,191 | 453,092 | 9,426,258 | 12,650,256 | 3,223,998 | 0.801 |
| MLV20-15261 | 9,381,824 | 276,062 | 5,290,265 | 8,045,530 | 2,755,265 | 0.858 |
| MLV20-15262 | 15,512,116 | 493,130 | 10,567,253 | 15,326,192 | 4,758,939 | 0.988 |
| MLV20-15263 | 20,188,093 | 592,807 | 14,099,354 | 19,789,887 | 5,690,533 | 0.98 |
| MLV20-15264 | 12,867,595 | 441,468 | 8,952,424 | 12,227,682 | 3,275,258 | 0.95 |
| MLV20-15265 | 7,777,046 | 231,316 | 4,725,157 | 7,386,866 | 2,661,709 | 0.95 |
| MLV20-15266 | 20,521,417 | 686,319 | 15,064,734 | 20,332,633 | 5,267,899 | 0.991 |
| MLV20-15267 | 20,002,666 | 669,043 | 14,947,013 | 19,630,558 | 4,683,545 | 0.981 |
| MLV20-15268 | 24,788,824 | 718,176 | 19,701,023 | 24,412,087 | 4,711,064 | 0.985 |
| MLV20-15269 | 19,153,299 | 556,314 | 14,032,872 | 18,839,624 | 4,806,752 | 0.984 |
| MLV20-15270 | 11,730,613 | 372,805 | 8,680,285 | 11,611,304 | 2,931,019 | 0.99 |
| MLV20-15271 | 16,841,331 | 556,417 | 11,787,644 | 16,511,475 | 4,723,831 | 0.98 |
| MLV20-15272 | 19,610,849 | 640,811 | 14,233,830 | 19,274,477 | 5,040,647 | 0.983 |
Figure 1Overall diversity patterns in mint from the northwest Andes. (A) PCA analysis from 2033 loci distributed in 912 transcripts. The first component explained 58.56% of the variance, and the second component explained 22.27% of the variance (both totaling 80.83% of explained variance). Clustering validation was performed using the algorithms NbClust and optCluster, which suggested a total of four clusters: three of M. spicata (spicata A, spicata B and spicata C) and one of M. × piperita. All clusters were recovered by the first two components. (B) Dendrogram carried out by UPGMA analysis from 2033 variants distributed in 912 transcripts using Nei’s distance and bootstrap as resampling method with 10,000 replicates. These results also suggested a total of four clusters: three from M. spicata (spicata A, spicata B and spicata C) and one for M. × piperita. Clusters under both approaches in (A) and (B) are fully concordant.
Figure 2GO pathway to analyze polymorphic biological processes via GO codes from Blast2GO. GO terms ranged from 35,822 to 1471.06 and the number of associated transcripts ranging from 365 to 428 sequences. Main GO terms related to carbohydrate and energy metabolism (GO:0044237 and GO:0044238) and cysteine and methionine synthesis (GO:0008152 and GO:0009987). We also explored the enzyme codes of the 52 KEEG pathways associated with all polymorphic transcripts (Table 2, extended in Table S5). Of the KEEG pathways, 25% were related to carbohydrate metabolism, 19.23% of the KEEG pathways were related to amino acid metabolism, 11.54% of the KEEG pathways were related to energy metabolism and the other pathways were associated with less than 10% of the target queries. Main pathways of carbohydrate/amino acid synthesis were linked to glycolysis/gluconeogenesis (Figure S6), and cysteine/methionine metabolism (Figure S7) [14].
Mentha spp. used for RNA-Seq and sampling localities in Colombia, Antioquia province.
| Sequencing ID | ID UBV UCO | Species | Conditions | Municipality | Latitude | Longitude | Elevation |
|---|---|---|---|---|---|---|---|
| MLV20-15244_s | 1 |
| Open field | La Ceja | 6.067500 | −75.415056 | 2208 |
| MLV20-15245_s | 2 |
| Open field | La Ceja | 6.067500 | −75.415056 | 2208 |
| MLV20-15246_s | 3 |
| Protected cultivation | El Retiro | 6.066111 | −75.449528 | 2218 |
| MLV20-15247_s | 4 |
| Open field | El Retiro | 6.066111 | −75.449528 | 2218 |
| MLV20-15248_s | 5 |
| Protected cultivation | Medellin | 6.198222 | −75.511889 | 2554 |
| MLV20-15249_s | 6 |
| Protected cultivation | Medellin | 6.198222 | −75.511889 | 2554 |
| MLV20-15250_s | 10 |
| Open field | Marinilla | 6.167222 | −75.325861 | 2131 |
| MLV20-15251_s | 11 |
| Open field | Marinilla | 6.167222 | −75.325861 | 2131 |
| MLV20-15252_s | 12 |
| Protected cultivation | Rionegro | 6.192972 | −75.360972 | 2128 |
| MLV20-15253_s | 14 |
| Open field | Rionegro | 6.192972 | −75.360972 | 2128 |
| MLV20-15254_s | 15 |
| Open field | Rionegro | 6.198111 | −75.350639 | 2118 |
| MLV20-15255_s | 16 |
| Open field | Rionegro | 6.198111 | −75.350639 | 2118 |
| MLV20-15256_s | 17 |
| Protected cultivation | El Retiro | 6.066111 | −75.449528 | 2218 |
| MLV20-15257_s | 18 |
| Protected cultivation | Medellin | 6.198222 | −75.511889 | 2554 |
| MLV20-15258_s | 19 |
| Protected cultivation | Medellin | 6.198222 | −75.511889 | 2554 |
| MLV20-15259_s | 20 |
| Protected cultivation | Medellin | 6.198222 | −75.511889 | 2554 |
| MLV20-15260_s | 21 |
| Open field | Rionegro | 6.067500 | −75.415056 | 2208 |
| MLV20-15261_s | 22 |
| Open field | Rionegro | 6.067500 | −75.415056 | 2208 |
| MLV20-15262_s | 24 |
| Protected cultivation | La Unión | 5.910333 | −75.412278 | 2277 |
| MLV20-15263_s | 25 |
| Protected cultivation | La Unión | 5.910333 | −75.412278 | 2277 |
| MLV20-15264_s | 26 |
| Protected cultivation | San Vicente Ferrer | 6.325194 | −75.335694 | 2251 |
| MLV20-15265_s | 27 |
| Protected cultivation | San Vicente Ferrer | 6.325194 | −75.335694 | 2251 |
| MLV20-15266_s | 28 |
| Protected cultivation | San Vicente Ferrer | 6.325194 | −75.335694 | 2251 |
| MLV20-15267_s | 29 |
| Protected cultivation | Carmen de Viboral | 6.082287 | −75.316147 | 2260 |
| MLV20-15268_s | 33 |
| Protected cultivation | Carmen de Viboral | 6.082287 | −75.316147 | 2260 |
| MLV20-15269_s | 44 |
| Protected cultivation | Guarne | 6.213960 | −75.417330 | 2200 |
| MLV20-15270_s | 45 |
| Protected cultivation | Medellin | 6.198222 | −75.511889 | 2554 |
| MLV20-15271_s | Reference piperita |
| Protected cultivation | - | - | - | 2118 |
| MLV20-15272_s | Reference spicata |
| Protected cultivation | - | - | - | 2118 |
Related KEEG pathways (52) across transcripts using enzyme codes in Blas2GO outputs.
| KEGG Pathway | # of | # of Enzymes | KEEG Label |
|---|---|---|---|
| Glycolysis/Gluconeogenesis | 15 | 16 | Carbohydrate metabolism |
| Cysteine and methionine metabolism | 13 | 13 | Amino acid metabolism |
| Carbon fixation in photosynthetic organisms | 11 | 13 | Energy metabolism |
| Methane metabolism | 10 | 10 | Energy metabolism |
| Pyruvate metabolism | 9 | 12 | Carbohydrate metabolism |
| Glycine, serine and threonine metabolism | 9 | 9 | Amino acid metabolism |
| Glyoxylate and dicarboxylate metabolism | 9 | 9 | Carbohydrate metabolism |
| Starch and sucrose metabolism | 8 | 8 | Carbohydrate metabolism |
| Galactose metabolism | 7 | 8 | Carbohydrate metabolism |
| Nitrogen metabolism | 7 | 8 | Energy metabolism |
| Citrate cycle (TCA cycle) | 7 | 7 | Carbohydrate metabolism |
| Oxidative phosphorylation | 7 | 7 | Energy metabolism |
| Tyrosine metabolism | 7 | 7 | Amino acid metabolism |
| Amino sugar and nucleotide sugar metabolism | 6 | 7 | Carbohydrate metabolism |
| Phenylalanine, tyrosine and tryptophan biosynthesis | 6 | 7 | Amino acid metabolism |
| Terpenoid backbone biosynthesis | 6 | 7 | Metabolism of terpenoids and polyketides |
| Carbon fixation pathways in prokaryotes | 6 | 6 | Energy metabolism |
| Glycerolipid metabolism | 6 | 6 | Lipid metabolism |
| Pentose phosphate pathway | 6 | 6 | Carbohydrate metabolism |
| Alanine, aspartate and glutamate metabolism | 5 | 7 | Amino acid metabolism |
| Tryptophan metabolism | 5 | 5 | Amino acid metabolism |
| Ubiquinone and other terpenoid-quinone synthesis | 5 | 5 | Metabolism of cofactors and vitamins |
| Ascorbate and aldarate metabolism | 5 | 4 | Carbohydrate metabolism |
| Glutathione metabolism | 5 | 4 | Metabolism of other amino acids |
| Phenylalanine metabolism | 4 | 5 | Amino acid metabolism |
| Phenylpropanoid biosynthesis | 4 | 5 | Biosynthesis of other secondary metabolites |
| alpha-Linolenic acid metabolism | 4 | 4 | Lipid metabolism |
| Fructose and mannose metabolism | 4 | 4 | Carbohydrate metabolism |
| O-Antigen nucleotide sugar biosynthesis | 4 | 4 | Glycan biosynthesis and metabolism |
| Porphyrin metabolism | 4 | 4 | Metabolism of cofactors and vitamins |
| Cyanoamino acid metabolism | 3 | 4 | Metabolism of other amino acids |
| Inositol phosphate metabolism | 3 | 4 | Carbohydrate metabolism |
| Pentose and glucuronate interconversions | 3 | 4 | Carbohydrate metabolism |
| Glycerophospholipid metabolism | 3 | 3 | Lipid metabolism |
| Selenocompound metabolism | 3 | 3 | Metabolism of other amino acids |
| Arginine biosynthesis | 2 | 3 | Amino acid metabolism |
| Carotenoid biosynthesis | 2 | 2 | Metabolism of terpenoids and polyketides |
| Drug metabolism—cytochrome P450 | 2 | 2 | Xenobiotics biodegradation and metabolism |
| Drug metabolism—other enzymes | 2 | 2 | Xenobiotics biodegradation and metabolism |
| Fatty acid degradation | 2 | 2 | Lipid metabolism |
| Isoquinoline alkaloid biosynthesis | 2 | 2 | Biosynthesis of other secondary metabolites |
| Lysine degradation | 2 | 2 | Amino acid metabolism |
| Metabolism of xenobiotics by cytochrome P450 | 2 | 2 | Xenobiotics biodegradation and metabolism |
| Nicotinate and nicotinamide metabolism | 2 | 2 | Metabolism of cofactors and vitamins |
| One carbon pool by folate | 2 | 2 | Metabolism of cofactors and vitamins |
| Propanoate metabolism | 2 | 2 | Carbohydrate metabolism |
| Steroid biosynthesis | 2 | 2 | Metabolism of terpenoids and polyketides |
| Styrene degradation | 2 | 2 | Xenobiotics biodegradation and metabolism |
| Sulfur metabolism | 2 | 2 | Energy metabolism |
| Thiamine metabolism | 2 | 2 | Metabolism of cofactors and vitamins |
| Tropane, piperidine and pyridine alkaloid synthesis | 2 | 2 | Biosynthesis of other secondary metabolites |
| Valine, leucine and isoleucine degradation | 2 | 2 | Amino acid metabolism |