| Literature DB >> 26184134 |
Guojun Zhang1,2,3, Zhenzhen Sun4, Di Zhou5, Min Xiong6, Xian Wang7, Junming Yang2, Zunzheng Wei8,9.
Abstract
A set of 899 L. gmelinii expression sequence tags (ESTs), available at the National Center of Biotechnology Information (NCBI), was employed to address the feasibility on development of simple sequence repeat (SSR) markers for Larch species. Totally, 634 non-redundant unigenes including 145 contigs and 489 singletons were finally identified and mainly involved in biosynthetic, metabolic processes and response to stress according to BLASTX results, gene ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) maps. Approximately 11.7% (74) unigenes contained 90 candidate SSRs, which were mainly trinucleotides (29, 32.2%) and dinucleotides (26, 28.9%). A relatively high frequency of SSRs was respectively found in the Open Reading Frame (ORF, about 54.4%) and 5'-untranslated region (5'-UTR, 31.2%), while a low frequency was observed in the 3'-untranslated region (3'-UTR, about 14.4%). Of the 45 novel EST-SSRs markers, nine were found to be polymorphic at two L. gmelinii populations. The number of alleles per locus (Na) ranged from two to four, and the observed (Ho) and expected (He) heterozygosity values were 0.200-0.733 and 0.408-0.604, respectively. The inbreeding coefficients (FIS) for all loci were more than zero except Lg41. Most of these 9EST-SSR markers were transferable to its related species L. kaempferi, L. principis-rupprechtii and L. olgensis. These novel EST-SSRs will be useful for further research on comparative genomics, genetic resources conservation and molecular breeding in larch trees.Entities:
Keywords: EST-SSRs; Larix gmelinii; cross-species transferability; genetic diversity
Mesh:
Substances:
Year: 2015 PMID: 26184134 PMCID: PMC6332378 DOI: 10.3390/molecules200712469
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1Gene ontology distribution of the L. gmelinii 634 EST-SSR unigenes into biological process, molecular function and cellular component. The number of transcripts encoded for each category is represented.
Figure 2Distribution of SSR motif frequency and typein L. gmelinii 634 EST-SSR unigenes. (A) SSR motif type and repeats numbers; (B) Percentage of dinucleotide SSR type; (C) Percentage of trinucleotide SSR type.
Characteristics of 10 polymorphic EST-SSR markers from L. gmelinii unigenes.
| Locus | Primers Sequence (5′-3′) | Repeats Motif | Size Range | Ta (°C) | SSR Locations | BLAST Top Hit Accession No. | Description of Putative Function | |
|---|---|---|---|---|---|---|---|---|
| Lg01 | F: CAGTGGTGTCCGTGGTGTA | (AGC)4 | 141–160 | 51.3 | ORF | XP_006375910.1 | hypothetical protein POPTR_0013s05890g ( | 2.00 × 10−20 |
| Lg02 | F: CTCTGTGACCAAGAAACCAA | (AGG)4 | 120–140 | 51.8 | ORF | XP_002306980.2 | hypothetical protein POPTR_0005s27390g ( | 3.00 × 10−38 |
| Lg06 | F: CAAGGATGGAGCAGACGAT | (AGA)5 | 135–150 | 50.4 | ORF | None | None | None |
| Lg14 | F: GGGGATTGCAGAGTAGAAA | (TC)6 | 140–150 | 49.5 | 5′UTR | XP_002319953.1 | Xyloglucan endotransglucosylase/hydrolase protein 9 precursor ( | 1.00 × 10−164 |
| Lg25 | F: GTGAGAGGTCAAACCCCAA | (AAG)4 | 105–125 | 53.8 | ORF | XP_003608708.1 | 40S ribosomal protein S30 ( | 2.00 × 10−24 |
| Lg32 | F: CTCTGTCGCACCAGCATTG | (AT)6 | 105–115 | 47.5 | ORF | XP_002307364.1 | 2-dehydro-3-deoxyphosphoheptonate aldolase family protein ( | 4.00 × 10−7 |
| Lg36 | F: TGCCCATCCTCTTTGTTTA | (GA)5 | 175–190 | 50.6 | ORF | None | None | None |
| Lg37 | F: ACAATGGCTTCCTTCAACA | (CT)6 | 160–170 | 47.1 | ORF | XP_002299125.2 | hypothetical protein POPTR_0001s04570g ( | 3.00 × 10−46 |
| Lg41 | F: ACTTCCACTAAGGTTGACA | (AGA)4 | 147–180 | 49.4 | ORF | XP_002313280.1 | 60S ribosomal protein L6 ( | 2.00 × 10−49 |
EST-SSR Genetic diversity for two populations of L. gmelinii and its cross-species transferability for L. kaempferi, L. principis-rupprechtii and L. olgensis.
| Locus | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ( | ( | ( | ( | ( | |||||||||||
| Lg01 | 2 | 0.467/0.704 | 0.338 | 2 | 0.200/0.278 | 0.280 | 3 | 0.000/0.406 | 1.000 ** | 2 | 0.100/0.375 | 0.733 * | 2 | 0.111/0.500 | 0.778 * |
| Lg02 | 2 | 0.600/0.464 | −0.292 | 4 | 0.733/0.560 | −0.310 | 2 | 0.500/0.469 | −0.067 | 3 | 0.800/0.535 | −0.495 | 2 | 0.667/0.494 | −0.350 |
| Lg06 | 2 | 0.267/0.498 | 0.464 | 2 | 0.267/0.498 | 0.464 | 2 | 0.000/0.219 | 1.000 ** | 2 | 0.000/0.180 | 1.000 ** | 2 | 0.000/0.346 | 1.000 ** |
| Lg14 | 2 | 0.200/0.358 | 0.441 | 2 | 0.267/0.480 | 0.444 | 1 | 0.000/0.000 | -- | 2 | 0.100/0.375 | 0.733 * | 2 | 0.111/0.278 | 0.600 |
| Lg25 | 3 | 0.400/0.371 | −0.078 | 3 | 0.733/0.598 | −0.227 | 2 | 0.250/0.375 | 0.333 | 3 | 0.400/0.465 | 0.140 | 2 | 0.111/0.278 | 0.600 |
| Lg32 | 2 | 0.267/0.444 | 0.400 | 3 | 0.467/0.558 | 0.163 | 1 | 0.000/0.000 | -- | 2 | 0.000/0.180 | 1.000 ** | 2 | 0.000/0.198 | 1.000 ** |
| Lg36 | 2 | 0.267/0.498 | 0.464 | 2 | 0.267/0.320 | 0.167 | 1 | 0.000/0.000 | -- | 2 | 0.100/0.255 | 0.608 | 2 | 0.333/0.401 | 0.169 |
| Lg37 | 2 | 0.200/0.358 | 0.441 | 3 | 0.400/0.611 | 0.345 | 1 | 0.000/0.000 | -- | 2 | 0.100/0.255 | 0.608 | 1 | 0.000/0.000 | -- |
| Lg41 | 3 | 0.467/0.584 | −0.167 * | 3 | 0.600/0.620 | −0.150 * | 3 | 0.625/0.617 | −0.013 | 2 | 0.500/0.495 | −0.010 | 2 | 0.333/0.500 | 0.333 * |
**: p ˂ 0.01; *: p ˂ 0.05; --: No available data.