| Literature DB >> 26243662 |
Temesgen Magule Olango1,2, Bizuayehu Tesfaye3, Mario Augusto Pagnotta4, Mario Enrico Pè5, Marcello Catellani6,7.
Abstract
BACKGROUND: Enset (Ensete ventricosum (Welw.) Cheesman; Musaceae) is a multipurpose drought-tolerant food security crop with high conservation and improvement concern in Ethiopia, where it supplements the human calorie requirements of around 20 million people. The crop also has an enormous potential in other regions of Sub-Saharan Africa, where it is known only as a wild plant. Despite its potential, genetic and genomic studies supporting breeding programs and conservation efforts are very limited. Molecular methods would substantially improve current conventional approaches. Here we report the development of the first set of SSR markers from enset, their cross-transferability to Musa spp., and their application in genetic diversity, relationship and structure assessments in wild and cultivated enset germplasm.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26243662 PMCID: PMC4524394 DOI: 10.1186/s12863-015-0250-8
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Enset and Musa plant materials used for marker validation, cross-transferability evaluation and genetic diversity analysis
| Genus and species | Biological type/taxonomic section | Number of accessions | Geographical origin | Source |
|---|---|---|---|---|
|
| ||||
|
| Wild | 6 | Dawro, Ethiopia | AARC |
|
| Cultivated | 5 | Ari, Ethiopia | HwU |
|
| Cultivated | 14 | Gamo Gofa, Ethiopia | AARC |
|
| Cultivated | 5 | Sidama, Ethiopia | HwU |
|
| Cultivated | 40 | Wolaita, Ethiopia | AARC |
|
| ||||
|
| Musa | 4 | India, Indonesia, Indonesia, NA | ITC |
|
| Musa | 8 | India, Malaysia, Papua New Guinea, Thailand, Philippines, Indonesia, Guadeloupe, NA | ITC |
|
| Musa | 1 | NA | ITC |
|
| Musa | 1 | Papua new Guinea | ITC |
|
| Callimusa | 1 | NA | ITC |
|
| 3 | Papua New Guinea, India, India | ITC | |
NA Not Available, AARC Areka Agricultural Research Center, HwU Hawassa University, ITC International Transit Center for Musa collection
Fig. 1Read length distribution and SSR composition of generated sequences from enriched enset genomic libraries. a. Read length for overall generated reads, quality reads with minimum size of 80 bp, reads containing SSRs and bearing primer pairs, b. Relative frequency (%) of SSRs (di-, tri- and tetranucleotide SSRs of size > 20 bp) and number of repeats in the sequences. Repeat number with C/I indicates compound or interrupted SSRs
Summary of pyrosequencing data and number of identified di-, tri- and tetra- nucleotide SSR loci
| Category | Numbers |
|---|---|
| Total number of reads | 9,483 |
| Total number of base-pairs | 1.9 Mbp |
| Number of quality readsa | 8,649 |
| Average length quality reads | 214 bp |
| Reads containing di- tri- and tetra-nucleotide SSR motifs with a size of > 20 bp | 840 |
| Sequence reads with SSR flanking region | 215 |
| SSR loci identified for primer-pair design | 217 |
| Perfect motif types in the identified loci | 208 |
| Dinucleotide motifs | 192 |
| Trinucleotide motifs | 14 |
| Tetranucleotide motifs | 2 |
| Compound motif types in the identified loci | 9 |
aquality reads = reads with minimum size of > 80 bp
Characteristics of 34 polymorphic SSR markers developed in enset (Ta = annealing temperature)
| Marker name | Forward primer sequence (5'–3') | Reverse primer sequence (5'–3') | Repeat motif | Size range (bp) | Ta (°C) |
|---|---|---|---|---|---|
| Evg-01 | AGTCATTGTGCGCAGTTTCC | GGAGGACTCCATGTGGATGAG | (CTT)8 | 100–120 | 60 |
| Evg-02 | GGAGAAGCATTTGAAGGTTCTTG | TTCGCATTTATCCCTGGCAC | (AG)12 | 118–153 | 55 |
| Evg-03 | ACAGCATAAGCGAAATAGCAG | ACAGCATAAGCGAAATAGCAG | (AG)12 | 107–123 | 60 |
| Evg-04 | GCCATCGAGAGCTAAGGGG | GGCAAGGCCGTAAGATCAAC | (AG)21 | 113–147 | 60 |
| Evg-05 | AGTTGTCACCAATTGCACCG | CCATCCTCCACACATGCC | (GA)22 | 103–141 | 60 |
| Evg-06 | CCGAAGTGCAACACCAGAG | TCGCTTTGCTCAACATCACC | (GAA)9 | 202–211 | 60 |
| Evg-07 | GGTTGTCCTCAAGAACGTGG | TGATGCCTAATGCCTCTCCC | (GTG)9 | 73–94 | 60 |
| Evg-08 | CCATCGACGCCTTAACAGAG | TGAACCTCGGGAGTGACATAAG | (GA)21 | 164–190 | 60 |
| Evg-09 | GCCTTTCGTATGCTTGGTGG | ACGTTGTTGCCGACATTCTG | (GA)13 | 141–175 | 60 |
| Evg-10 | CAGCCTGTGCAGCTAATCAC | CAGCAGTTGCAGATCGTGTC | (AG)21 | 191–210 | 60 |
| Evg-11 | GGCCTAGTGACATGATGGTG | TGATGCTAGATTCAAAGTCAAGG | (AC)13 | 135–160 | 60 |
| Evg-12 | TGCAACCCTTTGCTGCATTC | AGCATCATTCGCCATGGTTG | (TG)14 | 135–154 | 60 |
| Evg-13 | CTTGAAAGCATTGCATGTGGC | TCACCACTGTAGACCTCAGC | (CA)14 | 189–229 | 60 |
| Evg-14 | AACCAATCTGCCTGCATGTG | GCCAGTGATTGTTGAGGTGG | (TGA)8 | 153–159 | 60 |
| Evg-15 | TCCTTTAGGTTATTTGGTTGCC | CCTTGGACATGCCTCACATC | (AG)15 | 110–134 | 55 |
| Evg-16 | GGCTAGTCCAGTTGGAAAGAG | GTAATCACCTCTGCCTTCACC | (AG)13 | 109–117 | 60 |
| Evg-17 | GCGTCTGGTATGCTCAACTG | TCGGGAATGATACAGAGGCG | (TCA)8 | 111–154 | 60 |
| Evg-18 | TCACTCCGATGGAAGGGATG | TCTCCACCATTTTAGTTGGCAC | (GAG)7 | 181–188 | 60 |
| Evg-19 | GGTATGAAAGCCACACCACC | AGTTCACCCACGCCTCAC | (GT)16 | 234–255 | 60 |
| Evg-20 | TTGCTCTCTGCTACTGACGG | CCGGTAACTTGGTGGAAGTC | (CA)17 | 138–148 | 60 |
| Evg-21 | CAGGCAACCACTGCGATATG | CAGTTGTCTCCCCAGGTGC | (CA)12 | 106–116 | 60 |
| Evg-22 | CTATCCAGGAGCCCATCTCG | ACTCTTCTCTTCGCCTGTGG | (CA)15 | 88–94 | 60 |
| Evg-23 | CCACCAAAGGGCTCCTCG | TCGGATTCTCCCGCTATTGG | (AC)13 | 129–143 | 60 |
| Evg-24 | TTTTCGGACGGTCTCTGTGG | TTCTTCTGCTGGCGTTTGAG | (TTG)8 | 155–162 | 60 |
| Evg-25 | CACGTTGATGTCGTTCCGTC | GAATCGCTTCAAGGCGTAGG | (CT)13 | 201–229 | 60 |
| Evg-26 | AAGCCATTGATGACTCCCCG | CAGTTGCACGCAGAGAAAAC | (AC)12 | 110–139 | 60 |
| Evg-27 | GCAATAGAATGGTACGGAGCG | TTTTGACTGTTCCGACGGTG | (AG)16 | 103–123 | 60 |
| Evg-28 | AAGCCACGGAATCAGCAAAC | ACCCACTACCTTTCCCTAAGC | (AC)12 | 201–209 | 60 |
| Evg-29 | GTTCGACTCGTCCAAGAAGG | ACTGTCTTAGTGATAGCCATGC | (AC)15 | 103–113 | 60 |
| Evg-48 | TAATTCTTCCCACCGGGGTC | GACCACTTACTTTTTGCACGC | (TG)12 | 127–133 | 60 |
| Evg-49 | TCCTGCACCCTCCATATTCC | TCTCTCTCTCTGATCTTCGTAGC | (GA)13 | 226–234 | 60 |
| Evg-50 | ATCTTGAACGTGGGGAAGGG | TGATACCTGGTGAGGATGCG | (TG)13 | 162–188 | 60 |
| Evg-51 | TGAATGAGTGGGGGATGCTG | AATGGATCGTTATCCAACGTG | (CAT)9 | 145–148 | 60 |
| Evg-52 | TATGGGAAGGGGATCCACAC | CAAATGCCGATAGGGACAGC | (CA)13 | 212–231 | 60 |
Characteristics of the 34 polymorphic enset SSR markers used to assess genetic diversity in enset
| Marker name | Number of alleles | Ho | GD | PIC |
|---|---|---|---|---|
| Evg-01 | 9 | 0.64 | 0.67 | 0.63 |
| Evg-02 | 8 | 0.70 | 0.75 | 0.72 |
| Evg-03 | 6 | 0.64 | 0.64 | 0.58 |
| Evg-04 | 9 | 0.87 | 0.77 | 0.73 |
| Evg-05 | 4 | 0.49 | 0.65 | 0.58 |
| Evg-06 | 3 | 0.37 | 0.52 | 0.41 |
| Evg-07 | 6 | 0.82 | 0.72 | 0.67 |
| Evg-08 | 11 | 0.42 | 0.79 | 0.77 |
| Evg-09 | 9 | 0.83 | 0.79 | 0.76 |
| Evg-10 | 8 | 0.49 | 0.73 | 0.70 |
| Evg-11 | 6 | 0.78 | 0.66 | 0.62 |
| Evg-12 | 12 | 0.78 | 0.75 | 0.72 |
| Evg-13 | 7 | 0.58 | 0.60 | 0.52 |
| Evg-14 | 3 | 0.96 | 0.52 | 0.41 |
| Evg-15 | 5 | 0.41 | 0.68 | 0.62 |
| Evg-16 | 3 | 0.21 | 0.23 | 0.20 |
| Evg-17 | 8 | 0.72 | 0.72 | 0.68 |
| Evg-18 | 4 | 0.69 | 0.66 | 0.60 |
| Evg-19 | 5 | 0.13 | 0.12 | 0.12 |
| Evg-20 | 6 | 0.24 | 0.71 | 0.67 |
| Evg-21 | 5 | 0.79 | 0.69 | 0.65 |
| Evg-22 | 4 | 0.74 | 0.63 | 0.57 |
| Evg-23 | 6 | 0.57 | 0.64 | 0.59 |
| Evg-24 | 4 | 0.10 | 0.25 | 0.24 |
| Evg-25 | 5 | 0.58 | 0.60 | 0.53 |
| Evg-26 | 6 | 0.80 | 0.68 | 0.64 |
| Evg-27 | 8 | 0.40 | 0.66 | 0.61 |
| Evg-28 | 4 | 0.59 | 0.59 | 0.51 |
| Evg-29 | 5 | 0.51 | 0.60 | 0.55 |
| Evg-48 | 3 | 0.70 | 0.59 | 0.51 |
| Evg-49 | 4 | 0.10 | 0.10 | 0.09 |
| Evg-50 | 5 | 0.29 | 0.27 | 0.25 |
| Evg-51 | 2 | 0.32 | 0.50 | 0.37 |
| Evg-52 | 9 | 0.44 | 0.62 | 0.55 |
| Mean | 5.94 | 0.55 | 0.59 | 0.54 |
Diversity parameters estimated for enset population using 34 SSR markers
| Diversity parameters | Cultivated and wild population | Cultivation regions | ||||||
|---|---|---|---|---|---|---|---|---|
| Cultivated ( | Wild ( | Mean ± SE |
aAri ( | Gamo Gofa ( | Sidama ( | Wolaita ( | Mean ± SE | |
| Percentage of polymorphic loci (PPL%) | 100 | 91 | 96 ± 4.41 | 59 | 97 | 88 | 100 | 86 ± 9.41 |
| Number of different alleles (Na) | 5.88 | 2.56 | 4.22 ± 0.29 | 1.62 | 3.82 | 3.00 | 4.91 | 3.34 ± 0.16 |
| Rarefied allelic richness (Ar) | 3.56 | 2.32 | 2.94 ± 0.44 | 1.62 | 3.00 | 3.00 | 3.15 | 2.69 ± 0.36 |
| Number of effective alleles (Ne) | 2.79 | 1.88 | 2.34 ± 0.11 | 1.59 | 2.41 | 2.52 | 2.64 | 2.29 ± 0.09 |
| Shannon’s information index (I) | 1.16 | 0.67 | 0.91 ± 0.06 | 0.41 | 0.96 | 0.90 | 1.07 | 0.83 ± 0.04 |
| Observed heterozygosity (Ho) | 0.55 | 0.55 | 0.55 ± 0.04 | 0.53 | 0.53 | 0.56 | 0.55 | 0.54 ± 0.03 |
| Expected heterozygosity (He) | 0.59 | 0.40 | 0.49 ± 0.03 | 0.29 | 0.52 | 0.51 | 0.56 | 0.47 ± 0.02 |
| Private Na | 3.38 | 0.06 | 1.72 ± 0.21 | 0.03 | 0.44 | 0.15 | 1.06 | 0.42 ± 0.10 |
| Private Ar | 1.51 | 0.28 | 0.89 ± 0.43 | 0.14 | 0.41 | 0.46 | 0.36 | 0.34 ± 0.07 |
aAri population is represented by 5 individuals of the same landrace Entada which produces spontaneous suckers unlike other cultivated landraces
n = number of individuals per population
SE standard error
Analysis of Molecular Variance among and within populations of wild and cultivated enset as well as different growing regions
| Source of variation |
| Sum of squares | Variance component | Percentage variation (%) |
|
|---|---|---|---|---|---|
| Wild and cultivated enset | |||||
| Among Pops | 1 | 100.11 | 7.06 | 24 | 0.238 |
| Within Pops | 68 | 1537.61 | 22.61 | 76 | |
| Growing regions | |||||
| Among Pops | 3 | 199.15 | 3.91 | 16 | 0.16 |
| Within Pops | 60 | 1235.33 | 20.59 | 84 | |
P value is based on 1000 permutations; df = degree of freedom
Fig 2Genetic relationship and its pattern across sampling regions. a: UPGMA phylogenetic tree of individuals based on 34 polymorphic markers, b: Geographical location of sampling distribution. The colors of the dots in the tree correspond to the sampling location Southern Nations Nationalities and People’s Region, in southern Ethiopia
Fig 3Population structure based on 34 polymorphic SSR markers. a: phylogenetic tree of 5 enset groups and out-grouping Musa accessions inferred from DAPC, b: The estimated group structure with individual group membership values, c: DAPC scatter plot for 70 enset and 5 Musa accessions using the first two PCs. The inset indicates the number of PCs retained to describe the relationship between the clusters. The DAPC population numbers in each of the clusters correspond to group numbers of the phylogenetic tree
Fig 4Alignment and comparison of SSR containing homologous sequences between E. ventricosum landrace Gena (G) and M. acuminata ssp. malaccensis (M). Rectangular boxes indicate the occurrence of a variable number of repeat motifs between the two species along with multiple point mutations and INDELs both in SSR repeat block and flanking regions