Literature DB >> 25389527

Identification and characterization of microsatellites in expressed sequence tags and their cross transferability in different plants.

Shamshad Ul Haq1, Rohit Jain2, Meenakshi Sharma1, Sumita Kachhwaha1, S L Kothari3.   

Abstract

Expressed sequence tags (EST) are potential source for the development of genic microsatellite markers, gene discovery, comparative genomics, and other genomic studies. In the present study, 7630 ESTs were examined from NCBI for SSR identification and characterization. A total of 263 SSRs were identified with an average density of one SSR/4.2 kb (3.4% frequency). Analysis revealed that trinucleotide repeats (47.52%) were most abundant followed by tetranucleotide (19.77%), dinucleotide (19.01%), pentanucleotide (9.12%), and hexanucleotide repeats (4.56%). Functional annotation was done through homology search and gene ontology, and 35 EST-SSRs were selected. Primer pairs were designed for evaluation of cross transferability and polymorphism among 11 plants belonging to five different families. Total 402 alleles were generated at 155 loci with an average of 2.6 alleles/locus and the polymorphic information content (PIC) ranged from 0.15 to 0.92 with an average of 0.75. The cross transferability ranged from 34.84% to 98.06% in different plants, with an average of 67.86%. Thus, the validation study of annotated 35 EST-SSR markers which correspond to particular metabolic activity revealed polymorphism and evolutionary nature in different families of Angiospermic plants.

Entities:  

Year:  2014        PMID: 25389527      PMCID: PMC4217358          DOI: 10.1155/2014/863948

Source DB:  PubMed          Journal:  Int J Genomics        ISSN: 2314-436X            Impact factor:   2.326


1. Introduction

The flowering plants are extremely diverse in their morphology, growth habit, environmental adaptation, and nuclear genome content [1]. Plant genomes tend to be large and complex, varying in size from 125 million base pairs (Mbp) for Arabidopsis thaliana [2] to 124,852 Mbp for Fritillaria assyriaca [3]. Despite so much diversity, plants do exhibit conservation of both gene content and gene order [4]. This diversity in the genomes makes comparative studies involving data from smaller genomes important for accelerating the study of larger genomes. More interesting, it can relate evolutanory consequences of diverse plant taxa. Comparison could also be made about the conserved sequences and information on the regulatory elements for extending the genetic information from model to more complicated species [5, 6]. Moreover, comparative genetic analyses have shown that different plants species comprise homologous genes for very similar functions [1, 7–9]. The DNA based markers are routinely used in ecological, evolutionary, taxonomical, comparative biology, diversity, phylogenic and genetic studies [10]. Among all the markers, microsatellites are preferred in plant genetics due to their hypervariability, relative abundance, multiallelic nature, high reproducibility, codominant inheritance, high polymorphism, high transferability, chromosome-specific location, extensive genome coverage, and highly informative and wide genomic distribution [10-12]. Microsatellites or simple sequence repeats (SSRs) are sequences in which one or few bases are tandemly repeated, ranging from 1 to 6 base pair (bp) long units which are dispersed randomly and ubiquitously throughout the genomes including both prokaryotes and eukaryotes [13-15]. Microsatellites arose from ESTs called as EST-SSRs or genic SSRs that represent functional molecular markers as “putative function or particular enzymatic activity” that can be deduced by public data base through computational approaches. With the development of functional genomics, a huge number of expressed sequence tags (ESTs) have been deposited in the public database (NCBI) [16]. An in silico approach, for retrieving EST sequences from NCBI, provides a potential source of EST-SSRs, and computational methods could assign putative functions of the ESTs to various metabolic pathways. SSRs in the transcribed region are expected to be more conserved, significant, and more transferable across taxonomic boundaries than anonymous SSRs [17, 18]. Thus, the development of SSR through searching the database of EST has become a fast, efficient, and low-cost option for many studies [12, 19, 20]. The assessments of EST-SSRs, in polymorphism, diversity, and transferability have been carried out in different plant species, namely, rice [21], grape [22], sugarcane [23], tomato [24], loblolly pine [25], barley [26], rye [27], cereals [28], leguminous and nonleguminous plants [29], medicinal plants [30], and the millet and nonmillet species [31-33]. In the present study, 7630 EST sequences were retrieved from NCBI for SSR identification and characterization. Functional annotations of the sequences were assigned for the development of informative EST-SSR markers and assessment of their transferability in different families.

2. Material and Methods

2.1. Plant Material and DNA Extraction and Purification

Young juvenile, disease free, immature leaves from various therapeutic plants such as Datura metel, Datura innoxia, Withania coagulans, Withania somnifera, Capsicum annuum, Eclipta alba, Stevia rebaudiana, Citrullus colocynthis, Ocimum sanctum, Catharanthus roseus, and Moringa oleifera were collected from the University of Rajasthan campus. These plants belong to five distinct families. DNA was extracted from leaves using CTAB method [34]. DNA sample was treated with RNAase for 1 h at 37°C and purified by phenol extraction (25 phenol : 24 chloroform : 1 isoamyl alcohol, v/v/v) followed by ethanol precipitation [35] and stored at −80°C for long period. DNA was checked on a 0.8% agarose gel for confirmation of quality and concentration and final adjustments were made in 10 mM Tris HCl buffer to obtain the working concentration of 25 ng/μL.

2.2. Mining of EST Sequences, ESTs Assembling, and Microsatellites Identification

Total 7630 putative or enzyme-encoding EST sequences were retrieved as FASTA format from the National Center for Biotechnology Information (NCBI) of different plants sources because our selected plants do not have much sequencing data in public database. ESTs assembling were carried out using CAP3 programme through online web tool (http://mobyle.pasteur.fr/cgi-bin/portal.py#forms::cap3), for identification of nonredundancy. Microsatellite identification was carried out using MISA (http://pgrc.ipk-gatersleben.de/misa/) software tool and criteria for SSRs detection were 6, 4, 3, 3, and 3 repeat units for di-, tri-, tetra-, penta-, and hexanucleotides, respectively. SSR primer pairs (forward and reverse) were designed for the selected sequence using online web tool batch primer 3 from the flanking sequences of the identified microsatellite motifs [36].

2.3. EST-SSR Sequences Annotation

To decipher informative assessment of SSR containing ESTs was done using Blastn/Blastx analysis for homology search and the nonredundant protein (NR) at the NCBI and functional annotation pipeline was also run at FastAnnotator (http://fastannotator.cgu.edu.tw/) for gene ontology (GO) system to the different GO functional classes that were displayed as horizontal bar chart in addition to detailed chart [37].

2.4. PCR Amplification and Electrophoresis

PCR reaction was carried out in a total of 10 μL volume containing 25 ng template DNA, 1.0 μL of each forward and reverse primers (at a concentration of 10 pmole/μL) [31-33], 0.2 μL of 100 mM of dNTPs, 0.5 U of taq DNA polymerase, 1.0 μL of 10X PCR buffer, and 2.5 mM of MgCl2. Amplification was performed in a thermal cycler (Bio Rad, UK) in the following conditions: initial denaturation at 94°C for 5 min followed by 30 amplification cycles for denaturation for 1 min at 94°C followed by annealing for 1 min then extension for 2 min at 72°C; final extension at 72°C for 7 min was allowed. The PCR conditions particularly the annealing temperatures (varying from 52°C to 58°C) for each primer were standardized (Table 1). All the designed primers were surveyed in the selected plants, for 2-3 times, and amplified products were stored at 4°C. PCR products were used for electrophoresis on 1.5% high resolution agarose gel (Merk bioscience) at 70 V for approximately 3.5 hours, made in 0.5X TBE (Tris-Borate-EDTA) buffer. Ethidium bromide was used in agarose gel electrophoresis as intercalating dye then gel was subjected to photograph under UV light.
Table 1

Details of 35 EST-SSR primer pairs sequences.

Serial numberTypePrimer pairs sequenceTa (°C)SSRPIC value E valuePutative identities
EMS-1FCTCAGAGTTTGCTCACCCCTAC56(GAA)4 0.8341.74E − 11 Chalcone-flavonone isomerase
EMS-1RTGCTGGGAAATGGTATGTGA
EMS-2FTTAGACTGTGACACTGCGAAGC52(CAA)4 0.919.59E − 81Chalcone synthase
EMS-2RACTTGGCCGAAAAACAACAC
EMS-3FCCACCCACTAAATCACTTGACA52(CCT)4 0.7482.6E − 113Cinnamate 4-hydroxylase
EMS-3RGATCCTCCTCGCTCTCGAAT
EMS-4FAGCCACTGCACTTCTTGTTCTT52(GCC)7 0.25526.41E − 38Cinnamyl alcohol dehydrogenase
EMS-4RGGGATCTTCACCACAAACTTCT
EMS-5FCGTGGTGGAAAAGCTATTTAGG56(AAT)4 0.72682.62E − 82Flavonol synthase
EMS-5RAAACAAAACCACCCACTTCA
EMS-6FAATCATGGCCCTTTACCCTAGT52(CTTT)3 0.15162.19E − 68Endoxyloglucan transferase
EMS-6RCTTTGGCTTCCATTGTTCTTGT
EMS-7FGAAACTGAGGCACACAAAAA52(TTC)4 0.89792.55E − 76Endoxyloglucan transferase
EMS-7RTCATCAGCGTTCCATAGACTGT
EMS-8FGAAGAAACGCACTCTCTCTCTCTT58(CTCT)3 0.72072.72E − 18Oligosaccharyl transferase
EMS-8RATATGAAGGGACCATAGCCAGA
EMS-9FGTCGTTGCTTCTTTCTGCTTTT53(CTTT)3 0.70641.03E − 62Xyloglucan transferase
EMS-9RTAAGGGTGGGTAAATGGGATAC
EMS-10FCGTACGAGTTGAATCTCAGGA56(GAGA)4 0.61979.17E − 80Glutamine synthetase
EMS-10RCCAACAGGCCAGTTCACTTC
EMS-11FCCAACAACCACAAGAAACTGG56(CCAA)3 0.84817E − 63Isoflavone reductase
EMS-11RACGACCTCCGAACATTCAAC
EMS-12FCACATGCAACAGCACATCAAT56(CTT)6 0.92445.63E − 68Isopentenyl-diphosphate isomerase
EMS-12RCATCGACTGGTACATCTTCAGC
EMS-13FAGGGGAGAGGAAGGTGGAGT52(TGC)4 0.89965.34E − 22Isopentenyl-diphosphate delta isomerase
EMS-13RCAATTTCATGTTCTCCCCAGAT
EMS-14FAATCAGAAAATGGCACAGTCCT54(CTCT)3 0.62216.61E − 28Isopentenyl pyrophosphate
EMS-14RTCAGAAGGTGCCTTGTAAAGAA
EMS-15FGTTAAAGGTTCAGGAACGAACG54(CTT)4 0.7772.41E − 38Isopentenyl pyrophosphate
EMS-15RGCAGATAGTCCAGTTCATGCTC
EMS-16FGTTAAAGGTTCAGGAACGAACG54(CTT)4 0.8288.73E − 36Dimethyllallyl pyrophosphate isomerase
EMS-16RAACAGGTGTTTGTCCACACAAG
EMS-17FGTTAAAGGTTCAGGAACGAACG54(CTT)4 0.83441.3E − 20Dimethyllallyl pyrophosphate isomerase
EMS-17RCAACAATACCAAGCTCATCCAA
EMS-18FCACGAGGCTTTTTGAGAAAT54(CTAG)3 0.41592.05E − 45Isopentenyl transferase
EMS-18RCCCCATAAAAAGCAACAGTCAT
EMS-19FCAATAATTAATCCAGGCGGTTC52(TGGC)3 0.83076.23E − 55Enoyl-[acyl-carrier-protein]
EMS-19RCAAAGACATCGGTGATTCAGTG
EMS-20FGATCAAGGAATGCCGATCTTAC52(TTC)3 0.83331.86E − 36Phenylalanine ammonia-lyase
EMS-20RCGAGGTGCCATTTTATTTCC
EMS-21FAATCCTCCTCCTCCTTCCTC52(CGC)6 0.84253E − 18Farnesyltransferase alpha subunit
EMS-21RCGCTATGAGACCAAGCATGTAA
EMS-22FACGATTGTTTGATTGCCAAG52(GGAT)3 0.88924.94E − 52Alanine aminotransferase
EMS-22RTCCAAATGTTCCAGCACAAA
EMS-23FGCACCTTTTCAACCGTTCAC52(CTT)4 0.89665.32E − 88Alanine aminotransferase
EMS-23RGGCTTGCACCATCTGTCAT
EMS-24FCCTCTCTCTCCCTCTCTCCAA54(CAC)4 0.91271.44E − 07Isopropylmalate dehydratase
EMS-24RAAATATCCGAGGGTACGATTCC
EMS-25FCATTTTCCTTGCCTCTCTCTCT54(CGC)8 0.88317E − 29Chorismate mutase
EMS-25RCACTGAGCGCATGTCTTTTG
EMS-26FCGCTCTAGTTTCACATAAGCAGTC54(TCTT)3 0.78551.74E − 11Serine/threonine-protein kinase
EMS-26RGAATCATTTGCTTTTGGGTGTC
EMS-27FAGGAAGGAAGGAAGGAGAGGAG56(CCG)4 0.8281.93E − 24Putative thiolase
EMS-27RCTGAGAAAACAGCTCAACTTGG
EMS-28FGCTTCCGTCTCCTTGGATAAC52(GTC)4 0.92313.18E − 15Acetyl-CoA acyltransferase
EMS-28RAACAGAACAAGCCCAGAACATT
EMS-29FCCAACTCCTCTCAACTTCTTGAT53(CGG)4 0.85251.6E − 101Putative alanine transaminase
EMS-29RGGGAAATGCCAGCAGAATAA
EMS-30FCAAAGGCGGATGAGTTCAAGT56(TGG)3 0.92521.2E − 115Flavonoid 3′-hydroxylase
EMS-30RCCATACGTTGACCAAGAGAGTG
EMS-31FTGTCTACTATCCAGCGAAACCA54(AAG)3 0.75493E − 18Cytochrome p450 monooxygenase
EMS-31RCAGCTAGTAAGATCGAGTTCAAACA
EMS-32FAAGCTATACGGCCCGATTTT52(AGA)3 0.8611.28E − 21Cytochrome p450 monooxygenase
EMS-32RTTTGATGGAGAAAGGTTGGTCT
EMS-33FTTGAAATCTGGTCAAGAGGATG52(TA)8 0.58964.81E − 13Methylenetetrahydrofolate reductase
EMS-33RAAGGCTGCACTTTGTATTGTCC
EMS-34FTGGGACATGTGCTATTTGCTAC52(ACTG)3 0.89911.00E − 79Squalene synthase
EMS-34RGCAGGTGAGAGCCAAGATAAAT
EMS-35FACACAACAGGATCCTTCGAAAT54(AT)5 0.87222.18E − 39Putative tryptophan synthase alpha chain
EMS-35RAAGCGAGGCTGCCAAGAAC

2.5. Genetic Relationship with EST-SSR Primer

Amplified bands were scored as binary data in the form of present (1) or absent (0). Dendrogram was constructed by neighbor-joining and Jaccard's algorithm using free tree/tree view free software [38, 39]. The polymorphism information content (PIC) values were calculated for each primer by using the online resource of PIC calculator (http://www.liv.ac.uk/~kempsj/pic.html).

3. Results

3.1. Frequency of Microsatellites in Expressed Sequence Tags

A total of 7630 EST sequences of putative function (enzyme-encoding sequences) involved in different plant metabolic pathways were retrieved from NCBI for microsatellite (SSR) identification. Nonredundant 1749 (1117 kb) sequences were identified comprising 884 contigs and 865 singlets, in which 263 SSRs were having 220 perfect SSRs, 38 sequences containing more than 1 SSR, and 26 SSRs present in compound formation. The frequency of EST-SSR was 3.4% or density was one SSR per 4.2 kb. Among all SSRs, trinucleotide repeats were highly abundant (47.52%) followed by tetranucleotide (19.77%), dinucleotide (19.01%), pentanucleotide (9.12%), and hexanucleotide (4.56%) repeats. A total of 58 different types of motifs were identified which belonged to three different types of dinucleotides repeats, nine different types of trinucleotides, sixteen different types of tetranucleotides, eighteen different types of pentanucleotides, and twelve different types of hexanucleotide repeats. The most frequent repeat motifs were AG/CT and AT/AT in dinucleotide, motifs AAG/CTT, CCG/CGG, and AGC/CTG in trinucleotide, motifs AAAT/ATTT and AAAG/CTTT in tetranucleotide, and motif AAAAC/GTTTT in pentanucleotide (Figure 1).
Figure 1

Details of motifs comprising di-, tri-, tetra- and pentanucleotides with sequence complementarity.

3.2. Expressed Sequence Tags (ESTs) Annotation and Primer Designing

EST sequences, from which the SSR markers developed, were examined by functional annotation (blastn/blastx/gene ontology) and to identify 35 EST-SSR markers, on the basis of their presence in primary metabolic process, secondary metabolic process, biosynthetic process, nitrogen compound metabolic process, oxidation-reduction process, transferase activity, oxidoreductase activity, lyase activity, nucleotide binding activity, and others (Figure 2). Primer pairs could be designed for functionally annotated 35 EST-SSRs that were 13.30% of the total microsatellites (263) identified and evaluated for polymorphic nature, cross transferability, and genetic relationships in 11 plant species of five different families. Trinucleotide repeats were highly abundant in 35 EST-SSRs followed by tetra- and dinucleotide repeats (Table 1). All these were associated with common metabolic pathways such as GO:0009813 flavonoid biosynthetic process, GO:0045430 chalcone isomerase activity, GO:0016114 terpenoid biosynthetic process, GO:0004452 isopentenyl-diphosphate delta isomerase activity, GO:0046653 tetrahydrofolate metabolic process, GO:0004489 methylenetetrahydrofolate reductase (NADPH) activity, GO:0006694 steroid biosynthetic process, GO:0008483 transaminase activity, GO:0000162 tryptophan biosynthetic process, GO:0006571 tyrosine biosynthetic process, GO:0009094 L-phenylalanine biosynthetic process, GO:0006633 fatty acid biosynthetic process, GO:0009809 lignin biosynthetic process, GO:0009695 jasmonic acid biosynthetic process, GO:0004310 farnesyl-diphosphate farnesyltransferase activity, GO:0004311 farnesyltranstransferase activity, GO:0004713 protein tyrosine kinase activity, GO:0045548 phenylalanine ammonia-lyase activity, GO:0009821 alkaloid biosynthetic process, and GO:0006695 cholesterol biosynthetic process (see supplementary table available online at http://dx.doi.org/10.1155/2014/863948).
Figure 2

Partial results of GO annotations obtained using FastAnnotator. These horizontal bar charts represent the distribution of GO terms categorized as biological process (a), cellular components (b), and molecular functions (c).

3.3. Amplification and Polymorphism of Annotated EST-SSR Markers in Selected Plants

A set of 35 primer pairs from different microsatellites in EST was tested for PCR optimization, characterization, and amplification with 11 plants belonging to different families. All markers produced polymorphic amplification profile in selected plants (Figure 3), which ranged from 50 to 1050 bp. DNA finger printing data of 35 EST-SSR with eleven plants revealed a total of 402 alleles at 155 loci with an average of 2.6 alleles per locus. The markers designed in this study had potential of showing polymorphism among different plants and the polymorphic information content (PIC) of 35 EST-SSR ranged from 0.15 to 0.93 with an average 0.77.
Figure 3

PCR amplification of ESM-28 primer in eleven plants belonging to five different families. Lane 1 Datura metel, 2 Datura innoxia, 3 Withania coagulans, 4 Withania somnifera, 5 Capsicum annuum, 6 Stevia rebaudiana, 7 Eclipta alba, 8 Citrullus colocynthis, 9 Ocimum sanctum, 10 Catharanthus roseus, and 11 Moringa oleifera.

3.4. Cross Transferability

All 35 annotated EST-SSR markers were assessed for cross transferability in the selected plants. The cross transferability of these markers was found to be 86.45% in Datura metel, 81.29% in Datura innoxia, 96.77% in Withania coagulans, 98.06% in Withania somnifera, 85.16% in Capsicum annuum, 34.84% in Stevia rebaudiana, 49.68% in Eclipta alba, 54.19% in Citrullus colocynthis, 43.23% in Ocimum sanctum, 58.71% in Catharanthus roseus, and 58.66% in Moringa oleifera, with an average of 67.86% (Table 2). These markers were found to be more transferable in Solanaceous plants (Datura metel, Datura innoxia, Withania coagulans, Withania somnifera, Capsicum annuum), ranging from 81.29% to 98.06% with an average of 89.55% as compared to other plants showing variable transfer rates. Thus, all markers showed reliable amplification pattern in different plants and were scored as transferable.
Table 2

Details of cross transferability of 35 EST-SSR markers in eleven plants belonging to five different families.

Serial numberAmplification range Datura metel Datura innoxia Withania coagulans Withania somnifera Capsicum annuum Eclipta alba Stevia rebaudiana Citrullus colocynthis Ocimum sanctum Catharanthus roseus Moringa oleifera Primer transferability
EMS-1104–8918343420312190.91
EMS-2129–8840879760363481.82
EMS-3117–79464221413112100.00
EMS-492–35822121111111100.00
EMS-5135–80234423211112100.00
EMS-694–1642220211111190.90
EMS-7105–91912925712182100.00
EMS-898–68062213112141100.00
EMS-9130–71732532131221100.00
EMS-1092–91352322112211100.00
EMS-11135–7245354501411290.90
EMS-12121–881763114328557100.00
EMS-1375–6290789272526081.81
EMS-14106–6493223011001063.64
EMS-1599–5175302211513290.90
EMS-16105–85354263521231100.00
EMS-1777–93043756322111100.00
EMS-1883–7871242301211490.90
EMS-19111–60332972214133100.00
EMS-20117–560541046113241100.00
EMS-21101–86365654112121100.00
EMS-22108–7743675620126790.90
EMS-23101–83703661011733890.90
EMS-2461–7852435510565590.90
EMS-25124–7905797512032690.90
EMS-2650–42573344111121100.00
EMS-27124–5645444030411181.81
EMS-2850–100044562232349100.00
EMS-2991–504224610421211100.00
EMS-30139–9008767440266590.90
EMS-3197–3884120220112381.81
EMS-3284–7742457510210281.81
EMS-33119–41941522111111100.00
EMS-34117–795551157112121100.00
EMS-35127–9513334330022281.81
  134126150152132775484679190
Percent of transferability of each plant 86.45 81.29 96.77 98.06 85.16 49.68 34.84 54.19 43.23 58.71 58.06 67.86%

3.5. Genetic Diversity Analysis by EST-SSRs

Genetic relationship among selected plants was further analyzed by construction of dendrogram through allelic data obtained from EST-SSR primer amplification. All the plants were grouped into two major clusters. Cluster I contained 5 plants of Solanaceae family with two subgroups (Ia and Ib); each subgroup comprised same genus plants clustered together (Datura metel, Datura innoxia (Ia) and Withania coagulans, Withania somnifera (Ib)). Cluster II contained 6 plant species classified into two major subgroups (IIa and IIb). Subgroup IIa comprised Asteraceous plants (Eclipta alba and Stevia rebaudiana) clustered together and subgroup IIb comprised four plant species into three separate edges of the dendrogram, exception with one plant (Figure 4). Thus, the annotated 35 EST-SSR markers showed discriminatory potential to some extent and showed close intimacy amongst Solanaceous and between Asteraceous plants.
Figure 4

A dendrogram of genetic relationships revealed by 35 annotated EST-SSR markers, based on neighbor-joining and Jaccard's algorithm using free tree and tree view software.

4. Discussion

The present study intended to utilize publicly available EST sequences from different plant sources for functional annotation of EST sequences to decode informative EST-SSR markers using in silico approach. Experimental methods to develop SSR markers are laborious, time consuming, and expensive; therefore use of publicly available EST libraries which reduce time and expenses is now being used as an alternative for marker identification [16, 20, 40, 41]. We identified nonredundant 263 microsatellites having di-, tri-, tetra-, penta-, and hexanucleotide repeats. The SSR frequency in the ESTs collection was 3.4% which is close to earlier reports in other plants species, namely, 3.4% in Physcomitrella patens and 3.5% in Oryza sativa [20] and 3.2% in cereals [42] and 4.1% in almond [43]. Other studies also reported SSRs in various frequencies, namely, 2.5% in grapes [22], 2.88% in sugarcane [23], 4.7% in rice [44], and 2.8% in barely [45]. In general, about 5% of ESTs contained SSRs in diverse plant species [46]. The differences in the frequency of EST-SSRs could be attributed to the “search criteria” used, type of SSR motif, size of sequence data, and the mining tools used [31, 47]. An average density of one SSR per 4.2 kb was detected which is closely comparable to earlier reported in date palm [48] and in cereals [42]. Among 263 microsatellites, trinucleotide repeat motifs were the most abundant, with a frequency of 47.52% followed by tetra- (19.77%), di- (19.01%), penta- (9.12%), and hexanucleotide (4.56%) repeats. Varshney et al. [42] reported that trinucleotide repeats (TNRs) are the most common, followed by either dinucleotide repeats (DNRs) or tetranucleotide (TTNRs) repeats. Our result of trinucleotide repeat frequency is in close agreement with previous studies reporting 48.5% in sugarcane [49] and 48% in Setaria italic [32]. Some other studies also reported high TNRs, namely, cereals [50], Ricinus communis [51], Eucalyptus globulus [52], sugarcane [12], and Setaria italica [31]. The reason for the abundance of trinucleotide repeats in plants might be attributed to absence of frameshift mutations [53]. Among all types of trinucleotide motifs, AAG/CTT, CCG/CGG, and AGC/CTG were in high proportion. Motifs GGA/TTC, CCT/AGG, GAA/TTC, and CCG/GGC were also detected. These motifs can form hairpin-like structures, which stabilize and allow them to escape from repair mechanisms [15, 54]. Each trinucleotide motif encodes a particular amino acid including stop codon which participates within protein in various metabolic activities [20, 55]. Predictable, twenty different types of amino acids were detected in trinucleotide motifs including one stop codon (Figure 5). Amino acids (leucine, serine, alanine, and arginine) encoded by trinucleotide motifs are in agreement with earlier studies [20, 30, 55, 56].
Figure 5

Details of predicted amino acids encoded by trinucleotide motifs.

According to functional annotation, 35 EST-SSRs were identified due to their direct involvment in metabolic pathways through blastn/blastx and gene ontology (GO). As observed in earlier studies, relavant transcripts were detected using functional annotation pipelines for various applications [57]. Most of these were involved in biological processes and molecular function such as primary metabolism, secondary metabolism, nitrogen compound metabolism, oxidation-reduction process, and transferase activity. The 35 EST-SSR primer pairs were designed and surveyed in different plants. All primers produced clear PCR amplification profiles in all the selected plants and produced 402 alleles at 155 loci with an average of 2.6 alleles/locus. This result is in close agreement with earlier study reported in chickpea (2.6 alleles/locus) [58]. A set of 35 EST-SSR markers produced a clear amplification profile and these were found to be transferable among the selected plant species. The frequency of cross transferability ranged from high in W. somnifera (98.06%) to a low (34.84%) in S. rebaudiana with an average of 67.86%. This result is in conformity with earlier report on cross transferability of Medicago truncatula EST-SSRs into four leguminous and 3 non-leguminous plants [29]. The transferability (70%) of castor bean SSRs was reported in J. curcas and other Jatropha species [59]. Mishra et al. [30] reported cross transferability (31–57%) of Madagascar periwinkle EST-SSR markers in other medicinal plants. Choudhary et al. [58] also observed cross transferability (68.3% to 96.6%) of chickpea EST-SSR marker across 6 annual Cicer species and also reported 29.4% to 61.7% transferability in seven legume genera. Foxtail millet derived EST-SSR markers showed cross transferability of approximately 85 to 89% in different types of millets and nonmillets [31-33]. Saha et al. [60] also reported approximately 92% transferability from tall fescue to 7 grass species. Some other higher level of transferability was reported in other studies, namely, 86.6% transferability of wheat EST-SSRs to other cereal plants [28], 96.5% cross species amplification among 22 Gossypium species [61], 95.2% cross transferability between Saccharum complex and cereals [49], and 90% transferability of Vigna radiata derived EST-SSR in other Vigna species [62]. Some Lower frequency of transferability was also reported in earlier studies. Gutierrez et al. [63] reported that approximately 40.6% transferability of Medicago truncatula EST-SSR markers amplified across 3 pulse crops (faba bean, chickpea, and pea). In this study, 35 EST-SSR markers were found to be more transferable (89.54%) among Solanaceous plant species than other plant taxa and these markers can give credence to various genetic applications in Solanaceous plants. Further, the genetic relationships among the eleven plants species were evaluated by construction of dendrogram (neighbor-joining/jaccard's algorithm) using allelic data amplified through 35 EST-SSR markers. Here these markers showed close intimacy amongst Solanaceous plants (D. metel, D. innoxia, W. coagulans, W. somnifera, and C. annuum) and between Asteraceous plants (E. alba and S. rebaudiana) and also showed discrimination to some extent in other selected plants (C. colocynthis, O. sanctum, C. roseus, and M. oleifera). Similar relationship was shown by Gupta and Prasad [29] who evaluated the genetic relationships between leguminous (M. truncatula, lentil, pea, and chickpea) and nonleguminous plants (A. thaliana, tomato, wheat). Some other studies also reported genetic relationships using EST-SSR markers in other plant species such as in bread wheat [50], Grasses [60], sugarcane [49] and millets and nonmillets [31-33].

5. Conclusion

This study revealed the insight of abundance and distribution of microsatellites in the expressed sequence tags, retrieved from public data base. Further, functional annotation was feasible to develop and select the informative EST-SSR markers for various genomic applications. This is a bypass approach to reduce cost and time and it is an efficient way to analyze the transcribed portion of genome besides development of own libraries. Finally, 35 EST-SSR markers were developed and experimentally validated for their polymorphic nature, cross transferability, and genetic relationship in eleven different plants species. On the basis of amplification profiles, all these markers were found to be transferable. Genetic relations were established to unambiguously differentiate selected plants species. Supplementary Table: The complete details of most promising hits of gene ontology of 35 EST-SSRs given in the supplementary table.
  51 in total

Review 1.  Comparative sequence analysis of plant nuclear genomes:m microcolinearity and its many exceptions.

Authors:  J L Bennetzen
Journal:  Plant Cell       Date:  2000-07       Impact factor: 11.277

2.  Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat.

Authors:  Ramesh V Kantety; Mauricio La Rota; David E Matthews; Mark E Sorrells
Journal:  Plant Mol Biol       Date:  2002 Mar-Apr       Impact factor: 4.076

Review 3.  Advances in molecular marker techniques and their applications in plant sciences.

Authors:  Milee Agarwal; Neeta Shrivastava; Harish Padh
Journal:  Plant Cell Rep       Date:  2008-02-02       Impact factor: 4.570

4.  Slippage synthesis of simple sequence DNA.

Authors:  C Schlötterer; D Tautz
Journal:  Nucleic Acids Res       Date:  1992-01-25       Impact factor: 16.971

5.  Comparative analysis of polymorphism and chromosomal location of tomato microsatellite markers isolated from different sources.

Authors:  T. Areshchenkova; M. W. Ganal
Journal:  Theor Appl Genet       Date:  2002-02       Impact factor: 5.699

6.  Development of polymorphic markers from expressed sequence tags of Manihot esculenta Crantz.

Authors:  S Tangphatsornruang; S Sraphet; R Singh; E Okogbenin; M Fregene; K Triwitayakorn
Journal:  Mol Ecol Resour       Date:  2008-05       Impact factor: 7.090

7.  Informative genomic microsatellite markers for efficient genotyping applications in sugarcane.

Authors:  Swarup K Parida; Sanjay K Kalia; Sunita Kaul; Vivek Dalal; G Hemaprabha; Athiappan Selvi; Awadhesh Pandit; Archana Singh; Kishor Gaikwad; Tilak R Sharma; Prem Shankar Srivastava; Nagendra K Singh; Trilochan Mohapatra
Journal:  Theor Appl Genet       Date:  2008-10-23       Impact factor: 5.699

8.  Development of eSSR-Markers in Setaria italica and Their Applicability in Studying Genetic Diversity, Cross-Transferability and Comparative Mapping in Millet and Non-Millet Species.

Authors:  Kajal Kumari; Mehanathan Muthamilarasan; Gopal Misra; Sarika Gupta; Alagesan Subramanian; Swarup Kumar Parida; Debasis Chattopadhyay; Manoj Prasad
Journal:  PLoS One       Date:  2013-06-21       Impact factor: 3.240

9.  In silico comparative analysis of SSR markers in plants.

Authors:  Filipe C Victoria; Luciano C da Maia; Antonio Costa de Oliveira
Journal:  BMC Plant Biol       Date:  2011-01-19       Impact factor: 4.215

10.  Repertoire of SSRs in the Castor Bean Genome and Their Utilization in Genetic Diversity Analysis in Jatropha curcas.

Authors:  Arti Sharma; Rajinder Singh Chauhan
Journal:  Comp Funct Genomics       Date:  2011-05-22
View more
  9 in total

1.  Exploring the heat-responsive chaperones and microsatellite markers associated with terminal heat stress tolerance in developing wheat.

Authors:  Ranjeet R Kumar; Suneha Goswami; Mohammad Shamim; Kavita Dubey; Khushboo Singh; Shweta Singh; Yugal K Kala; Ravi R K Niraj; Akshay Sakhrey; Gyanendra P Singh; Monendra Grover; Bhupinder Singh; Gyanendra K Rai; Anil K Rai; Viswanathan Chinnusamy; Shelly Praveen
Journal:  Funct Integr Genomics       Date:  2017-06-01       Impact factor: 3.410

Review 2.  Cultivation, Genetic, Ethnopharmacology, Phytochemistry and Pharmacology of Moringa oleifera Leaves: An Overview.

Authors:  Alessandro Leone; Alberto Spada; Alberto Battezzati; Alberto Schiraldi; Junior Aristil; Simona Bertoli
Journal:  Int J Mol Sci       Date:  2015-06-05       Impact factor: 5.923

3.  Microsatellite loci for Urochloa decumbens (Stapf) R.D. Webster and cross-amplification in other Urochloa species.

Authors:  Rebecca C U Ferreira; Letícia J Cançado; Cacilda B do Valle; Lucimara Chiari; Anete P de Souza
Journal:  BMC Res Notes       Date:  2016-03-10

4.  Development and characterization of microsatellite markers in Campomanesia adamantium, a native plant of the Cerrado ecoregions of South America.

Authors:  Bruno do Amaral Crispim; Thamiris Gatti Déo; Juliana Dos Santos Fernandes; Adrielle Ayumi de Vasconcelos; Maria do Carmo Vieira; Thiago de Oliveira Carnevali; Miklos Maximiliano Bajay; Maria Imaculada Zucchi; Alexeia Barufatti
Journal:  Appl Plant Sci       Date:  2019-09-22       Impact factor: 1.936

5.  De novo transcriptome assembly and analysis of gene expression in different tissues of moth bean (Vigna aconitifolia) (Jacq.) Marechal.

Authors:  Sandhya Suranjika; Seema Pradhan; Soumya Shree Nayak; Ajay Parida
Journal:  BMC Plant Biol       Date:  2022-04-15       Impact factor: 5.260

6.  Assessment of Functional EST-SSR Markers (Sugarcane) in Cross-Species Transferability, Genetic Diversity among Poaceae Plants, and Bulk Segregation Analysis.

Authors:  Shamshad Ul Haq; Pradeep Kumar; R K Singh; Kumar Sambhav Verma; Ritika Bhatt; Meenakshi Sharma; Sumita Kachhwaha; S L Kothari
Journal:  Genet Res Int       Date:  2016-06-01

7.  First Microsatellite Markers Developed from Cupuassu ESTs: Application in Diversity Analysis and Cross-Species Transferability to Cacao.

Authors:  Lucas Ferraz Dos Santos; Roberta Moreira Fregapani; Loeni Ludke Falcão; Roberto Coiti Togawa; Marcos Mota do Carmo Costa; Uilson Vanderlei Lopes; Karina Peres Gramacho; Rafael Moyses Alves; Fabienne Micheli; Lucilia Helena Marcellino
Journal:  PLoS One       Date:  2016-03-07       Impact factor: 3.240

8.  Genome-Wide Discovery of Microsatellite Markers from Diploid Progenitor Species, Arachis duranensis and A. ipaensis, and Their Application in Cultivated Peanut (A. hypogaea).

Authors:  Chuanzhi Zhao; Jingjing Qiu; Gaurav Agarwal; Jiangshan Wang; Xuezhen Ren; Han Xia; Baozhu Guo; Changle Ma; Shubo Wan; David J Bertioli; Rajeev K Varshney; Manish K Pandey; Xingjun Wang
Journal:  Front Plant Sci       Date:  2017-07-18       Impact factor: 5.753

9.  Genome-wide identification of microsatellite markers from cultivated peanut (Arachis hypogaea L.).

Authors:  Qing Lu; Yanbin Hong; Shaoxiong Li; Hao Liu; Haifen Li; Jianan Zhang; Haofa Lan; Haiyan Liu; Xingyu Li; Shijie Wen; Guiyuan Zhou; Rajeev K Varshney; Huifang Jiang; Xiaoping Chen; Xuanqiang Liang
Journal:  BMC Genomics       Date:  2019-11-01       Impact factor: 3.969

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.