| Literature DB >> 29443955 |
Manosh Kumar Biswas1, Ujjal Kumar Nath2,3, Jewel Howlader4, Mita Bagchi5, Sathishkumar Natarajan6, Md Abdul Kayum7, Hoy-Taek Kim8,9, Jong-In Park10, Jong-Goo Kang10, Ill-Sup Nou11.
Abstract
Lilies (Lilium sp.) are commercially important horticultural crops widely cultivated for their flowers and bulbs. Here, we conducted large-scale data mining of the lily transcriptome to develop transcription factor (TF)-associated microsatellite markers (TFSSRs). Among 216,768 unigenes extracted from our sequence data, 6966 unigenes harbored simple sequence repeats (SSRs). Seventy-one SSRs were associated with TF genes, and these were used to design primers and validate their potential as markers. These 71 SSRs were accomplished with 31 transcription factor families; including bHLH, MYB, C2H2, ERF, C3H, NAC, bZIP, and so on. Fourteen highly polymorphic SSRs were selected based on Polymorphic Information Content (PIC) values and used to study genetic diversity and population structure in lily accessions. Higher genetic diversity was observed in Longiflorum compared to Oriental and Asiatic populations. Lily accessions were divided into three sub-populations based in our structure analysis, and an un-rooted neighbor-joining tree effectively separated the accessions according to Asiatic, Oriental, and Longiflorum subgroups. Finally, we showed that 46 of the SSR-associated genes were differentially expressed in response to Botrytiselliptica infection. Thus, our newly developed TFSSR markers represent a powerful tool for large-scale genotyping, high-density and comparative mapping, marker-aided backcrossing, and molecular diversity analysis of Lilium sp.Entities:
Keywords: Lilium species; SSR markers; genetic diversity; transcription factor
Year: 2018 PMID: 29443955 PMCID: PMC5852593 DOI: 10.3390/genes9020097
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
In silico characterization of transcription factor simple sequence repeats (SSR) (TFSSR) extracted from transcription factor (TF) sequences of lily for marker development.
| Item | Count | % |
|---|---|---|
| No. of sequences searched | 216,768 | |
| SSR-containing sequences | 6966 | 3.21 |
| Transcription factor SSRs | 71 | 1.99 |
| Di-nucleotide repeats | 20 | 28.17 |
| Tri-nucleotide repeats | 47 | 66.20 |
| Tetra-nucleotide repeats | 0 | 0.00 |
| Penta-nucleotide repeats | 1 | 1.41 |
| Hexa-nucleotide repeats | 3 | 4.23 |
| Class I members | 11 | 15.49 |
| Class II members | 60 | 84.51 |
| GC-rich SSRs | 47 | 66.20 |
| AT-rich SSRs | 3 | 4.23 |
| AT/GC-balanced SSRs | 21 | 29.58 |
Figure 1Distribution of TFSSRs in different TF family genes. (a) Distribution by family in terms of repeat unit size; (b) Distribution by family in terms of motif nucleotide base composition.
Figure 2Summary of differential expression of TFSSR-associated genes in response to biotic stress. (a) Venn diagram represents time-course specific distribution of TFSSR-associated gene expression among up- and down-regulated categories; (b) Heat map showing hierarchical cluster analysis based on the log2fragments per kilobase of transcript per million mapped reads(FPKM) values.
Evaluation of TFSSR primer pairs for the different repeat classes (based on 8 genotypes).
| Parameters | Di | Tri | Tetra | Penta | Hexa | Total/Average |
|---|---|---|---|---|---|---|
| Tested primer | 20 | 47 | na | 1 | 3 | 71 |
| PCR amplification | 18 | 47 | na | 1 | 3 | 69 |
| Band Specific | 9 | 28 | na | 0 | 2 | 39 |
| Scorable Primer | 11 | 31 | na | 0 | 2 | 44 |
| Polymorphic | 10 | 29 | na | 0 | 2 | 41 |
| Range of Alleles No. | 2–5 | 2–6 | na | na | 2–3 | 2–6 |
| Total No. of Alleles | 50 | 146 | na | 4 | 7 | 207 |
| No. of Homozygous | 12 | 40 | na | 1 | 2 | 55 |
| No. of Heterozygous | 6 | 7 | na | 0 | 1 | 14 |
| Homo:Hetero Ratio | 2:1 | 6:1 | na | na | 2:1 | 4:1 |
| Mean of Alleles ±SD | 2.78 ± 0.94 | 3.11 ± 1.5 | na | na | 2.33 ± 0.58 | 3.06 ± 0.755 |
| PIC±SD | 0.55 ± 0.17 | 0.64 ± 0.15 | na | 0.47 ± 0 | 0.65 ± 0.16 | 0.58 ± 0.12 |
PIC: Polymorphic Information Content; SD: Standard Deviation; na: not available.
Figure 3Experimental evaluations of TFSSR markers for 31 TF families. (a) Evaluation of homozygosity and heterozygosity by TF family; (b) Evaluation of mono and polymorphism by TF family.
Figure 4Population structure of Lily germplasm. (a) Principal Coordinates Analysis (PCoA) using distance matrix values for 39 lily accessions obtained from primers for 14 TFSSRs, colored circles represent the groups of lily accessions; (b) Population structure analysis of 39 lily accessions using STRUCTURE V2.3.4, each vertical bar represents one accession; (c) Phylogenetic tree generated by using the variations of PCR amplicon with the 39 lily accessions.