| Literature DB >> 35024520 |
Ge Bai1,2,3, Da-Hai Yang1,2,3, Peijian Chao4, Heng Yao1,2,3, MingLiang Fei1,2,3, Yihan Zhang1,2,3, Xuejun Chen1,2,3, Bingguang Xiao1,2,3, Feng Li4, Zhen-Yu Wang5, Jun Yang4, He Xie1,2,3.
Abstract
The bHLH transcription factors play pivotal roles in plant growth and development, production of secondary metabolites and responses to various environmental stresses. Although the bHLH genes have been well studied in model plant species, a comprehensive investigation of the bHLH genes is required for tobacco with newly obtained high-quality genome. In the present study, a total of 309 NtbHLH genes were identified and can be divided into 23 subfamilies. The conserved amino acids which are essential for their function were predicted for the NtbHLH proteins. Moreover, the NtbHLH genes were conserved during evolution through analyzing the gene structures and conserved motifs. A total of 265 NtbHLH genes were localized in the 24 tobacco chromosomes while the remained 44 NtbHLH genes were mapped to the scaffolds due to the complexity of tobacco genome. Moreover, transcripts of NtbHLH genes were obviously tissue-specific expressed from the gene-chip data from 23 tobacco tissues, and expressions of 20 random selected NtbHLH genes were further confirmed by quantitative real-time PCR, indicating their potential functions in the plant growth and development. Importantly, overexpressed NtbHLH86 gene confers improve drought tolerance in tobacco indicating that it might be involved in the regulation of drought stress. Therefore, our findings here provide a valuable information on the characterization of NtbHLH genes and further investigation of their functions in tobacco.Entities:
Keywords: Characterization; Development; Genome-wide analysis; bHLH gene family
Year: 2020 PMID: 35024520 PMCID: PMC8720692 DOI: 10.1016/j.pld.2020.10.004
Source DB: PubMed Journal: Plant Divers ISSN: 2468-2659
Fig. 1Phylogenetic analysis of NtbHLH proteins from cultivated tobacco. A total of 309 NtbHLH proteins from cultivated tobacco (Nicotiana tabacum) were used to generate the unrooted neighbor-joining (NJ) tree with 1000 bootstrap replicates. The bHLH proteins were classified into 23 subfamilies and distinguished by different colors.
Fig. 2Intron number and Number of length distribution of NtbHLH gene family in tobacco. Exon-intron analyses of identical tobacco NtbHLH genes were performed with GSDS 2.0. Introns number and number of length distribution were calculated for 23 clades in tobacco, respectively.
Fig. 3Conserved motifs of tobacco NtbHLH proteins were predicted by MEME. Colored boxes indicate different motifs.
Consensus motif of bHLH domain in tobacco.
| Atchley et al. | Toledo-Ortiz et al. | Hua Sun et al. | this study | |||||
|---|---|---|---|---|---|---|---|---|
| Position in the alignment | Consensus motif amino acid frequency within the bHLH domain | Position in the alignment | Amino acid frequency within theArabidopsis bHLH domains | Position in the alignment | Amino acid frequency within the tomato bHLH domains | Position in the alignment | Amino acid frequency within the tobacco bHLH domains | |
| Basic | 1 | R (61%), K (27%) | 1 | R (24%), K (22%) | 1 | K (28%), R (25%),N (11%) | 1 | K (27%), R (30%) |
| 2 | R (77%), K (16%) | 2 | R (35%) | 2 | R (32%), K (11%) | 2 | R (37%) | |
| 9 | E (93%) | 13 | E (76%), A (10%) | 13 | E (75%), A (11%) | 13 | E (77%), A (10%) | |
| 10 | R (81%), K (14%) | 14 | R (74%), K (14%) | 14 | R (76%), K (18%) | 14 | R (80%), K (15%) | |
| 12 | R (91%) | 16 | R (91%) | 16 | R (94%) | 16 | R (92%) | |
| Helix | 16 | I (35%),L (33%),V (23%) | 20 | I (52%), L (27%), M (12%) | 20 | I (53%), L (28%), M(17%) | 20 | I (53%), L (26%), M(17%) |
| 17 | N (74%) | 21 | N (51%), S (19%) | 21 | N (45%), S (26%) | 21 | N (51%), S (25%) | |
| 20 | F (72%), L (14%), I (9%) | 24 | F (26%), L (26%), M (20%), I (14%) | 24 | L (28%), F (26%), M(19%), I (16%) | 24 | L (27%), F (22%), M(21%), I (11%) | |
| 23 | L (98%) | 27 | L (100%) | 27 | L (99%) | 27 | L (97%) | |
| 24 | R (44%), K (35%) | 28 | Q (42%), R (35%) | 28 | Q (41%), R (37%) | 28 | Q (44%), R (36%) | |
| Loop | 47 | K (58%), R (24%) | 39 | K (66%) | 36 | K (68%) | 36 | K (85%) |
| 50 | K (93%) | 42 | K (45%), T (13%) | 47 | K (45%), T (21%) | 40 | K (47%), T (18%), R (10%) | |
| 53 | I (74%), T (15%), V (7%) | 45 | M (33%), I (27%), V (16%), L (14%) | 50 | M(33%), I (28%), V (15%), L (14%) | 43 | M(35%), I (27%), V (15%), L (11%) | |
| 54 | L (98%) | 46 | L (76%), V (14%) | 51 | L (78%), I (11%) | 44 | L (80%), I (10%), V (7%) | |
| 57 | A (76%) | 49 | A (60%), I (16%), V (12%) | 54 | A (60%), I (18%), V (11%), T (10%) | 47 | A (55%), I (17%), V (13%), T (14%) | |
| 58 | I (31%), V (27%),T (23%) | 50 | I (63%), V (22%) | 55 | I (60%), V (25%) | 48 | I (60%), V (22%) | |
| 60 | Y (77%) | 52 | Y (78%) | 57 | Y (74%), H (13%) | 50 | Y (78%), H (8%) | |
| 61 | I (69%), L (16%), V 8%) | 53 | I (40%), V (33%), L (13%) | 58 | I (43%), V (38%), L (13%) | 51 | I (38%), V (41%), L (15%) | |
| 64 | L (80%), M (7%) | 56 | L (93%) | 61 | L (97%) | 54 | L (95%) | |
Fig. 4The bHLH domain is highly conserved across all NtbHLH proteins. The overall height of each stack represents the conservation of sequence at that position. Capital letters indicate over 50% conservation of amino acids among 309 NtbHLH domains.
Predicted DNA-binding categories based on the tobacco bHLH domain.
| Predicted activity | Predicted motif | Number of AtbHLHs (Toledo-Ortiz) | Number of SlbHLHs | This study | |||
|---|---|---|---|---|---|---|---|
| E-box | bHLH | 237 | 74.15% | 98 | 61.63% | 241 | 78.00% |
| G-box | bHLH | 187 | 60.54% | 72 | 45.28% | 191 | 61.81% |
| Non-G-box | bHLH | 50 | 13.61% | 26 | 16.35% | 50 | 16.18% |
| Non-E-box | bHLH | 11 | 7.48% | 12 | 7.55% | 45 | 14.56% |
| Total | 120 | 81.63% | 110 | 69.18% | 286 | 92.55% | |
| Non-DNA binding | HLH | 27 | 18.37% | 49 | 30.82% | 23 | 7.40% |
Fig. 5Expression profile of 303 NtbHLH genes in tissues at different developmental stages. The relative transcript abundances of 303 NtbHLH genes were examined via microarray and visualized as a heatmap. The expression profiles of NtbHLH genes in 23 different samples, including dry seeds, germination seeds, cotyledons, leaves from two-true leaf stage (labeled as two true leaf_leaf), roots from two-true leaf stage (two true leaf_root), leaves from four-true leaf stage (four true leaf_leaf), roots from four-true leaf stage (four true leaf_root), leaves from six-true leaf stage (six true leaf_leaf), roots from six-true leaf stage (six true leaf_root), leaves from ten-true leaf stage (ten ture leaf_leaf), roots from ten-true leaf stage (ten ture leaf_root), and flowers at squaring stage (squaring stage_flower). The X axis is the samples in tissues at different developmental stages. The color scale represents Log2 expression values.
Fig. 6Expression patterns of 20 randomly selected NtbHLH genes in tabcco. The relative transcript abundances of 20 randomly selected NtbHLH genes were examined via qPCR and visualized as a histogram. Tobacco flower and 6–7 weeks old seedings grown in the soil were collected for RNA extraction and qPCR analysis. 26S was used as an internal control. Error bars represent SD (n = 3).
Fig. 7Collinear analysis for the NtbHLH gene family in tobacco. The annulus represents chromosomes of tobacco (Nicotiana tabacum), and scale on the annulus is labeled in megabases (Mb). Homoeologous genes are linked by lines. The figure was generated and modified using the Circus program.
Fig. 8Phenotypes of NtbHLH86 overexpression transgenic lines in tobacco. (A) Expression of NtbHLH86 was induced by drought treatment. (B) Five independent NtbHLH86 overexpression plants were obtained and analyzed by qPCR. (C) Represented picture of wild-type and five independent NtbHLH86 overexpressors under drought treatment for 14 days. (D) Survival ratio of wild-type and five independent NtbHLH86 overexpressors seedlings under drought treatment for 20 days. For (A) and (B), 26S was used as an internal control. For (A), (B) and (D), error bars represent SD (n = 3). Asterisks indicate significant differences (∗p < 0.05, ∗∗p < 0.01, ∗∗∗p < 0.001) as determined by a two-tailed paired Student's t-test.