| Literature DB >> 31905649 |
Shiya Shen1, Qianru Zhang1, Yu Shi1, Zhenmei Sun1, Qianqian Zhang1, Sijia Hou1, Rongling Wu1, Libo Jiang1, Xiyang Zhao2, Yunqian Guo1.
Abstract
As a plant-specific transcription factor, the NAC (NAM, ATAF1/2 and CUC2) domain protein plays an important role in plant growth and development, as well as stress resistance. Based on the genomic data of the cacao tree, this study identified 102 cacao NAC genes and named them according to their location within the genome. The phylogeny of the protein sequence of the cacao tree NAC family was analyzed using various bioinformatic methods, and then divided into 12 subfamilies. Then, the amino-acid composition, physicochemical properties, genomic location, gene structure, conserved domains, and promoter cis-acting elements were analyzed. This study provides information on the evolution of the TcNAC gene and its possible functions, laying the foundation for further research on the NAC family.Entities:
Keywords: Arabidopsis thaliana; NAC transcription factors; Theobroma cacao; bioinformatics; genome-wide analysis
Mesh:
Substances:
Year: 2019 PMID: 31905649 PMCID: PMC7017368 DOI: 10.3390/genes11010035
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Phylogenetic tree of NAC (NAM, ATAF1/2 and CUC2) domain protein from Arabidopsis and Theobroma cacao. The phylogenetic tree was constructed using the maximum parsimony (MP) method with 1000 bootstrap replications. The 16 subfamilies are distinguished in different colors, and the unclassified TcNACs are represented by the abbreviation “UN”.
Figure 2Phylogenetic relationships, gene structure and architecture of conserved protein motifs in NAC genes from Theobroma cacao. (A) The phylogenetic tree was constructed based on the full-length sequences of Theobroma cacao NAC proteins using MEGA 7.0 software. (B) Exon–intron structure of Theobroma cacao NAC genes. Blue boxes indicate untranslated 5′- and 3′-regions, red boxes indicate exons, and black lines indicate introns. (C) The motif composition of Theobroma cacao NAC proteins. The motifs are displayed in different colored boxes. The sequence information for each motif is provided in Supplementary File 3. The length of the protein can be estimated using the scale at the bottom.
Figure 3Distribution of TcNAC genes among 10 chromosomes. Vertical bars represent the chromosomes of Theobroma cacao. The chromosome number is to the top of each chromosome. The scale on the left represents chromosome length.
The Ka/Ks values of Theobroma cacao tandem repeat sequences.
| Tandem Repeat Sequence | Ka | Ks | Ka/Ks |
|---|---|---|---|
| TcNAC048/TcNAC047 | 0.069652 | 0.204593 | 0.34044 |
| TcNAC055/TcNAC057 | 0.022392 | 0.046771 | 0.47876 |
| TcNAC055/TcNAC056 | 0.031369 | 0.066627 | 0.47082 |
| TcNAC056/TcNAC057 | 0.020842 | 0.045468 | 0.45839 |
| TcNAC047/TcNAC046 | 0.071078 | 0.056378 | 1.26074 |
| TcNAC085/TcNAC084 | 0.057197 | 0.128216 | 0.44609 |
| TcNAC085/TcNAC086 | 0.051726 | 0.110318 | 0.46888 |
| TcNAC003/TcNAC004 | 0.028539 | 0.129133 | 0.22101 |
| TcNAC100/TcNAC101 | 0.014445 | 0.046379 | 0.33115 |
| TcNAC084/TcNAC086 | 0.053422 | 0.090511 | 0.59023 |
| TcNAC063/TcNAC005 | 0.111896 | 0.215804 | 0.51851 |
| TcNAC048/TcNAC046 | 0.077084 | 0.210968 | 0.36538 |