| Literature DB >> 31906076 |
Bingqian Tang1, Lingling Xie2, Ting Yi1, Junheng Lv1, Huiping Yang1, Xiang Cheng1, Feng Liu1,2, Xuexiao Zou1,3.
Abstract
Mitochondrial transcription termination factors (mTERFs) regulate the expression of mitochondrial genes and are closely related to the function of the mitochondrion and chloroplast. In this study, the mTERF gene family in capsicum (Capsicum annuum L.) was identified and characterized through genomic and bioinformatic analyses. Capsicum was found to possess at least 35 mTERF genes (CamTERFs), which were divided into eight major groups following phylogenetic analysis. Analysis of CamTERF promoters revealed the presence of many cis-elements related to the regulation of cellular respiration and photosynthesis. In addition, CamTERF promoters contained cis-elements related to phytohormone regulation and stress responses. Differentially expressed genes in different tissues and developmental phases were identified using RNA-seq data, which revealed that CamTERFs exhibit various expression and co-expression patterns. Gene ontology (GO) annotations associated CamTERFs primarily with mitochondrion and chloroplast function and composition. These results contribute towards understanding the role of mTERFs in capsicum growth, development, and stress responses. Moreover, our data assist in the identification of CamTERFs with important functions, which opens avenues for future studies.Entities:
Keywords: GO analysis; abiotic stress; capsicum; expression analysis; mTERF gene family; phylogenetic analysis
Mesh:
Substances:
Year: 2019 PMID: 31906076 PMCID: PMC6982079 DOI: 10.3390/ijms21010269
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
The basic information of the mitochondrial transcription termination factor (mTERF) gene family in capsicum. List of predicted genes and related information include gene name (Zunla_1), CM334 homologous gene and E-value, gene locus, molecular details.
| Gene | Gene Accession No. | CM334 Homelog | E-Value | Chr | Start | End | ORF bp | Size aa | Protein Molecular Weight/KD | PI |
|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
| 0 | Chr00 | 544847272 | 544848822 | 1551 | 516 | 59.52 | 9.24 |
|
|
|
| 0 | Chr00 | 545339784 | 545340731 | 948 | 315 | 35.86 | 9.3 |
|
|
|
| 1.25 × 10−12 | Chr00 | 591637478 | 591637978 | 501 | 166 | 19.33 | 7.6 |
|
|
|
| 0 | Chr01 | 227806286 | 227808022 | 1737 | 578 | 66.29 | 8.02 |
|
|
|
| 6.03 × 10−41 | Chr01 | 232164334 | 232164957 | 624 | 207 | 24.25 | 9.13 |
|
|
|
| 0 | Chr02 | 107229127 | 107230005 | 879 | 292 | 33.69 | 9.73 |
|
|
|
| 0 | Chr02 | 119529666 | 119531393 | 1728 | 575 | 65.45 | 8.98 |
|
|
|
| 0 | Chr02 | 143793081 | 143794292 | 1212 | 403 | 45.81 | 9.87 |
|
|
|
| 1.22 × 10−17 | Chr02 | 163004467 | 163007189 | 1455 | 484 | 54.96 | 9.2 |
|
|
|
| 4.29 × 10−69 | Chr03 | 95975 | 106725 | 1782 | 593 | 67.82 | 9.18 |
|
|
|
| 0 | Chr03 | 36374556 | 36376173 | 1533 | 510 | 57.4 | 8.83 |
|
|
|
| 1.51 × 10−17 | Chr03 | 168574645 | 168575497 | 729 | 242 | 28.26 | 9.22 |
|
|
|
| 0 | Chr03 | 250309314 | 250310699 | 1386 | 461 | 53.08 | 9.25 |
|
|
|
| 0 | Chr04 | 12169693 | 12170571 | 879 | 292 | 33.97 | 9.86 |
|
|
|
| 0 | Chr04 | 36527349 | 36528682 | 1263 | 420 | 47.68 | 9.46 |
|
|
|
| 0 | Chr04 | 36553095 | 36554216 | 1122 | 373 | 42.93 | 9.65 |
|
|
|
| 0 | Chr04 | 36586260 | 36587354 | 1095 | 364 | 41.92 | 9.28 |
|
|
|
| 6.61 × 10−18 | Chr04 | 36664744 | 36665883 | 1140 | 379 | 43.25 | 9.49 |
|
|
|
| 0 | Chr04 | 36679829 | 36682422 | 1143 | 380 | 43.5 | 9.47 |
|
|
| Chr04 | 36685395 | 36685721 | 327 | 108 | 12.4 | 9.52 | ||
|
|
|
| 0 | Chr04 | 36719507 | 36720607 | 1101 | 366 | 42.01 | 9.51 |
|
|
|
| 0 | Chr04 | 36830156 | 36837562 | 1185 | 394 | 45.17 | 10.02 |
|
|
|
| 0 | Chr04 | 36952799 | 36953611 | 813 | 270 | 30.41 | 9.42 |
|
|
|
| 0 | Chr04 | 37032362 | 37033440 | 981 | 326 | 38 | 9.64 |
|
|
|
| 1.02 × 10−15 | Chr04 | 37046910 | 37047527 | 618 | 205 | 23.63 | 9.52 |
|
|
|
| 7.52 × 10−78 | Chr05 | 6905983 | 6907173 | 1191 | 396 | 45.16 | 9.67 |
|
|
|
| 0 | Chr05 | 6908441 | 6909864 | 1173 | 390 | 44.77 | 10.02 |
|
|
|
| 0 | Chr05 | 112851518 | 112852951 | 1434 | 477 | 53.95 | 6.1 |
|
|
|
| 0 | Chr08 | 148385074 | 148388277 | 1971 | 656 | 75.9 | 9.16 |
|
|
|
| 0 | Chr09 | 32649482 | 32650606 | 1125 | 374 | 43.07 | 9.65 |
|
|
|
| 0 | Chr10 | 3405765 | 3407096 | 1332 | 443 | 50.49 | 6.12 |
|
|
|
| 0 | Chr11 | 79029802 | 79030677 | 876 | 291 | 33.23 | 9.57 |
|
|
|
| 4.08 × 10−15 | Chr11 | 197890867 | 197891505 | 639 | 212 | 24.48 | 8.93 |
|
|
|
| 0 | Chr12 | 40961437 | 40962249 | 813 | 270 | 31.42 | 9.1 |
|
|
|
| 6.69 × 10−15 | Chr12 | 119671476 | 119672487 | 930 | 309 | 35.48 | 9.55 |
Figure 1Chromosomal location and gene duplication of capsicum mTERFs (CamTERFs). The tandem duplicated genes are marked by blue rectangles.
Subcellular localization of CamTERFs. Listed are the predicted protein output scores for chloroplast transit peptide (C), mitochondrial targeting peptide (M), signal peptide (S), or any other location (-). Location (Loc) details the subcellular localization predicted based on the prediction scores and reliability class (RC, 1–5) indicates the strength of the prediction, with 1 indicating the strongest prediction. RC is a measure of the size of the difference (‘diff’) between the highest (winning) and the second highest output scores.
| Protein ID | Chloroplast Transit Peptide | Mitochondrial Targeting Peptide | Signal Peptide | Other Location | Loc | RC |
|---|---|---|---|---|---|---|
| CamTERF1 | 0.064 | 0.339 | 0.114 | 0.612 | _ | 4 |
| CamTERF2 | 0.888 | 0.047 | 0.016 | 0.07 | C | 1 |
| CamTERF3 | 0.079 | 0.536 | 0.06 | 0.568 | _ | 5 |
| CamTERF4 | 0.062 | 0.883 | 0.008 | 0.21 | M | 2 |
| CamTERF5 | 0.24 | 0.049 | 0.029 | 0.749 | _ | 3 |
| CamTERF6 | 0.098 | 0.127 | 0.165 | 0.532 | _ | 4 |
| CamTERF7 | 0.852 | 0.021 | 0.082 | 0.165 | C | 2 |
| CamTERF8 | 0.049 | 0.681 | 0.007 | 0.028 | M | 2 |
| CamTERF9 | 0.014 | 0.823 | 0.026 | 0.349 | M | 3 |
| CamTERF10 | 0.862 | 0.067 | 0.01 | 0.138 | C | 2 |
| CamTERF11 | 0.733 | 0.043 | 0.021 | 0.102 | C | 2 |
| CamTERF12 | 0.018 | 0.567 | 0.125 | 0.16 | M | 3 |
| CamTERF13 | 0.178 | 0.302 | 0.108 | 0.04 | M | 5 |
| CamTERF14 | 0.106 | 0.27 | 0.019 | 0.732 | _ | 3 |
| CamTERF15 | 0.177 | 0.12 | 0.054 | 0.147 | C | 5 |
| CamTERF16 | 0.082 | 0.857 | 0.006 | 0.031 | M | 2 |
| CamTERF17 | 0.009 | 0.176 | 0.763 | 0.07 | S | 3 |
| CamTERF18 | 0.014 | 0.797 | 0.163 | 0.042 | M | 2 |
| CamTERF19 | 0.261 | 0.635 | 0.021 | 0.098 | M | 4 |
| CamTERF20 | 0.06 | 0.164 | 0.142 | 0.883 | _ | 2 |
| CamTERF21 | 0.007 | 0.194 | 0.85 | 0.058 | S | 2 |
| CamTERF22 | 0.086 | 0.773 | 0.008 | 0.082 | M | 2 |
| CamTERF23 | 0.005 | 0.419 | 0.44 | 0.085 | S | 5 |
| CamTERF24 | 0.274 | 0.078 | 0.024 | 0.39 | _ | 5 |
| CamTERF25 | 0.685 | 0.039 | 0.176 | 0.248 | C | 3 |
| CamTERF26 | 0.069 | 0.284 | 0.085 | 0.06 | M | 5 |
| CamTERF27 | 0.131 | 0.598 | 0.034 | 0.07 | M | 3 |
| CamTERF28 | 0.212 | 0.659 | 0.026 | 0.029 | M | 3 |
| CamTERF29 | 0.665 | 0.268 | 0.01 | 0.103 | C | 4 |
| CamTERF30 | 0.009 | 0.538 | 0.611 | 0.025 | S | 5 |
| CamTERF31 | 0.029 | 0.588 | 0.064 | 0.492 | M | 5 |
| CamTERF32 | 0.071 | 0.339 | 0.012 | 0.707 | _ | 4 |
| CamTERF33 | 0.034 | 0.174 | 0.109 | 0.785 | _ | 2 |
| CamTERF34 | 0.815 | 0.029 | 0.203 | 0.017 | C | 2 |
| CamTERF35 | 0.139 | 0.095 | 0.053 | 0.679 | _ | 3 |
Figure 2CamTERFs protein predicted of the Subcellular localization.
Figure 3Phylogenetic relationship, conserved motif, and gene structure analysis of CamTERFs. (A) Phylogenetic tree of 35 CamTERFs. Depicted is an unrooted neighbor-joining phylogenetic tree; (B) Distributions of conserved motifs in CamTERFs. Ten putative motifs are indicated with different colored boxes. For details of motifs refer to Table 2; (C) Exon/intron organization in CamTERFs. Yellow boxes represent exons and black lines represent introns. The numbers 0, 1, and 2 represent the intron splicing phase. Exon length can be inferred by the scale at the bottom.
List of putative motifs in CamTERFs.
| Motif | Width | Sequence |
|---|---|---|
| 1 | 21 | YLVSHPALLMYSLEKRIKPRY |
| 2 | 29 | PKLLFYDVEKTLKPKLZFLKELGLSGSDL |
| 3 | 50 | VPPDSPMFLHGVQVLSSLKKSKLDRKJGIFKSFGWSDDDILTMFRKLPYC |
| 4 | 31 | YLRSLGGSDEBVVKLJKRCPWLLSYSLEKTL |
| 5 | 21 | YLINSLGFSKQEAJSASAKVT |
| 6 | 22 | SESKFLEKYVLPYKDELPDLYE |
| 7 | 21 | SEARIQTALTFFMKELGYKSA |
| 8 | 28 | LLRNFGFSNDKIRKMVLRCPQLLTQNPE |
| 9 | 21 | LVVNFFKQTGFBBTQIKKLVS |
| 10 | 21 | VKVIARDPKLLTRSLDTHJKP |
Figure 4Phylogenetic tree of mTERFs from Arabidopsis, rice, and capsicum. The phylogenetic tree was constructed using the NJ (neighbor-joining) method with 1000 bootstrap replications. The eight subfamilies are distinguished with different colors.
Figure 5Predicted cis-elements in CamTERFs promoters. Promoter sequences (−2000 bp of translation start site) of 35 CamTERFs (promoter sequence of CamTERF22 was omitted) and 6 housekeeping genes were analyzed by PlantCARE.
Figure 6(A) Expression profiles of CamTERFs in different tissues and organs. CR0: root control; L1–L9: Leaf from 0 DAG to 50 DAG; F1–F9: Flower from 0 DAG to 50 DAG; P10: Petal; O10: Ovary; STA10: Stamen; FST0 and FST1: Whole fruit from 3 DAP and 7 DAP; G1–G11: Pericarp from 10 DAP to 60 DAP; ST1 and ST2: Placenta and Seed from 10 DAP to 15 DAP; S3–S11: Seed from 20 DAP to 60 DAP; T3–T10: Placenta from 20 DAP to 60 DAP. DAG: Days after germination; DAP: Days after pollination; (B) Analysis of gene ontology (GO) enrichment for co-expressing CamTERFs. GO term categorization includes biological process (BP), cellular component (CC), and molecular function (MF).
Figure 7(A) Expression profiles of CamTERFs untreated following a 24 h time course; (B) expression profiles of CamTERFs under ABA (Abscisic acid) stress following a 24 h time course; (C) expression profiles of CamTERFs under salt stress and following a 24 h time course.