| Literature DB >> 30258345 |
Susan D Lawrence1, Nicole G Novak1.
Abstract
BACKGROUND: Brassica oleracea, B. rapa and B. napus encompass many economically important vegetable and oil crops; such as cabbage, broccoli, canola and Chinese cabbage. The genome sequencing of these species allows for gene discovery with an eye towards discerning the natural variability available for future breeding. The Q-type C2H2 zinc-finger protein (ZFP) transcription factors contain zinc finger motifs with a conserved QALGGH as part of the motif and they may play a critical role in the plants response to stress. While they may contain from one to five ZF domains (ZFD) this work focuses on the ZFPs that contain two zinc-fingers, which bind to the promoter of genes, and negatively regulate transcription via the EAR motif. B. oleracea and rapa are diploid and evolved into distinct species about 3.7 million years ago. B. napus is polyploid and formed by fusion of the diploids about 7500 years ago.Entities:
Keywords: Brassica; Cabbage; Canola; Q-type C2H2 zinc finger transcription factors
Mesh:
Substances:
Year: 2018 PMID: 30258345 PMCID: PMC6150991 DOI: 10.1186/s41065-018-0065-5
Source DB: PubMed Journal: Hereditas ISSN: 0018-0661 Impact factor: 3.271
Fig. 1Skeletonized phylogenetic tree of Brassica and Arabidopsis (Zats) Q-type ZFPs form 5 groups. The tree shows the position of the 18 Zats and the major bootstrap values. Groups are delineated as 2i-B, 2i-A, 2i-C, 2i-D and 2i-X in pink blue, green, yellow and turquoise, respectively. The proteins were aligned with CLUSTALW and the phylogenetic tree was generated using the maximum likelihood method with 1000 bootstrap replicates. A complete tree with the names of all Brassica and Arabidopsis proteins are in (Additional file 3)
Fig. 2Comparison of ZFP proteins on diploid Brassica species and the polyploid Brassica napus. Chromosomes of Brassica rapa in dark gray (BrZFP) Brassica oleracea in dark blue (BoZFP) to Brassica napus with the A genome in light gray (BnaAZFP) and C genome in light blue (BnaCZFP). BnaAZFP36, 37 and 38 are unmapped. Within the 35 BnaAZFP mapped genes, 83% (29) map to similar positions as BrZFP genes. Of the 38 BnaAZFP proteins 84% (32) have 97–100% (red) identity and 3 have 93–96% (purple). The final 3 proteins (black) have 81% (BnaAZFP9), 56% (BnaAZFP21) and 85% (BnaAZFP34) identity to the Br homolog. Generally, the BoZFP and the BnaCZFP genes map to the same location on the chromosome with either 96–100% (red) or 93–95% (purple) identity. BoZFP20 and BoZFP32 have homologous proteins that are currently unmapped in the B. napus genome (BnaCZFP32 and BnaCZFP33), while BoZFP4 and BoZFP8 (black) have no homolog in the B. napus genome. BoZFP36 and BoZFP37 (black) are currently unmapped on the B. oleracea genome and they are homologous to BnaCZFP34 and BnaCZFP35, which are also unmapped. BoZFP7 (black) is a homolog and maps to a B. napus gene that contains only a portion of the ORF
Fig. 3Comparison of map position and percent identity between the diploid Brassicas. Brassica rapa (top) and Brassica oleracea (bottom) chromosomes with red gene names containing protein sequences that are 92–98% identical, map to the same chromosome and are in identical order. BrZFP10 and BrZFP11 are tandem duplications. Genes in black are not present in the other species (BoZFP23, BoZFP29, BoZFP36). BrZFP14 is a homolog of BoZFP1, which is currently unmapped. Additional homologous pairs between the two species are identified by like colors and with or without underline or italics. These additional genes generally do not appear in the same genomic location. An exception is the 3 genes in orange on either chr 9 of B. oleracea or chr 10 of B. rapa
Number of Arabidopsis ZFP homologs in the Brassica A and C genome
| Gene Name | Arabidopsis Gene ID | # of Bo homologs | # of Br homologs | # of BnaA homologs | # of BnaC homologs |
|---|---|---|---|---|---|
| 2i-X | At1g02040 | 3 | 2 | 1 | 3 |
| Zat10 (2i-D) | At1g27730 | 3 | 2 | 2 | 3 |
| ZAT5 (2i-C) | At2g28200 | 2 | 2 | 2 | 2 |
| ZAT17 (2i-B) | At2g28710 | 2 | 2 | 2 | 2 |
| ZAT11 (2i-A) | At2g37430 | 2 | 2 | 3 | 3 |
| ZAT15 (2i-C) | At3g10470 | 2 | 2 | 2 | 1 |
| AZF2 (2i-D) | At3g19580 | 3 | 3 | 3 | 3 |
| ZAT16 (2i-B) | At3g46070 | 2 | 2 | 3 | 2 |
| ZAT8 (2i-B) | At3g46080 | 0 | 0 | 0 | 0 |
| ZAT7 (2i-B) | At3g46090 | 0 | 0 | 0 | 0 |
| ZAT13 (2i-D) | At3g49930 | 2 | 2 | 2 | 3 |
| ZAT18 (2i-A) | At3g53600 | 2 | 2 | 2 | 2 |
| ZAT14 (2i-C) | At5g03510 | 3 | 3 | 4 | 3 |
| ZAT6 (2i-D) | At5g04340 | 1 | 0 | 1 | 1 |
| (2i-C) | At5g04390 | 3 | 3 | 3 | 2 |
| AZF3 (2i-D) | At5g43170 | 1 | 1 | 1 | 1 |
| ZAT12 (2i-B) | At5g59820 | 3 | 4 | 4 | 2 |
| AZF1 (2i-D) | At5g67450 | 3 | 3 | 3 | 3 |
Homologs of ZFPs from Arabidopsis and Brassica A and C genome
*members of tandem duplications in square boxes
Fig. 4Conserved motifs identified in 146 Brassica ZFP proteins and in all five separate ZFP groups. A cartoon (top) shows the position of the conserved motifs on the protein sequence. Consensus motifs common to all 146 Q-type ZFPs from Brassica oleracea, rapa and napus include, 2 zinc finger motifs with invariant QALGGH (center) and an L-box (left) and EAR-motif (right). Amino acids in bold black are invariant within the composite motifs. Red and green amino acids are common to more than one of the subgroups within either the first or second zinc finger motifs. Alignment of all Brassica proteins in each subgroup confirms the presence of the motif variations (see Additional file 2)
B. oleracea Q-type C2H2 TF transcript length and expression
| BoZFP | Ensembla | Bolbasea | Transcription Notesb,c |
|---|---|---|---|
| BoZFP1 | cannot find transcript info for this gene | ||
| BoZFP2 | 534 | found EAR motif in club root SRR but there is a gap between 112-132 | |
| BoZFP3 | 669 | locus_18655, bud SRR630928 1–643 100% ID, seedling 102043,7–637, SNPs at 207, 537 and 541 | |
| BoZFP4 | 1170 | bud and flower, missing 1–47 in bud with gap at 793–815, or 1–168 in flower up to 1100 | |
| BoZFP5 | 786 | locus_27313, 2–746 with gap at 273–277 | |
| BoZFP6 | 807 | locus_29791 in seedling 102043, 5–775 | |
| BoZFP7 | 1050 | bud missing 1–66 bases, SNP at 372 | |
| BoZFP8 | 474 | 102043 flower has most of the transcript seedling is missing L-box | |
| BoZFP9 | 708 | locus_24740 and 40724, seedling 102043, 10 SNPs, 13–677 | |
| BoZFP10 | 831 | locus_9351 stem also found in stem of 2, 1–824 | |
| BoZFP11 | 1092 | flower with gap at 344–357 | |
| BoZFP12 | 477 | 420 | flower |
| BoZFP13 | 807 | 107140 leaf 100% ID, flower of 2 is 1–763 and bud is 7–772 with a gap 672–683 | |
| BoZFP14 | 726 | locus_2060, seedling 107140, 1–718 SNP at 492, in 102043 find gap at ~ 450–550 | |
| BoZFP15 | 615 | cannot find this transcript | |
| BoZFP16 | 702 | seedling 102043, 2–670 | |
| BoZFP17 | 543 | 102043 seedling | |
| BoZFP18 | 453 | 102043 complete coverage of entire transcript | |
| BoZFP19 | 882 | leaf of 107140 and flower of 2 covering 1–878 | |
| BoZFP20 | 864 | 102043 flower 2–841 includes the EAR motif | |
| BoZFP21 | 447 | 102043 seedling 3–447 only 1 SNP found | |
| BoZFP22 | 540 | 102043 seedling | |
| BoZFP23 | 717 | C1220 clubroot resistant, 1–717 with gap 327–352 and SNPs 173, 213, 471, 503, 653. 1–400 bases no gap or SNPs in 7 day cotyledon of broccoli | |
| BoZFP24 | 792 | bud of SRR630928 | |
| BoZFP25 | 690 | seedling 102043, 14–684 | |
| BoZFP26 | 810 | 801 | locus_47213, seedling 102043 |
| BoZFP27 | 1155 | 936 | clubroot resistant 27–862, gap 604–639 |
| BoZFP28 | 507 | 107140 seedling | |
| BoZFP29 | 696 | seedling 102043 | |
| BoZFP30 | 570 | flower | |
| BoZFP31 | 639 | 468 | flower M? Does not seem to coincide with map |
| BoZFP32 | 777 | 96–618, 635–750 of 777 in bud does have 2ZFPs and EAR motif maybe even L-box. | |
| BoZFP33 | 696 | locus_45812, stem 12–696 100% ID, leaf of 107140, 1–672 | |
| BoZFP34 | 486 | 102043 seedling great coverage 100% 3–480 of 486 | |
| BoZFP35 | 1038 | locus_43474 in bud 6–971 100% ID SRR630928, 102043 seedling, 1–893, 17 SNPs and a 3 base gap | |
| BoZFP36 | 708 | 720 | locus_43800, leaf |
| BoZFP37 | 798 | locus_ 26577, seedling 102043, 5–794 |
aLength of transcript in Ensembl Plants http://plants.ensembl.org/Brassica_oleracea/Info/Index
or Bolbase http://www.ocri-genomics.org/bolbase/genes.htm
bSRA (sequence read archive) data on the B. oleracea transcriptome 1) SRP032830; 9d old seedlings of the cabbage cultivars 102043 and 107140. Flowers were isolated from 102043 and leaf and root tissue were isolated from 107140. Loci were identified for 35,274 genes and presented in Kim et al. [34] Additional file 1: Table S1. 2) SRP017530; 7 week old plants isolated 7 different organs. 3) SRP029141; line C1220 clubroot disease resistant cabbage. 4) SRP034015; germinating broccoli
clocus # from Kim et al. [34] Additional file 1: Table S1, organ and/or cultivar and sequence ID = 100% unless noted otherwise