| Literature DB >> 21286311 |
Shih-Chi Hsu1, Mark F Belmonte, John J Harada, Kentaro Inoue.
Abstract
The plastid is an organelle vital to all photosynthetic and some non-photosynthetic eukaryotes. In the model plant Arabidopsis thaliana, a number of nuclear genes encoding plastid proteins have been found to be necessary for embryo development. However, the exact roles of plastids in this process remain largely unknown. Here we use publicly available datasets to obtain insights into the relevance of plastid activities to A. thaliana embryogenesis. By searching the SeedGenes database (http://www.seedgenes.org) and recent literature, we found that, of the 339 non-redundant genes required for proper embryo formation, 108 genes likely encode plastid-targeted proteins. Nineteen of these genes are necessary for development of preglobular embryos and/or their conversion to globular embryos, of which 13 genes encode proteins involved in non-photosynthetic metabolism. By contrast, among 38 genes which are dispensable for globular embryo formation but necessary for further development, only one codes for a protein involved in metabolism. Products of 21 of the 38 genes play roles in plastid gene expression and maintenance. Examination of RNA profiles of embryos at distinct growth stages obtained in laser-capture microdissection coupled with DNA microarray experiments revealed that most of the identified genes are expressed throughout embryo morphogenesis and maturation. These findings suggest that metabolic activities are required at preglobular and throughout all stages of embryo development, whereas plastid gene expression becomes necessary during and/or after the globular stage to sustain various activities of the organelle including photosynthetic electron transport.Entities:
Keywords: Arabidopsis thaliana; SeedGenes.; embryogenesis; globular embryo; microarray; plastid; preglobular embryo
Year: 2010 PMID: 21286311 PMCID: PMC2944999 DOI: 10.2174/138920210791616716
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Nuclear Genes Encoding Plastid Proteins Required for Arabidopsis thaliana Embryogenesis
| Gene | ID | Function (Cat) | References | |||
|---|---|---|---|---|---|---|
| EMB | F | L | ||||
| At1g34430 | NC | dihydrolipoamide S-acetyltransferase# | (M) | S | [ | P |
| At4g33680 | C | aminotransferase class I and II family protein# | (M) | [ | [ | [ |
| At1g74960 | C | ketoacyl-acyl carrier protein synthase# | (M) | [ | – | P |
| At4g26900 | C | imidazole glycerol phosphate synthase# (His biosynthesis) | (M) | [ | – | P |
| At2g36230 | C | N'-5'-phosphoribosyl-formimino-5-aminoimidazole-4-carboxamide ribonucleotide isomerase# (His biosynthesis) | (M) | [ | [ | P |
| At1g31860 | C2’ | phosphoribosyl-ATP pyrophosphohydrolase/phosphoribosyl-AMP cyclohydrolase (His biosynthesis) | (M) | [ | – | P |
| At5g10330+ | C | histidinol phosphate aminotransferase# (His biosynthesis) | (M) | [ | – | P |
| At5g14760 | C | L-asp oxidase (NAD biosynthesis) | (M) | [ | [ | [ |
| At5g50210+ | C | ouinolinate synthase (NAD biosynthesis) | (M) | [ | [ | [ |
| At2g01350 | C | ouinolinic acid phosphoribosyl transferase (NAD biosynthesis) | (M) | [ | – | [ |
| At2g28880 | NC | 4-amino-4-deoxychorismate synthase (folate biosynthesis) | (M) | S | [ | [ |
| At3g54660 | C2 | glutathione reductase# | (M) | S | [ | [ |
| At4g26500 | C | activator of plastidic and mitochondrial desulfurases (AtSufE) | (M) | [ | [ | [ |
| At1g08840 | C2 | DNA replication helicase# | (PGME) | S | – | T |
| At5g24120 | C | RNA polymerase sigma subunit SigE# | (PGME) | [ | – | [ |
| At3g02660 | C | tyrosyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At3g18290 | C2’ | zinc finger protein-related# | (PGME) | S | – | T |
| At5g17710 | C | co-chaperone GrpE family protein# | (PH) | S | – | P |
| At3g46740+ | C | precursor protein import channel (Toc75) | (PT) | [ | [ | [ |
| At4g39120* | C | histidinol-phosphate phosphatase (His biosynthesis) | (M) | [ | [ | [ |
| At2g01860 | C | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | T |
| At5g03800 | C | similar to pentatricopeptide (PPR) repeat-containing protein# | (PGME) | [ | – | T |
| At1g30610+ | C | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | [ | – | T |
| At1g79490 | NC | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | T |
| At4g39620 | C | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | [ | – | T |
| At5g50280 | NC | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | T |
| At3g49240 | C | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | [ | – | P |
| At2g38770 | C | U5 associated protein | (PGME) | S | – | P |
| At5g26742 | NC | DEAD/DEAH box RNA helicase# | (PGME) | S | – | P |
| At3g18390+ | NC | chloroplast splicing factor (CRS1) | (PGME) | S | – | P |
| At4g26300 | NC | arginyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At1g05190+ | NC | ribosomal protein L6 family protein# | (PGME) | S | – | P |
| At1g78630 | NC | ribosomal protein L13 family protein# | (PGME) | S | – | P |
| At4g04350 | C | leucyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At1g62750+ | C | elongation factor Tu family protein# | (PGME) | [ | – | [ |
| At2g04842 | NC | threoninyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At5g16715 | C | valyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At4g29060 | C | elongation factor Ts family protein# | (PGME) | S | – | P |
| At3g48110+ | C2 | glycyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At2g04530* | NC | RNase Z# | (PGME) | [ | – | [ |
| At5g18570*+ | C | GTP1/OBG family protein# | (PGME) | [ | – | [ |
| At3g10670 | C | plastidic SufC-like protein (Fe-S cluster biogenesis) | (PH) | [ | [ | [ |
| At3g04340+ | NC | FtsH protease family protein# | (PH) | S | – | P |
| At5g18820 | NC | RuBisCO subunit binding-protein alpha subunit (Cpn60a)# | (PH) | S | – | T |
| At1g02560* | C | ATP-dependent Clp protease proteolytic subunit (ClpP5)# | (PH) | [ | [ | [ |
| At1g06950+ | C | chloroplast protein import (Tic110) | (PT) | [ | [ | [ |
| At2g31530 | C | secY family protein# | (PT) | S | – | T |
| At4g32400* | C | nucleotide export | (T) | [ | [ | [ |
| At5g19620* | C | OEP80 | (U) | [ | – | [ |
| At5g66055+ | C | ankyrin repeat protein (AKRP) # | (U) | [ | – | [ |
| At1g10510 | C | leucine-rich repeat family protein, similar to ribonuclease inhibitor# | (U) | S | – | P |
| At3g12080 | C | GTP-binding family protein# | (U) | S | – | P |
| At5g63420 | C | metallo-beta-lactamase family protein# | (U) | S | – | P |
| At3g24560 | C | ATP binding | (U) | [ | – | T |
| At5g40160 | C | ankyrin repeat protein# | (U) | [ | [ | [ |
| At2g25660 | C | unknown | (U) | S | – | T |
| At5g57930 | C | Fe-S cluster related | (U) | S | – | T |
| At3g25860 | C | dihydrolipoamide S-acetyltransferase# | (M) | S | [ | [ |
| At5g16390 | C | biotin carboxyl carrier protein of acetyl-CoA carboxylase# | (M) | S | [ | [ |
| At3g20440 | C2 | 1,4-alpha-glucan branching enzyme# (starch biosynthesis) | (M) | S | – | P |
| At5g67570 | C4 | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | [ |
| At3g29290 | NC | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | T |
| At1g75350 | NC | ribosomal protein L31 family protein# | (PGME) | S | – | P |
| At5g22800 | C2 | aminoacyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At1g23400 | C2 | chloroplast intron splicing factor | (PGME) | [ | [ | P |
| At2g04030+ | C | heat shock protein (Hsp90) | (PH) | S | [ | [ |
| At1g79560 | C2’ | AAA and metalloprotease (FtsH12) | (PH) | S | – | [ |
| At3g16290 | C2 | FtsH protease (AAA ATPase) | (PH) | S | – | P |
| At4g23430 | C | subunit of Tic complex (Tic32), short chain dehydrogenase | (PT) | [ | – | [ |
| At5g62990 | C | unknown | (U) | S | – | T |
| At3g61780 | C2 | unknown | (U) | S | – | T |
| At2g37920 | NC | copper transporter related | (U) | S | [ | T |
| At4g30580 | C | 2-acylglycerophosphoethanolamine acyltransferase | (M) | [ | [ | [ |
| At1g48850 | NC | chorismate synthase/5-enolpyruvylshikimate-3-phosphate phospholyase# | (M) | S | – | P |
| At3g55610 | C | delta 1-pyrroline-5-carboxylate synthetase B# (Pro synthesis) | (M) | [ | – | [ |
| At5g61410 | C | ribulose-5-phosphate-3-epimerase# | (M) | S | – | P |
| At3g06350 | C | dehydroquinate dehydratase; shikimate dehydrogenase# | (M) | S | [ | P |
| At1g08510 | C4 | acyl-acyl carrier protein thioesterase# | (M) | [ | – | T |
| At4g23100 | C | gamma-glutamylcysteine synthetase# | (M) | [ | [ | [ |
| At5g24400 | C | 6-phosphogluconolactonase | (M) | [ | – | [ |
| At5g52920 | C4 | similar to pyruvate kinase isozyme G# | (M) | [ | – | [ |
| At2g19450 | C4 | diacylglycerol O-acyltransferase / acyl CoA:diacylglycerol acyltransferase# | (M) | [ | [ | [ |
| At1g78580 | C | trehalose-6-phosphate synthase 1# | (M) | [ | [ | T |
| At3g10690+ | C3 | DNA gyrase subunit A family protein# | (PGME) | [ | – | [ |
| At3g06430+ | NC | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | T |
| At3g18110 | C | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | [ | – | T |
| At3g49170 | C | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | [ | – | T |
| At4g20090 | C | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | [ | – | T |
| At5g27270 | NC | pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | T |
| At1g06145 | UC | similar to pentatricopeptide (PPR) repeat-containing protein# | (PGME) | S | – | T |
| At1g79350 | NC | DNA-binding protein | (PGME) | S | – | T |
| At1g70070+ | C3 | DEAD/DEAH box helicase# | (PGME) | [ | – | P |
| At1g74970 | NC | Ribosomal protein S9# | (PGME) | S | – | [ |
| At1g14610 | C | valyl-tRNA synthetase# | (PGME) | [ | – | [ |
| At5g02250 | C4 | ribonuclease II family protein# | (PGME) | [ | [ | [ |
| At2g28000 | C3 | RuBisCO subunit binding-protein alpha subunit (Cpn60a) # | (PH) | [ | [ | P |
| At1g19800 | C4 | Permease (TGD1, trigalactosyldiacylglycerol 1) | (T) | [ | [ | [ |
| At4g33460 | NC | ABC-type transport protein# | (T) | S | – | P |
| At2g01735 | C | zinc finger (C3HC4-type RING finger) family protein# | (U) | S | – | T |
| At5g22640 | C | MORN (Membrane Occupation and Recognition Nexus) repeat-containing protein# | (U) | S | – | P |
| At3g07430 | C | YGGT family protein# | (U) | S | – | P |
| At1g58210 | C | kinase interacting family protein, similar to kinase interacting protein 1# | (U) | S | – | T |
| At4g28210 | C | Unknown | (U) | S | – | T |
| At1g21390 | NC | Unknown | (U) | S | – | T |
| At1g56200 | C3 | Unknown | (U) | [ | – | [ |
| At5g53860 | C | Unknown | (U) | S | – | P |
| At1g49510 | C2 | Unknown | (U) | S | – | T |
| At2g03050* | C | similar to the mitochondrial transcription termination factor | (PGME) | [ | – | [ |
Genes not listed in the SeedGenes database but reported in individual literatures are indicated with an asterisk (*), and those that give mutants with no viable homozygotes as reported by Myouga et al. [42] with a plus symbol (+).
ID indicates identity confidence as defined in SeedGenes database. C, confirmed by the presence of multiple alleles causing an embryo arrest or by the genetic complementation assay; C2, having multiple null-lines with insertions in different portions of exons showing different terminal phenotypes; C2’, having multiple alleles including the ones with 5’UTR insertion causing a phenotype different from those with coding region insertions; C3, having null-mutant seeds that can germinate and develop into seedlings but not beyond; C4, having null-mutant seeds that can germinate and develop into mature plants; NC, not confirmed (only a single mutant allele with sequence information is available); UC, uncertain (insertion or mutation site not within coding region or 5' UTR and either downstream of stop codon or more than 250 bp upstream of start codon. The information of identity confidence extracted from SeedGenes database has been further updated with recent reports.
Function is assigned based on annotation in public database (GreenPhylDB http://greenphyl.cirad.fr/cgi-bin/greenphyl.cgi as indicated with a number sign #) or individual publications. Cat, functional categories: M, metabolism; PGME, plastid gene maintenance and expression; PH, protein homeostasis; PT, protein trafficking; T, transport; U, unknown.
References are listed for EMB (embryo deficiency), F (function), and L (localization). S, embryo-defective mutants were reported only by SeedGenes database; P, subcellular localization was confirmed only by proteomic research (compliled by PPDB [40]) but not other means; T, subcellular localization was predicted by TargetP [41] but has not been confirmed by experiments.
Unambiguous Gene Expression Data Available in GeneChip for Essential Plastid-Targeted Protein-Encoding Genes
| Terminal phenotype | I [Preglobular] | II [Globular] | III [Transition] | IV [Cotyledon] | Unknown |
|---|---|---|---|---|---|
| Total | 19 | 38 | 15 | 35 | 1 |
| Expression analyses available | 18 | 33 | 13 | 30 | 1 |
| Expressed at all stages of embryo | 11 | 21 | 7 | 17 | 0 |
| Not detected in any stage of embryo | 2 | 2 | 3 | 6 | 1 |
| Not detected in preglobular stage | 4 | 6 | 3 | 8 | 1 |
| Not detected in globular stage | 6 | 5 | 3 | 9 | 1 |
| Not detected in heart stage | 3 | 2 | 3 | 8 | 1 |
| Not detected in linear cotyledon stage | 4 | 7 | 3 | 7 | 1 |
| Not detected in mature green stage | 5 | 11 | 6 | 11 | 1 |
Different compartments of Arabidopsis seeds were collected at different developmental stages and gene expression profile of these compartments were analyzed. For the 108 embryogenesis-essential, plastidic protein-encoding genes, 95 of them have unambiguous probe sets on Arabidopsis whole genome ATH1 GeneChip.
The five stages at which embryo samples were taken for analyses (Fig. ).
Genes Encoding Plastid Proteins Required for Embryogenesis whose Expression Data are Unavailable or Ambiguous on the Arabidopsis whole Genome ATH1 GeneChip
| At2g04842 |
| At3g06430 |
| At5g22800 |
| At5g26742 |
| At2g31530 |
| At3g55610[ |
| At4g23430 |
| At5g10330 |
| At5g63420 |
| At1g06145 |
| At1g21390 |
| At3g49170 |
| At5g03800 |
Expression in embryo was reported in the reference [86].
Genes Encoding Plastid Proteins Required for Embryogenesis that are Not Expressed in Embryos but in Other Seed Compartments
| Gene | Terminal Phenotype | Expression in non-embryo compartment(s) |
|---|---|---|
| At1g19800 | IV | Peripheral endosperm; Chalazal seed coat; Seed coat |
| At2g28880 | I | Micropylar endosperm; Peripheral endosperm; Chalazal endosperm; Chalazal seed coat |
| At2g37920 | III | Micropylar endosperm; Peripheral endosperm; Chalazal endosperm |
| At3g20440 | III | Peripheral endosperm |
| At4g30580 | IV | Peripheral endosperm |
| At4g33460 | IV | Peripheral endosperm; Seed coat |
Expression data for individual seed compartments is available at http://seedgenenetwork.net.