| Literature DB >> 25881190 |
Hyun Uk Kim1, Grace Qianhong Chen2.
Abstract
BACKGROUND:Entities:
Mesh:
Substances:
Year: 2015 PMID: 25881190 PMCID: PMC4381405 DOI: 10.1186/s12864-015-1413-8
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of sequencing reads
| Number of reads | 1,568,943 |
| Number of bases | 651,314,783 |
| Assembled reads | 496,246 |
| Partially assembled reads | 109,151 |
| Singleton | 35,573 |
| Repeat | 901,748 |
| Outlier+ | 15,725 |
| Reads too short to assemble | 10,500 |
+The read was identified by the GS De Novo Assembler as problematic.
Summary of assembly
|
|
|
| |
|---|---|---|---|
| Number | 26,995 | 38,002 | 21,912 |
| Average contig count | 1.7 | 2.132 | |
| Largest contig count | 2,257 | 16 | |
| Number with one contig | 22,649 | 22,994 | |
| Average isotig count | 1.4 | ||
| Largest isotig count | 97 | ||
| Number with one isotig | 22,698 | ||
| Number of bases (nt) | 37,527,552 | ||
| Average isotig size (bp) | 987.515 | ||
| N50# isotig size (bp) | 1,310 | ||
| Largest isotig size (bp) | 11,074 |
┼The number of valid singleton after SeqClean and Lucy.
#The half size of all bases reside in isotigs.
Figure 1Gene ontology (GO) annotation of all detected isotigs. (A) Overall distribution of 38,002 isotigs into major GO categories. (B) The histogram shows the percentage (x axis) of isotigs within functional subdivisions (y axis) of each of the three major GO categories.
Figure 2Gene ontology (GO) annotation of all detected singletons. (A) Overall distribution of 21,912 singletons into major GO categories. (B) The histogram shows the percentage (x axis) of isotigs within functional subdivisions (y axis) of each of the three major GO categories.
List of gene products for the 50 most abundant isotigs
|
|
|
|
|
|---|---|---|---|
| isogroup00026 | 97 | HSD5 (hydroxysteroid dehydrogenase 5) | AT4G10020 |
| isogroup00055 | 64 | ATAILP1 | AT5G19140 |
| isogroup00045 | 55 | OLEOSIN 1 | AT4G25140 |
| isogroup00062 | 49 | GAMMA-VPE, vacuolar-processing | AT4G32940 |
| isogroup00056 | 46 | FDH (formate dehydrogenase) | AT5G14780 |
| isogroup00010 | 44 | FAD3 | AT2G29980 |
| isogroup00025 | 41 |
| AAK62348.1 |
| isogroup00041 | 37 | PRXR1 | AT4G21960 |
| isogroup00046 | 36 | HSC70-1 | AT5G02500 |
| isogroup00048 | 36 | OLEOSIN 4 | AT3G27660 |
| isogroup00071 | 36 | unknown protein | AT1G62220 |
| isogroup00037 | 35 | CESA3 (cellulose synthase 3) | AT5G05170 |
| isogroup00047 | 35 | Class-II DAHP synthetase family | AT1G22410 |
| isogroup00072 | 35 | UAP56B (homolog of human UAP56 B) | AT5G11200 |
| isogroup00073 | 35 | UBQ1 (ubiquitin extension protein 1) | AT3G52590 |
| isogroup00042 | 34 | ATNTT2 | AT1G15500 |
| isogroup00079 | 34 | splicing factor PWI domain-containing protein | AT1G60200 |
| isogroup00001 | 32 | 12S seed storage protein CRU4 | AT5G44120 |
| isogroup00049 | 32 | FDM1 (factor of DNA methylation 1) | AT1G15910 |
| isogroup00050 | 32 | XYP1 (xylogen protein 1) | AT5G64080 |
| isogroup00043 | 31 | HAI1 (highly ABA-induced PP2C gene 1) | AT5G59220 |
| isogroup00063 | 31 | RING-type Zinc finger protein | AT5G25560 |
| isogroup00023 | 30 | ALPHA-TIP (alpha-tonoplast intrinsic protein) | AT1G73190 |
| isogroup00057 | 30 | ACO2 (ACC oxidase 2) | AT1G62380 |
| isogroup00086 | 30 | CESA1 (cellulose synthase 1) | AT4G32410 |
| isogroup00080 | 26 | BBD2 (bifuctional nuclease in basal defense response 2) | AT1G19660 |
| isogroup00027 | 25 | Heat shock protein 81-2, | AT5G56030 |
| isogroup00030 | 25 | BGLU37 (beta glucosidase 37) | AT5G25980 |
| isogroup00044 | 24 | NLP4 (nin-like protein 4) | AT1G20640 |
| isogroup00058 | 24 | TUA6 (tubulin alpha-6) | AT4G14960 |
| isogroup00074 | 23 | Pyruvate kinase family | AT5G08570 |
| isogroup00038 | 22 | UBQ1 (ubiquitin extension protein 1) | AT3G52590 |
| isogroup00087 | 22 | PEPR1 (PEP1 receptor 1) | AT1G73080 |
| isogroup00088 | 22 | Methionine synthesis 1 | AT5G17920 |
| isogroup00033 | 21 | GTP binding Elongation factor Tu family | AT5G60390 |
| isogroup00075 | 21 | TCTP (translationally controlled tumor protein) | AT3G16640 |
| isogroup00081 | 20 | DRP2B (dynamin related protein 2B) | AT1G59610 |
| isogroup00089 | 20 | AOAT2 (alanine-2-oxoglutarate aminotransferase 2) | AT1G70580 |
| Isogroup00011 | 19 | 2S seed storage protein 3 | AT4g27160 |
| isogroup00090 | 19 | Papain family cysteine protease | AT4G16190 |
| isogroup00064 | 17 | Phosphoinositide phosphatase family | AT1G17340 |
| isogroup00065 | 17 | ALAAT1(alanine aminotransferase 1) | AT1G17290 |
| isogroup00040 | 16 | GTP binding Elongation factor Tu family protein | AT5G60390 |
| isogroup00066 | 16 | Leucine-rich repeat (LRR) family protein | AT5G45510 |
| isogroup00083 | 16 | HSP70 (heat shock protein 70) | AT3G12580 |
| isogroup00067 | 15 | BGLU37 ( beta glucosidase 37) , | AT5G25980 |
| isogroup00068 | 15 | GTP binding Elongation factor Tu family | AT5G60390 |
| isogroup00078 | 15 | F-box/RNI-like superfamily protein | AT4G15475 |
| isogroup00053 | 14 | Copper amine oxidase family | AT4G12290 |
| isogroup00061 | 13 | AAE17(acyl-activating enzyme 17) | AT5G23050 |
Number of genes and transcripts involved in acyl-lipid metabolism
|
|
|
|
|
|
|---|---|---|---|---|
| Fatty acid synthesis | 43 | 29 | 106 | 4 |
| Fatty acid elongation, Desaturation & export from plastid | 27 | 22 | 62 | 3 |
| Prokaryotic galactolipid, Sulfolipid, & Phospholipid synthesis | 61 | 39 | 91 | 2 |
| Eukaryotic galactolipid & Sulfolipid synthesis | 36 | 27 | 62 | 2 |
| Triacylglycerol biosynthesis | 87 | 59 | 483 | 8 |
| Eukaryotic phospholipid synthesis & editing | 75 | 53 | 122 | 2 |
| Triacylglycerol & fatty acid degradation | 61 | 41 | 119 | 3 |
| Fatty acid elongation & wax biosynthesis | 246 | 102 | 313 | 3 |
| Sphingolipid biosynthesis | 37 | 30 | 67 | 2 |
| Mitochondria fatty acid & lipoic acid synthesis | 18 | 11 | 33 | 3 |
| Mitochondria phospholipid synthesis | 24 | 17 | 34 | 2 |
| Lipid trafficking | 6 | 5 | 9 | 2 |
| Cutin synthesis & transport | 112 | 49 | 152 | 3 |
| Suberin synthesis & transport | 39 | 25 | 57 | 2 |
| Oxylipin metabolism | 69 | 36 | 66 | 2 |
| Phospholipid signaling | 125 | 70 | 213 | 3 |
| Total | 1066 | 615 | 1989 | 3 |
*Number of genes obtained from Arabidopsis acyl-lipid metabolism database [58].
List of expressed genes involved in fatty acid and TAG biosynthesis in lesquerella seed
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
|
| ||||||
|
|
| At5g16390 | isotig19563 | 88 | 7 | P |
|
| At5g15530 | isotig07553 | 89 | 3 | F | |
|
| At2g38040 | isotig03259 | 90 | 17 | F | |
|
| At2g30200 | isotig17972 | 93 | 5 | P | |
|
| At5g27200 | isotig07684 | 89 | 6 | F | |
|
| At1g74960 | isotig15566 | 91 | 6 | F | |
|
| At1g24360 | isotig08993 | 91 | 5 | F | |
|
| At5g10160 | isotig23452 | 88 | 9 | P | |
|
| At2g05990 | isotig17121 | 93 | 4 | F | |
|
| At2g43710 | isotig11458 | 91 | 6 | F | |
|
| At3g25110 | isotig17885 | 89 | 1 | F | |
|
| At3g25110 | isotig17382 | 90 | 1 | F | |
|
| At1g08510 | isotig09645 | 88 | 4 | P | |
|
| At2g04350 | isotig15424 | 90 | 1 | F | |
|
| At1g77590 | isotig15369 | 89 | 1 | P | |
| Endoplasmic reticulum-hydroxylase, desaturase, and elongase |
| AF016103 | isotig27455 | 95 | 7 | F |
|
| At3g12120 | isotig17487 | 88 | 2 | F | |
|
| At2g29980 | isotig17127 | 92 | 1 | F | |
|
| At2g29980 | isotig00061 | 84 | 58 | F | |
|
| AF367052 | isotig02742 | 97 | 28 | F | |
| Acyl-CoA- dependent TAG synthesis in Kennedy pathway |
| At5g60620 | isotig18564 | 94 | 2 | P |
|
| At3g57650 | isotig03872 | 92 | 2 | F | |
|
| At2g19450 | isotig11157 | 91 | 2 | P | |
|
| At2g19450 | isotig11156 | 90 | 1 | P | |
|
| At3g51520 | isotig19956 | 87 | 1 | F | |
|
| At1g48300 | isotig08903 | 91 | 3 | F | |
| PC-mediated TAG synthesis |
| At1g12640 | isotig03769 | 91 | 10 | F |
|
| At1g63050 | isotig16468 | 92 | 1 | P | |
|
| At5g13640 | isotig08780 | 92 | 1 | F | |
|
| At5g13640 | isotig08781 | 90 | 15 | P | |
|
| At3g44830 | isotig02595 | 86 | 12 | P | |
|
| At3g15820 | isotig25038 | 89 | 1 | P |
Figure 3Predicted fatty acid and TAG biosynthetic pathways in seeds. Numbers after the gene are the numbers of isotigs for the corresponding genes in the 454 analysis of the seed cDNA library. The dashed arrows in the plastid fatty acid biosynthetic pathway indicate one or more cycles of acyl-chain elongation that is initiated by 3-ketoacyl-ACP synthase (KAS) I or II. The yellow circle delineates reactions in acyl-CoA pools. The bright green rectangle delineates reactions involving desaturation or hydroxylation of PC. The acyl-CoA dependent Kennedy pathway is indicated with red arrows. PC-mediated TAG synthesis pathways are indicated with green (via PDAT), blue (via PDCT) and orange (via LPCAT) arrows. Abbreviations: BCCP, biotin carboxyl carrier protein subunit of acetyl-CoA carboxylase; BC, biotin carboxylase subunit of acetyl-CoA carboxylase; α-CT, α-carboxyltransferase subunit of acetyl-CoA carboxylase; ACP, acyl carrier protein; MCMT, malonly-CoA ACP transferase; KAR, 3-ketoacyl-ACP reductase; HAD, 3-hydroxyacyl-ACP dyhydratase; ER, 2-enoyl-ACP reductase; FAB2/SAD, stearoyl-ACP desaturase; FatA, acyl-ACP thioesterase A; FatB, acyl-ACP thioesterase B; LACS, long-chain acyl-CoA synthase; FAH12, Δ12 oleic acid hydroxylase; FAD2, Δ12 oleic acid desaturase; FAD3, Δ15 (ω-3) linoleic acid desaturase; KCS3, 3-ketoacyl-CoA synthase 3; GPAT9, glycerol 3-phosphate acyltransferase 9 ; LPAT2, lysophosphatidic acid acyltransferase 2; PAPase: phosphatidic acid phosphatase; LPCAT, lysophosphatidylcholine acyltransferase 1 and 2; DGAT, diacylglycerol acyltransferase1 and 2; PDAT, phospholipid:diacylglycerol acyltransferase; G3P, glycerol-3-phosphate; LPA, lysophosphatidic acid; PA, phosphatidic acid; LPC, lysophosphatidylcholine; PC, phosphatidylcholine; DAG, diacylglycerol; TAG, triacylglycerol.
Figure 4Expression of (A), (B), (C) and (D) in major organs and developing seeds determined by qPCR. Abbreviated names for the genes are defined in Figure 3. Each data point represents the mean ± SD of three replicates.
Figure 5Characterization of two . (A) Amino acid sequences alignment among Arabidopsis FAD3 (AtFAD3:At2g29980), PfFAD3-1 (isotig17127) and PfFAD3-2 (isotig00061). Black shading indicates identical amino acids. No shading indicates dissimilar amino acids. Dashes indicate gaps in alignment. (B) Phylogenetic tree showing relationships among FAD3 and FAD7/8 desaturases. Lesquerella sequences are shaded. This tree was constructed by DNASTAR MegAlign program using the ClustalW method. Bootstrap used trials = 1000, seed = 111. At, Arabidopsis, Bn, Brassica napus FAD3a (AFJ19040), FAD3b (AFJ19037) and FAD3c (AFJ19034). Arabidopsis FAD7 (At3g11170) and FAD8 (At5g05580) used for outgroup. Expression of PfFAD3-1 (C) and PfFAD3-2 (D) in developing seeds and major organs determined by qPCR. Abbreviated names for the genes are described in Figure 3. Each data point represents the mean ± SD of three replicates. ND = not detected.
Figure 6Characterization of three types . (A) Phylogenetic tree showing relationships among DGAT1, DGAT2, and DGAT3 from Physaria fendleri (Pf), Arabidopsis (At), Brassica napus (Bn), castor bean (Rc), peanut (Ah). PfDGAT1-1 (isotig11157), PfDGAT1-2 (isotig11156), PfDGAT2 (isotig19956), PfDGAT3 (isotig08903), AtDGAT1 (At2g19450), AtDGAT2 (At3g51520), AtDGAT3 (At1g48300), BnDGAT1-1 (AIA77019), BnDGAT1-2 (AIA67020), BnDGAT1-3 (AFM31260), BnDGAT1-4 (AAF64065), RcDGAT1 (XP_002514132), RcDGAT2 (XP_002528531), RcDGAT3 (XP_002519339), AhDGAT3 (AAX62735). The tree was constructed as described in Figure 5B. (B-E) Expression of PfDGAT1-1, PfDGAT1-2, PfDGAT2, and PfDGAT3 in developing seeds and major organs determined by qPCR. Abbreviated names for the genes are described in Figure 3. Each data point represents the mean ± SD of three replicates.
Figure 7Characterization of two (A) Phylogenetic tree showing relationships among LPCAT1 and LPCAT2 from Physaria fendleri (Pf), Arabidopsis (At), Bernardia pulchella (Bp), castor bean (Rc), Hiptage benghalensis (Hb), Momordica charantia (Mc), Carthamus tinctorius (Ct). PfLPCAT1 (isotig03769), PfLPCAT2 (isotig16468), BpLPCAT1 (AHE80984), BpLPCAT2 (AHE80985), RcLPCAT (AHF20951), HpLPCAT1a (AHE80986), HpLPCAT2 (AHE80988), McLPCAT (AGL81301), CtLPCAT (AHE80989). The tree was constructed as described in Figure 5B. (B, C) Expression of PfLPCAT1 and PfLPCAT2 in developing seeds and major organs determined by qPCR. Abbreviated names for the genes are described in Figure 3. Each data point represents the mean ± SD of three replicates.
Figure 8Characterization of three (A) Phylogenetic tree of PDATs from Physaria fendleri (Pf), Arabidopsis (At), castor bean (Rc), Saccharomyces cerevisiae (Sc). PfPDAT1-1, PfPDAT1-2, PfPDAT2, AtPDAT1(At5g13640), AtPDAT2(At3g44830), RcPDAT1-1 (HM807520), RcPDAT1-2 (HM807521), RcPDAT2 (HM807522), ScPDAT (NM_001183185). The tree was constructed as described in Figure 5B. (B-D) Expression of PfPDAT1-1, PfPDAT1-2, and PfPDAT2 in developing seeds and major organs determined by qPCR.Abbreviated names for the genes are described in Figure 3. Each data point represents the mean ± SD of three replicates.