| Literature DB >> 35243384 |
Vasileios L Zogopoulos1, Apostolos Malatras2, Ioannis Michalopoulos1.
Abstract
Coexpressed genes tend to participate in related biological processes. Gene coexpression analysis allows the discovery of functional gene partners or the assignment of biological roles to genes of unknown function. In this protocol, we describe the steps necessary to create a gene coexpression tree for Arabidopsis thaliana, using publicly available Affymetrix CEL microarray data. Because the computational analysis described here is highly dependent on sample quality, we detail an automatic quality control approach. For complete details on the use and execution of this protocol, please refer to Zogopoulos et al. (2021).Entities:
Keywords: Bioinformatics; Genomics; Model Organisms; Plant sciences; Systems biology
Mesh:
Substances:
Year: 2022 PMID: 35243384 PMCID: PMC8885756 DOI: 10.1016/j.xpro.2022.101208
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1Entity-relation diagram (ERD) in crow’s foot notation of local MySQL database
Figure 2RLE plots of two ArrayExpress studies
(A) All samples of E-ATMX-1 study pass the quality control.
(B) AtD3.CEL sample of E-ATMX-19 study does not pass the quality control, as its IQR > 0.4, thus it must be removed from the pool of samples.
Figure 3NUSE plots of two ArrayExpress studies
(A) All samples of E-ATMX-7 study pass the quality control.
(B) CHIP_313_B.CEL and CHIP_317_B.CEL samples E-ATMX-24 study do not pass the quality control, as their median > 1.1, thus, they should be removed from the pool of samples.
Selected fields of the metadata of the samples of series E-ATMX-31
| Source name | Characteristics [organism] | Characteristics [OrganismPart] |
|---|---|---|
| Shoot 3 | Arabidopsis thaliana | Shoot |
| Shoot 2 | Arabidopsis thaliana | Shoot |
| Shoot 1 | Arabidopsis thaliana | shoot |
| Cell culture 3 | Arabidopsis thaliana | cultured callus |
| Root 3 | Arabidopsis thaliana | root |
| Cell culture 2 | Arabidopsis thaliana | cultured callus |
| Root 2 | Arabidopsis thaliana | root |
| Cell culture 1 | Arabidopsis thaliana | cultured callus |
| Root 1 | Arabidopsis thaliana | root |
“Cell culture 1”, “Cell culture 2” and “Cell culture 3” samples should be deleted, as they are from cell cultures, while the rest of the samples should be accepted as they are based on single tissues (“shoot” or “root”).
Selected fields of the metadata of the samples of series E-ATMX-20
| Source name | Characteristics [organism] | Characteristics [organism part] | Characteristics [genotype] |
|---|---|---|---|
| Zat10-OE-2 | Arabidopsis thaliana | Leaf | 35S:ZAT10 |
| wildtype-2 | Arabidopsis thaliana | leaf | wild type |
| wildtype-1 | Arabidopsis thaliana | leaf | wild type |
| wildtype-3 | Arabidopsis thaliana | leaf | wild type |
| Zat10-OE-1 | Arabidopsis thaliana | leaf | 35S:ZAT10 |
| Zat10-OE-3 | Arabidopsis thaliana | leaf | 35S:ZAT10 |
“Zat10-OE-1”, “Zat10-OE-2” and “Zat10-OE-3” samples should be deleted, as they come from mutated plants, while “wildtype-1”, “wildtype-2” and “wildtype-3” samples should be accepted, as they are based on wild-type plants.
Selected fields of the metadata of the samples of series E-GEOD-50526
| Source name | Characteristics [organism] | Characteristics [organism part] | Assay name | FactorValue [genotype] | FactorValue [infect] |
|---|---|---|---|---|---|
| Arabidopsis thaliana | leaf | dde2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | dde2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | dde2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | dde2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | dde2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | dde2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | ein2-1 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | ein2-1 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | ein2-1 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | ein2-1 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | ein2-1 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | ein2-1 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | sid2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | sid2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | sid2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | sid2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | sid2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | sid2-2 | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | wild type | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | wild type | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | wild type | mock | ||
| Arabidopsis thaliana | leaf | wild type | mock | ||
| Arabidopsis thaliana | leaf | wild type | mock | ||
| Arabidopsis thaliana | leaf | wild type | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | wild type | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | wild type | Alternaria brassicicola | ||
| Arabidopsis thaliana | leaf | wild type | mock | ||
| Arabidopsis thaliana | leaf | wild type | mock | ||
| Arabidopsis thaliana | leaf | wild type | mock |
All samples except for “GSM1220702”, “GSM1220707”, ”GSM1220712”, “GSM1220717”, “GSM1220722”, “GSM1220727” should be deleted, as they come from infected plants.
Figure 4Arabidopsis thaliana gene coexpression trees as viewed by Dendroscope
(A and B) Node labels are the AGI codes of the Arabidopsis thaliana genes. (A) Coexpression tree containing all Arabidopsis thaliana genes. The leaves highlighted in red denote the region where CTL2 (AT3G16920) coexpression subtree is located (B) Gene coexpression subtree containing CTL2 and its coexpressed genes.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| ( | ||
| ( | ||
| ( | ||
| Full list of Microarray Samples used | ( | |
| MySQL Workbench | ( | |
| Single Channel Array Normalization (SCAN) | ( | |
| Brainarray Custom CDF | ( | |
| Array Power Tools | ( | |
| InterMineR | ( | |
| simpleaffy | ( | |
| affyQCReport | ( | |
| affyPLM | ( | |
| oligo | ( | |
| Phangorn | ( | |
| Newick Utilities | ( | |
| Dendroscope | ( | |
| Thalemine | ( | |
| String | ( | |
| WebGestalt | ( | |
| Intel(R) Core(TM) i7-8700K CPU @ 3.7 GHz (6 cores × 2 threads) | Intel Corporation | SR3QR |
| 4 TB WD Purple Surveillance Hard Drive | Western Digital | WD40PURZ |
| 2 × VENGEANCE® LPX 32 GB (2 × 16 GB) DDR4 DRAM 2400MHz C14 Memory Kits | Corsair | CMK32GX4M2A2400C14 |