| Literature DB >> 26504573 |
Dongliang Du1, Nidhi Rawat2, Zhanao Deng2, Fred G Gmitter1.
Abstract
After the sequencing of citrus genomes, gene function annotation is becoming a new challenge. Gene coexpression analysis can be employed for function annotation using publicly available microarray data sets. In this study, 230 sweet orange (Citrus sinensis) microarrays were used to construct seven coexpression networks, including one condition-independent and six condition-dependent (Citrus canker, Huanglongbing, leaves, flavedo, albedo, and flesh) networks. In total, these networks contain 37 633 edges among 6256 nodes (genes), which accounts for 52.11% measurable genes of the citrus microarray. Then, these networks were partitioned into functional modules using the Markov Cluster Algorithm. Significantly enriched Gene Ontology biological process terms and KEGG pathway terms were detected for 343 and 60 modules, respectively. Finally, independent verification of these networks was performed using another expression data of 371 genes. This study provides new targets for further functional analyses in citrus.Entities:
Year: 2015 PMID: 26504573 PMCID: PMC4595991 DOI: 10.1038/hortres.2015.26
Source DB: PubMed Journal: Hortic Res ISSN: 2052-7276 Impact factor: 6.793
Figure 1Work flow used for networks construction and clustering in the present study.
Composition of the 230 microarrays according to the experiment conditions and organs.
| Epicotyls | Root | Stem | Leaves | Flower | Peel | Flavedo | Albedo | Vascular core | Flesh | Seed | Total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Citrus canker | 0 | 0 | 0 | 30 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 30 |
| Huanglongbing | 0 | 3 | 3 | 12 | 0 | 0 | 12 | 0 | 12 | 12 | 4 | 58 |
| Other treatments | 12 | 0 | 0 | 2 | 0 | 0 | 6 | 0 | 0 | 0 | 0 | 20 |
| Control | 0 | 3 | 3 | 19 | 9 | 12 | 22 | 12 | 7 | 31 | 4 | 122 |
| Total | 12 | 6 | 6 | 63 | 9 | 12 | 40 | 12 | 19 | 43 | 8 | 230 |
Topological characteristics of seven coexpression networks
| Arrays | RT | Nodes | Edges | AD | ND | NCC | NBC | EBC | APL | DE | CC | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| All data | 230 | 0.882 | 1391 | 10 220 | 14.69 | 0.011 | 104 | 1046 | 9878 | 7.62 | 1.13 | 0.40 |
| HLB | 36 | 0.968 | 1400 | 5036 | 7.19 | 0.005 | 91 | 1040 | 4379 | 6.76 | 1.58 | 0.29 |
| Citrus canker | 30 | 0.938 | 1841 | 5565 | 6.05 | 0.003 | 211 | 1265 | 5174 | 8.79 | 1.50 | 0.23 |
| Leaves | 63 | 0.911 | 2263 | 11 535 | 10.19 | 0.005 | 158 | 1862 | 11 247 | 10.75 | 1.47 | 0.33 |
| Flavedo | 40 | 0.964 | 1425 | 2467 | 3.46 | 0.002 | 123 | 1133 | 2289 | 9.63 | 2.10 | 0.20 |
| Albedo | 31 | 0.947 | 1592 | 3725 | 4.68 | 0.003 | 162 | 996 | 2644 | 6.66 | 1.80 | 0.20 |
| Flesh | 43 | 0.948 | 1137 | 2431 | 4.28 | 0.004 | 90 | 911 | 2290 | 7.59 | 1.86 | 0.20 |
| Total | 6256 | 37 633 |
RT, RMT threshold; AD, average degree; ND, network density; NCC, number of connected components; NBC, nodes of biggest components; EBC, edges of biggest components; APL, average path length; DE, degree exponent; CC, clustering coefficient.
Figure 2Layout of the citrus “all data” coexpression network. The most overrepresented GO terms were shown for the 12 largest color-coded modules.
Intersection between edges/nodes (upper/lower triangular) of networks
| All data | HLB | Citrus canker | Leaves | Flavedo | Albedo | Flesh | |
|---|---|---|---|---|---|---|---|
| All data | 206 (1.35%) | 574 (3.64%) | 1305 (6%) | 158 (1.25%) | 45 (0.32%) | 28 (0.22%) | |
| HLB | 334 (11.97%) | 26 (0.25%) | 29 (0.18%) | 14 (0.19%) | 28 (0.32%) | 15 (0.2%) | |
| Citrus canker | 482 (14.91%) | 308 (9.5%) | 1271 (7.43%) | 15 (0.19%) | 10 (0.11%) | 3 (0.04%) | |
| Leaves | 665 (18.2%) | 274 (7.48%) | 925 (22.54%) | 59 (0.42%) | 21 (0.14%) | 9 (0.06%) | |
| Flavedo | 278 (9.87%) | 257 (9.1%) | 276 (8.45%) | 361 (9.79%) | 70 (1.13%) | 32 (0.65%) | |
| Albedo | 262 (8.78%) | 267 (8.92%) | 269 (7.84%) | 317 (8.22%) | 565 (18.73%) | 40 (0.65%) | |
| Flesh | 144 (5.7%) | 238 (9.38%) | 178 (5.98%) | 217 (6.38%) | 410 (16%) | 382 (14%) |
Network clustering and functional enrichment of modules
| Inflation | Efficiency | Mass fraction (%) | Area fraction (%) | Modules | M5 | SBM | GO | KEGG | |
| All data | 2 | 0.52 | 83.63 | 2.85 | 285 | 55 | 200 | 39 | 7 |
| HLB | 1.8 | 0.45 | 80.93 | 1.38 | 267 | 72 | 90 | 47 | 9 |
| Citrus canker | 1.8 | 0.56 | 85.90 | 0.81 | 447 | 88 | 103 | 58 | 9 |
| Leaves | 1.8 | 0.46 | 82.00 | 1.40 | 432 | 87 | 144 | 57 | 17 |
| Flavedo | 1.6 | 0.48 | 86.92 | 0.75 | 287 | 86 | 47 | 58 | 12 |
| Albedo | 1.8 | 0.54 | 83.03 | 0.84 | 402 | 73 | 102 | 37 | 3 |
| Flesh | 1.6 | 0.44 | 84.90 | 1.67 | 218 | 64 | 108 | 47 | 3 |
| Total | 2338 | 525 | 343 | 60 |
M5, number of modules containing more than five nodes; SBM, size of biggest module; GO, KEGG: number of modules with significantly enriched GO biological process terms, KEGG pathways.
Figure 3Graph showing coexpressed genes of the C. clementina homolog of citrus LOB1 (Ciclev10033956m) and SWEET1 (Ciclev10002276m) in canker-module 1.
Figure 4Genes and edges in canker-module 25.
Figure 5Genes and edges in flesh-module 19.
Figure 6Genes and edges in HLB-module 6.
Figure 7Distribution of absolute value of correlation coefficients.