| Literature DB >> 32426691 |
Jean-Michel Michno1,2, Junqi Liu2, Joseph R Jeffers3, Robert M Stupar1,2, Chad L Myers1,3.
Abstract
Genome-wide association studies (GWAS) have proven to be a valuable approach for identifying genetic intervals associated with phenotypic variation in Medicago truncatula. These intervals can vary in size, depending on the historical local recombination. Typically, significant intervals span numerous gene models, limiting the ability to resolve high-confidence candidate genes underlying the trait of interest. Additional genomic data, including gene co-expression networks, can be combined with the genetic mapping information to successfully identify candidate genes. Co-expression network analysis provides information about the functional relationships of each gene through its similarity of expression patterns to other well-defined clusters of genes. In this study, we integrated data from GWAS and co-expression networks to pinpoint candidate genes that may be associated with nodule-related phenotypes in M. truncatula. We further investigated a subset of these genes and confirmed that several had existing evidence linking them nodulation, including MEDTR2G101090 (PEN3-like), a previously validated gene associated with nodule number.Entities:
Keywords: Medicago; bioinformatics; co‐expression; genome‐wide association studies; nodulation
Year: 2020 PMID: 32426691 PMCID: PMC7229696 DOI: 10.1002/pld3.220
Source DB: PubMed Journal: Plant Direct ISSN: 2475-4455
FIGURE 1GWAS and co‐expression pipeline using Camoco. (a) Manhattan plot represents DNA markers used as input for Camoco, bold black circles represent a subset of markers used for illustrative purposes. (b) Regions along a chromosome from previously selected markers are represented as gray bars, genes are represented as black rectangles. Black circles represent a SNP from (a). (c) Genes from previously identified intervals are then selected from the co‐expression network for per‐gene network density measurements. Colored lines represent the strength of co‐expression between two genes in a co‐expression network. Wider lines represent gene pairs that are more strongly co‐expressed. The red box represents the current gene being measured for density. (d) Per‐gene density measurement of random subnetworks equal in size to the testing set. (e) Other GWAS traits and networks used for analysis
FIGURE 2Nodule_A discoverable genes in the Nodule‐D2 network. Chromosome‐centric diagram of the connectivity of discoverable genes (FDR < 0.35), focused on co‐expression neighbors of the candidate, MEDTR2G101090, within the Nodule‐D2 network for the Nod_A trait. Gray circles represent GWAS markers, colored circles represent genes, with MEDTR2G101090 in red, its first neighbors in orange, and other discoverable genes in purple. Gray lines represent co‐expression between genes (minimum Z‐Score of 2.5); the wider the line, the stronger the co‐expression between genes. Yellow lines represent the connections of MEDTR2G101090 to its first neighbors
FIGURE 3Co‐expression/GWAS discoverable gene summary. Number of discoverable genes (FDR < 0.35) obtained from co‐expression/GWAS integration. Colors represent the window size parameters used for our analysis. Each square in the grid represents a different co‐expression network (horizontal axis) trait (vertical axis) combination
List of genes that were discoverable across all six parameters (10, 20, and 50 kb and 1, 2, and 5 flanking genes) for the Nod_A phenotype using the Nodule‐D2 network
| Gene | Number of connections ( | SNP_position | GWAS ‐log10( | Rank (out of 292) | Annotation |
|---|---|---|---|---|---|
| MEDTR2G101090 | 8 | chr2:43448968 | 7.591607 | 1 | Drug resistance transporter‐like ABC domain protein |
| MEDTR8G074920 | 4 | chr8:31665171 | 6.753532 | 11 | Receptor‐like kinase theseus protein |
| MEDTR2G100280 | 4 | chr2:43061039 | 6.743592 | 12 | RNA exonuclease‐like protein |
| MEDTR4G018770 | 4 | chr4:5776217 | 6.509395 | 19 | GDP‐mannose transporter GONST3 |
| MEDTR3G026650 | 6 | chr3:8183997 | 6.177657 | 53 | GDP‐fucose protein O‐fucosyltransferase |
| MEDTR4G059870 | 4 | chr4:22091245 | 5.827601 | 114 | C2H2 and C2HC zinc finger protein, putative |
| MEDTR4G019910 | 4 | chr4:6362962 | 5.7494 | 139 | SnoaL‐like domain protein |
| MEDTR5G076270 | 1 | chr5:32504251 | 5.707181 | 156 | Auxin response factor 2 |
| MEDTR6G084440 | 2 | chr6:31605458 | 5.678609 | 161 | DUF1666 family protein |
| MEDTR2G090960 | 9 | chr2:39088095 | 5.657328 | 171 | TCP family transcription factor |
| MEDTR4G104350 | 2 | chr4:43099392 | 5.512627 | 210 | DNA polymerase III subunit gamma/tau |
| MEDTR7G102310 | 6 | chr7:41285876 | 5.493289 | 220 | Rhodanese/cell cycle control phosphatase superfamily protein |
| MEDTR5G093580 | 5 | chr5:40860194 | 5.415629 | 252 | Co‐factor for nitrate, reductase and xanthine dehydrogenase |
| MEDTR3G019490 | 5 | chr3:5482913 | 5.410043 | 257 | S‐locus lectin kinase family protein |
| MEDTR7G109130 | 16 | chr7:44591633 | 5.381151 | 270 | P‐loop nucleoside triphosphate hydrolase superfamily protein |
| MEDTR8G027385 | 1 | chr8:9668134 | 5.239786 | 350 | Endomembrane Family Protein |
| MEDTR4G126160 | 11 | chr4:52449376 | 5.231223 | 358 | Cytokinin oxidase/dehydrogenase‐like protein |
| MEDTR7G076250 | 5 | chr7:28686036 | 5.221541 | 366 | Zinc finger, C3HC4 type (RING finger) protein |
| MEDTR4G058970 | 10 | chr4:21744831 | 5.102555 | 448 | Homeodomain leucine zipper protein |
| MEDTR7G075580 | 13 | chr7:28296141 | 5.067043 | 470 | Cytochrome P450 family protein |
| MEDTR1G075610 | 5 | chr1:33462984 | 5.06158 | 474 | Cyclin‐dependent kinase |
| MEDTR2G096950 | 8 | chr2:41430755 | 5.050944 | 485 | Kinase 1B |
| MEDTR1G070455 | 9 | chr1:31235133 | 5.044264 | 491 | WRKY transcription factor |
| MEDTR3G111650 | 10 | chr3:52196531 | 5.019337 | 507 | Hypothetical protein |
| MEDTR1G080690 | 0 | chr1:35874811 | 5.009149 | 517 | TPX2 (targeting protein for Xklp2) family protein |
FIGURE 4Overlap of Nod_A candidates in the Nodule‐D2 network. Candidate genes for the Nodule‐D2 network for the Nod_A trait. Purple circles represent genes, and gray lines represent co‐expression between genes (minimum Z‐Score of 2.5). The larger the circle, the more connections it has with other genes. The wider the line, the stronger the co‐expression between genes
List of genes that were discoverable for at least five different parameters across all networks for the Nod_B trait
| Gene | Number of hits across parameters and terms | Annotation |
|---|---|---|
| MEDTR4G027195 | 10 | N/A |
| MEDTR4G035980 | 10 | Pectinesterase/pectinesterase inhibitor |
| MEDTR1G012530 | 9 | TPX2 (targeting protein for Xklp2) family protein |
| MEDTR4G073400 | 9 | Synaptotagmin‐1‐related |
| MEDTR2G073540 | 8 | Cysteine‐rich RLK (receptor‐like kinase) protein |
| MEDTR1G028960 | 6 | Glycolipid transfer protein (GLTP) family protein |
| MEDTR1G037520 | 5 | N/A |
| MEDTR1G040105 | 5 | Methylenetetrahydrofolate reductase |
| MEDTR2G048855 | 5 | Pentatricopeptide (PPR) repeat protein |
| MEDTR2G090960 | 5 | TCP family transcription factor |
| MEDTR2G450720 | 5 | SAM domain (sterile alpha motif) protein, putative |
| MEDTR3G088820 | 5 | PPR containing plant‐like protein |
| MEDTR4G087510 | 5 | O‐acetylserine (thiol) lyase |
| MEDTR5G053950 | 5 | Allene oxide cyclase |
| MEDTR5G065080 | 5 | PURINE permease |
| MEDTR5G094290 | 5 | Tubulin‐folding cofactor A |
| MEDTR6G023600 | 5 | SHORT‐chain dehydrogenase/reductase |
| MEDTR6G048290 | 5 | PPPDE thiol peptidase family protein, putative |
| MEDTR7G039370 | 5 | Origin recognition complex subunit 6 |
| MEDTR8G432620 | 5 | Methyltransferase |