| Literature DB >> 33522607 |
Rajiv K Tripathi1, Olivia Wilkins1.
Abstract
Global warming poses major challenges for plant survival and agricultural productivity. Thus, efforts to enhance stress resilience in plants are key strategies for protecting food security. Gene regulatory networks (GRNs) are a critical mechanism conferring stress resilience. Until recently, predicting GRNs of the individual cells that make up plants and other multicellular organisms was impeded by aggregate population scale measurements of transcriptome and other genome-scale features. With the advancement of high-throughput single cell RNA-seq and other single cell assays, learning GRNs for individual cells is now possible, in principle. In this article, we report on recent advances in experimental and analytical methodologies for single cell sequencing assays especially as they have been applied to the study of plants. We highlight recent advances and ongoing challenges for scGRN prediction, and finally, we highlight the opportunity to use scGRN discovery for studying and ultimately enhancing abiotic stress resilience in plants.Entities:
Keywords: abiotic stress; climate change; gene regulatory network; heat stress; high throughput sequencing; resilience; single cell; transcription
Mesh:
Year: 2021 PMID: 33522607 PMCID: PMC8359182 DOI: 10.1111/pce.14012
Source DB: PubMed Journal: Plant Cell Environ ISSN: 0140-7791 Impact factor: 7.228
FIGURE 1General workflow for single cell sequencing assays. (a) Tissues or organs are dissociated into individual cells through the isolation of protoplasts (small green circles); (b) the protoplasts are loaded into a microfluidics system that encapsulates individual protoplasts (small green circles) with reagents for labelling transcripts with distinct barcodes (larger multi‐coloured circles) which identify the cell from which the transcript originated, other barcodes such as UMIs may be added through this process as well; (c) the barcoded transcripts are then pooled and sequenced using a short read technology; (d) sequencing reads are then processed to assign each transcript to a cell of origin based on the barcode sequence added during library preparation; (e) the transcriptomes of all cells undergo dimension reduction (e.g., tSNE or UMAP) whereby cells with similar transcriptome profiles will be plotted closer together in two‐dimensional space while those with less similar transcriptomes will be plotted farther apart, and clusters of cells with similar transcriptomes can be identified algorithmically. In this example, each point on the plot represents a single cell and the colour of the point represents the cluster to which that cell has been assigned. (f) Clusters of cells may be characterized as a known cell types based on the abundance of known marker genes or on overall similarity to the transcriptomes of established cell types; cell clusters may also be described as unknown or novel if no known markers match the observed transcriptome profiles. In this example, cells in the reconstructed tissue are coloured to reflect the hypothetical transcriptome clusters identified in panel (e) [Colour figure can be viewed at wileyonlinelibrary.com]
Summary of high throughput scRNA‐seq assays of Arabidopsis roots
| Number of scRNA Libraries | Median Number of Transcripts/Cell | Median Number of Genes/Cell | Total Number of Genes | Clusters | |
|---|---|---|---|---|---|
| Ryu et al. ( | 7,522 | ~24,000 | ~5,000 | >22,000 | 9 |
| Jean‐Baptiste et al. ( | 3,121 | 6,152 | 2,445 | 22,419 | 11 |
| Denyer et al. ( | 4,727 | 14,758 | 4,276 | 16,975 | 15 |
| Shulse et al. ( | 12,198 | 2,291 | 1,216 | 25,324 | 17 |
| Zhang et al. ( | 7,695 | 4,556 | 1,875 | 23,161 | 24 |
| Wendrich et al. ( | 5,145 | – | 6,781 | 21,492 | 14 |
| Farmer, Thibivilliers, Ryu, Schiefelbein, and Libault ( | 10,608 nuclei | 1,384 | 1,126 | 24,740 | 21 |
The number of scRNA‐libraries is variously reported as the number of transcriptomes, the number of single cells, and the number of STAMPs. In all cases this is taken to mean the number of single cells for which high quality sequencing data were obtained.
Farmer et al. used snucRNA‐seq in this project. All data in this row relate to single nuclei.
Summary of experimental considerations for sgGRN prediction
| Tissue Selection | Stress Selection |
|---|---|
|
Does the tissue include diverse cell types or cell states? Can enough cells or nuclei be harvested sufficiently quickly to perform the assay? Is the tissue sensitive to the proposed treatment? |
How severe and long lasting will the stress be? When will the stress be applied in development? In the circadian period? Will the stress be applied in isolation, in combination or in series with other relevant stressors? |
|
|
|
|
Can protoplasts or nuclei be quickly and efficiently isolated from the tissue? Do isolated protoplasts or nuclei represent the full diversity of cells in the tissue? Is knowledge of the spatial arrangement of cells important for the analysis? |
Is multimodal single cell sequencing data available either in data repositories or in the proposed experiment? Does the proposed experiment incorporate time series sampling? What complementary data exists for the tissue and/or treatment? |
FIGURE 2Comparison of SCENIC and the Inferelator, two scGRN prediction algorithms. The SCENIC and Inferelator algorithms both use scRNA‐seq data as their primary input. SCENIC uses a random forest clustering algorithm to identify target genes that are co‐expressed with transcription factors. It then filters the putative regulatory clusters to retain only those whose targets are enriched for the occurrence of a priori known cisregulatory elements for the relevant transcription factor. Inferelator uses a multi‐task learning algorithm to learn scGRNs from transcriptome and complementary data types that are used to estimate the activity of transcription factors. The Inferelator can accept a wide variety of complementary inputs including Chromatin Immunoprecipitation ‐ sequencing (ChIP‐seq), protein–protein interaction and can explicitly use time series data. Both algorithms produce a matrix or graph of transcription factor target interactions [Colour figure can be viewed at wileyonlinelibrary.com]
FIGURE 3General workflow for using scGRNs for enhancing crop resilience. (a) The predicted scGRN will be a network of directed edges connecting transcription factors to the target genes they regulate. (b) post hoc assessment of the scGRN is used to prioritize regulatory targets for experimental characterization. This may include the identification of subnetworks that enriched in a cell type of interest; identification of co‐regulated genes that are enriched for biological processes of interest using Gene Set Enrichment Analysis or which are enriched for known cis‐regulatory elements; identification of regulatory interactions with corroborating experimental data, for example ChIP or yeast 1‐hybrid data; identification of co‐regulated genes that are strongly differentially expressed in response to the stress treatment; and characterization of transcription factors (TFs) that regulate many target genes. (c) Experimental characterization of prioritized components of the scGRN can be undertaken using genome editing approaches such as CRISPR. The coloured bars indicate different genomic regions that are targeted for editing. We anticipate that editing different interactions in the scGRN will influence plant resilience to varying degrees. (d) The most promising genome edited lines can then be tested in the field to determine the full effects of the modified scGRNs on stress resilience [Colour figure can be viewed at wileyonlinelibrary.com]