Literature DB >> 35924094

Gene expression profiling of NUAK kinase overexpression in Drosophila larval muscle development.

David Brooks1, Erika R Geisbrecht1.   

Abstract

Signal transduction pathways mediated by kinases control diverse biological outputs at the level of cells and tissues to regulate a diverse array of biological and developmental events. To gain insight into how muscle expression of the evolutionarily conserved NUAK kinase regulates the transcriptional landscape during Drosophila melanogaster development, we performed high-throughput sequencing of RNA from either whole larvae or dissected muscle fillets at the end of larval development. Raw data was generated using the Illumina HiSeq 4000 platform. After trimming and mapping to the Drosophila reference genome, differential gene expression and GO enrichment analysis were completed. Raw data are deposited in the NCBI Gene Expression Ominbus (GEO) repository under GEO accession GSE204894.
© 2022 The Author(s).

Entities:  

Keywords:  Drosophila melanogaster; NUAK; RNA-seq; kinase; larvae

Year:  2022        PMID: 35924094      PMCID: PMC9340529          DOI: 10.1016/j.dib.2022.108482

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

These data provide a comprehensive overview of transcriptomic changes that occur as a result of NUAK kinase overexpression in muscle tissue. This dataset and accompanying analysis will be useful to Drosophila researchers who study kinases and/or the intersection between muscle tissue and larval development. Results in these datasets can be further mined to directly identify individual genes and/or pathways that are misregulated in response to aberrant kinase signaling. Since data outputs are either from muscle tissue or whole larvae, the associated gene expression changes may provide information about inter-tissue communication during larval development.

Data Description

To investigate transcriptional changes that contribute to a reduction in larval and pupal body size upon expression of the serine/threonine NUAK kinase in muscle tissue [1], we used RNA-seq (Fig. 1). The bipartite UAS/Gal4 system [2] was used to express two independent NUAK transgenes (UAS-NUAK 548 or UAS-NUAK 550) or a control transgene (UAS-lacZ) in developing muscle with mef2-Gal4. RNA-seq was performed on either wandering whole larvae or dissected muscle fillets with three biological replicates of each genotype. RNA library preparation with polyA selection and Illumina HiSeq 2 × 150bp sequencing was performed. The total library size and mapping statistics are provided in Table 1, with more than 94% of total reads uniquely mapped.
Fig. 1

Schematic overview of experimental design. Either UAS-lacZ, UAS-NUAK 548, or UAS-NUAK 550 were expressed in muscle tissue under control of the mef2 promoter. Total RNA isolated from either whole larvae or dissected muscle carcasses were polyA-selected and used to prepare libraries for Illumina HiSeq sequencing. Mapping of trimmed reads was followed by analysis of relative transcript changes across different genotypes.

Table 1

RNA-Seq library size and mapping statistics.

Sample nameNumber ofraw readsNumber of uniquelymapped reads% Uniquemapped reads
mef2>lacZ whole larvae replicate 1236179832253780695.4
mef2>lacZ whole larvae replicate 2194234071856199595.5
mef2>lacZ whole larvae replicate 3207474621991542095.9
mef2>NUAK548 whole larvae replicate 1221366062120969695.8
mef2>NUAK548 whole larvae replicate 2208803471999357895.7
mef2>NUAK548 whole larvae replicate 3190262881832388596.3
mef2>NUAK550 whole larvae replicate 1173226521650406895.2
mef2>NUAK550 whole larvae replicate 2208067081974204594.9
mef2>NUAK550 whole larvae replicate 3257605072451419495.1
mef2>lacZ muscle carcass replicate 1198378271889900895.2
mef2>lacZ muscle carcass replicate 2229433062197586795.7
mef2>lacZ muscle carcass replicate 3199589011913908295.7
mef2>NUAK548 muscle carcass replicate 1206771371992700796.3
mef2>NUAK548 muscle carcass replicate 2208530042007639896.2
mef2>NUAK548 muscle carcass replicate 3191082871836823396.1
mef2>NUAK550 muscle carcass replicate 1215569842073796496.2
mef2>NUAK550 muscle carcass replicate 2259200572478024095.6
mef2>NUAK550 muscle carcass replicate 3210551212002099995.0
Schematic overview of experimental design. Either UAS-lacZ, UAS-NUAK 548, or UAS-NUAK 550 were expressed in muscle tissue under control of the mef2 promoter. Total RNA isolated from either whole larvae or dissected muscle carcasses were polyA-selected and used to prepare libraries for Illumina HiSeq sequencing. Mapping of trimmed reads was followed by analysis of relative transcript changes across different genotypes. RNA-Seq library size and mapping statistics.

The obtained RNA-seq data are appropriate for differential gene expression analysis

PCA analysis of log transformed counts from each genotype generally revealed the expected groupings among replicates within samples and across different sample groups (Fig. 2A). Overexpression of NUAK compared to control samples was confirmed by assessing the relative abundance of transcripts in Reads Per Kilobase Million (RPKM) (Fig. 2B) and verifies the suitability of this data for differential expression analysis. Differentially expressed genes were determined using DESeq2 and the Wald test was used to define significance as p-value < 0.05 and absolute log2 fold change > 1. Volcano plots confirm global transcriptional changes between control and NUAK overexpression conditions in both whole larvae and muscle samples (Fig. 2C).
Fig. 2

Assessment of inter- and intragroup variability for gene expression analysis. (A-C) Whole larvae samples. (D-F) Muscle carcass samples. (A,D) Principal component analysis (PCA) for three replicates of control (mef2>lacZ) or three replicates for each NUAK transgene (mef2>NUAK 548 or mef2>NUAK 550). X and Y axes correspond to the first two principle components. (B,E) Barplots depicting NUAK 548 or NUAK 550 transcript expression counts as Reads Per Kilobase Million (RPKM). (C,F) Volcano plots of differential gene expression analysis for the indicated genotypes. Each data point in the scatter plot represents a gene. The log2 fold change of each gene is represented on the y-axis and the log10 of its adjusted p-value is on the x-axis. Numbers in grey above or below the black dotted line correspond to the number of target genes upregulated or downregulated, respectively. Orange circle corresponds to NUAK.

Assessment of inter- and intragroup variability for gene expression analysis. (A-C) Whole larvae samples. (D-F) Muscle carcass samples. (A,D) Principal component analysis (PCA) for three replicates of control (mef2>lacZ) or three replicates for each NUAK transgene (mef2>NUAK 548 or mef2>NUAK 550). X and Y axes correspond to the first two principle components. (B,E) Barplots depicting NUAK 548 or NUAK 550 transcript expression counts as Reads Per Kilobase Million (RPKM). (C,F) Volcano plots of differential gene expression analysis for the indicated genotypes. Each data point in the scatter plot represents a gene. The log2 fold change of each gene is represented on the y-axis and the log10 of its adjusted p-value is on the x-axis. Numbers in grey above or below the black dotted line correspond to the number of target genes upregulated or downregulated, respectively. Orange circle corresponds to NUAK.

GO enrichment

Significant differentially expressed genes were further analyzed for GO classifications using the GOrilla online analysis software. GO terms featuring biological processes are shown in Table 2 for whole larvae and Table 3 for muscle carcass samples.
Table 2

Enriched GO terms categorized by biological process in whole larvae RNA seq samples.

GenotypeIDGO termCountP value
Whole larvae

mef2>NUAK 548 vs mef2>lacZGO:0042398cellular modified amino acid biosynthetic process46.32E-06
GO:0005975carbohydrate metabolic process126.38E-06
GO:0006575cellular modified amino acid metabolic process73.03E-05
GO:0009066aspartate family amino acid metabolic process42.29E-04
GO:0006528asparagine metabolic process27.04E-04
GO:0009109coenzyme catabolic process27.04E-04

mef2>NUAK 550 vs mef2>lacZGO:0003341cilium movement223.82E-13
GO:0044782cilium organization365.25E-08
GO:0007283spermatogenesis441.07E-06
GO:0035082axoneme assembly161.75E-06
GO:0070286axonemal dynein complex assembly122.91E-06
GO:0000003reproduction418.79E-06
GO:0120031plasma membrane bounded cell projection assembly332.15E-05
GO:0006936muscle contraction101.03E-04
GO:0072522purine-containing compound biosynthetic process401.14E-04
GO:0060285cilium-dependent cell motility72.93E-04
GO:0070585protein localization to mitochondrion225.97E-04
GO:0043648dicarboxylic acid metabolic process148.02E-04
GO:0044281small molecule metabolic process1798.59E-04
GO:0003012muscle system process218.81E-04
GO:0006096glycolytic process248.82E-04
Table 3

Enriched GO terms categorized by biological process in muscle carcass RNA seq samples.

GenotypeIDGO termCountP value
Muscle carcass

mef2>NUAK 548 vs mef2>lacZGO:0007594puparial adhesion78.73E-10
GO:1901605alpha-amino acid metabolic process162.02E-07
GO:0006566threonine metabolic process42.11E-06
GO:0009066aspartate family amino acid metabolic process86.70E-06
GO:0006520cellular amino acid metabolic process161.08E-04
GO:0044282small molecule catabolic process151.91E-04
GO:0019752carboxylic acid metabolic process253.44E-04
GO:1901607alpha-amino acid biosynthetic process74.30E-04
GO:0009081branched-chain amino acid metabolic process45.61E-04
GO:0006082organic acid metabolic process256.34E-04

mef2>NUAK 550 vs mef2>lacZGO:0042335cuticle development254.25E-09
GO:0030497fatty acid elongation82.99E-06
GO:0032504multicellular organism reproduction158.55E-06
GO:0006959humoral immune response93.27E-05
GO:0003012muscle system process75.30E-05
GO:0050830defense response to Gram-positive bacterium95.55E-05
GO:0006633fatty acid biosynthetic process117.99E-05
GO:0030148sphingolipid biosynthetic process99.05E-05
GO:0072330monocarboxylic acid biosynthetic process123.64E-04
GO:0009074aromatic amino acid family catabolic process44.90E-04
GO:0009617response to bacterium206.80E-04
GO:0044281small molecule metabolic process529.77E-04
Enriched GO terms categorized by biological process in whole larvae RNA seq samples. Enriched GO terms categorized by biological process in muscle carcass RNA seq samples.

Experimental Design, Materials and Methods

Drosophila genetics

Flies were maintained on standard cornmeal-yeast-agar medium at 25°C. mef2-Gal4 females from the Bloomington Stock Center (BL27390) were mated to males of the following genotypes: UAS-lacZ (BL3956), UAS-NUAK 548 [3], or UAS-NUAK 550 [3] for RNA isolation.

RNA isolation and library preparation

Both male and female third instar larvae (L3) were combined for total RNA extraction according to manufacturer protocol using the RNeasy Mini Kit (Qiagen, Germnay). For each genotype (mef2>lacZ, mef2>NUAK 548, or mef2>NUAK 550), RNA was isolated from either three whole larvae or ten dissected muscle fillets for each biological replicate. Muscle fillets were prepared by pinning L3 larvae on Sylgard plates followed by the removal of fat body and other internal organs in Phosphate Buffered Saline (PBS). The quality of RNA prepared from three individual biological replicates of each genotype was analyzed using the Nanodrop 2000 spectrophotometer (Thermo Fisher Scientific, USA). High quality RNA (≥2 µg; OD260/280 – 1.8-2.2) was sent to Genewiz for further quality assessment using the Qubit RNA Assay to measure RNA concentration (≥50 ng/µl) and the Agilent Bioanalyzer to assess RNA quality (RIN≥6.0). The NEBNext Ultra II RNA Library Prep Kit was used to prepare the RNA-seq according to standard Illumina polyA selection protocols.

Sequencing, raw data processing, and data analysis

The RNA-seq libraries were sequenced using the Illumina HiSeq 4000 (2 × 150 bp sequencing). Poor quality regions and adapter sequences were trimmed with Trimmomatic v.0.36 [4]. The trimmed reads were aligned with the Drosophila melanogaster BDGP6 reference genome available on ENSEMBL using the STAR aligner v.2.5.2b [5] to generate .bam files. PCA plots were generated using log transformed counts imported into the Clustvis software (https://biit.cs.ut.ee/clustvis/) [6]. For either whole larvae or muscle carcass plots, unit variance scaling was applied to rows and SVD with imputation was used to calculate principal components. The X and Y axes show principal component 1 and principal component 2, with the total variance listed in the axes. N = 9 data points for each.

Differential gene expression and Gene ontology (GO) analysis

Unique gene hit counts, calculated using featureCounts from the Subread package v.1.5.2, were used for downstream differential expression analysis. DESeq2 was used to compare gene expression between experimental and control samples. P-values and log2 fold changes were calculated using the Wald test. Adjusted p-value < 0.05 and absolute log2 fold change > 1 were used as cut-offs for differentially expressed genes. Volcano plots were generated in GraphPad Prism 9.2.0 using genes designated as significant after differential gene expression analysis. Gene Ontology enRIchment anaLysis and visuaLizAtion tool (GOrilla) (http://cbl-gorilla.cs.technion.ac.il) was used to identify significantly enriched GO terms featuring biological processes with P-values less than 0.05 [7].

Ethics Statements

This work does not contain any experiments with humans, animals, or social media platforms.

CRediT authorship contribution statement

David Brooks: Conceptualization, Methodology, Investigation, Writing – review & editing. Erika R Geisbrecht: Conceptualization, Software, Data curation, Supervision, Writing – original draft, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
SubjectBiological Sciences
Specific subject areaDevelopmental Biology; Molecular Biology; Transcriptomics
Type of dataRNA-seq data, Tables, Figures
How the data were acquiredRNA sequencing by Illumina HiSeq 4000Software: Trimmomatic v.0.36, STAR aligner v.2.5.2b, featureCounts from Subread v.1.5.2, DESeq2, Gorilla
Data formatRawAnalyzed
Description of data collectionTotal RNA was isolated from either wandering L3 whole larvae or filleted muscle carcasses from control (mef2>lacZ) or NUAK overexpression (mef2>NUAK 548 or mef2>NUAK 550) samples. Three biological replicates were prepared for each genotype. After assessment of RNA quality, libraries were constructed using standard Illumina protocols, and sequenced.
Data source locationKansas State University, Manhattan, KS
Data accessibilityRaw and analyzed RNA-Seq data were deposited in the NCBI GEO database under GEO accession GSE204894.(https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?&acc=GSE204894)
Related research articleD. Brooks, S. Bawa, A. Bontrager, M. Stetsiv, Y. Guo, E.R. Geisbrecht, Independent pathways control muscle tissue size and sarcomere remodeling, Dev Biol 490 (2022) 1-12. https://doi.org/10.1016/j.ydbio.2022.06.014.
  7 in total

1.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

2.  Independent pathways control muscle tissue size and sarcomere remodeling.

Authors:  David Brooks; Simranjot Bawa; Alexandria Bontrager; Marta Stetsiv; Yungui Guo; Erika R Geisbrecht
Journal:  Dev Biol       Date:  2022-06-24       Impact factor: 3.148

3.  ClustVis: a web tool for visualizing clustering of multivariate data using Principal Component Analysis and heatmap.

Authors:  Tauno Metsalu; Jaak Vilo
Journal:  Nucleic Acids Res       Date:  2015-05-12       Impact factor: 16.971

4.  Drosophila NUAK functions with Starvin/BAG3 in autophagic protein turnover.

Authors:  David Brooks; Fawwaz Naeem; Marta Stetsiv; Samantha C Goetting; Simranjot Bawa; Nicole Green; Cheryl Clark; Arash Bashirullah; Erika R Geisbrecht
Journal:  PLoS Genet       Date:  2020-04-22       Impact factor: 5.917

5.  GOrilla: a tool for discovery and visualization of enriched GO terms in ranked gene lists.

Authors:  Eran Eden; Roy Navon; Israel Steinfeld; Doron Lipson; Zohar Yakhini
Journal:  BMC Bioinformatics       Date:  2009-02-03       Impact factor: 3.169

6.  Targeted gene expression as a means of altering cell fates and generating dominant phenotypes.

Authors:  A H Brand; N Perrimon
Journal:  Development       Date:  1993-06       Impact factor: 6.868

7.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.