Literature DB >> 30483741

Establishment of microRNA, transcript and protein regulatory networks in Alport syndrome induced pluripotent stem cells.

Wenbiao Chen¹, Donge Tang², Yong Dai², Hongyan Diao¹.

Abstract

Alport syndrome (AS) is an inherited progressive disease caused by mutations in genes encoding for the α3, α4 and α5 chains, which are an essential component of type IV collagen and are required for formation of the glomerular basement membrane. However, the underlying etiology of AS remains largely unknown, and the aim of the present study was to examine the genetic mechanisms in AS. Induced pluripotent stem cells (iPSCs) were generated from renal tubular cells. The Illumina HiSeq™ 2000 system and iTRAQ‑coupled 2D liquid chromatography‑tandem mass spectrometry were used to generate the sequences of microRNAs (miRNAs), transcripts and proteins from AS‑iPSCs. Integration of miRNA, transcript and protein expression data was used to construct regulatory networks and identify specific miRNA targets amongst the transcripts and proteins. Relative quantitative proteomics using iTRAQ technology revealed 383 differentially abundant proteins, and high‑throughput sequencing identified 155 differentially expressed miRNAs and 1,168 differentially expressed transcripts. Potential miRNA targets were predicted using miRanda, TargetScan and Pictar. All target proteins and transcripts were subjected to network analysis with miRNAs. Gene ontology analysis of the miRNAs and their targets revealed functional information on the iPSCs, including biological process and cell signaling. Kyoto Encyclopedia of Genes and Genomes pathways analysis revealed that the transcripts and proteins were primarily enriched in metabolic and cell adhesion molecule pathways. In addition, the network maps identified hsa‑miRNA (miR)‑4775 as a prominent miRNA that was associated with a number of targets. Similarly, the prominent ELV‑like protein 1‑A and epidermal growth factor receptor (EGFR)‑associated transcripts were identified. Reverse transcription‑quantitative polymerase chain reaction analysis was used to confirm the upregulation of hsa‑miR‑4775 and EGFR. The integrated approach used in the present study provided a comprehensive molecular characterization of AS. The results may also further understanding of the genetic pathogenesis of AS and facilitate the identification of candidate biomarkers for AS.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Substances：
MicroRNAs
Proteome

Year: 2018 PMID： 30483741 PMCID： PMC6297794 DOI： 10.3892/mmr.2018.9672

Source DB: PubMed Journal: Mol Med Rep ISSN： 1791-2997 Impact factor: 2.952

Introduction

Alport syndrome (AS) is a hereditary glomerulonephritis, which is variably associated with neural hearing loss and ocular abnormalities. It is widely accepted that AS is a genetically heterogeneous disease, which is caused by mutations in the collagen type IV α5 chain (COL4A5) gene located at Xq22.3 and encoding the α5 chain of type IV collagen (1,2). The primary pathological event appears to be the loss of the type IV collagen network, consisting of α3, α4 and α5 chains, which is an important component of the glomerular basement membrane (GBM). This results in the characteristic thinning, thickening and splitting of the GBM (3,4). Until now, a definitive diagnosis of AS was only possible through renal biopsy (5), which is primarily performed late following the onset of the profound clinical symptoms of this progressive renal disease. Therefore, early diagnosis and preemptive treatments have become increasingly important (6). Progress has been made in understanding the genetic basis of AS (7,8) and improving treatment, although patients suffering from AS inevitably develop end-stage renal disease. A study demonstrated that the secretion of α3-α4-α5 (IV) heterotrimers by podocytes into a preformed, abnormal, filtering GBM is effective at restoring the missing collagen IV network, slowing kidney disease progression and extending life span (9). Another study identified a missense mutation, c.368G>A (p.Gly123Glu), in the COL4A5 gene, which was reported to be the genetic cause of AS (10). However, the exact mechanisms remain poorly understood and novel treatment strategies are still required. Thus, novel analysis methods are required to reveal the genetic basis and mechanisms underlying AS. MicroRNAs (miRNAs) regulate gene expression and modulate crucial biological processes, including differentiation, proliferation and apoptosis. They function through various mechanisms, including targeted miRNA degradation and translational repression (11,12). Aberrant miRNA expression is associated with kidney disease. Emerging evidence from clinical and animal studies has indicated a critical role for miRNAs in renal pathophysiology (13). In previous studies, miRNAs were studied in the context of nephropathy, including membranous nephropathy, immunoglobulin (Ig)A nephropathy, lupus nephritis and renal transplantation. Collectively, the results demonstrated that miRNAs may act as biomarkers for the early diagnosis of kidney disease (14–16). Transcript sequencing is a useful alternative to whole genome sequencing as it avoids non-coding and repetitive sequences that make up the majority of eukaryotic genomes. Transcriptomic data offers an opportunity to deliver fast, inexpensive and accurate genome information (17). Transcriptome analysis of diabetic kidney disease identified multiple genes and pathways that may serve a role in the pathogenesis of the disease or act as biomarkers (18). A previous study demonstrated that injury-associated transcriptome alterations occur in kidney parenchymal cells in response to stresses of transplantation (19). Transcriptome analysis appears to be an effective and important tool for hypothesis-driven investigation of kidney disease. In addition, proteins are useful for gaining insight into the functional network of gene expression. With the advancement of 2D gel electrophoresis and isobaric tagging for relative and absolute quantification technologies, protein analysis has become an important tool for investigating disease (20). Exosomal proteomics has emerged as a powerful tool for understanding the molecular composition of exosomes and has the potential to accelerate biomarker discovery in diabetic nephropathy (21). By comparing IgA nephropathy with a normal control, Park et al (22) established a urinary proteomic map of IgA nephropathy and identified 216 protein spots that were differentially expressed. Therefore, studying proteins may aid in understanding the molecular physiology of nephropathy and identifying early diagnostic markers. Induced pluripotent stem cells (iPSCs) may be used to generate pluripotent patient-specific cell lines that are beneficial for studying the pathogenesis of model human diseases, and may also be used to determine genetic information associated with disease pathogenesis, particularly for diseases resulting from abnormal embryonic development (23,24). Extensive research, including system-wide genomic analyses and comparisons of transcripts expression profiles, has been conducted to identify iPSCs (25). The molecular mechanisms that govern the induction, maintenance and directed differentiation of pluripotency are mediated by transcription factors and non-coding RNAs, including miRNAs and long non-coding RNAs (26,27). Systematic deciphering of genetic or epigenetic alterations may help identify genomic hotspots in iPSCs with their consistent genetic background. However, these finding have also left notable gaps in the knowledge regarding the development and progression of disease. In the present study, iPSCs from renal tubular cells of patients with AS (AS-iPSCs) and normal controls (NC-iPSCs) were successfully generated (28). Subsequently, miRNA, transcript and protein regulation in iPSCs was studied with the aim of better understanding the genetic mechanism of AS. Furthermore, the identified proteins, in addition to miRNAs and their target transcripts, were subjected to gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) functional analyses. The resulting comprehensive network maps of miRNAs, transcripts and proteins may help with understanding the pathogenesis of AS at the genetic level (Fig. 1).

Figure 1.

Schematic representation of the protocols of the present study. AS, Alport syndrome; GO, Gene Ontology; iPSC, induced pluripotent stem cell; KEGG, Kyoto Encyclopedia of Genes and Genomes; LC-MS/MS, liquid chromatography-tandem mass spectrometry; miRNA, microRNA; NC, normal control.

Materials and methods

Patients and controls

A family with three generations of patients suffering from AS was identified (Fig. 2). The proband (III2) was a 26-year-old woman, who presented clinically with gross hematuria and albuminuria. The patient was diagnosed with AS at the Second Clinical Medical College of Jinan University (Shenzhen, China) in 2013. The proband was confirmed to have AS by pathological examination. The grandmother of the proband (I2) was also diagnosed with AS, although this individual succumbed to kidney failure. The mother of the proband (II3) exhibited AS symptoms, including kidney failure, gross hematuria, albuminuria, sensorineural hearing loss and pathognomonic ocular lesions. The sister of the proband (III3) presented with mild clinical symptoms of gross hematuria and albuminuria. The study protocol and consent forms were approved by the Ethics Committee of Jinan University and adhered to the guidelines set forth in the Declaration of Helsinki. Healthy participants and those with AS provided written informed consent.

Figure 2.

AS family pedigree chart. Males are represented by squares and females by circles. Black symbols indicate patients with AS. Slashes indicate deceased family members. The proband is indicated by the arrow. AS, Alport syndrome.

A total of six members of the family were selected for further research. The proband (III2), her mother (II3) and a sister (III3) formed the experimental AS group, whereas a sister (III4), brother (III1) and her father (II4) served as the normal controls (NC) (Table I).

Table I.

Participant characteristics and blood biochemical test results.

Individual	Sex	Age, years	BLO	PRO	Urea nitrogen, mmol/l	Inosine, µmol/l	TP, g/l	ALB, g/l	CHOL, mmol/l	Hearing loss	Vision loss (L/R)
III2	Female	26	++	+	6.6	121	57	35	5.76	+	0.2/1.0
III3	Female	23	+	+	3.9	110	71	45	4.45	+	0.4/1.0
II3	Female	51	++	++	3.5	231	70	43	4.68	+	0.2/0.8
II4	Male	53	−	−	3.0	55	75	52	3.81	−	1.0/1.0
III1	Male	21	−	−	3.2	56	66	45	4.61	−	1.5/1.0
III4	Female	17	−	−	3.2	50	75	43	4.98	−	1.2/1.1

The vision test was based on the Eggers visual chart. ALB, albumin; BLO, urine occult blood; CHOL, cholesterol; L, left; PRO, urine protein; R, right; TP, total protein.

Renal tubular cell isolation and iPSC generation

iPSCs were successfully generated from renal tubular cells, as previously described (28). iPSC formation was confirmed by comparatively analyzing human embryonic stem cell (hESC) markers via colony morphology, immunohistochemistry, reverse transcription-quantitative polymerase chain reaction (RT-qPCR), flow cytometry, gene expression profiling of the three germ layers and karyotyping. The results demonstrated that iPSCs were similar to hESCs with regards to morphology, proliferation, hESC-specific surface marker expression and differentiation into the cell types of the three germ layers. Therefore, iPSCs have similar characteristics to hESCs and are suitable for AS research (28). Total RNA from iPSCs was extracted using TRIzol® (Invitrogen; Thermo Fisher Scientific, Inc., Waltham, MA, USA), according to the manufacturer's protocol. RNA concentration was measured using a NanoDrop™ 2000 spectrophotometer (Thermo Fisher Scientific, Inc.) and stored at −80°C. All materials and solutions were handled under RNase-free conditions, and all solutions were prepared with RNase-free water. Small RNAs (sRNAs) from the AS and NC groups were sequenced by BGI (Shenzhen, China) using Solexa high-throughput sequencing technology. A total of ~2 µg RNA/sample was used to prepare an sRNA library using the TruSeq sRNA sample preparation kit, according to the manufacturer's protocol (Illumina, Inc., San Diego, CA, USA). sRNA libraries were sequenced in two batches (three samples per sequencing run) using a next sequencing apparatus (Illumina HiSeq™ 2000), to generate ~16 million single-end 75-bp reads per sample. To compare the estimated miRNA expression, the miRNA levels were first normalized. Subsequently, the fold change between AS and NC was calculated as follows: Fold change = log2 (AS/NC). A rigorous significance test developed by Audic and Claverie (29) was used to identify differentially expressed genes with minimal statistical error. A fold change ≥1 and P≤0.01 were considered to indicate statistically significant differential expression between the two groups. Following total RNA extraction, a cDNA library was constructed using the First Strand cDNA Synthesis kit (Thermo Fisher Scientific, Inc.) containing random oligonucleotides and a recombinant Moloney murine leukemia virus reverse transcriptase. The cDNA library consisted of 200-bp fragments. The cDNA whole transcriptome library was sequenced on the Illumina HiSeq™ 2000 sequencing platform (Illumina, Inc.). Differentially expressed transcripts were detected by IDEG6 software (http://telethon.bio.unipd.it/bioinfo/IDEG6_form/) using a general χ2 test based on the reads per kilobase of transcript per million mapped reads (RPKM) values. Multiple testing correction via false discovery rate (FDR) was performed. An FDR<0.001, fold change ≥1, and P≤0.05 were used as thresholds to judge the significance of differential gene expression.

Proteome identification and comparative analysis

iPSCs were washed five times in ice-cold PBS and lysed using enhanced radioimmunoprecipitation assay buffer (Beyotime Institute of Biotechnology, Haimen, China) containing protease and phosphatase inhibitors for 30 min on ice. Subsequently, the samples were sonicated at 4°C for 10 min at 40,000 × g, with ten cycles of 5-sec bursts followed by 30-sec cooling intervals. Cell debris was removed by centrifugation at 12,000 × g for 30 min at 4°C and the supernatants were collected. Protein concentration was measured using a bicinchoninic acid assay kit (Pierce; Thermo Fisher Scientific, Inc.). A total of 100 µg protein per AS or NC sample was labeled with iTRAQ® reagents (Applied Biosystems; Thermo Fisher Scientific, Inc.). Proteins with ≥3 peptides were considered valid for subsequent analysis. For each peptide, the ratio between the iTRAQ® label peak value and the sum of the intensities was calculated. The ratio was normalized to 1 and divided by the median of the ratio. Relative quantification of proteins was based on the ratio of the peak areas of the mass-to-charge ratio of 114 (AS) and 115 (NC) tandem mass spectrometry (MS/MS) spectra. Proteins with a fold change of >1.2 in peptide abundance (>1.2 increased or <0.83 decreased) were considered to be significantly differentially abundant (P<0.05).

miRNA target prediction

Three different software packages were used to predict the miRNA target genes, including PicTar (http://pictar.mdc-berlin.de/), miRanda v5 (http://www.ebi.ac.uk/enright%20srv/microcosm/htdocs/targets/v5/) and TargetScan Human 7.2 (http://www.targetscan.org/vert/). Genes identified by at least two of the three platforms were selected as ultimate target genes.

Data analysis and network visualization

For functional analysis of miRNAs, differentially abundant proteins, differentially expressed transcripts and miRNAs were combined with miRNA target predictions to form integrated regulatory networks. Statistical analyses were performed using R software (http://www.r-project.org/) with the appropriate KEGGSOAP packages (http://www.bioconductor.org/packages/2.4/bioc/html/KEGGSOAP.html). The network was visualized using Cytoscape 3.0 software (http://cytoscape.org).

Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis

KEGG pathways of the differentially expressed transcripts and differentially abundant proteins were annotated using the Database for Annotation, Visualization and Integrated Discovery gene annotation tool 6.8 (http://david.abcc.ncifcrf.gov). KEGG pathways were defined as significantly enriched in target candidates at a corrected P≤0.01. This analysis tool was also able to predict the principal biological functions and pathways in which the candidate targets were involved.

Gene Ontology (GO) analysis

Target transcripts, proteins and miRNAs were subjected to GO analysis. The whole GO database was used as the background network to calculate the number of genes in each node. The hypergeometric distribution was used to test the enrichment of genes in each GO node. GO terms were classified as biological process, cellular component and molecular function, whereby P<0.05 was considered statistically significant.

Validation using RT-qPCR

RT-qPCR was used to validate the results of deep sequencing analysis for miRNAs and transcripts. U6 and GAPDH were selected as the internal controls for miRNAs and transcripts, respectively. A total of 2 µg total RNA from each sample was extracted using a Qiagen RNeasy Mini Extraction kit and reverse transcribed to cDNA using miScript Reverse Transcription kit (both from Qiagen GmbH, Hilden, Germany) according to the manufacturer's protocol. Gene-specific primers were listed in Table II. The reverse transcription products were amplified using the following PCR program with SYBR green reagent (Roche Diagnostics, Basel, Switzerland) on a ABI QuantStudio™ 6 Flex Real-Time PCR system (Thermo Fisher Scientific, Inc.): Polymerase activation at 95°C for 2 min, followed by 40 cycles of 94°C for 10 sec, 59°C for 10 sec and 72°C for 40 sec. All reactions were run in triplicate. miRNA and transcript expression levels were normalized to the reference genes. Fold changes were determined and the relative quantification of gene expression data was performed using the 2−ΔΔCq method (30).

Table II.

RT-qPCR primer used in the validation assays.

Primer name	Primer sequence (5′→3′)	Tm
mir4461-R	GTCGTATCCAGTGCGTGTCGTGGAGTCGGCAATTGCACTGGATACGACTATGGC
mir4461-F	TATGTACGTAGTCTAGGCC	55
mir-4775-R	GTCGTATCCAGTGCGTGTCGTGGAGTCGGCAATTGCACTGGATACGACACCAAC
mir-4775-F	AAGCATTCTTTCATTGGTTGG	55
mir-4651-R	GTCGTATCCAGTGCGTGTCGTGGAGTCGGCAATTGCACTGGATACGACCCCACC
mir-4651-F	CGGCGACGGCGGGGT	55
U6-R	GTCGTATCCAGTGCGTGTCGTGGAGTCGGCAATTGCACTGGATACGACAAAATATG
U6-F	GATTAGCATGGCCCCTGC	55
EGFR-F	AAAGTGGCCGCCATTTTAGA	57
EGFR-R	CAACAATCACGCAAAGCTCC
CX3CL1-F	CGTGCAGCAAGATGACATCA	59
CX3CL1-R	TCCTTGACCCATTGCTCCTT
LRRC55-F	AATGGACACCCGAAACCTCA	57
LRRC55-R	TGGCACATGGCTGAAATTGT
FAM18B1-F	AATGGTTGGCCTACGTTGGT	59
FAM18B1-R	TGGACAGGCAATAAGTCCCA
AURKC-F	AGCGCACAGCCACGATAATA	59
AURKC-R	TGCACAGACCAGCCAAAATC
RPS4Y1-F	ATGGCAAGGTTCGAGTGGAT	59
RPS4Y1-R	GATGCGGTGAACAGCAAAAC
GAPDH-F	ACCACAGTCCATGCCATCAC	59
GAPDH-R	TCCACCACCCTGTTGCTGTA

Tm, primer melting temperature; F, forward; R, reverse.

Results

Sequence profiling of miRNAs, transcripts and proteins

To build an integrated network of miRNAs, transcripts and proteins for AS-iPSCs, miRNA, transcript and protein expression level data for AS-iPSCs and NC-iPSCs were compared. The comparison of AS-iPSC and NC-iPSC sRNA libraries revealed 830 differentially expressed miRNAs, of which 155 were significantly differentially expressed, including 79 upregulated and 76 downregulated miRNAs (data not shown). The miRNA sequences were submitted to the National Center for Biotechnology Information (NCBI) and assigned the accession no. SRP041435. Following quality control and filtering, 26,021,874 and 27,551,343 clean reads were obtained from the AS-iPSC and NC-iPSC transcript libraries, respectively. Following application of the thresholds for significance, 1,168 significantly differentially expressed genes were identified, including 786 that were upregulated and 382 that were downregulated (data not shown). The raw transcript sequence data were deposited at the NCBI under the accession no. SRP041474. For the proteomic analysis, a total of 15,553 iTRAQ- labeled peptides that mapped to a total of 3,431 proteins were identified and quantified. Among them, 899 proteins were predicted as differentially abundant between the AS-iPSC and NC-iPSC samples. Of these, 383 were significantly differentially abundant proteins (data not shown), including 227 upregulated and 156 downregulated proteins.

miRNA-target transcript regulatory network

To establish the miRNA-target transcript network, a systematic investigation was conducted using miRanda, TargetScan and Pictar to predict potential miRNA target transcripts. A transcript was classed as a potential target if two out of three platforms identified it as a miRNA target. The network map, which consisted of miRNAs and their target transcripts, was constructed using Cytoscape (Fig. 3). The results indicated that 156 miRNAs were associated with 382 target transcripts. It was observed that miRNAs and their target transcripts were mutually cross-regulated. Among them, the most prominent miRNA was hsa-miRNA (miR)-4775, which was associated with 27 transcripts.

Figure 3.

miRNA-target transcript regulatory network. A total of 156 miRNAs were associated with 382 target transcripts. Red circles represent upregulated miRNAs and green circles represent downregulated target transcripts. The lines signify coherent miRNA-target transcript pairs. miRNA, microRNA.

To evaluate the functions of the target transcripts and miRNAs, GO functional enrichment analysis was conducted. A total of three ontologies, including molecular function (MF), cellular component (CC) and biological process (BP) were enriched by features of the transcript and miRNA datasets. In total, miRNAs were mainly involved in 9 BP, 7 CC and 4 MF terms (Fig. 4A), and the target transcripts were mainly involved in five BP, 5 CC and five MF terms (Fig. 4B). Notably, certain terms involving miRNAs were also associated with the target transcripts. BP terms included ‘cellular process’ and ‘biological regulation’. CC terms included ‘cell physiology’ and ‘membrane part’. MF terms included ‘cell activity’ and ‘cellular molecular binding’.

Figure 4.

Enrichment of GO terms for miRNAs, target transcripts and target proteins. GO enrichments of (A) miRNAs, (B) target transcripts and (C) target proteins. The X-axis indicates the GO ontology terms according to molecular function, cellular component and biological process. The Y-axis indicates the percentage of genes in each category. GO, gene ontology; miRNA, microRNA.

miRNA-target protein regulatory network

GO enrichment analysis of the miRNAs and target proteins was performed. Target proteins were mainly enriched in five BP, five CC and five MF terms (Fig. 4C). Compared with the miRNA GO enrichment analysis, certain GO terms for the target proteins were the same or similar to those of the miRNAs. For example, BP included ‘cellular physiological process’ and ‘intracellular signal’, and MF included ‘cellular activity’ and ‘cellular molecular binding’. However, the CC terms of the target proteins and miRNAs were different. The CC term ‘intracellular biological process’ was associated with the target proteins, whereas ‘extracellular biological process’ was associated with the miRNAs. To investigate the potential functional association between miRNAs and target proteins, the differentially expressed miRNAs and target proteins were used to build a regulatory network. Cytoscape used to visualize the miRNA-target protein regulatory network and the results demonstrated that 26 miRNAs were associated with 25 target proteins (Fig. 5). In addition, the results indicated that the miRNAs and target proteins were mutually cross-regulated.

Figure 5.

miRNA-target protein regulatory network. A total of 26 miRNAs were associated with 25 target proteins. Red circles represent upregulated miRNAs and green circles represent downregulated target proteins. The lines signify coherent miRNA-target protein pairs. miRNA/miR, microRNA.

Transcript-protein regulatory network

A total of 1,168 differentially expressed transcripts and 383 differentially abundant proteins were used to build a regulatory network. The results demonstrated that 237 transcripts were associated with 189 proteins (Fig. 6). ELAV-like RNA binding protein 1 (ELAVL1) was the most prominent protein, which was associated with 41 transcripts, and epidermal growth factor receptor (EGFR) was the most prominent transcript, which was associated with 19 proteins.

Figure 6.

Transcript-protein regulatory network. A total of 237 transcripts were associated with 189 proteins. Circles represent transcripts and squares represent proteins. Red indicates upregulation and blue indicates downregulation. The lines signify coherent transcript-protein pairs.

KEGG was used to analyze pathways enriched for significantly differentially expressed transcripts between the AS-iPSC and NC-iPSC libraries. Among them, 1,168 differentially expressed genes were involved in 15 pathways (data not shown) and 383 differentially abundant proteins were involved in 11 pathways (data not shown). Notably, there were several pathways that the proteins were involved in that were similar to those of the transcripts, including cell signaling pathways, cell metabolism pathways and cell interaction pathways. Transcripts and proteins were also mapped to KEGG pathways using GenMAPP to identify common pathways. The results indicated that the transcripts and proteins were primarily enriched in 262 common pathways, which included ‘carbon metabolism’ and ‘cell adhesion molecules’ (data not shown). Numerous studies have demonstrated a substantial role for post-transcriptional, post-translational and protein degradation regulation in cellular biology (31). However, it is generally accepted to use transcript concentrations as proxies for the concentrations and activities of the corresponding proteins. On the basis of transcript-protein association, nine proteins and nine transcripts were identified to have a consistent expression pattern, including three upregulated and six downregulated pairs (Fig. 7).

Figure 7.

A total of nine proteins and their corresponding transcripts, identified to have a consistent expression pattern. The X-axis indicates transcripts and proteins. The Y-axis indicates upregulation or downregulation of expression. CAV1, caveolin 1; CPT1A, carnitine palmitoyltransferase 1A; CRYZ, crystallin-ζ; L1CAM, L1 cell adhesion molecule; NLGN4X, neuroligin 4 X-linked; RCN3, reticulocalbin 3; STOM, stomatin; UTF1, undifferentiated embryonic cell transcription factor 1; WDR72, WD repeat domain 72.

miRNA-protein-transcript network

To integrate the miRNA, transcript and protein data, Cytoscape was used to construct a network comprising 156 miRNAs, 26 proteins and 381 transcripts (Fig. 8). The network consisted of miRNA-protein and miRNA-transcript interaction pairs, whereby a single miRNA was able to be connected to multiple transcripts or proteins, and a single protein or transcript was able to be targeted by multiple miRNAs. In the network, the most prominent miRNA was hsa-miR-4775, which was associated with 30 transcripts and proteins.

Figure 8.

miRNA-protein-transcript regulatory network. The regulatory network comprises of 156 miRNAs, 26 proteins and 381 transcripts. The red circles indicate upregulated miRNAs, the blue circles represent downregulated transcripts and the blue squares indicate downregulated proteins. The lines signify coherent miRNA-transcript-protein-associated pairs. miRNA, microRNA.

Expression profiling by RT-qPCR

A total of three differentially expressed miRNAs and six differentially expressed transcripts were selected for RT-qPCR analysis. These miRNAs and transcripts exhibited high abundance and were differentially expressed between the AS and NC groups. The expression levels determined by RT-qPCR were consistent with the read abundance of deep sequencing, indicating the validity of the microarray-based miRNA and transcript quantification. RT-qPCR confirmed that hsa-mir-4651, hsa-mir-4461 and hsa-miR-4775 were upregulated in AS compared with NC (data not shown). In addition, EGFR, LRRC55, and CX3CLI were upregulated; and AURKC, RPS4Y1, and FAM18BI were downregulated in AS, compared with NC (data not shown).

Discussion

It is considered that IPSCs may revolutionize stem cell biology and regenerative medicine research by providing unprecedented opportunities to research human diseases and to overcome study limitations associated with safety, efficiency and ethics (32). IPSCs are an ideal platform for medical research as they carry the identical genetic anomalies associated with genetically heritable diseases, including Down's syndrome and type 1 diabetes. Various diseases have been modeled using iPSCs to better understand their etiology, which may aid in the development of novel treatments (33–35). In addition, iPSC technology has been employed in various diseases for disease modeling and gene therapy (36). Zou et al (37) demonstrated that single nucleotide substitution in human iPSCs was feasible, which may provide a novel strategy for gene therapy for monogenic diseases, including sickle cell disease. Another study used zinc finger nucleases and PiggyBac technology in human iPSCs to achieve biallelic correction of a point mutation (Glu342Lys) in the serpin family A member 1 gene, which is responsible for α1-antitrypsin deficiency (38). In addition, a previous study demonstrated successful in situ gene corrections in iPSCs derived from patients with pyruvate kinase deficiency (39). In the present study, iPSCs were used to study the pathogenic mechanisms of AS, since it is a hereditary disease caused by genetic mutations. Increasing research has revealed that miRNAs, transcripts and proteins regulate iPSCs, which may be important for their pluripotency (40,41). The specific regulatory roles of miRNAs in controlling the self-renewal and pluripotency of iPSCs are becoming increasingly evident. Studies have suggested a critical role for miRNAs in reprogramming somatic cells into pluripotent cells (42,43). Another study revealed that cells undergoing iPSC reprogramming exhibited marked transcriptomic alterations associated with cell signaling pathways (44). Protein studies of iPSCs have also proven to have the potential to identify molecules and pathways that are important for iPSC biology (45,46). Therefore, the present study made use of miRNA, transcript and protein data to study AS in an iPSC disease model, which will aid in a better understanding of AS pathogenesis. Systems biological approaches provide tools to investigate the interactions between candidate genes by integrating miRNA, transcript and protein data. Network-based approaches are ideally suited to study the implications of functional genes in disease (47). In the present study, multiple expression profiling and bioinformatics analyses were integrated. The constructed databases were subjected to network analysis. To the best of our knowledge, the present study is the first of its kind to attempt investigating miRNA-target transcript, miRNA-target protein and transcript-protein networks in the context of an iPSC AS disease model. The results may be used for further research in the area of AS. In the present study, significantly differentially expressed miRNAs, transcripts and proteins were identified, and the corresponding targets of the miRNAs were predicted. Target transcripts, proteins and miRNAs were subjected to GO enrichment analysis. A systematic comparison of the results revealed similar GO enrichments terms for AS-iPSCs and NC-iPSCs. Cellular process in BP, membrane part in CC, and cell binding in MF were identified as common GO terms for miRNAs and their target transcripts. AS is an inherited disease caused by gene mutations that result in absence of the collagen IV network from basement membranes (9). The glomerular basement membrane (GBM) is the extracellular matrix component of the glomerular filtration barrier that lies between podocytes and endothelial cells. Pathological alterations in the GBM are specific characteristics of AS (5,9). Notably, the results demonstrated that the miRNAs and their target transcripts were enriched in the ‘membrane part’ CC category. Studies have also revealed that podocyte dysfunction and the development of fibrosis are pathological alterations characteristic of AS (2,48). Podocytes may bind to various immune deposits, thereby contributing to the development of fibrosis (49). This is consistent with the present results, which identified that the miRNAs and their target transcripts were enriched in ‘binding’ in the MF category. In the GO analysis of the miRNAs and their target proteins, ‘binding’ was also a significant term in the MF category. The common KEGG pathways for the miRNAs and proteins were those involving carbon metabolism and cell adhesion molecules. Research has demonstrated that in addition to the inherited AS factors, acquired external factors serve an important role in the progression of the disease (50,51). Metabolic abnormalities are an example of these acquired external factors (52,53). During the development of kidney failure, the majority of patients with AS suffer from metabolic abnormalities, including high blood pressure, hyperlipidemia and high proteinuria (54,55). Metabolic abnormalities contribute to the progression of kidney disease and subsequent kidney failure (56); however, the correction of metabolic disorders may considerably delay renal replacement therapy and accelerate the onset of end-stage kidney disease (57). Cell adhesion molecules are a principal mechanism of cell-cell and cell-matrix interactions. The growth, differentiation and organization of cells require cell-cell interactions (58). A previous study revealed that cadherin, a cell adhesion molecule, serves an important role in animal morphogenesis (59). Another study revealed that neural cell adhesion molecules were already present on the cells of the uninduced nephrogenic mesenchyme at the onset of kidney development (60). Therefore, it was hypothesized that morphological alterations occur in AS during the developmental process due to abnormalities in cell adhesion molecules, which in turn results in a thin and fractured GBM. Cellular reproduction is controlled by protein-coding genes and noncoding regions, including those encoding for sRNAs. The expression of coding and noncoding genes may be markedly influenced by the structural features of the corresponding sRNA transcripts (61). miRNAs, a diverse class of highly conserved sRNAs, serve a critical role in fundamental biological processes, including cellular differentiation, apoptosis, cell proliferation and development (62,63). The comprehensive characterization of miRNAs and transcripts in cell lines provides insight into the miRNA regulation of transcription, and enables the miRNA regulation of the corresponding target genes to be evaluated (64). In the present study, the regulatory network of miRNA-target transcripts revealed that they were mutually cross-regulated. Similarly, miRNA-protein interaction networks have long been studied in detail in various diseases. Numerous diseases have been linked to the misregulation or malfunction of proteins that interact with RNA (65). Thus, deciphering RNA-protein interactions at the molecular and cellular level is essential for understanding human physiology and disease (66). RNA-protein interactions are critical for the regulation of gene expression. Research over the last decade has demonstrated that RNA is invariably bound and frequently altered by proteins in cells, and that in biological environments, RNAs generally function together with proteins as RNA-protein complexes (67,68). In the present study, it was identified that certain miRNAs and proteins were closely associated. An in-depth study of this interaction network may aid in understanding the pathogenesis of AS. The aim of the present study was to identify potential important targets through regulatory network analysis of miRNAs, transcripts and proteins. Notably, hsa-miR-4775 had the greatest number of associations with proteins and target transcripts. In addition, hsa-miR-4775 was upregulated in AS-iPSCs, which was also verified using RT-qPCR. Recently a study revealed that hsa-miR-4775 promotes colorectal cancer invasion and metastasis (69). It was also reported that hsa-miR-4775 binds to the mRNAs of genes involved in the core apoptosis pathway, resulting in alterations which may lead to cancer, nephropathy and other diseases (70,71). The EGFR family is a central element for cellular signal transduction and diversification (72). Upregulated EGFR was observed to be the most relevant transcript in the interaction analysis in the present study. EGFR is known to have a key role in chronic renal failure (73). Transgenic mice harboring the negative form of EGFR are resistant to the progression of renal lesions (74). Dawson et al (75) revealed that EGFR was overexpressed in 75–90% of renal cell carcinomas and served an important role in tumor suppression or progression. Downregulated ELAVL1 was the most prominent protein in the network in the present study. This protein is a member of the ELAV family of RNA binding proteins that contain a number of RNA recognition motifs. ELAVL1 has been identified as a key post-transcriptional regulator and is implicated in cancer progression, particularly human renal cell carcinoma. ELAVL1 knockdown mice exhibited 60% inhibition of cell growth through inhibition of cell proliferation in addition to induction of cellular apoptosis (76). Therefore, it may be hypothesized that upregulated EGFR was involved in chronic renal failure of AS. In addition, it was theorized that there was an association between EGFR and hsa-miR-4775 expression, which may have accelerated the core apoptosis of GBM cells. However, further studies are required to confirm this hypothesis. In conclusion, 155 differentially expressed miRNAs, 1,168 differentially expressed transcripts and 383 differentially abundant proteins were identified in AS-iPSCs compared with NC-iPSCs. Using computational and systems techniques, integrated analysis of miRNA, transcript and protein expression data was performed. In addition, regulatory network maps were constructed to determine miRNA interactions with transcripts and proteins. To investigate the influence of miRNAs on biological processes, potential targets were predicted using miRanda, TargetScan and Pictar. The results demonstrated that hsa-miR-4775, ELAVL1 and EGFR were the most prominent miRNA, protein and transcript, respectively. RT-qPCR confirmed the upregulation of hsa-miR-4775 and EGFR. Target transcripts, proteins and miRNAs were used for GO enrichment analysis. ‘Cellular process’ in BP, ‘membrane part’ in CC and ‘cell binding’ in MF were common GO terms for miRNAs and target transcripts. ‘Cell binding’ in MF was also a common GO term for miRNAs and target proteins. GenMAPP identified common KEGG pathways of miRNAs and proteins, which included ‘carbon metabolism’ and ‘cell adhesion molecules’. Integration of multiple profiling datasets provides a novel way of examining gene regulation by miRNAs in conjunction with proteins and transcripts. This approach may enhance the understanding of the pathogenesis of AS and provide novel diagnostic and treatment strategies for AS.

76 in total

Review 1. Genomics of microRNA.

Authors: V Narry Kim; Jin-Wu Nam
Journal: Trends Genet Date: 2006-01-30 Impact factor: 11.639

2. Association between the metabolic syndrome and chronic kidney disease in Chinese adults.

Authors: Jing Chen; Dongfeng Gu; Chung-Shiuan Chen; Xigui Wu; L Lee Hamm; Paul Muntner; Vecihi Batuman; Chien-Hung Lee; Paul K Whelton; Jiang He
Journal: Nephrol Dial Transplant Date: 2007-02-01 Impact factor: 5.992

3. Genome-wide analysis of microRNAs expression profiling in patients with primary IgA nephropathy.

Authors: Kuibi Tan; Jing Chen; Wuxian Li; Yuyu Chen; Weiguo Sui; Yang Zhang; Yong Dai
Journal: Genome Date: 2013-03-11 Impact factor: 2.166

4. MicroRNAs in regulation of pluripotency and somatic cell reprogramming: small molecule with big impact.

Authors: Tian Wang; San-bao Shi; Hong-ying Sha
Journal: RNA Biol Date: 2013-07-23 Impact factor: 4.652

Review 5. Podocyte injury and its consequences.

Authors: Michio Nagata
Journal: Kidney Int Date: 2016-03-19 Impact factor: 10.612

Review 6. Cell adhesion: the molecular basis of tissue architecture and morphogenesis.

Authors: B M Gumbiner
Journal: Cell Date: 1996-02-09 Impact factor: 41.582

7. Production of Gene-Corrected Adult Beta Globin Protein in Human Erythrocytes Differentiated from Patient iPSCs After Genome Editing of the Sickle Point Mutation.

Authors: Xiaosong Huang; Ying Wang; Wei Yan; Cory Smith; Zhaohui Ye; Jing Wang; Yongxing Gao; Laurel Mendelsohn; Linzhao Cheng
Journal: Stem Cells Date: 2015-05 Impact factor: 6.277

8. Collagen type IV nephropathy: genetic heterogeneity examinations in affected Hungarian families.

Authors: Emoke Endreffy; Zoltán Ondrik; Béla Iványi; Zoltán Maróti; Csaba Bereczki; Ibolya Haszon; Zsuzsanna Györke; Endre Worum; Krisztina Németh; Csaba Rikker; Zsuzsanna Ökrös; Sándor Túri
Journal: Mol Cell Probes Date: 2010-10-14 Impact factor: 2.365

Review 9. Specificity and nonspecificity in RNA-protein interactions.

Authors: Eckhard Jankowsky; Michael E Harris
Journal: Nat Rev Mol Cell Biol Date: 2015-08-19 Impact factor: 94.444

Review 10. MicroRNA and nephropathy: emerging concepts.

Authors: Arthur Ck Chung; Xueqing Yu; Hui Y Lan
Journal: Int J Nephrol Renovasc Dis Date: 2013-09-25

3 in total

1. HMGB-1 and TGFβ-1 highlight immuno-inflammatory and fibrotic processes before proteinuria onset in pediatric patients with Alport syndrome.

Authors: R Chimenz; V Chirico; P Basile; A Carcione; G Conti; P Monardo; A Lacquaniti
Journal: J Nephrol Date: 2021-03-24 Impact factor: 3.902

2. Deciphering the link between Diabetes mellitus and SARS-CoV-2 infection through differential targeting of microRNAs in the human pancreas.

Authors: E Pathak; R Mishra
Journal: J Endocrinol Invest Date: 2021-10-20 Impact factor: 5.467

Review 3. Non-Coding RNAs in Hereditary Kidney Disorders.

Authors: Julie Xia Zhou; Xiaogang Li
Journal: Int J Mol Sci Date: 2021-03-16 Impact factor: 5.923

3 in total