| Literature DB >> 34746868 |
Vijit Saini1,2, Mugdha V Joglekar1, Wilson K M Wong1, Guozhi Jiang3, Najah T Nassif2, Ann M Simpson2, Ronald C W Ma3, Louise T Dalgaard4, Anandwardhan A Hardikar1,4.
Abstract
MicroRNAs (miRNAs) are elements of the gene regulatory network and manipulating their abundance is essential toward elucidating their role in patho-physiological conditions. We present a detailed workflow that identifies important miRNAs using a machine learning algorithm. We then provide optimized techniques to validate the identified miRNAs through over-expression/loss-of-function studies. Overall, these protocols apply to any field in biology where high-dimensional data are produced. For complete details on the use and execution of this protocol, please refer to Wong et al. (2021a).Entities:
Keywords: Bioinformatics; Cell culture; Computer sciences; Gene Expression; Molecular Biology
Mesh:
Substances:
Year: 2021 PMID: 34746868 PMCID: PMC8554629 DOI: 10.1016/j.xpro.2021.100910
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1Representative image of data arrangement prior to analysis
An example of how your dataset should be arranged; with the samples in rows and variables in columns. The (Biological) Group Name, Sample Name and dependent variable (dv) are in columns A, B and C respectively. This is followed by the independent variables (miRNAs) (v4…v(n)) starting from column D. Cycle threshold (Ct)-values for each miRNA (v4-v(n)) are presented in this example.
Figure 2Representative image of the output result
Figure presents the frequency table produced by our penalized logistic regression with bootstrapping. The frequency table can be used to identify the most important discriminatory miRNAs. For example, v490 appears with >96% frequency, suggesting it is one of the most important miRNAs in discriminating dv “0” and dv “1”.
Figure 3Optimization for puromycin concentration
Untransduced hIPCs with increasing puromycin concentrations (0–10 μg/mL) in the serum-containing medium. Cells were observed every day and imaged on d7. Images of cells with concentrations of >1 μg/mL are not shown since all cells were killed off. Scale bar 100μm.
Figure 4Transduced human islet-derived progenitor cells
hIPCs transduced with doxycycline-inducible lentiviral vectors expressing green fluorescence protein (TurboGFP) in serum-containing medium with either no doxycycline (A and C) or 200 ng/mL doxycycline (B and D). Scale bar 200μm.
Figure 5Images of two output results produced on the same dataset without the same number in set.seed()
Figure 6Error message if the directory/file name is incorrect
Figure 7Error message if the output directory location is incorrect
Figure 8Error message if dataset contains non-numeric values
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Human islet cells | Derived from human islets obtained from islet isolation centres | ( |
| PANC1 cells | Sigma-Aldrich | 87092802 |
| Potassium chloride (KCl) | VWR | 26760.295 |
| Potassium dihydrogen phosphate (KH2PO4) | VWR | 26925.295 |
| Sodium bicarbonate (NaHCO3) | Sigma-Aldrich | 56014 |
| Sodium chloride (NaCl) | Sigma-Aldrich | 59888 |
| Disodium hydrogen phosphate (Na2HPO4) | VWR | 28026.292 |
| D-(+)-Glucose | Sigma-Aldrich | G7021 |
| EDTA | VWR | VWRC20301.290 |
| Trypsin (1:250), powder | Thermo Fisher Scientific | 27250018 |
| Medium (CMRL-1066) | Thermo Fisher Scientific | 11530037 |
| Medium (DMEM, high glucose) | Thermo Fisher Scientific | 11965092 |
| Fetal bovine serum, certified, United States | Thermo Fisher Scientific | 16000044 |
| GlutaMAX Supplement | Thermo Fisher Scientific | 35050061 |
| Penicillin-Streptomycin (5,000 U/mL) | Thermo Fisher Scientific | 15070063 |
| hEGF recombinant, expressed in E. coli, lyophilized powder, suitable for cell culture | Sigma-Aldrich | E9644 |
| Polybrene (Hexadimethrine bromide) | Sigma-Aldrich | H9268 |
| Doxycycline | Sigma-Aldrich | D9891 |
| Puromycin Dihydrochloride | Thermo Fisher Scientific | A1113803 |
| SMARTvector inducible third-generation lentiviral vectors (Non-Target Control) | Dharmacon | VSC6570 |
| SMARTvector inducible third-generation lentiviral vectors (microRNA) | Dharmacon | VSH6906 |
| TRIzol reagent | Thermo Fisher Scientific | 15596026 |
| miRCURY LNA microRNA Power Inhibitor | QIAGEN | 339131 |
| Cumate-inducible promoters (alternate to doxycycline-inducible SMARTvector™) | System Bioscience | |
| BLOCK-iT™ Inducible H1 RNAi Entry Vector Kit (Do-It-Yourself shRNA-inducible vector as an alternate to doxycycline-inducible SMARTvector™) | Thermo Fisher Scientific | |
| R software (The R project) | ver. 3.6.2 | |
| Rstudio software | Ver 1.3.1093 (used here. Most recent version of Rstudio should also work with R ver 3.6.2). | |
| R script (codes) | n/a | |
| 24-well tissue culture plates (Falcon) | Falcon | 353047 |
| 12-well tissue culture plates (Falcon) | Falcon | 353043 |
| 6-well tissue culture plates (Falcon) | Falcon | 353046 |
| 6-well suspension plates (Greiner) | Greiner | M9062 |
| T25 tissue culture flasks (Falcon) | Falcon | 352097 |
| T75 tissue culture flasks (Falcon) | Falcon | 353136 |
Trypsin: Once prepared, trypsin should be filtered using 0.2 μm filtration system and then aliquoted as 10–12 mL aliquots into 15 mL conical tubes and stored at −20°C (for long term storage).
| Reagent | Final concentration (mM) | Amount (for 500 mL) |
|---|---|---|
| Potassium chloride (KCl) | 5.365 | 0.2 g |
| Monopotassium phosphate (KH2PO4) | 0.441 | 0.03 g |
| Sodium bicarbonate (NaHCO3) | 4.285 | 0.18 g |
| Sodium chloride (NaCl) | 136.893 | 4.0 g |
| Disodium phosphate (Na2HPO4) | 0.704 | 0.05 g |
| D-glucose | 5.551 | 0.5 g |
| EDTA | 1.300 | 0.19 g |
| Trypsin (1:250) | n/a | 1.25 g |
| Autoclave water | n/a | 500 mL |
Medium for PANC1 or hIPCs culture: Once prepared, cell culture media are stored at 4°C (up to three months).
| Reagent | Final concentration | Amount |
|---|---|---|
| Medium | 1 | 450 mL |
| Fetal bovine serum | 10% | 50 mL |
| GlutaMAX™ | 1% | 5 mL |
| Penicillin-Streptomycin (stock 5,000 U/mL) | 1% | 5 mL |
| hEGF (stock 0.1 mg/mL) | 10 ng/mL | 50 μL |
High glucose DMEM medium is for PANC1 cells and CMRL (1066) medium is for islet cells.
hEGF is only included in CMRL medium (i.e., for islet cells).