| Literature DB >> 35163149 |
Kseniya Khamina1, Andreas B Diendorfer1, Susanna Skalicky1, Moritz Weigl1, Marianne Pultar1, Teresa L Krammer1, Catharine Aquino Fournier2, Amy L Schofield3, Carolin Otto4, Aaron Thomas Smith5, Nina Buchtele6,7, Christian Schoergenhofer6, Bernd Jilma6, Bernhard J H Frank8, Jochen G Hofstaetter8,9, Regina Grillari10,11,12, Johannes Grillari10,12,13, Klemens Ruprecht4, Christopher E Goldring3, Hubert Rehrauer2, Warren E Glaab14, Matthias Hackl1.
Abstract
The plasma levels of tissue-specific microRNAs can be used as diagnostic, disease severity and prognostic biomarkers for chronic and acute diseases and drug-induced injury. Thereby, the combination of diverse microRNAs into biomarker signatures using multivariate statistics seems especially powerful from the perspective of tissue and condition specific microRNA shedding into the plasma. Although next-generation sequencing (NGS) technology enables one to analyse circulating microRNAs on a genome-scale level, it suffers from potential biases (e.g., adapter ligation bias) and lacks absolute transcript quantitation as well as tailor-made quality controls. In order to develop a robust NGS discovery assay for genome-scale quantitation of circulating microRNAs, we first evaluated the sensitivity, repeatability and ligation bias of four commercially available small RNA library preparation protocols. The protocol from RealSeq Biosciences was selected based on its performance and usability and coupled with a novel panel of exogenous small RNA spike-in controls to enable quality control and absolute quantitation, thus ensuring comparability of data across independent NGS experiments. The established microRNA Next-Generation-Sequencing Discovery Assay (miND) was validated for its relative accuracy, precision, analytical measurement range and sequencing bias and was considered fit-for-purpose for microRNA biomarker discovery. Summarized, all these criteria were met, and thus, our analytical platform is considered fit-for-purpose for microRNA biomarker discovery from biofluids in the setting of any diagnostic, prognostic or patient stratification need. The established miND assay was tested on serum, cerebrospinal fluid (CSF), synovial fluid (SF) and extracellular vesicles (EV) extracted from cell culture medium of primary cells and proved its potential to be used across different sample types.Entities:
Keywords: biomarkers; drug safety; microRNA; next-generation sequencing; small RNA-sequencing; spike-in; toxicology
Mesh:
Substances:
Year: 2022 PMID: 35163149 PMCID: PMC8835905 DOI: 10.3390/ijms23031226
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Study design. (A) Four library preparation protocols were selected in order to evaluate their performance and usability. Six technical replicates from a pool of plasma samples and aliquots of miRXplore Universal Reference were analysed. (B) The NGS assay utilizing mind spike-ins was designed using the following steps: (1) RNA extraction; (2) adding of the miND spike-in—a pool of 7 oligonucleotides, each containing a 13 nt core sequence flanked by a set of 4 randomized nucleotides on the 5′ and 3′ ends, that were mixed in the defined ratio; (3) preparation of the NGS libraries according to the selected protocols; (4) data analyses and data normalisation based on the miND spike-in concentration range.
Figure 2A systematic comparison of the selected small RNA library preparation protocols. (A) Unsupervised clustering analyses of plasma and miRXplore samples based on RPM data from 739 microRNAs. (B) Sequencing bias. RPM values were averaged across six technical replicates per protocol. The fold change (FC) compared to the median RPM was calculated for each microRNA and sample, and is presented on a log10 scale. Medians with interquartile range are indicated. The percentage of microRNAs within 0.1- to 10-fold from the median (=1) was calculated and depicted above the x-axis. (C) Pearson correlation coefficient matrix was generated from the data shown in (B). (D) Venn diagram representing the overlap between distinct microRNAs detected with four selected protocols in the miRXplore samples. (E) Venn diagram representing the overlap between distinct microRNAs detected with four selected protocols in the plasma samples.
The miND spike-in sequences. A 13-nucleotide core sequence is flanked by four randomized nucleotides the 5′ and 3′ ends.
| Oligo | Sequence (5′–3′) | Molar Amount (amol) |
|---|---|---|
| I | (N)(N)(N)(N)ACGAUCGGCUCUA(N)(N)(N)(N) | 50 |
| K | (N)(N)(N)(N)UGAACGUCCGUAC(N)(N)(N)(N) | 10 |
| M | (N)(N)(N)(N)UCUCGCGCGCGUU(N)(N)(N)(N) | 2.5 |
| N | (N)(N)(N)(N)CGAGUAAUGAACG(N)(N)(N)(N) | 1.5 |
| H | (N)(N)(N)(N)GCUACACACGUCG(N)(N)(N)(N) | 0.1 |
| C | (N)(N)(N)(N)UAUUCGCGGUGAC(N)(N)(N)(N) | 0.01 |
| E | (N)(N)(N)(N)ACCUCCGUUUACG(N)(N)(N)(N) | 0.005 |
Figure 3Testing of the miND spike-in. Aliquots of a plasma pool were processed either with Qiagen or Promega RNA extraction protocols in triplicates. The generated RNA samples were analysed with the miND assay. (A) Scatter plot of relative miND spike-in (RC) compared to absolute miND spike-in levels (molecules detected per microliter of RNA) in Promega-extracted sample (Replicate 01). Pearson correlation coefficient r as well as the line that represents a linear model derived from the plotted values were calculated. (B) Distribution of RPM values of the endogenous microRNAs in the Qiagen- and Promega-extracted samples (3 replicates per protocol). The miND spike-ins with the highest (I, 50 amol) and the lowest (E, 0.005 amol) concentrations were indicated with red and green lines on the plot. (C) Distribution of absolute concentrations (molecules per microliter of RNA) of the endogenous microRNAs in the Qiagen- and Promega-extracted samples (3 replicates per protocol). The average values of molecules/µL for each sample were calculated, and an unpaired t-test comparison for Qiagen- and Promega-extracted samples was performed (p-value < 0.0001, p-value summary ****).
Figure 4Observed vs. expected absolute concentrations in (A) amol and (B) RPM from 37 microRNAs that were significantly up-regulated in APAP compared to NHV samples, presented on log10 axes. The Pearson correlation coefficient of dilution series was calculated for 37 microRNAs and transformed to z-score using Fisher transformation. (C) The coefficient of variation for each microRNA from 9 technical replicates was calculated and plotted against the absolute concentrations (amol). The red line indicates the fraction of microRNAs with less than 50% CV. (D) Distribution of RPM values of the endogenous microRNAs for 9 technical replicates (RPM > 0). The miND spike-in with the highest (I, 50 amol) and the lowest (E, 0.005 amol) concentrations indicated with red and green lines. The number of detected microRNAs is indicated above each sample. (E) Observed vs. expected increase in absolute concentrations (amol) for hsa-miR-137-3p (under-represented), hsa-miR-520e-3p (normal representation) and hsa-miR-630 (over-represented). The synthetic microRNAs were spiked in the RNA isolated from a pool of NHV samples at 5, 20 and 80 amol concentrations.
Figure 5Distribution or read count, RPM values and concentrations for endogenous microRNA detected in 3 samples for plasma, serum, cerebrospinal fluid (CSF), synovial fluid (SF) and extracellular vesicles (EV) extracted from cell culture medium of primary human cells are presented on log10 scale. (A) microRNA RC values are demonstrated. The miND spike-ins with the highest (I, 50 amol) and the lowest (E, 0.005 amol) concentrations are indicated with red and green lines in the plot. (B) microRNA RPM values are shown together with median (dark blue line) and interquartile range (light blue lines). (C) microRNA copy numbers per µL of input RNA are shown together with median (dark blue line) and interquartile range (light blue lines). (D) microRNA copy numbers per µL of input RNA for a CNS enriched miRNA, miR-124-3p.
The median (Q3–Q1) and inter-quartile range (IQR) of microRNA concentrations (molecules/µL) in 1 µL of analysed biofluids. The number of detected microRNAs and the 5 most abundant microRNAs on average for 3 samples per biofluid are indicated.
| Biofluid | Sample | Median (Q3–Q1) | IQR | Number of Detected MicroRNAs | The 5 Most Abundant MicroRNAs on Average |
|---|---|---|---|---|---|
| Plasma | 1 | 6 (41.3–1.7) | 39.6 | 668 | miR-451a, miR-16-5p, miR-486-5p, miR-92a-3p, miR-103a-3p |
| 2 | 4.1 (33.6–0.8) | 32.8 | 707 | ||
| 3 | 3.8 (31.2–0.9) | 30.3 | 658 | ||
| Serum | 1 | 10.2 (76.7–2.6) | 74.1 | 900 | miR-451a, miR-16-5p, miR-92a-3p, miR-486-5p, miR-19b-3p |
| 2 | 14.1 (134.6–3.2) | 131.4 | 925 | ||
| 3 | 21.5 (131.1–7.2) | 123.9 | 600 | ||
| Synovial Fluid | 1 | 9.9 (30.0–1.2) | 57.6 | 548 | miR-21-5p, miR-23a-3p, miR-451a, miR-221-3p, miR-223-3p |
| 2 | 14.4 (45.3–1.8) | 87.0 | 530 | ||
| 3 | 6.0 (19.5–0.8) | 37.5 | 552 | ||
| Cerebrospinal Fluid | 1 | 2.7 (12.2–0.9) | 11.3 | 387 | miR-21-5p, miR-204-5p, miR-145-5p, miR-99a-5p, miR-221-3p |
| 2 | 3.8 (16.4–0.9) | 15.5 | 388 | ||
| 3 | 4.5 (23.1–1.5) | 21.6 | 467 |