| Literature DB >> 29089558 |
Maria G Amorim1, Renan Valieris2, Rodrigo D Drummond2, Melissa P Pizzi1, Vanessa M Freitas3, Rita Sinigaglia-Coimbra4, George A Calin5, Renata Pasqualini6, Wadih Arap6, Israel T Silva2,7, Emmanuel Dias-Neto8,9, Diana N Nunes10.
Abstract
Extracellular vesicles (EVs) are key mediators of intercellular communication. Part of their biological effects can be attributed to the transfer of cargos of diverse types of RNAs, which are promising diagnostic and prognostic biomarkers. EVs found in human biofluids are a valuable source for the development of minimally invasive assays. However, the total transcriptional landscape of EVs is still largely unknown. Here we develop a new method for total transcriptome profiling of plasma-derived EVs by next generation sequencing (NGS) from limited quantities of patient-derived clinical samples, which enables the unbiased characterization of the complete RNA cargo, including both small- and long-RNAs, in a single library preparation step. This approach was applied to RNA extracted from EVs isolated by ultracentrifugation from the plasma of five healthy volunteers. Among the most abundant RNAs identified we found small RNAs such as tRNAs, miRNAs and miscellaneous RNAs, which have largely unknown functions. We also identified protein-coding and long noncoding transcripts, as well as circular RNA species that were also experimentally validated. This method enables, for the first time, the full spectrum of transcriptome data to be obtained from minute patient-derived samples, and will therefore potentially allow the identification of cell-to-cell communication mechanisms and biomarkers.Entities:
Mesh:
Substances:
Year: 2017 PMID: 29089558 PMCID: PMC5663969 DOI: 10.1038/s41598-017-14264-5
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the recent reports employing RNA sequencing analysis of EVs.
| Reference | Sample type | Number of EVs samples | EVs isolation methodology | rRNA depletion | Input | RNA-Seq methodology | Raw reads (sample) | Mapped reads (sample) | rRNA | # Molecules identified | Major categories of molecules identified | Technical validation |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Nolte-’t Hoen | CCM from mouse DC-T cell co-culture | 1 | Ultracentrifugation | No | 450 ml CCM | Small RNA (15–70 nt) size-selection on native 6% gradient PAGE gel Sequencing on Illumina (single-end 35 cycles) | 28.8 M | 27.5 M | ~30% | NA | tRNA repeat, Simple repeat LINE, rRNA, vRNA, Protein coding, SRP-RNA, Y-RNA | qPCR for miR-29a, miR-155, miR-191, Y-RNA, SRP-RNA |
| Huang | Human plasma | 3 = 14 libraries | ExoQuick | No | 250 μl plasma/2ng RNA | Small RNA (20–40 nt) size-selection on native 5% acrylamide gel Sequencing on Illumina HiSeq. 2000 (single-end) | 7.27 M | 4.04 M | 9.16% | 593 miRNAs | miRNA (76.20% of all mappable reads), ribosomal RNA (9.16%), long noncoding RNA (3.36%), piwi-interacting RNA (1.31%), transfer RNA (1.24%), small nuclear RNA (0.18%), and small nucleolar RNA (0.01%); fragments of coding sequence (1.36%), 5′ untranslated region (0.21%), and 3′ untranslated region (0.54%) | qPCR for miR-92a-3p, miR-191-3p, miR-26b-5p |
| Jenjaroenpun | CCM from MDA-MB-231 and MDA-MB-436, human metastatic breast cancer cell lines | 2 | Ultracentrifugation | RiboMinus (Thermo) did not work, authors argue due to fragmented rRNA | 48 h CCM/ 200ng RNA | Whole transcriptome library with RNAse III fragmentation Considered only reads ≥ 20 bp Sequencing on Ion Torrent PGM | 3.3 M | 3 M | 97% | 16,086 transcripts (RPKM ≥ 1) | small nucleolar RNA, small nuclear RNA, Mt_tRNA, microRNA | qPCR for GAPDH, EEF1A1, FTH1, FTL, RAB13, RPPH1, RPL28 |
| Schageman | CCM from HeLa cells and human serum | 3 = 7 libraries | Total exosome RNA and protein isolation kit (Thermo Fisher) | No | 4 ml serum/~2ng RNA | Small RNA bead-based Sequencing on Ion Torrent PGM Considered only reads > 17 bp | 5–6 M | 90–98% | ~15–40% | NA | mRNA, rRNA, tRNA, miRNA, ncRNA | serum EVs - qPCR for has-mir-1281, has-mir-4257, has-mir-451 |
| Li | Human serum and urine | 4 = 8 libraries | Total exosome RNA and protein isolation kit (Thermo Fisher) | No | 4 ml serum/10 ml urine/ ~2ng RNA | Small RNA bead-based Sequencing on Ion Torrent PGM | 5–6 M | 90–98% | serum: 5–30% urine: 30–60% | NA | miRNA, rRNA, tRNA, mRNA, piRNA | NA |
| Yuan | Human plasma | 192 | ExoQuick | No | 2–10ng RNA | Small RNA (20–40 nt) size-selection on native 5% acrylamide gel Sequencing on Illumina HiSeq. 2000 | 12.6 M | 5.4 M | 0.7% | 3,387 RNAs | miRNAs (~40.4%), piwiRNAs (~40.0%), pseudo-genes (~3.7%), lncRNAs (~2.4%), tRNAs (~2.1%), and mRNAs (~2.1%) | No |
| Lefebvre | CCM from A431 epidermoid carcinoma and HepG2 hepatocellular carcinoma cell lines | 2 | Ultracentrifugation | No | 80 ml CCM/ 50ng RNA | No size selection was performed - only analysed RNAs > 50 nt Sequencing on Illumina HiSeq. 2000 | NA | 6.88 M | 91.19% | 7,361 RNAs (FPKM ≥ 5) | miscRNA, mRNA, lincRNA, snRNA, snoRNA, antisense | qPCR for TPT1, PABPC1, ATF4, PTBP1, HDGF, G3BP1,BRAF |
| San Lucas | Human pleural effusion and plasma | 3 | Ultracentrifugation | No | 800 ml pleural effusion 15 ml plasma | mRNA amplification using oligo dT primers Sequencing on Illumina HiSeq. 2500 | NA | 498 M | NA | NA | Protein coding | No |
| Quek | CCM from GT1–7 mouse hypothalamic neuronal cell lines | 5 = 23 libraries (4 gradient fractions, 1 UCexo) | Ultracentrifugation and Optiprep gradient | No | 96 h CCM/ 20ng RNA | Small RNA bead-based Sequencing on Ion Torrent PGM | NA | 1 M | 0.54% | 515 small RNAs | fragments of tRNA (range 41.6–67.0%), fragments of RNA repeat elements (~42.26%), miRNA (range 0.48–2.11%), protein-coding mRNA (range 0.49–1.82%), piRNA (range 0.46–1.71%), snRNA (range 0.11–0.23.%), snoRNA (range 0.04–0.10%), rRNA (range 0.20–0.88%) | qPCR and dPCR for let-7b and miR-342-3p |
M: million, nt: nucleotides, bp: base pairs, CCM: cell conditioned medium, EVs: extracellular vesicles, NA: information not available, RPKM: reads per kilobase per million mapped reads, FPKM: fragments per kilobase of transcript per million mapped reads, RPM: reads per million, qPCR: quantitative real time PCR, UCexo: exosomes from ultracentrifugation, dPCR: digital PCR.
Figure 1Scheme of the bioinformatics analysis pipeline.
Figure 2Characterization of EVs isolated from plasma by ultracentrifugation. (A) Whole-mount transmission electron micrograph of EVs displaying the characteristic “cup-shaped” morphology. (B) NanoSight quantification plot of EVs concentration in function of size shows that the majority of EVs are less than 250 nm in the healthy control samples (n = 5). (C) Western blot of the vesicle-associated markers CD63, FLOTILLIN, HSP70 and RAB27B from a pool of plasma samples (n = 4) and corresponding Ponceau staining of membrane before blocking.
Figure 3Bioanalyzer electropherogram analysis of fragmentation time-points of leukocyte RNA. Arbitrary fluorescence units (FU) are plotted as a function of RNA size in nucleotides (nt). (A) Analysis of fragmentation time-points 15–180 min with Agilent RNA 6000 Pico Kit shows the expected ribosomal RNA peaks in non-fragmented sample (in red), whereas after all fragmentation time-points the majority of RNAs are below 200 nt in size. (B) Analysis of fragmentation time-points from 45 to 420 min with Agilent Small RNA Kit shows that the majority of RNAs are smaller than 40 nt in size for all time-points, and that a plateau is reached after approximately 180 min, as no further reduction in size is observed with longer fragmentation times. For the longer fragmentation periods, reduced amounts of RNA (marked with a star) were used to better simulate the enzymatic kinetics in the presence of less (but still detectable) RNA, and the plateau region is still the same.
Summary of sequencing analysis of the total transcriptome of EVs isolated from the plasma of healthy controls.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 |
|---|---|---|---|---|---|---|---|---|
| Sample | Raw reads | Reads after FastQ screen filter | % of filtered reads | % of rRNA filtered reads | Reads 12–25 nt used for miRNA analysis (%) | miRBase mapped reads (%) | Reads used for other RNA analysis (%) | Ensembl mapped reads (%) |
| C1 | 18,085,235 | 1,653,832 | 90.9 | 75.9 | 458,186 (27.7) | 33,157 (7.2) | 1,620,675 (98.0) | 576,866 (35.6) |
| C3 | 7,843,315 | 727,633 | 90.7 | 73.4 | 221,985 (30.5) | 23,594 (10.6) | 704,039 (96.8) | 170,592 (24.2) |
| C13 | 13,830,831 | 1,148,352 | 91.7 | 74.8 | 291,302 (25.4) | 14,734 (5.1) | 1,133,618 (98.7) | 358,268 (31.6) |
| C15 | 10,111,670 | 1,023,795 | 89.9 | 73.8 | 300,569 (29.4) | 25,465 (8.5) | 998,330 (97.5) | 232,061 (23.2) |
| C16 | 10,936,417 | 1,509,424 | 86.2 | 69.3 | 390,705 (25.9) | 40,801 (10.4) | 1,468,623 (97.3) | 526,181(35.8) |
| Average | 12,161,494 | 1,212,607 | 89.9 | 73.4 | 332,549 (27.8) | 27,550 (8.4) | 1,185,057 (97.7) | 372,793 (30.1) |
The % in columns 4 and 5 are relative to column 2; the % in columns 6 and 8 are relative to column 3; the % in column 7 is relative to column 6, and the % in column 9 is relative to column 8.
Summary of the total number of distinct transcripts identified for different RNA categories in EVs isolated from the plasma of healthy controls.
| Sample | Protein coding | Pseudogene | Long noncoding | Short noncoding | |||
|---|---|---|---|---|---|---|---|
| miRNA mature (miRBase) | tRNA (GtRNAdb) | misc_RNA | other* | ||||
| C1 | 7,824 | 91 | 220 | 765 | |||
| 181 | 220 | 218 | 146 | ||||
| C3 | 5,461 | 34 | 101 | 554 | |||
| 163 | 172 | 110 | 109 | ||||
| C13 | 7,937 | 83 | 207 | 738 | |||
| 157 | 259 | 189 | 133 | ||||
| C15 | 6,134 | 44 | 131 | 626 | |||
| 162 | 217 | 133 | 114 | ||||
| C16 | 8,660 | 171 | 211 | 887 | |||
| 216 | 235 | 254 | 182 | ||||
| Average | 7,203 | 85 | 174 | 714 | |||
| 176 | 221 | 181 | 137 | ||||
*The category ‘other’ includes: miRNA precursors (miRBase), miRNA (Ensembl), Mt_rRNA, Mt_tRNA, rRNA, snoRNA, snRNA.
Figure 4Distribution of the RNA classes identified in the sequencing analysis of the total transcriptome of EVs isolated from the plasma of healthy controls (average values for five samples). Pie charts show the distribution of mapped reads according to gene biotype, as defined by Ensembl, and sequencing coverage. Distribution of reads is shown for the four major Ensembl biotypes (A,D); the short noncoding biotype (B,E) and the misc_RNA biotype (C,F). Genes considered were covered by at least 2 reads (A–C), or by a minimum of 10 reads (D–F).
Figure 5Quantitative RT-PCR analysis of two miRNAs (miR-223-3p and let-7g-5p) identified by sequencing. The treatment of intact EVs with RNase before RNA extraction minimally altered Ct values of both miRNAs, strongly suggesting that these miRNAs were derived from the EVs-cargo and thereby protected from digestion by the membrane bilayer, whereas there was a large increase in Ct values when lysed EVs were similarly treated. Undetermined values (no detection by qRT-PCR) were assigned as Ct = 35. (A,B) Bar-graphs for miR-223-3p and let-7g-5p, showing Ct values for each treatment per sample; (C,D) Box-plots for miR-223-3p and let-7g-5p, showing Ct values per treatment (dots are results from each one of the five samples).