| Literature DB >> 24816817 |
Kevin C Miranda1, Daniel T Bond1, Joshua Z Levin2, Xian Adiconis2, Andrey Sivachenko2, Carsten Russ2, Dennis Brown1, Chad Nusbaum2, Leileata M Russo1.
Abstract
Intact RNA from exosomes/microvesicles (collectively referred to as microvesicles) has sparked much interest as potential biomarkers for the non-invasive analysis of disease. Here we use the Illumina Genome Analyzer to determine the comprehensive array of nucleic acid reads present in urinary microvesicles. Extraneous nucleic acids were digested using RNase and DNase treatment and the microvesicle inner nucleic acid cargo was analyzed with and without DNase digestion to examine both DNA and RNA sequences contained in microvesicles. Results revealed that a substantial proportion (∼87%) of reads aligned to ribosomal RNA. Of the non-ribosomal RNA sequences, ∼60% aligned to non-coding RNA and repeat sequences including LINE, SINE, satellite repeats, and RNA repeats (tRNA, snRNA, scRNA and srpRNA). The remaining ∼40% of non-ribosomal RNA reads aligned to protein coding genes and splice sites encompassing approximately 13,500 of the known 21,892 protein coding genes of the human genome. Analysis of protein coding genes specific to the renal and genitourinary tract revealed that complete segments of the renal nephron and collecting duct as well as genes indicative of the bladder and prostate could be identified. This study reveals that the entire genitourinary system may be mapped using microvesicle transcript analysis and that the majority of non-ribosomal RNA sequences contained in microvesicles is potentially functional non-coding RNA, which play an emerging role in cell regulation.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24816817 PMCID: PMC4015934 DOI: 10.1371/journal.pone.0096094
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Urinary microvesicle RNA integrity and alignment to the genome.
a) RNA isolated from urinary microvesicles was shown to be of high integrity with prominent 18S and 28S rRNA peaks when analyzed using the Agilent Bioanalyzer. Red trace 1.7 ng RNA with DNase, Blue trace 2.2 ng RNA without DNase. b) Flow chart outlining sample processing. An initial RNase and DNase digestion was carried out to remove extraneous nucleic acids co-isolating with the microvesicle pellet. To determine the proportion of potential DNA inside the microvesicles the extracted RNA was divided into two groups; No DNase digestion (-DNase), which yields RNA+DNA and DNase digested (+DNase) which yields RNA. c) Both the -DNase and the +DNase samples showed a similar trend in read distribution with ∼88% of reads mapping to rRNA, ∼4% mapping to genes and ∼6% mapping to ncRNA. A smaller proportion of reads (<0.1%) mapped to mitochondrial genome. Approximately 2% of reads failed to hit the human genome. (mito – mitochondrial).
Alignment of microvesicle non-coding RNA to repeat regions.
| +DNase treated | -DNase treated | ||||
| Repeat Class | # known | Hits | % hits | Hits | % hits |
| SINE | 49 | 36 | 73% | 40 | 82% |
| LINE | 146 | 102 | 70% | 114 | 78% |
| LTR | 504 | 200 | 40% | 264 | 52% |
| DNA | 204 | 81 | 39% | 104 | 51% |
| Simple_repeat | 311 | 84 | 27% | 105 | 34% |
| Low_complexity | 10 | 10 | 10% | 10 | 10% |
| Satellite | 24 | 8 | 33% | 14 | 58% |
| RNA | 84 | 16 | 19% | 27 | 32% |
| RC | 10 | 6 | 60% | 8 | 80% |
| Unknown | 29 | 2 | 7% | 4 | 14% |
Repeat class was defined by the UCSC repeatMasker dataset [12] and the number of known repeats listed under ‘# known’. All repeat classes were found in microvesicles and the number of hits and percentage hits are shown for each sample, with DNase treatment (+DNase) (RNA sample) and without DNase treatment (-DNase) (RNA+DNA sample). SINE-Short interspersed nuclear elements (which include ALUs), LINE-Long interspersed nuclear elements, LTR-Long terminal repeat elements (which include retrotransposons), DNA repeat elements, Simple repeats (micro-satellites), Low complexity repeats, Satellite repeats, RNA repeats (which includes RNA, rRNA – ribosomal RNA, scRNA – small cytoplasmic RNA, snRNA – small nuclear RNA, srpRNA – signal recognition particle RNA, tRNA – transfer RNA), Other repeats (including Rolling Circle (RC)) were detected within the non-ORF reads of microvesicles and presented as hits and % hits detected.
Top 10 expressed ncRNA in microvesicles (+DNase).
| No. Reads aligning | RNAdb ID | RNAdb definition | Accession ID |
| 855908 | LIT1626 | Homo sapiens non-coding chimeric transcript hc9(5)h-2-1/4 (UM 9(5)) | AY072609 |
| 69064 | LIT1868 | Human 7S L gene, complete | M20910 |
| 2722 | LIT3330 | Homo sapiens ret finger protein-like 3 antisense (RFPL3S) on chromosome 22. | NR_001450 |
| 2505 | LIT3569 | Homo sapiens mRNA sequence | AY927568 |
| 2484 | LIT3497 | Homo sapiens mRNA sequence | AY927481 |
| 1668 | LIT1867 | Homo sapiens 7S RNA | V00477 |
| 900 | LIT2062 | Homo sapiens ribosomal protein, large, P0 pseudogene 2 (RPLP0P2), misc RNA | XR_000076 |
| 621 | LIT1834 | Homo sapiens non-small cell lung carcinoma noncoding RNA, partial sequence | AY166681 |
| 520 | LIT2056 | Homo sapiens EMX2OS mRNA, complete sequence. | AY117413 |
| 505 | LIT2059 | Homo sapiens general transcription factor II, i, pseudogene 1 (GTF2IP1), misc RNA | XR_000139 |
The most abundantly expressed ncRNA as defined by the RNAdb [14] are listed by number of reads present in the microvesicle population, the RNAdb ID, ncRNA name and accession ID number as identified in the DNase treated sample.
Top 10 expressed ncRNA in microvesicles (-DNase).
| No. Reads aligning | RNAdb ID | RNAdb definition | Accession ID |
| 914237 | LIT1626 | Homo sapiens non-coding chimeric transcript hc9(5)h-2-1/4 (UM 9(5)) | AY072609 |
| 52467 | LIT1868 | Human 7S L gene, complete | M20910 |
| 2972 | LIT3330 | Homo sapiens ret finger protein-like 3 antisense (RFPL3S) on chromosome 22. | NR_001450 |
| 2073 | LIT3569 | Homo sapiens mRNA sequence | AY927568 |
| 1620 | LIT3497 | Homo sapiens mRNA sequence | AY927481 |
| 1608 | LIT2062 | Homo sapiens ribosomal protein, large, P0 pseudogene 2 (RPLP0P2), misc RNA | XR_000076 |
| 1393 | LIT1867 | Homo sapiens 7S RNA | V00477 |
| 1033 | LIT1834 | Homo sapiens non-small cell lung carcinoma noncoding RNA, partial sequence | AY166681 |
| 890 | LIT2059 | Homo sapiens general transcription factor II, i, pseudogene 1 (GTF2IP1), misc RNA | XR_000139 |
| 843 | LIT2056 | Homo sapiens EMX2OS mRNA, complete sequence. | AY117413 |
The most abundantly expressed ncRNA as defined by the RNAdb [14] are listed by number of reads present in the microvesicle population, the RNAdb ID, ncRNA name and accession ID number as identified in the sample without DNase treatment.
Figure 2Chromosomal alignment of genes.
a) Reads aligning to coding genes were mapped back to the human chromosomes and represented as the % reads aligning to each chromosome. All chromosomes of the human genome are represented within the urinary microvesicles including the mitochondrial chromosome (M). Solid bars – +DNase sample, lined bars – -DNase sample. b) Correlation of reads aligning to coding genes in the +DNase and -DNase samples suggests little if any DNA was present (R2 = 0.9664).
Figure 3Mapping of coding genes to the genitourinary epithelium.
a) Cartoon of the genitourinary system, highlighting specialized regions including the glomerulus (1), proximal tubules (2), medullary thick ascending limb (3), distal convoluted tubule (4), collecting duct (5), bladder (6) and prostate (7). b) The number of deep sequencing reads normalized to gene length were graphed to produce a transcriptional profile for each of the sub-regions of the genitourinary tract (solid bars – +DNase, lined bars – -DNase).
Figure 4Analysis of the top 50 genes found in urinary exosomes.
The top 50 most highly expressed genes were determined and grouped in terms of function or name. a) In the +DNase sample 76% of genes were related to ribosomal proteins and a further 6% related to translation regulation. Other genes related to ferritin, prostate specific genes, cell regulation and novel genes were also featured. A similar distribution was also seen for the -DNase sample (b). In both instances the TPT1 gene was the most abundantly expressed gene.