| Literature DB >> 30159428 |
Rashmi Tripathi1, Pavan Chakraborty2, Pritish Kumar Varadwaj1.
Abstract
Extensive genome-wide transcriptome study mediated by high throughput sequencing technique has revolutionized the study of genetics and epigenetic at unprecedented resolution. The research has revealed that besides protein-coding RNAs, large proportions of mammalian transcriptome includes a heap of regulatory non protein-coding RNAs, the number encoded within human genome is enigmatic. Many taboos developed in the past categorized these non-coding RNAs as ''dark matter" and "junks". Breaking the myth, RNA-seq-- a recently developed experimental technique is widely being used for studying non-coding RNAs which has acquired the limelight due to their physiological and pathological significance. The longest member of the ncRNA family-- long non-coding RNAs, acts as stable and functional part of a genome, guiding towards the important clues about the varied biological events like cellular-, structural- processes governing the complexity of an organism. Here, we review the most recent and influential computational approach developed to identify and quantify the long non-coding RNAs serving as an assistant for the users to choose appropriate tools for their specific research.Entities:
Keywords: Genetic and epigenetic; High throughput sequencing; Long non-coding RNA; RNA-seq; RNA-sequencing; Transcriptome
Year: 2017 PMID: 30159428 PMCID: PMC6096414 DOI: 10.1016/j.ncrna.2017.06.003
Source DB: PubMed Journal: Noncoding RNA Res ISSN: 2468-0540
Comparison between Next Generation Sequencing technique and Microarray technique.
| Next generation sequencing | Microarray |
|---|---|
| Advantages | |
| Species- or transcript-specific probes are not required in the case of NGS technology. | Specific probes are required in the case of microarray technologies. |
| NGS technology computes the sequencing read counts, analyzing the result for studying gene expression. | Gene expression measurement based on array hybridization technology is restricted by background and signal saturation noise. |
| NGS shows increased specificity and sensitivity for wide range of applications. | Specificity and sensitivity is low as compared to NGS for identifying differentially expressed genes. |
| Sequencing coverage depth is high in NGS technology facilitating the detection of rare or single transcripts per cell as well as in identifying weakly expressed genes. | Rare and low-abundance transcripts cannot be easily detected and are lost using microarray technology. |
| NGS technology is able to detect multiple splice sites and novel isoforms. | Microarray technologies cannot detect multiple splice sites and novel isoforms. |
| NGS technology is able to do | Reference genome is required for the analysis of sample. |
| Disadvantages | |
| NGS based techniques are very expensive. | Microarrays are cheaper in comparison to NGS. |
| Accuracy and longevity of this approach remains questionable. | Microarray is more reliable methods in long run. |
| Low yield of high-quality sequences are obtained using NGS techniques. | Comparatively high yield of high-quality sequences is obtained using microarray technologies. |
| NGS technologies have a drawback of generating shorter sequences with more noise. | Microarray offers lesser errors and is more accurate. |
| NGS assembly algorithms show poor performance in presence of identical repeats. | Homologous repeats are identified using microarray technologies. |
| Annotation is challenging when considering complex genomes with higher repeat and duplication content. | Microarray technologies are more successful when considering complex genomes with higher repeat and duplication content. |
Fig. 1The progressive and substantial research on long non-coding RNAs is rising. Cumulative plot of the total number of publication entries in PubMed related to non-coding RNAs is represented in green line and of entries related to long non-coding RNAs is represented in red line and axis.
Fig. 2The RNA sequencing (RNA-seq) process commences with the input of sequences (in fasta format) generated using sequencers. Further the process requires pre-processing events involving the filtering and mapping of the input sequences (reads) followed by gene quantification and topological analysis.