| Literature DB >> 32128162 |
Bahlibi Weldegebriall Sahlu1,2, Shanjiang Zhao1, Xiuge Wang3, Saqib Umer1, Huiying Zou1, Jinming Huang3, Huabin Zhu1.
Abstract
Spermatogenesis is a complex differentiating developmental process in which undifferentiated spermatogonial germ cells differentiate into spermatocytes, spermatids, and finally, to mature spermatozoa. This multistage developmental process of spermatogenesis involves the expression of many male germ cell-specific long noncoding RNAs (lncRNAs) and highly regulated and specific gene expression. LncRNAs are a recently discovered large class of noncoding cellular transcripts that are still relatively unexplored. Only a few of them have post-meiotic; however, lncRNAs are involved in many cellular biological processes. The expression of lncRNAs is biologically relevant in the highly dynamic and complex program of spermatogenesis and has become a research focus in recent genome studies. This review considers the important roles and novel regulatory functions whereby lncRNAs modulate mammalian spermatogenesis.Entities:
Keywords: Germ cell; Long noncoding RNA; Mammalian spermatogenesis; Regulatory pathways; Spermatocytes
Year: 2020 PMID: 32128162 PMCID: PMC7047388 DOI: 10.1186/s40104-019-0424-8
Source DB: PubMed Journal: J Anim Sci Biotechnol ISSN: 1674-9782
Fig. 1Schematic representation of the developmental processing of spermatogenesis
Fig. 2The origin of lncRNA with relative to protein-coding genes structure in different genomic regions. The schematic sketch represents a and e for Noncoding regions; b and d for Protein coding genes; c for Intronic region and f for Cases of intronic retention region
Fig. 3The location of lncRNA in relative to their nearby genes encoding protein
Approaches for lncRNA function prediction
| Approaches | Description | References |
|---|---|---|
| Comparative genomics | The lncRNA transcripts that are conserved in both human and mouse, and located within or close to a coding gene in < 1 kb distance assumed to have close functional relationship with the neighboring gene. This method of predicting lncRNAs is may not be widely applied to genomic level due to the low conservation potential of lncRNAs. | [ |
| Coexpression of lncRNA and coding genes | The lncRNAs and protein coding genes coexpressed in specific biological process regulation. The method is well practiced to predict and identify the enriched function of the lncRNAs at the genomic level. | [ |
| Interaction with miRNAs and proteins | The lncRNAs may involve in regulatory network by coordinating with target sites of miRNA. Scientists developed different methods to determine the target sites of miRNA in lncRNAs such as miRcode interface helping the microRNA-lncRNA interaction study across the GENCODE annotated transcriptome, Validated RNA mediated interactions in genome-wide networks in determining the mediatory roles of lncRNA and its interaction and correlation in miRNA, mRNA and proteins which may help in predicting the lncRNA function. This approach is successful for those lncRNAs with known mechanism of interaction between miRNA and protein. | [ |
Fig. 4Schematic drawings of the lncRNA archetypes (Adapted from [13, 14])
Descriptions of widely used lncRNA databases
| Tools | Web address | Description |
|---|---|---|
| EVlncRNA, [ | It provides 77 species of functionally annotated long noncoding RNAs validated for disease specific roles through low put experiments and deals with evolution and disease association of lncRNAs. | |
| LNCipedia [ | https://lncipedia.org/ | It is one of the lncRNA reference gene database containing a total of 127802 unique transcripts, 56946 unique genes, 2482 lncRNA articles and 1555 lncRNA annotated genes including their functional information. |
| NONCODE [ | www.noncode.org/ | It is one of the comprehensively compiled databases which is integrated database with complete collection and annotation of lncRNAs but doesn’t include tRNAs and rRNAs. This database contains 17 species and 73,372 lncRNAs as well as literatures and public data bases. |
| LncRNADisease [ | It provides a systematic collection of lncRNA and circular RNA disease association, transcriptional regulation of lncRNAs, miRNAs and mRNAs. It is one of the important database used in the clinical application for lncRNAs related studies. | |
| LncRBase [ | http://bicresources.jcBose.ac.in/zhumur/ | It is enriched resource of lncRNA transcripts of human (133,361 entries) and mouse (83,201 entries) of 14 distinct subtypes with 8507of mouse and 14,813 of human newly annotated noncoding RNAs, piwi interacting RNAs and microRNAs including their regulation and association with many other genomic elements. |
| LncRNAdb [ | It is a database with comprehensive collection of lncRNAs with their information of biological functions in eukaryotes and regulatory role of messenger RNA. It contains most of the relevant information about the RNA, annotation, target tissues of expression and associated diseases. | |
| LncCeRBase [ | http:// | This database is developed for the competing endogenous RNAs (ceRNAs) encompassing 432 interactions of lncRNA-miRNA-mRNA with 130, 214 and 245 of lncRNA, miRNA, and genes respectively. It also have interactions of lncRNA-miRNA-mRNA associated with signaling pathways to explore the lncRNA regulatory mechanisms. It is designed specifically to studied and validated human ceRNAs. |
| RNAcentral [ | http:// | It is an integrated database that compiled sequences of noncoding RNA of all RNA types and organisms serving a single entry for all RNA sequence searches. This comprehensive database has 22 collaborating databases from specialized non-coding RNA resources. |
| LncRNome [ | Provides an outlook on annotation of more than 18000 transcripts of the different classes of lncRNAs including the intergenic lncRNAs, antisense lncRNAs, intronic lncRNAs, overlapping lncRNAs and processed pseudogenes. It is designed to provide relevant information for human lncRNAs and their respective function. This database focuses on compiled information about sequence, structure, genomic loci, motifs, expression and associated diseases of lncRNAs. | |
| NRED [ | It provides relevant information on gene expression of lncRNA in mammals especially in human and mouse. This database consists of microarray data and in situ hybridization data and serves as an important resource to the scientific community to study and understand the lncRNAs. |
Descriptions of other methods to identify lncRNAs
| Type | Methods [Ref] | Description |
|---|---|---|
| Experimental methods of lncRNA identification | Microarray [ | It uses Computational and annotation pipeline to determine the expression and regulation potential of lncRNA transcripts. It has higher efficiency in high put lncRNA analysis. However, its detection potential is low due to its low sensitivity and low expression level of the lncRNA |
| SAGE [ | The serial analysis of gene expression (SAGE) is a technology which identifies the lncRNA known and unknown transcripts by producing short sequence tags and is the highly effective method to study lncRNAs but, it is expensive and not applicable in large scale researches. | |
| EST [ | Expressed sequence tag (EST) is a cDNAs short subsequence generated from cDNA clone by one shot sequencing to discover novel and functional transcripts of lncRNA in mammalian. This public database helps to search the transcripts in the intergenic regions of genes and reconstruct lncRNA transcript assemblies. | |
| RNA-Seq [ | It is a shotgun sequencing of whole transcriptome in the next generation sequencing technologies and is used to identify novel lncRNA transcripts and gene expression analysis. | |
| RNA-IP [ | RNA-immunoprecipitation one of the latest techniques that used antibodies of protein to discover and isolate the lncRNA that interacts with protein complexes or specific proteins by constructing cDNA library and deep sequencing of lncRNAs. | |
| Chromatin Signature Based Approach [ | Is a method that do not target directly on the RNA transcripts but directly involves in the identification of lncRNA expression regulation mechanisms using Chromatin signatures and their regulation factors. | |
| Computational methods of lncRNA identification | ORF Length Strategy [ | This strategy is a method used to differentiate the lncRNA from the mRNA by the Open Reading Frame (ORF) length cutoff based on codons length. |
| Sequence and Secondary Structure Conservation Strategy [ | This strategy is used to differentiate the non-coding genes from the protein coding genes by using different methods and strategies such as conservation potential, measure of coding potential, codon substitution frequency scores, reading frame conservation and PhyloCSF. The other methods that are used to explore the RNA secondary structure conservation include the programs of QRNA, EvoFOLD and RNAz. | |
| Machine Learning Strategies [ | Due to the complexity of lncRNAs, a new machine learning systems have been increasingly developed such as SVM (support vector machine) based machine learning technique like CONC (coding or non-coding), and other models to integrate and utilize various protein features to distinguish the lncRNAs from mRNAs. |
The expression of certain long noncoding RNAs during the progression of mammalian spermatogenesis
| LncRNA name | Locationa/Length | Expression level [Ref] |
|---|---|---|
| Gm11837 | Chr.4: 14929908–14953030 (23122 bp) | Highly expressed in adult mouse testis [ |
| LncRNA-Gm2044 | Chr.7: 139957005–139958039 (1034 bp) | Highly expressed in spermatocytes [ |
| LncRNA H19 | Chr.7:142575529–142578143 (2.6 kb) | Highly expressed in testis [ |
| Meiotic recombination hot spot1 locus (Mrhl) | Chr.8:85994245–85996642 (∼2.4 kb) | Expressed in spermatogonial Gc1-Spg (derived from type B spermatogonia) cells [ |
| AK007004 | Chr.12:82932520–82939155 | Low expressions in male germ cells [ |
| AK015322 | Chr.12:26814371–26834873 | Highly expressed in spermatogonial stem cells [ |
| Chr.12: 7372039–7380330 (8291 bp) | Highly expressed in testicular germ cells in the meiotic prophase and round spermatocyte stage of spermatogenesis [ | |
| lncRNA4667 | Chr.13:23396074–23397418 (1.3 kb) | Highly expressed in round spermatids [ |
| Chr.14:122229905–122233638 (3733 bp) | Specifically expressed in testis and highly expressed in testicular germ cells during testicular spermatogenesis [ | |
| LncRNA HOTAIR | Chr.15:102944062–102947730 (∼3.7 kb) | Low processes in the spermatozoa of patients with asthenozoospermia and oligoasthenozoospermia [ |
| Metastasis-associated lung adenocarcinoma transcript 1 ( | Chr.19:5795690–5802671 (∼7 kb) | Expressed in sperm cells and intestinal cells of testis [ |
| LncRNA-Tsx | Chr.X | Expressed in meiotic germ cells [ |
| LncRNA-Xist (X-inactive specific transcripts) | Chr.X:103460373–103483233 | Highly expressed in male germ cells [ |
| Chr.X:62510539–62527011 (∼16.5 kb) | Highly expressed in pachytene spermatocytes [ | |
| 4435 bp | Highly expressed in the nuclei of pachytene spermatocytes [ | |
| LncRNA | 2.4 kb | Expressed in mouse male germ cells [ |
| LncRNA033862 | ∼6.4 kb | Highly expressed in mouse spermat- ogonial stem cells especially in early spermatogonia [ |
aThe chromosomal location is in the mouse genome
List of certain long non-coding RNAs, and their role in mammalian spermatogenesis
| LncRNA name | Length | Chromosomal location* | Function [Ref] |
|---|---|---|---|
| HongrES2 | 1.588 kb | Chr. 5 | Responsible for normal sperm capacitation in the epididymis [ |
| LncRNA-Dmrt1 | – | Chr.5 | Possibly involved in the switching between mitosis and meiosis of developing germ cell [ |
| LncRNA H19 | 2.6 kb | Chr.7 | Affects the expression of IGF-1R by regulating the IGF-1 pathway [ |
| LncRNA Gm2044 | 1034 bp | Chr. 7 | Inhibits Utf1 mRNA translation and plays a potential role in spermatogenesis [ |
| Meiotic recombination hot spot1 locus (Mrhl) | ∼2.4 kb | Chr. 8 | Inhibits the Wnt signaling pathway during spermatogenesis by interacting with p68 in spermatogonia [ |
| 4435 bp | Chr.9 | Activation of | |
| LncRNA-HSVIII | – | Chr.9 | Participate in the activation of the |
| LncRNA033862 | 6384 bp | – | Regulates |
| LncRNA-Tcam1 | 2.4 kb | Chr.11 | Important for the immune response during spermatogenesis [ |
| Malat1 | ∼7 kb | Chr.11 | Plays a potential |
| AK015322 | ∼20.5 kb | Chr.12 | Promotes the proliferation of spermatogonial stem cells [ |
| LncRNA4667 | 1.3 kb | Chr.13 | A marker for round spermatids identification in mice [ |
| LncRNA HOTAIR | ∼3.7 kb | Chr.15 | Regulates HoxD genes expression in |
| LncRNA Neat1 | 3.2 kb | Chr.19 | Plays an important role in the maintenance and assembly of nuclear speckles of mammalian cells [ |
| LncRNA Jpx | – | Chr.X | Regulates the X-chromosome inactivation (Xi) switch [ |
| LncRNA Tsix | – | Chr.X | Regulates the X-chromosome activation (Xa) switch [ |
| LncRNA Tsx | – | Chr.X | Involved in apoptosis in meiotic division during spermatogenesis [ |
| LncRNA-Xist | ⁓17 kb | Chr.X | Involved in X chromosome inactivation [ |
| LncRNA NLC1- C | – | – | Has a potential role in male fertility and regulates miRNA expression [ |
| LncRNA Spga | – | – | Involved in spermatogonial differentiation |
* The chromosomal location is in the mouse genome