| Literature DB >> 27620863 |
David A duVerle1,2, Sohiya Yotsukura3, Seitaro Nomura4, Hiroyuki Aburatani4, Koji Tsuda5,6,7.
Abstract
BACKGROUND: Single-cell RNA sequencing is fast becoming one the standard method for gene expression measurement, providing unique insights into cellular processes. A number of methods, based on general dimensionality reduction techniques, have been suggested to help infer and visualise the underlying structure of cell populations from single-cell expression levels, yet their models generally lack proper biological grounding and struggle at identifying complex differentiation paths.Entities:
Keywords: Cell differentiation; Cell heterogeneity; Human stem cell; Single-cell RNA-seq
Mesh:
Substances:
Year: 2016 PMID: 27620863 PMCID: PMC5020541 DOI: 10.1186/s12859-016-1175-6
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Comparison of single-cell gene expression ordering tools, using mouse embryo data
| Tool | Naive | Monocle | TSCAN | cellTree |
|---|---|---|---|---|
| Method | TSP | ICA | PCA | LDA |
| Input size after thresholding | 20,214 | 10,452 | 225 | 12,903 |
| Accuracy in % | 90.6 | 94.8 | 95.3 | 96.5 |
| CPU time in seconds | 0.015 | 5308.6 | 0.08 | 18.5 |
Fig. 1Myoblast Backbone Tree. Backbone tree obtained from the dataset of differentiating myoblasts with cellTree. Larger nodes indicate backbone cells (selected representatives), whereas smaller nodes represent cells that only show slight variations from the backbone cell they are attached to. Cell nodes have been coloured according to a their sampling time (in hours from the start of the differentiation process) and b their distribution over topics
Fig. 2Myoblast MST. Comparison of minimum spanning trees obtained from the dataset of differentiating myoblasts by cellTree (a) and Monocle (b). Cell nodes have been coloured according to their sampling time (in hours). c shows the final tree structure obtained directly by Monocle
List of biological process go terms significantly enriched and uniquely appearing in each topic for myoblast differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0051301 | cell division | 4.7e-12 |
| GO:0007067 | mitotic nuclear division | 1.4e-10 |
| GO:0019083 | viral transcription | 1.7e-10 |
| GO:0006283 | transcription-coupled nucleotide-excision repair | 5.7e-10 |
| GO:0007264 | small GTPase mediated signal transduction | 1.7e-09 |
| GO:0007077 | mitotic nuclear envelop disassembly | 7.8e-09 |
| GO:0008380 | RNA splicing | 1.0e-08 |
| GO:0016925 | protein SUMOylation | 1.1e-08 |
| G O:0000086 | G2/M transition of mitotic cell cycle | 2.7e-08 |
| GO:0007059 | chromosome segregation | 4.5e-08 |
| GO:0010827 | regulation of glucose transport | 4.6e-08 |
| GO:0000082 | G1/S transition of mitotic cell cycle | 6.0e-08 |
| GO:0006369 | termination of RNA polymerase II transcription | 1.1e-07 |
| GO:0042769 | DNA damage response, detection of DNA damage | 1.3e-07 |
| GO:1900034 | regulation of cellular response to heat | 1.4e-07 |
| GO:0006271 | DNA strand elongation involved in DNA replication | 1.6e-06 |
| GO:0006626 | protein targeting to mitochondrion | 1.7e-06 |
| Topic 2 | ||
| GO:0006367 | transcription initiation from RNA polymerase II promoter | 9.9e-08 |
| GO:0006376 | mRNA splice site selection | 1.1e-06 |
| Topic 3 | ||
| GO:0030049 | muscle filament sliding | 2.5e-08 |
| GO:0017148 | negative regulation of translation | 1.1e-06 |
| GO:0000186 | activation of MAPKK activity | 1.8e-06 |
| Topic 4 | ||
| GO:0006886 | intracellular protein transport | 9.6e-07 |
| Topic 5 | ||
| GO:0050434 | positive regulation of viral transcription | 3.0e-07 |
| GO:0006370 | 7-methylguanosine mRNA capping | 4.3e-07 |
| GO:0044267 | cellular protein metabolic process | 4.7e-07 |
| GO:0006457 | protein folding | 1.6e-06 |
List of cellular components go terms significantly enriched and uniquely appearing in each topic for myoblast differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0000777 | condensed chromosome kinetochore | 1.1e-10 |
| GO:0005813 | centrosome | 1.4e-06 |
| GO:0000922 | spindle pole | 2.2e-06 |
| GO:0005876 | spindle microtubule | 3.4e-06 |
| GO:0005689 | U12-type spliceosomal complex | 4.8e-06 |
| GO:0005688 | U6 snRNP | 5.7e-06 |
| GO:0000940 | condensed chromosome outer kinetochore | 9.5e-06 |
| GO:0005759 | mitochondrial matrix | 1.1e-05 |
| GO:0000784 | nuclear chromosome, telomeric region | 1.2e-05 |
| GO:0046540 | U4/U6 x U5 tri-snRNP complex | 1.3e-05 |
| Topic 2 | ||
| GO:0005665 | DNA-directed RNA polymerase II, core complex | 5.1e-06 |
| Topic 3 | ||
| GO:0001725 | stress fiber | 1.7e-07 |
| GO:0030018 | Z disc | 3.1e-07 |
| GO:0098800 | inner mitochondrial membrane protein complex | 1.3e-06 |
| GO:0000932 | cytoplasmic mRNA processing body | 2.9e-06 |
| Topic 4 | ||
| GO:0005604 | basement membrane | 1.0e-05 |
| Topic 5 | ||
| GO:0005789 | endoplasmic reticulum membrane | 2.4e-06 |
| GO:0005885 | Arp2/3 protein complex | 1.3e-05 |
Fig. 3Cellular Component GO Terms for Myoblasts. Subgraph of the Cellular Component GO terms DAG showing terms significantly enriched for each topic. Darker colours indicated lower p-values and terms shared between topics use a combination of the topics colours (terms that are enriched in more than half the topics have been removed)
Fig. 4Embryonic cell differentiation. Backbone trees obtained by cellTree for hESC (a and b) and mouse embryo cell differentiation (c and d). Both trees are shown annotated with cell type (a and c) and topic distributions (b and d)
List of biological process GO terms significantly enriched and uniquely appearing in each topic for hESC differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0042769 | DNA damage response, detection of DNA | 3.5e-08 |
| damage | ||
| GO:0090263 | positive regulation of canonical Wnt | 3.0e-07 |
| signaling pathway | ||
| GO:0045814 | negative regulation of gene expression, | 4.1e-07 |
| epigenetic | ||
| GO:0051084 |
| 1.3e-06 |
| Topic 2 | ||
| GO:1900034 | regulation of cellular response to heat | 7.0e-08 |
| GO:0051301 | cell division | 1.3e-07 |
| GO:0044743 | intracellular protein transmembrane import | 2.1e-07 |
| GO:0007077 | mitotic nuclear envelope disassembly | 7.2e-07 |
| GO:1990542 | mitochondrial transmembrane transport | 9.7e-07 |
| Topic 3 | ||
| GO:0000281 | mitotic cytokinesis | 1.3e-07 |
| Topic 4 | ||
| GO:0010501 | RNA secondary structure unwinding | 1.5e-09 |
| GO:0045596 | negative regulation of cell differentiation | 9.5e-09 |
| GO:0048387 | negative regulation of retinoic acid receptor | 1.9e-08 |
| signaling pathway | ||
| GO:0043066 | negative regulation of apoptotic process | 1.4e-07 |
| GO:0006368 | transcription elongation from RNA | 2.0e-07 |
| polymerase II promoter | ||
List of cellular components GO terms significantly enriched and uniquely appearing in each topic for embryonic mouse cell differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0033178 | proton-transporting two-sector ATPase complex, catalytic domain | 7.8e-07 |
| GO:0031597 | cytosolic proteasome complex | 2.0e-06 |
| GO:0005680 | anaphase-promoting complex | 2.5e-06 |
| GO:0008540 | proteasome regulatory particle, base subcomplex | 3.7e-06 |
| Topic 2 | ||
| GO:0005736 | DNA-directed RNA polymerase I complex | 4.8e-07 |
| GO:0005665 | DNA-directed RNA polymerase II, core complex | 5.0e-07 |
| GO:0005844 | polysome | 1.2e-06 |
| GO:0005689 | U12-type spliceosomal complex | 1.8e-06 |
| GO:0005838 | proteasome regulatory particle | 3.1e-06 |
| GO:0008023 | transcription elongation factor complex | 8.1e-06 |
| GO:0000346 | transcription export complex | 9.5e-06 |
| GO:0015030 | Cajal body | 1.5e-05 |
| Topic 3 | ||
| GO:0005813 | centrosome | 4.1e-08 |
| GO:0000932 | cytoplasmic mRNA processing body | 3.1e-06 |
| GO:0016592 | mediator complex | 6.2e-06 |
| Topic 4 | ||
| GO:0072562 | blood microparticle | 7.4e-29 |
| GO:0005783 | endoplasmic reticulum | 9.3e-27 |
| GO:0005615 | extracellular space | 2.5e-25 |
| GO:0005743 | mitochondrial inner membrane | 3.7e-22 |
| GO:0005829 | cytosol | 2.2e-17 |
| GO:0005759 | mitochondrial matrix | 2.0e-11 |
| GO:0016021 | integral component of membrane | 3.2e-10 |
| GO:0034364 | high-density lipoprotein particle | 3.7e-09 |
| GO:0031012 | extracellular matrix | 4.1e-09 |
| GO:0030176 | integral component of endoplasmic reticulum membrane | 4.2e-09 |
| GO:0005777 | peroxisome | 3.3e-08 |
| GO:0005789 | endoplasmic reticulum membrane | 5.4e-08 |
| GO:0005788 | endoplasmic reticulum lumen | 6.5e-08 |
| GO:0034361 | very-low-density lipoprotein particle | 1.3e-07 |
| GO:0005764 | lysosome | 1.4e-07 |
| GO:0005791 | rough endoplasmic reticulum | 3.6e-07 |
| GO:0005778 | peroxisomal membrane | 1.2e-06 |
| GO:0005793 | endoplasmic reticulum-Golgi intermediate compartment | 1.3e-06 |
| GO:0070069 | cytochrome complex | 3.8e-06 |
| GO:0009986 | cell surface | 5.1e-06 |
| GO:0030867 | rough endoplasmic reticulum membrane | 8.9e-06 |
| GO:0005782 | peroxisomal matrix | 9.8e-06 |
| GO:0009897 | external side of plasma membrane | 1.0e-05 |
| GO:0005790 | smooth endoplasmic reticulum | 1.1e-05 |
| GO:0005765 | lysosomal membrane | 1.1e-05 |
List of cellular components GO terms significantly enriched and uniquely appearing in each topic for hESC differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0044297 | cell body | 2.7e-06 |
| GO:0045263 | proton-transporting ATP synthase complex, coupling factor F(o) | 4.2e-06 |
| GO:0005819 | spindle | 1.3e-05 |
| Topic 2 | ||
| GO:0005789 | endoplasmic reticulum membrane | 7.6e-08 |
| GO:0098796 | membrane protein complex | 3.6e-07 |
| GO:0005746 | mitochondrial respiratory chain | 5.6e-07 |
| GO:0000785 | chromatin | 5.5e-06 |
| GO:0042645 | mitochondrial nucleoid | 1.2e-05 |
| Topic 3 | ||
| GO:0005813 | centrosome | 2.8e-06 |
| Topic 4 | ||
| GO:0005634 | nucleus | 3.9e-15 |
| GO:0071339 | MLL1 complex | 4.5e-06 |
| GO:0005689 | U12-type spliceosomal complex | 9.1e-06 |
| GO:0005732 | small nucleolar ribonucleoprotein complex | 1.1e-05 |
List of molecular function GO terms significantly enriched and uniquely appearing in each topic for hESC differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0004129 | cytochrome-c oxidase activity | 7.5e-06 |
| Topic 2 | ||
| GO:0008536 | Ran GTPase binding | 3.2e-06 |
| Topic 4 | ||
| GO:0003677 | DNA binding | 4.4e-09 |
| GO:0030515 | snoRNA binding | 1.5e-07 |
| GO:0004004 | ATP-dependent RNA helicase activity | 3.0e-07 |
| GO:0042974 | retinoic acid receptor binding | 1.1e-06 |
| GO:0004402 | histone acetyltransferase activity | 6.6e-06 |
| GO:0043022 | ribosome binding | 6.9e-06 |
List of biological process GO terms significantly enriched and uniquely appearing in each topic for embryonic mouse cell differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0032543 | mitochondrial translation | 2.2e-07 |
| GO:0015031 | protein transport | 2.7e-07 |
| GO:0015986 | ATP synthesis coupled proton transport | 1.8e-06 |
| Topic 2 | ||
| GO:0042254 | ribosome biogenesis | 7.1e-10 |
| GO:0000462 | maturation of SSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) | 7.2e-10 |
| GO:0042273 | ribosomal large subunit biogenesis | 4.1e-08 |
| Topic 3 | ||
| GO:0007049 | cell cycle | 1.6e-09 |
| GO:0045893 | positive regulation of transcription, DNA-templated | 1.5e-08 |
| GO:0006355 | regulation of transcription, DNA-templated | 1.2e-07 |
| GO:0000122 | negative regulation of transcription from RNA polymerase II promoter | 3.2e-07 |
| GO:0043161 | proteasome-mediated ubiquitin-dependent protein catabolic process | 5.5e-07 |
| GO:0000281 | mitotic cytokinesis | 1.5e-06 |
| Topic 4 | ||
| GO:0055114 | oxidation-reduction process | < 1e-30 |
| GO:0006805 | xenobiotic metabolic process | 3.5e-13 |
| GO:0042738 | exogenous drug catabolic process | 1.9e-12 |
| GO:0006749 | glutathione metabolic process | 4.3e-12 |
| GO:1901606 | alpha-amino acid catabolic process | 6.9e-10 |
| GO:0046700 | heterocycle catabolic process | 9.8e-10 |
| GO:0008203 | cholesterol metabolic process | 1.0e-09 |
| GO:0019373 | epoxygenase P450 pathway | 7.6e-09 |
| GO:0044270 | cellular nitrogen compound catabolic process | 1.0e-08 |
| GO:0006958 | complement activation, classical pathway | 1.2e-08 |
| GO:0006641 | triglyceride metabolic process | 4.2e-08 |
| GO:0045454 | cell redox homeostasis | 4.4e-08 |
| GO:0019439 | aromatic compound catabolic process | 6.0e-08 |
| GO:1901361 | organic cyclic compound catabolic process | 7.3e-08 |
| GO:0030433 | ER-associated ubiquitin-dependent protein catabolic process | 1.2e-07 |
| GO:0006953 | acute-phase response | 1.4e-07 |
| GO:0010951 | negative regulation of endopeptidase activity | 2.7e-07 |
| GO:0042493 | response to drug | 5.4e-07 |
| GO:0006103 | 2-oxoglutarate metabolic process | 5.8e-07 |
| GO:0042537 | benzene-containing compound metabolic process | 6.2e-07 |
| GO:0010466 | negative regulation of peptidase activity | 9.0e-07 |
| GO:0019748 | secondary metabolic process | 9.8e-07 |
| GO:0006120 | mitochondrial electron transport, NADH to ubiquinone | 1.0e-06 |
| GO:0042744 | hydrogen peroxide catabolic process | 1.0e-06 |
| GO:0009813 | flavonoid biosynthetic process | 1.4e-06 |
| GO:0052696 | flavonoid glucuronidation | 1.4e-06 |
| GO:0046688 | response to copper ion | 1.5e-06 |
Fig. 5Enriched GO terms subgraph for hESC. Subgraph of the GO DAG showing terms significantly enriched for each topic in hESC differentiation: a using BP GO terms. b using CC GO terms
Fig. 6Mouse cortical cell subtypes. Backbone trees obtained by cellTree for mouse cortical cell subtypes annotated with cell subtype (a) and topic distributions (b)
List of molecular function GO terms significantly enriched and uniquely appearing in each topic for embryonic mouse cell differentiation
| GO.ID | Term |
|
|---|---|---|
| Topic 2 | ||
| GO:0001054 | RNA polymerase I activity | 2.5e-07 |
| GO:0043022 | ribosome binding | 4.8e-06 |
| GO:0030515 | snoRNA binding | 5.0e-06 |
| GO:0005524 | ATP binding | 6.1e-06 |
| Topic 3 | ||
| GO:0003713 | transcription coactivator activity | 7.6e-07 |
| GO:0003730 | mRNA 3’-UTR binding | 4.6e-06 |
| GO:0004843 | ubiquitin-specific protease activity | 7.7e-06 |
| Topic 4 | ||
| GO:0020037 | heme binding | 2.2e-19 |
| O:0008395 | steroid hydroxylase activity | 5.9e-18 |
| O:0019825 | oxygen binding | 6.5e-14 |
| O:0005506 | iron ion binding | 9.2e-13 |
| O:0004364 | glutathione transferase activity | 1.5e-12 |
| O:0008392 | arachidonic acid epoxygenase activity | 1.9e-12 |
| O:0005102 | receptor binding | 4.2e-11 |
| GO:0004867 | serine-type endopeptidase inhibitor activity | 1.1e-10 |
| GO:0009055 | electron carrier activity | 1.3e-10 |
| GO:0070330 | aromatase activity | 2.6e-10 |
| GO:0042803 | protein homodimerization activity | 7.4e-10 |
| GO:0008201 | heparin binding | 1.5e-09 |
| GO:0002020 | protease binding | 2.5e-08 |
| GO:0051087 | chaperone binding | 6.0e-07 |
| GO:0003988 | acetyl-CoA C-acyltransferase activity | 8.4e-07 |
| GO:0016836 | hydro-lyase activity | 1.0e-06 |
| GO:0004602 | glutathione peroxidase activity | 1.0e-06 |
| GO:0004601 | peroxidase activity | 1.3e-06 |
| GO:0030170 | pyridoxal phosphate binding | 1.6e-06 |
| GO:0004029 | aldehyde dehydrogenase (NAD) activity | 2.4e-06 |
| GO:0001848 | complement binding | 3.1e-06 |
| GO:0016616 | oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor | 3.2e-06 |
| GO:0050660 | flavin adenine dinucleotide binding | 3.8e-06 |
| GO:0005507 | copper ion binding | 4.1e-06 |
| GO:0016709 | oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, NAD(P)H as one donor, and incorporation of one atom of oxygen | 5.3e-06 |
| GO:0032403 | protein complex binding | 5.7e-06 |
| GO:0051537 | 2 iron, 2 sulfur cluster binding | 7.8e-06 |
List of molecular function GO terms significantly enriched and uniquely appearing in each topic for mouse cortical cell subtypes
| GO.ID | Term |
|
|---|---|---|
| Topic 2 | ||
| GO:0005524 | ATP binding | 4.7e-12 |
| GO:0005516 | calmodulin binding | 4.6e-09 |
| GO:0019901 | protein kinase binding | 2.0e-08 |
| GO:0044325 | ion channel binding | 3.3e-08 |
| GO:0005515 | protein binding | 2.7e-07 |
| GO:0005509 | calcium ion binding | 3.6e-07 |
| GO:0017075 | syntaxin-1 binding | 1.0e-05 |
| Topic 3 | ||
| GO:0003677 | DNA binding | 1.1e-07 |
| GO:0032403 | protein complex binding | 3.7e-06 |
| GO:0019843 | rRNA binding | 6.1e-06 |
| Topic 4 | ||
| GO:0003924 | GTPase activity | 2.7e-07 |
List of biological process GO terms significantly enriched and uniquely appearing in each topic for mouse cortical cell subtypes
| GO.ID | Term |
|
|---|---|---|
| Topic 1 | ||
| GO:0006457 | protein folding | 1.1e-06 |
| Topic 2 | ||
| GO:0007165 | signal transduction | 1.8e-06 |
| GO:0016079 | synaptic vesicle exocytosis | 2.9e-06 |
| Topic 3 | ||
| GO:0015991 | ATP hydrolysis coupled proton transport | 3.0e-07 |
| Topic 4 | ||
| GO:0046916 | cellular transition metal ion homeostasis | 1.4e-06 |
| GO:0016525 | negative regulation of angiogenesis | 2.7e-06 |
| Topic 5 | ||
| GO:0006633 | fatty acid biosynthetic process | 1.2e-05 |