| Literature DB >> 31896769 |
Jinling Liao1,2,3, Zhenyuan Yu1,2,3,4,5, Yang Chen1,2,3,4,5, Mengying Bao1,2,3, Chunlin Zou6,7, Haiying Zhang1,2,3, Deyun Liu4,5, Tianyu Li4,5, Qingyun Zhang1,2,3,4,8, Jiaping Li9,10,11, Jiwen Cheng12,13,14,15,16, Zengnan Mo17,18,19,20,21.
Abstract
A comprehensive cellular anatomy of normal human kidney is crucial to address the cellular origins of renal disease and renal cancer. Some kidney diseases may be cell type-specific, especially renal tubular cells. To investigate the classification and transcriptomic information of the human kidney, we rapidly obtained a single-cell suspension of the kidney and conducted single-cell RNA sequencing (scRNA-seq). Here, we present the scRNA-seq data of 23,366 high-quality cells from the kidneys of three human donors. In this dataset, we show 10 clusters of normal human renal cells. Due to the high quality of single-cell transcriptomic information, proximal tubule (PT) cells were classified into three subtypes and collecting ducts cells into two subtypes. Collectively, our data provide a reliable reference for studies on renal cell biology and kidney disease.Entities:
Mesh:
Year: 2020 PMID: 31896769 PMCID: PMC6940381 DOI: 10.1038/s41597-019-0351-8
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1scRNA-seq reveals the cell populations of the human kidney. (a) Overview of the scRNA-seq process using human kidney tissue samples. (b) Uniform manifold approximation and projection (UMAP) plot showing the unbiased classification of renal cells. (c) Pie chart showed the proportion of each kidney cell type. (d) Heat map showing the marker genes of each cluster, highlighting the selected marker genes for each cluster.
Detailed QC of FASTQ files.
| Sample ID | Cellranger version and seq strategy | Q30 Bases in Barcode | Q30 Bases in RNA Read | Q30 Bases in UMI | Reads Mapped Confidently to Genome | Reads Mapped Confidently to Exonic Regions | Reads Mapped Confidently to Transcriptome |
|---|---|---|---|---|---|---|---|
| kidney1 | Cellranger 2.1, 150 bp × 2 | 95.7% | 89.6% | 95.1% | 92.8% | 82.8% | 79.4% |
| kidney1 | Cellranger 3.0, 150 bp × 2 | 95.7% | 86.6% | 95.1% | 91.3% | 81% | 76.5% |
| kidney1 | Cellranger 3.0, 26 bp × 98 bp, 10x seq protocol | 95.7% | 89.6% | 95.1% | 92.9% | 82.9% | 79.3% |
| kidney2 | Cellranger 2.2, 150 bp × 2 | 98.3% | 94.1% | 98.4% | 96.1% | 90.3% | 85.6% |
| kidney2 | Cellranger 3.0, 150 bp × 2 | 98.3% | 94.2% | 98.5% | 96.1% | 90.1% | 85% |
| kidney2 | Cellranger 3.0, 26 bp × 98 bp, 10x seq protocol | 98.3% | 96.9% | 98.5% | 87.4% | 83.3% | 79.6% |
| kidney3 | Cellranger 2.2, 150 bp × 2 | 98.3% | 93.7% | 98.4% | 96.1% | 87.9% | 82.8% |
| kidney3 | Cellranger 3.0, 150 bp × 2 | 98.3% | 93.7% | 98.4% | 96.3% | 88% | 82.6% |
| kidney3 | Cellranger 3.0, 26 bp × 98 bp, 10x seq protocol | 98.3% | 94.6% | 98.4% | 96.5% | 88.5% | 83.2% |
Information and sequencing statistics of three samples and comparison between different sequencing strategies. NA, after pre-processing the data, some of them didn’t use for downstream analysis.
| Sample ID | Cellranger version and seq strategy | Number of cells | Median number of detected genes | Sequencing saturation (%) | Mean reads per cell | Number of cells post filtering |
|---|---|---|---|---|---|---|
| kidney1 | Cellranger 2.1, 150 bp × 2 | 8,091 | 856 | 85.9 | 42,485 | NA. |
| Kidney1 | Cellranger 3.0, 150 bp × 2 | 8,161 | 825 | 85.5 | 42,121 | NA. |
| Kidney1 | Cellranger 3.0, 26 bp × 98 bp, 10X seq protocol | 8,164 | 837 | 85.7 | 42,105 | 7,221 |
| kidney2 | Cellranger 2.2, 150 bp × 2 | 6,472 | 960 | 85.6 | 37,763 | NA. |
| kidney2 | Cellranger 3.0, 150 bp × 2 | 6,493 | 910 | 80.2% | 25,539 | NA. |
| kidney2 | Cellranger 3.0, 26 bp × 98 bp, 10X seq protocol | 6,499 | 887 | 79.9% | 25,515 | 5,543 |
| kidney3 | Cellranger 2.2, 150 bp × 2 | 10,705 | 692 | 85.7 | 21,064 | NA. |
| Kidney3 | Cellranger 3.0, 150 bp × 2 | 10,741 | 683 | 85.6 | 20,993 | NA. |
| Kidney3 | Cellranger 3.0, 26 bp × 98 bp, 10X seq protocol | 10,741 | 684 | 85.6 | 20,993 | 10,602 |
Fig. 2Quality control (QC) of human kidney single cell data. (a) Scatterplot illustrating the number of genes, unique molecular identifiers (UMIs) and the percentage of mitochondrial genes in each cell of three kidney samples. (b) The relationship between the percentage of mitochondrial genes and the mRNA reads, together with the relationship between the amount of mRNA and the reads of mRNA. (c) We detected the batch effect between three different kidney samples. (d) UMAP plot showing the cell cycle status of each cell. (e) Violin plot illustrating the number of genes, unique molecular identifiers (UMIs) and the percentage of mitochondrial genes in previous kidney single-cell data from GSE107585[1].
Cell type assignment based on the marker genes reported in previous studies.
| Gene | Marker of | Reference |
|---|---|---|
| Proximal tubule |
[ | |
| Proximal tubule |
[ | |
| Proximal tubule |
[ | |
| Proximal tubule |
[ | |
| Proximal convoluted tubule |
[ | |
| Proximal straight tubule |
[ | |
| Glomerular parietal epithelial cells |
[ | |
| Glomerular parietal epithelial cells |
[ | |
| Glomerular parietal epithelial cells |
[ | |
| Glomerular parietal epithelial cells |
[ | |
| Monocytes |
[ | |
| Monocytes |
[ | |
| NK cells |
[ | |
| NK cells |
[ | |
| T cells |
[ | |
| T cells |
[ | |
| T cells |
[ | |
| B cells |
[ | |
| B cells |
[ | |
| Distal tubule |
[ | |
| Distal tubule, Collecting duct |
[ | |
| Collecting duct |
[ | |
| Collecting duct principal cells |
[ | |
| Collecting duct intercalated cells |
[ | |
| Collecting duct intercalated cells |
[ | |
| Collecting duct intercalated cells |
[ |
Fig. 3Subpopulations of PT cells and reconstructing the developmental trajectory of PT cells. (a) Violin plots representing the expression of marker genes in PT cells. Clusters 1, 2 and 3 refer to the proximal convoluted tubule, proximal tubule and proximal straight tubule, respectively. (b) Monocle2-generated pseudotemporal trajectory of three PT cell types (n = 20,308); imported Seurat data are coloured according to the cell name designation. (c) Pseudotime was coloured in a gradient from dark to light blue, and the start of pseudotime is dark. (d) The pseudotime trajectory was divided into three different states by Monocle2. (e) The trajectory showing the distribution of cells from three samples. (f) The top six genes influencing fate decisions are shown as line plots displayed as the expression level over pseudotime by Monocle2. (g) Heat map for clustering the top 50 genes that affected cell fate decisions. These 50 genes were divided into three clusters (cluster 1, cluster 2 and cluster 3), showing genes at the beginning stage, the transitory stage and the end stage of the developmental trajectory, respectively.
Fig. 4Detailed classification of collecting duct cells and NK-T cells by scRNA-seq. (a) Violin plot showing the expression of the collecting duct principal cell marker AQP2 and the collecting duct intercalated cell markers ATP6V1B1, ATP6V0D2 and ATP6V1G3. Clusters 8 and 10 are collecting duct principal cells and collecting duct intercalated cells, respectively. (b) UMAP plot showing the spatial location of NKT cells and T cells after dimensionality reduction. The red dots represent NKT cells and the green dots represent T cells. (c–g) Violin plots showing the expression of the NK cells markers GNLY and NKG7. Violin plots showing the expression of the T cell markers CD3D, CD3E and IL7R.
| Measurement(s) | transcriptome • RNA |
| Technology Type(s) | RNA sequencing |
| Factor Type(s) | type of kidney cell |
| Sample Characteristic - Organism | Homo sapiens |