| Literature DB >> 34606616 |
Lize Wu1,2, Ziwei Xue3, Siqian Jin3, Jinchun Zhang3, Yixin Guo3, Yadan Bai3, Xuexiao Jin1, Chaochen Wang3, Lie Wang4, Zuozhu Liu5, James Q Wang3, Linrong Lu1,2,3,6, Wanlu Liu2,3,6,7,8.
Abstract
T-cell receptors (TCRs) and B-cell receptors (BCRs) are critical in recognizing antigens and activating the adaptive immune response. Stochastic V(D)J recombination generates massive TCR/BCR repertoire diversity. Single-cell immune profiling with transcriptome analysis allows the high-throughput study of individual TCR/BCR clonotypes and functions under both normal and pathological settings. However, a comprehensive database linking these data is not yet readily available. Here, we present the human Antigen Receptor database (huARdb), a large-scale human single-cell immune profiling database that contains 444 794 high confidence T or B cells (hcT/B cells) with full-length TCR/BCR sequence and transcriptomes from 215 datasets. All datasets were processed in a uniform workflow, including sequence alignment, cell subtype prediction, unsupervised cell clustering, and clonotype definition. We also developed a multi-functional and user-friendly web interface that provides interactive visualization modules for biologists to analyze the transcriptome and TCR/BCR features at the single-cell level. HuARdb is freely available at https://huarc.net/database with functions for data querying, browsing, downloading, and depositing. In conclusion, huARdb is a comprehensive and multi-perspective atlas for human antigen receptors.Entities:
Mesh:
Substances:
Year: 2022 PMID: 34606616 PMCID: PMC8728177 DOI: 10.1093/nar/gkab857
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of huARdb workflow and modules. All single-cell immune profiling datasets were retrieved from public databases (SRA, GEO, and GSA). All datasets were uniformly processed with our workflow. All processed data were stored in PostgreSQL database and HDF5 files, using sample ID as primary keys. For the web user interface, various interactive visualization and data analysis modules were provided for analyzing transcriptome and clonotype features.
Figure 2.The transcriptome module of huARdb. (A) Screenshot for the huARdb home page. Users can select samples from various tissues, diseases and papers. (B) The tSNE visualization module for predicted cell subtype. Users can select a cell subtype on the side menu and obtain a zoomed-in view. (C) The bar plot shows the number of predicted cell subtype in the dataset. (D, E) The gene expression modules. The web interface provides a feature plot (D) and violin plot (E) for visualizing individual gene expression level. The red bar in the violin plot represents the average expression level for certain predicted cell subtype. IFNG and PDCD1 were used as example genes for gene expression modules. (F) Heatmap of marker genes for predicted cell types. The average expression values are normalized between 0 to 1. Sample ‘CPIc_C2’ from Luoma et al. was used as examples in this figure (35). All panels were exported and downloaded from the web interface.
Figure 3.The clonotype module of huARdb. (A) The web interface provides clonotype network plot of the top 10 expanded-clonotypes. (B) Summary table of single cell information. (C) Bar plots showing number of cells of the top 10 expanded-clonotypes. After clicking the bar in the left panel, the bar will be highlighted, and the tSNE plot (D) and violin plot (E) exhibit the cells in the selected clonotype. (F) Bar plots showing cell subtype composition of the top 10 expanded-clonotypes. After clicking the bar, the tSNE plot (G) and violin plot (H) exhibit effector memory CD8+ T cells in all top 10 expanded-clonotypes. (I, J) After clicking a node in (A), the clonotype information including predicted cell subtype (I, upper panel), V(D)J gene usage (I, lower panel), amino acid/nucleotide sequences of CDR3 (J), and full-length V(D)J sequence (J) of the selected cell will be displayed in the menu bar on the web interface (I). Sample ‘CPIc_C2’ from Luoma et al. was used as examples in this figure (35). IFNG were used as example genes for gene expression modules. All panels were exported and downloaded from the web interface.
Figure 4.The transcriptome-clonotype analysis module of huARdb. (A) The correlation heatmap of the transcriptome for different predicted cell types in the top 10 expanded-clonotypes. Pearson correlation coefficient are used to reflect the similarity of the transcriptome. (B) The differentially expressed genes for any two clonotypes among the top 10 expanded-clonotypes. Users may select any two clonotypes in the web interface for comparison. (C) Circos plot for V gene usage for all hcT/B cells in the sample. Sample ‘CPIc_C2’ from Luoma et al. was used as examples in this figure (35). All panels were exported and downloaded from the web interface.