| Literature DB >> 33884369 |
Alexander Yermanos1, Andreas Agrafiotis1, Raphael Kuhn1, Damiano Robbiani1, Josephine Yates1, Chrysa Papadopoulou1, Jiami Han1, Ioana Sandu2, Cédric Weber1, Florian Bieberich1, Rodrigo Vazquez-Lombardi1, Andreas Dounas3, Daniel Neumeier1, Annette Oxenius2, Sai T Reddy1.
Abstract
High-throughput single-cell sequencing (scSeq) technologies are revolutionizing the ability to molecularly profile B and T lymphocytes by offering the opportunity to simultaneously obtain information on adaptive immune receptor repertoires (VDJ repertoires) and transcriptomes. An integrated quantification of immune repertoire parameters, such as germline gene usage, clonal expansion, somatic hypermutation and transcriptional states opens up new possibilities for the high-resolution analysis of lymphocytes and the inference of antigen-specificity. While multiple tools now exist to investigate gene expression profiles from scSeq of transcriptomes, there is a lack of software dedicated to single-cell immune repertoires. Here, we present Platypus, an open-source software platform providing a user-friendly interface to investigate B-cell receptor and T-cell receptor repertoires from scSeq experiments. Platypus provides a framework to automate and ease the analysis of single-cell immune repertoires while also incorporating transcriptional information involving unsupervised clustering, gene expression and gene ontology. To showcase the capabilities of Platypus, we use it to analyze and visualize single-cell immune repertoires and transcriptomes from B and T cells from convalescent COVID-19 patients, revealing unique insight into the repertoire features and transcriptional profiles of clonally expanded lymphocytes. Platypus will expedite progress by facilitating the analysis of single-cell immune repertoire and transcriptome sequencing.Entities:
Year: 2021 PMID: 33884369 PMCID: PMC8046018 DOI: 10.1093/nargab/lqab023
Source DB: PubMed Journal: NAR Genom Bioinform ISSN: 2631-9268
Figure 1.Flowchart demonstrating the workflow of Platypus. A selection of functions internal to Platypus and their respective relationships are depicted. Each node in the flow chart indicates a process in the workflow requiring just a single line of code with user-definable parameters.
Figure 2.Extracting and visualizing clonal information of PBMCs from patients recently infected with SARS-CoV-2 using Platypus. (A) Experimental overview for single-cell immune repertoire of B and T cells in two patients previously infected with SARS-CoV-2. (B) Multiple B-cell clonotyping strategies involving CDR3 sequence identity, germline gene usage and sequence homology thresholds from the two patients previously infected with SARS-CoV-2. (C) Clonal expansion profiles of the T cells from the blood repertoires of one individual recently infected with SARS-CoV-2. Clone is defined as unique CDRβ3 + CDRα3 nucleotide sequence. (D) Clonal expansion profiles of the B cells from the blood repertoires of one individual recently infected with SARS-CoV-2. Clone is defined by unique CDRH3 + CDRL3 nucleotide sequence. Color depicts isotype on the cell level within each clone determined by the VDJ repertoire sequencing libraries. Plots produced using VDJ_isotypes_per_clone in Platypus. (E) Length distribution of the paired CDRH3 + CDRL3 amino acid sequences from the B-cell clones of a single patient. (F) Sequence logo plots of those paired CDRH3 + CDRL3 amino acid sequences with a combined sequence length of 29 in two patients. Colors correspond to biochemical properties: red = acidic, blue = basic, black = hydrophobic, purple = neutral, green = polar. Top logo plot corresponds to patient 1 and bottom logo plot corresponds to patient 2.
Figure 3.Analyzing and visualizing germline gene usage from B and T cell repertoires of two patients recently infected with SARS-CoV-2. (A andB) Circos plots for the ten most expanded B- and T-cell clones for each patient. Each line indicates the V and J gene usage for either the heavy or light chain (alpha/beta in the case of T cells) for an individual cell. The inner track number and the corresponding thickness of the bar indicates the number of cells using a given germline gene. Color corresponds to germline gene. Plots were produced using VDJ_VJ_usage_circos in Platypus. (C) VH gene usage for the ten most used VH genes in each repertoire. Clone is defined as unique CDRH3 + CDRL3 nucleotide sequence for B cells and unique CDRβ3 + CDRα3 nucleotide sequence for T cells. Plots were produced using VDJ_Vgene_usage in Platypus. (D) Stacked bar plot comparing the ten most used VH genes across both patients. Plots produced using VDJ_Vgene_usage_stacked_barplot in Platypus.
Figure 4.Integration of gene expression (GEX) and repertoire (VDJ) sequencing datasets from two patients recently infected with SARS-CoV-2. (A) Uniform manifold approximation projection (UMAP) of gene expression data from both COVID-19 patients. Cluster corresponds to the transcriptional clustering performed on the GEX datasets after excluding TCR and BCR receptor genes. Each point corresponds to a cell in one of the two patients. (B) Distribution of the fraction of cells located in each transcriptional cluster for all cells of each patient. Produced using the GEX_cluster_membership function in Platypus. (C) Distribution of the fraction of cells located in each transcriptional cluster for the top 10 most expanded T-cell clones of a single COVID-19 patient. Only those cells found in both GEX and VDJ sequencing datasets were included in the quantification. T-cell clone was defined by unique CDRβ3 + CDRα3 nucleotide sequence. (D) UMAP displaying those cells from both patients corresponding to memory-like or effector-like phenotypes labeled using the GEX_phenotype function from Platypus. (E) Fraction of cells from the top five most expanded T-cell clones from patient 2 matching either of the phenotypes in (D). Unclassified indicates lack of CD8, CCL5, IL7R and TCF7 expression. Plot was produced with GEX_phenotype_per_clone in Platypus. (F) The 10 most expanded T-cell clones defined by unique CDRβ3 + CDRα3 nucleotide sequence from a single COVID-19 patient are highlighted on the UMAP containing all cells from both patients. Each point corresponds to a unique cell barcode. Only those cells found in both GEX and VDJ sequencing datasets were included in the quantification. Plot was produced using GEX_visualize_clones in Platypus.
Figure 5.Functions from Platypus extract parameters relevant to the discovery of antigen-specific B- and T-cell clones. (A) Mean nucleotide somatic hypermutation for the 30 more expanded B-cell clones found in the blood repertoire following SARS-CoV-2 infection. Somatic hypermutation was quantified in the V and J segments for both the heavy and light chains for each cell by comparing misalignments to the reference germline segments. (B) Phylogenetic tree rooted by germline reference sequence. The reference germline as determined by cellranger was set as the root. The number of cells in the tip label corresponds to the number of unique nucleotide variants producing the exact, full-length antibody sequence. Each tip label represents a single unique nucleotide paired VH + VL sequence. (C andD) Similarity networks depicting the B- and T-cell clones that are separated by either <10 or 14 amino acid mutations in either CDRH3 or CDRL3 sequence (CDRβ3 + CDRα3 for T cells). Vertices represent unique clones defined as those cells containing identical CDHR3 + CDRL3 sequence (CDRβ3 + CDRα3 for T cells). Vertex colors represent each patient (patient 1 = orange, patient 2 = blue). Vertex size corresponds to the relative number of cells supporting each individual clone. Edges represent those clones that are separated by an edit distance of less than either 10 or 14 amino acid mutations in their CDR3 regions. Produced using VDJ_network in Platypus, which uses functions in igraph.
Figure 6.(A) Heatmap depicting normalized gene expression for the most expanded T-cell clones that were found in both the VDJ and gene expression (GEX) sequencing libraries from the blood repertoires of COVID-19 patients. The color of each column corresponds to an individual clonal family and the width of the bar corresponds to the number of cells found in the GEX library for that clone. Clone is defined as cells containing identical CDRβ3 + CDRα3 nucleotide sequences. (B) Gene ontology (GO) term enrichment for the top 10 most upregulated genes in the expanded clones relative to the unexpanded clones by average log fold-change. The color of each dot corresponds to adjusted P-value. The size of the dot corresponds to the number of genes of the particular GO term. Ratio corresponds to the number of differentially expressed genes relative to the number of total genes corresponding to each GO term. Produced using the GEX_GOterm function in Platypus. (C) Gene set enrichment (GSEA) plots based on the C7 immunological signatures from the Broad institute. The top 10 most upregulated (top panel) and the top 10 most downregulated (bottom panel) genes between expanded and unexpanded T-cell clones were produced using the GEX_GSEA function in Platypus. The enrichment plot of the highest scoring gene set is displayed. (D) Heatmap depicting normalized gene expression for the most expanded B cells found in both VDJ and GEX sequencing libraries. Clone is defined as those cells containing identical CDHR3 + CDRL3 sequence.