| Literature DB >> 36126074 |
Eric Waltari1, Saba Nafees1, Krista M McCutcheon1, Joan Wong1, John E Pak1.
Abstract
The sequencing of antibody repertoires of B-cells at increasing coverage and depth has led to the identification of vast numbers of immunoglobulin heavy and light chains. However, the size and complexity of these Adaptive Immune Receptor Repertoire sequencing (AIRR-seq) datasets makes it difficult to perform exploratory analyses. To aid in data exploration, we have developed AIRRscape, an R Shiny-based interactive web browser application that enables B-cell receptor (BCR) and antibody feature discovery through comparisons among multiple repertoires. Using AIRR-seq data as input, AIRRscape starts by aggregating and sorting repertoires into interactive and explorable bins of germline V-gene, germline J-gene, and CDR3 length, providing a high-level view of the entire repertoire. Interesting subsets of repertoires can be quickly identified and selected, and then network topologies of CDR3 motifs can be generated for further exploration. Here we demonstrate AIRRscape using patient BCR repertoires and sequences of published monoclonal antibodies to investigate patterns of humoral immunity to three viral pathogens: SARS-CoV-2, HIV-1, and DENV (dengue virus). AIRRscape reveals convergent antibody sequences among datasets for all three pathogens, although HIV-1 antibody datasets display limited convergence and idiosyncratic responses. We have made AIRRscape available as a web-based Shiny application, along with code on GitHub to encourage its open development and use by immuno-informaticians, virologists, immunologists, vaccine developers, and other scientists that are interested in exploring and comparing multiple immune receptor repertoires.Entities:
Mesh:
Substances:
Year: 2022 PMID: 36126074 PMCID: PMC9524643 DOI: 10.1371/journal.pcbi.1010052
Source DB: PubMed Journal: PLoS Comput Biol ISSN: 1553-734X Impact factor: 4.779
Open-source tools for comparing and visualizing BCR repertoires.
| Tool | Summary | Features | Reference |
|---|---|---|---|
| AIRRscape | Web-based interactive tool for exploring B-cell receptor repertoires | Enables | This study |
| AncesTree | Interactive immunoglobulin lineage tree visualizer | Enables exploration of antibody clonal lineages processed following AIRR Community standards using a Java-based GUI | 10 |
| ASAP | Web server for AIRR-seq analysis pipeline | Processes and visualizes multiple repertoires starting from paired fastq files; outputs plots of somatic hypermutation, VDJ gene usage, & clonal expansion | 11 |
| BRrepertoire | Web server for large-scale statistical analyses of repertoire data | Accepts web-based inputs from IMGT output; focuses on plots of physico-chemical properties | 12 |
| immunarch | R package for analysis of T-cell receptor and B-cell receptor repertoires | Accepts multiple input data types; generates many exploratory plot types using R commands | 13 |
| immuneREF | R package for analysis of repertoire similarity on a one-to-one, one-to-many, and many-to-many scale | Compares multiple repertoires processed via AIRR Community standards; visualizations include repertoire clustering by similarity and comparison of CDR3 amino acid occurrence and VJ usage among repertoires | 14 |
| Olmsted | Dockerized application for B-cell repertoire and clonal family tree exploration | Visualizes clonal lineages after clustering and processing of AIRR-seq data in JSON format; enables interactive exploration of clonotype phylogenies and amino acid changes | 15 |
| PASA | Web server for analysis and integration of data obtained from proteomics of serum antibodies | Enables exploration of proteomics data obtained via raw mass spectrometry data files from LC-MS/MS | 16 |
| sumrep | R package for immune receptor repertoire comparison and model validation | Compares multiple repertoires and outputs multiple similarity indices; creates plots of similarity distributions | 17 |
| VDJbase | AIRR-seq genotype and haplotype database | Has interactive modules for analysis of published AIRR-seq data including haplotype, gene, and allele usage; produces reports | 18 |
| VDJServer | Free, scalable web-based pipeline for immune repertoire analysis | Processes and visualizes repertoires starting from fastq files; outputs plots of somatic hypermutation, gene usage, clonal abundance, and diversity & selection measures | 19 |
| VDJtools | Software suite for analysis of T-cell receptor repertoires | Processes and visualizes T-cell receptor repertoire datasets | 20 |
| VDJviz | Web tool for browsing and analyzing B-cell and T-cell receptor repertoires | Uses VDJtools API; has interactive plots of VJ usage, clonal expansion, & rarefaction curves; online demo allows for analysis of up to 25 samples of 10,000 clonotypes each | 21 |
Datasets used in AIRRscape.
Datasets in italics are collections of antibodies.
| Dataset | Data type | Sample | Reference | Source | Number of sequences |
|---|---|---|---|---|---|
|
|
|
| 2 | CoV-AbDab database (2) | |
|
| patient bulk repertoire | p11 | 23 | iReceptor (34) | 9,385 |
|
| patient bulk repertoire | 7450 | 38 | iReceptor | 15,645 |
|
| patient bulk repertoire | galson-01 | 26 | iReceptor | 29,795 |
|
| patient bulk repertoire | M5 | 37 | iReceptor | 18,711 |
|
|
| - | 31 | (31) |
|
|
| patient bulk repertoire | d13 | 29 | SRA | 32,495 |
|
| patient bulk repertoires | 45 patients | 30 | Observed Antibody Space (35) | 198,119 |
|
| bulk repertoire | BX | 40 | SRA | 50,942 |
|
|
| - | 43 | IEDB (43) |
|
|
|
| - | 44 | CATNAP (44) |
|
|
|
| - | 45 | (45) |
|
|
| patient bulk repertoire | NIH45 | 41 | iReceptor | 14,644 |
|
| patient bulk repertoire | MT1214 | 39 | iReceptor | 33,855 |
|
| patient bulk repertoires | 6 CAPRISA patients | 7 | Observed Antibody Space | 184,294 |
Fig 1Workflow of repertoire data retrieval and processing.
Fig 2AIRRscape visualization of immune repertoires.
Heatmaps comparing characteristics of (A) separate and (B) combined datasets for anti-SARS-CoV-2 and anti-HIV-1 antibodies.
Fig 3AIRRscape interface showing antibodies and CDR3 amino acid topology of selected antibody bin.
Selected bin is highlighted in the red box of Fig 2B. SHM values in blue are calculated after an additional tblastn search of the NCBI nr/nt database.
Fig 4AIRRscape heatmaps comparing anti-SARS-CoV-2 antibodies, bulk BCR repertoires of four COVID-19 patients, and a healthy control bulk BCR repertoire.
Fig 5SARS-CoV-2 convergent clonotypes to mAb DH1149 in the 3_6_14 bin.
An 80% identity threshold is used to calculate convergence. Tips are colored by dataset source. Purple tips are published anti-COVID-19 antibodies from 7 different studies, dark gray tips are antibody sequences from a healthy donor BCR repertoire, and orange through brown shaded tips are antibody sequences from COVID-19 patient BCR repertoires.
Fig 6HIV-1 convergent clonotypes to antibodies from Setliff et al. (2018; Fig 4).
(A) Convergent clonotypes to mAb 02-o in the 1_4_13 bin. (B) Convergent clonotypes to mAb 02-s in the 1_4_14 bin. (C) Convergent clonotypes to mAb HK20 in the 1_3_15 bin. A 70% identity threshold is used to calculate convergence. Tips are colored by dataset source. Purple tips are published anti-COVID-19 antibodies, and green shaded tips are antibody sequences from HIV-1 patient BCR repertoires.
Dengue convergence to plasmablast clonal families.
| Clonal family | V_J_CDR3 length | Plasmablast donor | Colombian p13 Bulk matches? | Nicaraguan cohort matches? | Nicaraguan sequence matches | Nicaraguan donors with match |
|---|---|---|---|---|---|---|
|
| 4_5_10 | p13, p20 | yes | yes | 41 |
|
|
| 1_4_15 | p13 | yes | no | ||
|
| 1_5_16 | p13 | yes | no | ||
|
| 1_4_16 | p20 | no | no | ||
|
| 1_3_17 | p13 | yes | no | ||
|
| 1_4_13 | p20 |
| yes | 2 |
|
|
| 1_5_16 | p13 | yes | yes | 24 |
|
|
| 1_5_17 | p13 | yes | yes | 1 |
|
|
| 1_5_20 | p13 | yes | yes | 7 |
|
|
| 3_5_14 | p13 | yes | no | ||
|
| 3_6_23 | p13 | yes | no | ||
|
| 3_4_17 | p20 | no | no | ||
|
| 4_5_18 | p13 | yes | no | ||
|
| 4_4_15 | p20 | no | no | ||
|
| 4_5_23 | p13 | yes | no |
Fig 7Dengue convergent clonotypes to CF1 (Zanini et al. 2018).
An 80% identity threshold is used to calculate convergence. Tips are colored by dataset source. Purple tips are plasmablast sequences reported by Zanini et al. (2018) isolated from two Colombian patients (d13 and d20), blue tips are antibody sequences from the BCR repertoire of patient d13, and gold tips are antibody sequences from a cohort of Nicaraguan patient BCR repertoires.
Fig 8SARS-CoV-2, HIV-1, & dengue convergent clonotypes to anti-SARS-CoV-2 mAb DH1149 in the 3_6_14 bin.
An 80% identity threshold is used to calculate convergence. Tips are colored by dataset source. Purple tips are published anti-COVID-19 antibodies from 7 different studies, dark gray tips are antibody sequences from a healthy donor BCR repertoire, and orange through brown shaded tips are antibody sequences from COVID-19 patient BCR repertoires. Green shaded tips are antibody sequences from HIV-1 patient BCR repertoires. Gold tips are antibody sequences from a cohort of Nicaraguan dengue patient BCR repertoires.