| Literature DB >> 34655133 |
Tobias H Olsen1, Fergus Boyles1, Charlotte M Deane1.
Abstract
The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B-cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in-depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence-based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at http://opig.stats.ox.ac.uk/webapps/oas/, and all data are freely available for download.Entities:
Keywords: BCR-seq; Observed Antibody Space (OAS); annotated antibody sequences; antibody database; antibody repertoire; antibody sequence
Mesh:
Substances:
Year: 2021 PMID: 34655133 PMCID: PMC8740823 DOI: 10.1002/pro.4205
Source DB: PubMed Journal: Protein Sci ISSN: 0961-8368 Impact factor: 6.725
FIGURE 1Overview of the structure of an antibody and the sequence of its variable regions. An antibody contains two heavy (blue) and two light (red) chains, with each chain separable into one or more conserved (C) and one variable (V) region. The paired heavy and light V regions, annotated as VH and VL respectively, contain the binding site
FIGURE 2Downloading from OAS. (a) The sequence search tab for unpaired sequences, with the search options filled for heavy chain sequences from SARS‐CoV‐2 infected patients (shown with red arrows). (b) The search result, with each data unit matching the search and a downloadable link containing the links for the relevant data units (with a red arrow)