| Literature DB >> 27297497 |
Dmitriy V Bagaev1, Ivan V Zvyagin1,2,3, Ekaterina V Putintseva1,3, Mark Izraelson1,2,3, Olga V Britanova1,2,3, Dmitriy M Chudakov4,5,6, Mikhail Shugay7,8.
Abstract
BACKGROUND: The repertoire of T- and B-cell receptor sequences encodes the antigen specificity of adaptive immunity system, determines its present state and guides its ability to mount effective response against encountered antigens in future. High throughput sequencing of immune repertoires (Rep-Seq) is a promising technique that allows to profile millions of antigen receptors of an individual in a single experiment. While a substantial number of tools for mapping and assembling Rep-Seq data were published recently, the field still lacks an intuitive and flexible tool that can be used by researchers with little or no computational background for in-depth analysis of immune repertoire profiles.Entities:
Keywords: B-cell; Browser; High-throughput sequencing; Immunology; Repertoire sequencing; T-cell
Mesh:
Substances:
Year: 2016 PMID: 27297497 PMCID: PMC4907000 DOI: 10.1186/s12864-016-2799-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Spectratypes of 6 and 64 years old donors. Each bin of the histograms corresponds to the length of CDR3 nucleotide sequence, clonotypes were weighted by their frequency and the fraction of top 10 most abundant clonotypes is shown with colored bars. Note that bars at CDR3 lengths that are not a multiple of 3 represent out-of-frame clonotypes
Fig. 2Variable segment spectratype and Variable-Joining segment usage chord diagram. a Distribution of CDR3 nucleotide sequence lengths weighted by clonotype frequency. Most enriched Variable segments are explicitly shown. b Chord diagram of Variable-Joining junction abundance. Segment lengths are scaled according to the abundance of a specific segment, arc widths scaled by the abundance of corresponding Variable-Joining junctions
Fig. 3VDJviz clonality plot, a nested pie chart divided into the following regions: singletons (clonotypes represented by a single read), doubletons (2 reads), high order (3 and more reads). High order clonotypes are divided into five quantiles (top 20 % of unique high order clonotypes and so on). Top ten clonotypes of the first quantile are explicitly shown. Size of each segment is the cumulative frequency of all clonotypes that fall into corresponding frequency category
Fig. 4VDJviz interactive rarefaction plot (diversity vs sampling depth) for T-cell repertoires from two replicate PBMC samples processed with various error correction strategies including quality filtering (q20 and q35 thresholds), elimination of clonotypes encountered only in one of the samples (“intersection”) and frequency-based error correction (“freq”). Solid lines show rarefaction curves computed using observed clonotype frequencies, dashed lines represent their extrapolations. Note that the expected sample diversity is ~90 clonotypes according to UMI-corrected data
Fig. 5VDJviz clonotype browser interface snapshots showing clonotypes matching a given CDR3 amino acid sequence in a single sample (a) and across multiple samples (b). a A trace of erroneous variants for one of the top clonotypes from sample2_q35 dataset described in example#4. b Matching the CDR3 nucleotide sequence of a cancer clonotype in post-treatment samples. The panel shows presence of minimal residual disease in corresponding patient (D29_17), as well as cross-sample contamination in two other patients
Fig. 6A snapshot of clonotype sharing (Join sample tab of VDJviz) across multiple samples. Clonotypes of 41 healthy donors of various ages were matched by their CDR3 amino acid sequence selecting the ones that were present in at least 10 repertoires