| Literature DB >> 27559159 |
Lisle E Mose1, Sara R Selitsky1, Lisa M Bixby1,2, David L Marron1, Michael D Iglesia3, Jonathan S Serody1,2,4, Charles M Perou1,5, Benjamin G Vincent1,2, Joel S Parker1,6.
Abstract
MOTIVATION: B-cell receptor (BCR) repertoire profiling is an important tool for understanding the biology of diverse immunologic processes. Current methods for analyzing adaptive immune receptor repertoires depend upon PCR amplification of VDJ rearrangements followed by long read amplicon sequencing spanning the VDJ junctions. While this approach has proven to be effective, it is frequently not feasible due to cost or limited sample material. Additionally, there are many existing datasets where short-read RNA sequencing data are available but PCR amplified BCR data are not.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27559159 PMCID: PMC5167060 DOI: 10.1093/bioinformatics/btw526
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.V’DJer features. (a) BCR light and heavy chains can be assembled from a single assay. (b) The isotype of an assembled heavy chain can be identified using the assembled constant region sequence. (c) Relative clone abundance can be accurately measured using reads mapped to assembled clones. (d) Nucleotide resolution assembly provides the ability to perform mutation specific analyses including somatic hypermutation assessment and clonal diversity of the sample
Fig. 2.V’DJer workflow. V’DJer accepts a mapped mRNA-seq BAM file as input. Reads mapping to or having homology with Ig chain specific loci or sequence are extracted along with all unmapped reads and are used to construct a deBruijn graph. The graph is traversed producing putative contigs which are filtered based upon the presence of sequence having homology with anchors arising from germline V and J segments as well as conserved amino acids and read coverage. The final set of assembled contigs spanning most of the V(D)J region and a portion of the constant region is output along with a SAM file of reads mapped to the assembled contigs
Fig. 3.Performance characteristics. (a) Evaluation of ability to detect simulated IgH sequences by depth of sequencing. (b) Quantification results from simulated data show that relative abundance measured by RSEM for clones of varying depths closely matches expectation. (c) Assembled contigs validated by MiSeq sequencing sorted by relative abundance. All contigs comprising at least 1% of the IgH repertoire for a given sample are shown. (d) Assembled IgH contigs validated by MiSeq sequencing for Trinity and V’DJer
Fig. 4.TCGA melanoma results. (a) Total V’DJer abundance measured against reads mapped to IgH constant regions. (b) V’DJer heavy chain abundance is associated with V’Djer light chain abundance. (c) Number of assembled clones per sample. (d) CDR3 length distributions for all assembled contigs (inclusive of conserved Cys and Trp/Phe). (e) Relative abundance of isotype assignments. (f) Isotype specific mutational loads
Fig. 5.Impact of BCR abundance and diversity on survival. (a) Examples of low evenness and high evenness in clone abundance. Larger nodes indicate higher clone abundance. Edges were drawn if Hamming distance is <30% between two sequences. (b) Kaplan–Meier survival curves for the TCGA Melanoma cohort stratified by BCR abundance (high: count >1000, low: count ≤1000) and clone evenness (high: >0.8, low: ≤0.8) into three groups: low abundance, high abundance/high evenness and high abundance/low evenness