Literature DB >> 23303508

Decombinator: a tool for fast, efficient gene assignment in T-cell receptor sequences using a finite state machine.

Niclas Thomas1, James Heather, Wilfred Ndifon, John Shawe-Taylor, Benjamin Chain.   

Abstract

SUMMARY: High-throughput sequencing provides an opportunity to analyse the repertoire of antigen-specific receptors with an unprecedented breadth and depth. However, the quantity of raw data produced by this technology requires efficient ways to categorize and store the output for subsequent analysis. To this end, we have defined a simple five-item identifier that uniquely and unambiguously defines each TcR sequence. We then describe a novel application of finite-state automaton to map Illumina short-read sequence data for individual TcRs to their respective identifier. An extension of the standard algorithm is also described, which allows for the presence of single-base pair mismatches arising from sequencing error. The software package, named Decombinator, is tested first on a set of artificial in silico sequences and then on a set of published human TcR-β sequences. Decombinator assigned sequences at a rate more than two orders of magnitude faster than that achieved by classical pairwise alignment algorithms, and with a high degree of accuracy (>88%), even after introducing up to 1% error rates in the in silico sequences. Analysis of the published sequence dataset highlighted the strong V and J usage bias observed in the human peripheral blood repertoire, which seems to be unconnected to antigen exposure. The analysis also highlighted the enormous size of the available repertoire and the challenge of obtaining a comprehensive description for it. The Decombinator package will be a valuable tool for further in-depth analysis of the T-cell repertoire.
AVAILABILITY AND IMPLEMENTATION: The Decombinator package is implemented in Python (v2.6) and is freely available at https://github.com/uclinfectionimmunity/Decombinator along with full documentation and examples of typical usage.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23303508     DOI: 10.1093/bioinformatics/btt004

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  58 in total

1.  TCRβ repertoire of CD4+ and CD8+ T cells is distinct in richness, distribution, and CDR3 amino acid composition.

Authors:  Hoi Ming Li; Toyoko Hiroi; Yongqing Zhang; Alvin Shi; Guobing Chen; Supriyo De; E Jeffrey Metter; William H Wood; Alexei Sharov; Joshua D Milner; Kevin G Becker; Ming Zhan; Nan-ping Weng
Journal:  J Leukoc Biol       Date:  2015-09-22       Impact factor: 4.962

2.  IMonitor: A Robust Pipeline for TCR and BCR Repertoire Analysis.

Authors:  Wei Zhang; Yuanping Du; Zheng Su; Changxi Wang; Xiaojing Zeng; Ruifang Zhang; Xueyu Hong; Chao Nie; Jinghua Wu; Hongzhi Cao; Xun Xu; Xiao Liu
Journal:  Genetics       Date:  2015-08-21       Impact factor: 4.562

3.  CapTCR-seq: hybrid capture for T-cell receptor repertoire profiling.

Authors:  David T Mulder; Etienne R Mahé; Mark Dowar; Youstina Hanna; Tiantian Li; Linh T Nguyen; Marcus O Butler; Naoto Hirano; Jan Delabie; Pamela S Ohashi; Trevor J Pugh
Journal:  Blood Adv       Date:  2018-12-11

4.  MiXCR: software for comprehensive adaptive immunity profiling.

Authors:  Dmitriy A Bolotin; Stanislav Poslavsky; Igor Mitrophanov; Mikhail Shugay; Ilgar Z Mamedov; Ekaterina V Putintseva; Dmitriy M Chudakov
Journal:  Nat Methods       Date:  2015-05       Impact factor: 28.547

5.  pRESTO: a toolkit for processing high-throughput sequencing raw reads of lymphocyte receptor repertoires.

Authors:  Jason A Vander Heiden; Gur Yaari; Mohamed Uduman; Joel N H Stern; Kevin C O'Connor; David A Hafler; Francois Vigneault; Steven H Kleinstein
Journal:  Bioinformatics       Date:  2014-03-10       Impact factor: 6.937

Review 6.  Unifying immunology with informatics and multiscale biology.

Authors:  Brian A Kidd; Lauren A Peters; Eric E Schadt; Joel T Dudley
Journal:  Nat Immunol       Date:  2014-02       Impact factor: 25.606

7.  Benchmarking of T cell receptor repertoire profiling methods reveals large systematic biases.

Authors:  Pierre Barennes; Valentin Quiniou; Mikhail Shugay; Evgeniy S Egorov; Alexey N Davydov; Dmitriy M Chudakov; Imran Uddin; Mazlina Ismail; Theres Oakes; Benny Chain; Anne Eugster; Karl Kashofer; Peter P Rainer; Samuel Darko; Amy Ransier; Daniel C Douek; David Klatzmann; Encarnita Mariotti-Ferrandiz
Journal:  Nat Biotechnol       Date:  2020-09-07       Impact factor: 54.908

8.  repgenHMM: a dynamic programming tool to infer the rules of immune receptor generation from sequence data.

Authors:  Yuval Elhanati; Quentin Marcou; Thierry Mora; Aleksandra M Walczak
Journal:  Bioinformatics       Date:  2016-02-26       Impact factor: 6.937

Review 9.  The era of immunogenomics/immunopharmacogenomics.

Authors:  Makda Zewde; Kazuma Kiyotani; Jae-Hyun Park; Hua Fang; Kai Lee Yap; Poh Yin Yew; Houda Alachkar; Taigo Kato; Tu H Mai; Yuji Ikeda; Tatsuo Matsuda; Xiao Liu; Lili Ren; Boya Deng; Makiko Harada; Yusuke Nakamura
Journal:  J Hum Genet       Date:  2018-05-21       Impact factor: 3.172

10.  Ultrasensitive detection of TCR hypervariable-region sequences in solid-tissue RNA-seq data.

Authors:  Bo Li; Taiwen Li; Binbin Wang; Ruoxu Dou; Jian Zhang; Jun S Liu; X Shirley Liu
Journal:  Nat Genet       Date:  2017-03-30       Impact factor: 38.330

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.