| Literature DB >> 25404364 |
Xi Yang1, Di Liu2, Na Lv3, Fangqing Zhao4, Fei Liu3, Jing Zou3, Yan Chen3, Xue Xiao3, Jun Wu3, Peipei Liu3, Jing Gao3, Yongfei Hu5, Yi Shi4, Jun Liu6, Ruifen Zhang3, Chen Chen7, Juncai Ma8, George F Gao9, Baoli Zhu10.
Abstract
The next-generation sequencing technology has promoted the study on human TCR repertoire, which is essential for the adaptive immunity. To decipher the complexity of TCR repertoire, we developed an integrated pipeline, TCRklass, using K-string-based algorithm that has significantly improved the accuracy and performance over existing tools. We tested TCRklass using manually curated short read datasets in comparison with in silico datasets; it showed higher precision and recall rates on CDR3 identification. We applied TCRklass on large datasets of two human and three mouse TCR repertoires; it demonstrated higher reliability on CDR3 identification and much less biased V/J profiling, which are the two components contributing the diversity of the repertoire. Because of the sequencing cost, short paired-end reads generated by next-generation sequencing technology are and will remain the main source of data, and we believe that the TCRklass is a useful and reliable toolkit for TCR repertoire analysis.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25404364 DOI: 10.4049/jimmunol.1400711
Source DB: PubMed Journal: J Immunol ISSN: 0022-1767 Impact factor: 5.422