| Literature DB >> 23977981 |
Miri Michaeli1, Michal Barak, Lena Hazanov, Hila Noga, Ramit Mehr.
Abstract
BACKGROUND: Immunoglobulin (that is, antibody) and T cell receptor genes are created through somatic gene rearrangement from gene segment libraries. Immunoglobulin genes are further diversified by somatic hypermutation and selection during the immune response. Studying the repertoires of these genes yields valuable insights into immune system function in infections, aging, autoimmune diseases and cancers. The introduction of high throughput sequencing has generated unprecedented amounts of repertoire and mutation data from immunoglobulin genes. However, common analysis programs are not appropriate for pre-processing and analyzing these data due to the lack of a template or reference for the whole gene.Entities:
Year: 2013 PMID: 23977981 PMCID: PMC3846367 DOI: 10.1186/2043-9113-3-15
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Figure 1A scheme of the cleaning and analysis pipeline of high throughput Ig gene sequences. The process starts with raw data reads from high throughput sequencing (right column). Each rectangle in the right column represents an independent module or program that is used in the analysis pipeline, and thus can be skipped through. The Automation program is represented in the right column as a single step, and the lines coming out of the “Automation” rectangles lead to the left column which presents the Automation program flow, which is constructed of several steps as detailed in the manuscript. The Automation program receives as input a file of sequences, and creates metadata and alignment files for each clone for further analyses. The arrows in the right column represent the recommended flow of the cleaning and analysis process as created and done by the authors. The arrows in the left column represent the flow of the Automation program, as discussed in the manuscript.
Figure 2Average percentages of clones in each VH-JH combination, in human LNs. This graph represents one of the possible outcomes of our proposed pipeline, which, in this case, contributes to Ig gene repertoire diversity analyses.