| Literature DB >> 29867956 |
Scott Christley1, Walter Scarborough2, Eddie Salinas1, William H Rounds1, Inimary T Toby1, John M Fonner2, Mikhail K Levin3, Min Kim1, Stephen A Mock2, Christopher Jordan2, Jared Ostmeyer1, Adam Buntzman4, Florian Rubelt5, Marco L Davila6, Nancy L Monson7,8, Richard H Scheuermann9,10,11, Lindsay G Cowell1.
Abstract
Background: Recent technological advances in immune repertoire sequencing have created tremendous potential for advancing our understanding of adaptive immune response dynamics in various states of health and disease. Immune repertoire sequencing produces large, highly complex data sets, however, which require specialized methods and software tools for their effective analysis and interpretation.Entities:
Keywords: B-cell receptor; Rep-seq; T cell receptor; bioinformatics; cloud computing; immune repertoire
Mesh:
Year: 2018 PMID: 29867956 PMCID: PMC5953328 DOI: 10.3389/fimmu.2018.00976
Source DB: PubMed Journal: Front Immunol ISSN: 1664-3224 Impact factor: 7.561
Figure 1The number of newly published repertoire sequencing papers appearing in PubMed each year over the last 15 years. For 2002, the number was obtained by the query ((repertoire sequencing) AND (2002 [dp])). The number for 2017 is a projection based on the number appearing between January and December 2017. The red arrow indicates the year in which publications demonstrated the feasibility of applying high-throughput sequencing to immune repertoires.
VDJServer release 1.0 API.
| Endpoint | Method | Description |
|---|---|---|
| / | GET | Status of API service |
| /feedback | POST | User feedback |
| /feedback/public | POST | Public feedback |
| /jobs/queue/pending | GET | Pending jobs for project |
| /jobs/queue | POST | Submit job |
| /jobs/archive/:id | POST | Archive job |
| /jobs/unarchive/:id | POST | Unarchive job |
| /notifications/files/import | POST | Agave notifications for file import |
| /notifications/jobs/:id | POST | Agave notifications for job |
| /permissions/metadata | POST | Update user permissions for metadata |
| /permissions/username | POST | Add user permissions on project data |
| /permissions/username | DELETE | Remove user permissions from project data |
| /projects | POST | Create project |
| /projects/:id/metadata/export | GET | Export metadata from project into tab-separated values file |
| /projects/:id/metadata/import | POST | Import metadata into project |
| /public | GET | Query public data |
| /telemetry | POST | Error logging |
| /token | POST | Request an Agave authentication token |
| /token | PUT | Refresh Agave authentication token |
| /user | POST | Create user account |
| /user/change-password | POST | User change password |
| /user/reset-password | POST | Initiate password reset |
| /user/reset-password/verify | POST | Verify password reset |
| /user/:username/verify/email | POST | Send user verification email |
| /user/verify/:id | POST | Verify user |
Figure 2Web browser interface for Project Data Management. Along the top of the screen are tabs for Community Data, which displays public projects, Documentation, which opens up the documentation webpage in a new browser window, and Feedback, which allows you to send a message or ask a question. The left panel provides the list of user projects and the button to add a new project. When a project is selected, a submenu is shown to navigate the primary views: Project Settings, Upload and Browse Project Data, Metadata Entry, Link.fasta/.qual Files, Link Paired Read Files, and View Analyses and Results. The current view shown is Upload and Browse Project Data, which shows the list of user-uploaded files and some output files from a VDJPipe preprocessing job. The top of the view has project information, buttons for uploading files and running analysis jobs, and a search window to narrow the list of files displayed.
Figure 3VDJServer analysis workflow overview.
Figure 4Preprocessing visualizations. All of the charts provide pre- and post-filtering statistics for side-by-side comparison. Full size versions of the figures are provided in Supplementary Material. (A) Nucleotide composition for each read position. Each colored line represents the percentage of a nucleotide or ambiguous base call (N) at each read position. The legend functions as a toggle to set which lines are shown. This figure is currently showing the composition pre-filtering for A, C, G, and T nucleotides. (B) GC content histogram. The graph shows the number of reads along the Y axis and GC percentage along the X axis. The red curve indicates the number for pre-filter reads, and the blue curve represents the number for post-filter reads. (C) Sequence length histogram. The graph shows the number of reads along the Y axis and sequence length along the X axis. The red curve shows the number of sequences of each length for pre-filter reads, the blue curve shows the number for post-filter reads, and the magenta curve shows the pre/post difference. The graph clearly shows that a length filter of 200 was used. (D) Mean quality score histogram. The graph shows the number of reads along the Y axis and the average quality score along the X axis. The red curve shows the number of sequences for pre-filter reads, and the blue curve shows the number for post-filter reads. The magenta and black vertical lines show the median score (36) for the pre- and post-filter reads, respectively. (E) Quality score distribution for each read position. Quality score is shown on the Y axis, and read position is shown on the X axis. At each position, the box-and-whiskers plot shows the median quality score, inter-quartile range, and the 10th and 90th quantiles. The X axis has zoom control, which is set to positions 125–250 in this figure.
Figure 5Repertoire characterization and comparison visualizations. For simplicity, the interface buttons for selecting which samples and sample groups to display are not shown. Full size versions of the figures are provided in Supplementary Material. (A) Gene segment usage histogram. The graph shows IGH V gene segments along the X axis and the percentage of reads assigned to each V gene segment along the Y axis. The percentage is currently being displayed for two samples, as indicated by the blue and black bars. (B) CDR3 length histogram. CDR3 length is shown on the X axis, and the percentage of reads with that CDR3 length is shown on the Y axis. The percentage is currently shown for two sample groups as indicated by the blue and black bars. When the data for sample groups, rather than samples, are displayed, the bar height represents the average percentage across all samples in the group, and the error bars indicate the SD. (C) Ranked clonal abundance percentage. Each colored line represents a sample with the clones ranked from highest abundance (rank 1) to lowest abundance along the X axis and the corresponding percentage of reads for each clone along the Y axis. (D) Cumulative clonal abundance. Each colored line represents a sample with clones ranked from highest abundance (rank 1) to lowest abundance along the X axis and the cumulative percentage along the Y axis. (E) Diversity profile. Each colored line represents a sample where clonal diversity along the Y axis is calculated across a sweep of the ordering parameter (Q) along the X axis. A point of Q = 1 corresponds to the Shannon entropy. (F) Quantification of selection pressure for CDR and framework regions. Region is shown on the X axis, and the value of the selection parameter is shown on the Y axis. Negative bar values indicate negative selection, and positive bar values indicate positive selection. The error bars show the 95% confidence interval of the selection parameter along with a p-value for significance. Two B cell samples are currently displayed as indicated by the blue and black bars.