Päivi Saavalainen1,2, Dario Greco3,4,5, Dawit A Yohannes1,2, Katri Kaukinen6, Kalle Kurppa7. 1. Research Programs Unit, Translational Immunology, University of Helsinki, Helsinki, Finland. 2. Department of Medical and Clinical Genetics, University of Helsinki, Helsinki, Finland. 3. Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland. dario.greco@tuni.fi. 4. BioMediTech Institute, Tampere University, Tampere, Finland. dario.greco@tuni.fi. 5. Institute of Biotechnology, University of Helsinki, Helsinki, Finland. dario.greco@tuni.fi. 6. Department of Internal Medicine, Faculty of Medicine and Health Technology, Tampere University Hospital, Tampere University, Tampere, Finland. 7. Department of Pediatrics, Tampere University Hospital and Center for Child Health Research, Tampere University, Tampere, Finland.
Abstract
BACKGROUND: Deep immune receptor sequencing, RepSeq, provides unprecedented opportunities for identifying and studying condition-associated T-cell clonotypes, represented by T-cell receptor (TCR) CDR3 sequences. However, due to the immense diversity of the immune repertoire, identification of condition relevant TCR CDR3s from total repertoires has mostly been limited to either "public" CDR3 sequences or to comparisons of CDR3 frequencies observed in a single individual. A methodology for the identification of condition-associated TCR CDR3s by direct population level comparison of RepSeq samples is currently lacking. RESULTS: We present a method for direct population level comparison of RepSeq samples using immune repertoire sub-units (or sub-repertoires) that are shared across individuals. The method first performs unsupervised clustering of CDR3s within each sample. It then finds matching clusters across samples, called immune sub-repertoires, and performs statistical differential abundance testing at the level of the identified sub-repertoires. It finally ranks CDR3s in differentially abundant sub-repertoires for relevance to the condition. We applied the method on total TCR CDR3β RepSeq datasets of celiac disease patients, as well as on public datasets of yellow fever vaccination. The method successfully identified celiac disease associated CDR3β sequences, as evidenced by considerable agreement of TRBV-gene and positional amino acid usage patterns in the detected CDR3β sequences with previously known CDR3βs specific to gluten in celiac disease. It also successfully recovered significantly high numbers of previously known CDR3β sequences relevant to each condition than would be expected by chance. CONCLUSION: We conclude that immune sub-repertoires of similar immuno-genomic features shared across unrelated individuals can serve as viable units of immune repertoire comparison, serving as proxy for identification of condition-associated CDR3s.
BACKGROUND: Deep immune receptor sequencing, RepSeq, provides unprecedented opportunities for identifying and studying condition-associated T-cell clonotypes, represented by T-cell receptor (TCR) CDR3 sequences. However, due to the immense diversity of the immune repertoire, identification of condition relevant TCRCDR3s from total repertoires has mostly been limited to either "public" CDR3 sequences or to comparisons of CDR3 frequencies observed in a single individual. A methodology for the identification of condition-associated TCRCDR3s by direct population level comparison of RepSeq samples is currently lacking. RESULTS: We present a method for direct population level comparison of RepSeq samples using immune repertoire sub-units (or sub-repertoires) that are shared across individuals. The method first performs unsupervised clustering of CDR3s within each sample. It then finds matching clusters across samples, called immune sub-repertoires, and performs statistical differential abundance testing at the level of the identified sub-repertoires. It finally ranks CDR3s in differentially abundant sub-repertoires for relevance to the condition. We applied the method on total TCRCDR3β RepSeq datasets of celiac diseasepatients, as well as on public datasets of yellow fever vaccination. The method successfully identified celiac disease associated CDR3β sequences, as evidenced by considerable agreement of TRBV-gene and positional amino acid usage patterns in the detected CDR3β sequences with previously known CDR3βs specific to gluten in celiac disease. It also successfully recovered significantly high numbers of previously known CDR3β sequences relevant to each condition than would be expected by chance. CONCLUSION: We conclude that immune sub-repertoires of similar immuno-genomic features shared across unrelated individuals can serve as viable units of immune repertoire comparison, serving as proxy for identification of condition-associated CDR3s.
Authors: Jacob Glanville; Huang Huang; Allison Nau; Olivia Hatton; Lisa E Wagar; Florian Rubelt; Xuhuai Ji; Arnold Han; Sheri M Krams; Christina Pettus; Nikhil Haas; Cecilia S Lindestam Arlehamn; Alessandro Sette; Scott D Boyd; Thomas J Scriba; Olivia M Martinez; Mark M Davis Journal: Nature Date: 2017-06-21 Impact factor: 49.962
Authors: Harlan S Robins; Paulo V Campregher; Santosh K Srivastava; Abigail Wacher; Cameron J Turtle; Orsalem Kahsai; Stanley R Riddell; Edus H Warren; Christopher S Carlson Journal: Blood Date: 2009-08-25 Impact factor: 22.113
Authors: Rama S Akondy; Nathan D Monson; Joseph D Miller; Srilatha Edupuganti; Dirk Teuwen; Hong Wu; Farah Quyyumi; Seema Garg; John D Altman; Carlos Del Rio; Harry L Keyserling; Alexander Ploss; Charles M Rice; Walter A Orenstein; Mark J Mulligan; Rafi Ahmed Journal: J Immunol Date: 2009-12-15 Impact factor: 5.422
Authors: Bhavya Papudeshi; J Matthew Haggerty; Michael Doane; Megan M Morris; Kevin Walsh; Douglas T Beattie; Dnyanada Pande; Parisa Zaeri; Genivaldo G Z Silva; Fabiano Thompson; Robert A Edwards; Elizabeth A Dinsdale Journal: BMC Genomics Date: 2017-11-28 Impact factor: 3.969
Authors: Mikhail V Pogorelyy; Anastasia A Minervina; Maximilian Puelma Touzel; Anastasiia L Sycheva; Ekaterina A Komech; Elena I Kovalenko; Galina G Karganova; Evgeniy S Egorov; Alexander Yu Komkov; Dmitriy M Chudakov; Ilgar Z Mamedov; Thierry Mora; Aleksandra M Walczak; Yuri B Lebedev Journal: Proc Natl Acad Sci U S A Date: 2018-11-20 Impact factor: 11.205
Authors: Koshlan Mayer-Blackwell; Stefan Schattgen; Liel Cohen-Lavi; Jeremy C Crawford; Aisha Souquette; Jessica A Gaevert; Tomer Hertz; Paul G Thomas; Philip Bradley; Andrew Fiore-Gartland Journal: Elife Date: 2021-11-30 Impact factor: 8.140
Authors: Hannah Kockelbergh; Shelley Evans; Tong Deng; Ella Clyne; Anna Kyriakidou; Andreas Economou; Kim Ngan Luu Hoang; Stephen Woodmansey; Andrew Foers; Anna Fowler; Elizabeth J Soilleux Journal: Diagnostics (Basel) Date: 2022-05-13
Authors: Cédric R Weber; Teresa Rubio; Longlong Wang; Wei Zhang; Philippe A Robert; Rahmad Akbar; Igor Snapkov; Jinghua Wu; Marieke L Kuijjer; Sonia Tarazona; Ana Conesa; Geir K Sandve; Xiao Liu; Sai T Reddy; Victor Greiff Journal: Cell Rep Methods Date: 2022-08-22