AUTHOR=Grazioli Filippo , Mösch Anja , Machart Pierre , Li Kai , Alqassem Israa , O’Donnell Timothy J. , Min Martin Renqiang TITLE=On TCR binding predictors failing to generalize to unseen peptides JOURNAL=Frontiers in Immunology VOLUME=13 YEAR=2022 URL=https://www.frontiersin.org/journals/immunology/articles/10.3389/fimmu.2022.1014256 DOI=10.3389/fimmu.2022.1014256 ISSN=1664-3224 ABSTRACT=
Several recent studies investigate TCR-peptide/-pMHC binding prediction using machine learning or deep learning approaches. Many of these methods achieve impressive results on test sets, which include peptide sequences that are also included in the training set. In this work, we investigate how state-of-the-art deep learning models for TCR-peptide/-pMHC binding prediction generalize to unseen peptides. We create a dataset including positive samples from IEDB, VDJdb, McPAS-TCR, and the MIRA set, as well as negative samples from both randomization and 10X Genomics assays. We name this collection of samples