AUTHOR=McGill Joseph R. , Yogurtcu Osman N. , Verthelyi Daniela , Yang Hong , Sauna Zuben E. TITLE=SampPick: Selection of a Cohort of Subjects Matching a Population HLA Distribution JOURNAL=Frontiers in Immunology VOLUME=10 YEAR=2019 URL=https://www.frontiersin.org/journals/immunology/articles/10.3389/fimmu.2019.02894 DOI=10.3389/fimmu.2019.02894 ISSN=1664-3224 ABSTRACT=

Immune responses to therapeutic proteins and peptides can adversely affect their safety and efficacy; consequently, immunogenicity risk-assessments are part of the development, licensure and clinical use of these products. In most cases the development of anti-drug antibodies is mediated by T cells which requires antigen presentation by Major Histocompatibility Complex Class II (MHCII) molecules (also called Human Leucocyte Antigen, HLA in humans). Immune responses to many protein therapeutics are thus HLA-restricted and it is important that the distribution of HLA variants used in the immunogenicity assessments provides adequate coverage of the target population. Due to biases inherent to the collection of samples in a blood bank or donor pool, simple random sampling will not achieve a truly representative sample of the population of interest. To help select a donor cohort we introduce SampPick, an implementation of simulated annealing which optimizes cohort selection to closely match the frequency distribution of a target population or subpopulation. With inputs of a target background frequency distribution for a population and a set of available, HLA-typed donors, the algorithm will iteratively create a cohort of donors of a user selected size that will closely match the target population rather than a random sample. In addition to optimizing the HLA types of donor cohorts, the software presented can be used to optimize donor cohorts for any other biallelic or monoallelic trait.