H3N2 influenza hemagglutination inhibition method qualification with data driven statistical methods for human clinical trials

Sawant, Sheetal; Gurley, Sarah Anne; Overman, R. Glenn; Sharak, Angelina; Mudrak, Sarah V.; Oguin, Thomas; Sempowski, Gregory D.; Sarzotti-Kelsoe, Marcella; Walter, Emmanuel B.; Xie, Hang; Pasetti, Marcela F.; Moody, M. Anthony; Tomaras, Georgia D.

doi:10.3389/fimmu.2023.1155880

ORIGINAL RESEARCH article

Front. Immunol., 06 April 2023

Sec. Vaccines and Molecular Therapeutics

Volume 14 - 2023 | https://doi.org/10.3389/fimmu.2023.1155880

This article is part of the Research TopicDevelopment and standardization of assays to assess immunogenicity and correlates of protection of vaccines against respiratory viral infectionsView all 14 articles

H3N2 influenza hemagglutination inhibition method qualification with data driven statistical methods for human clinical trials

Sheetal Sawant^1,2,3‡

Sarah Anne Gurley^1,2,3‡

R. Glenn Overman^1,2,3

Angelina Sharak^1,2,3

Sarah V. Mudrak^1,2,3

Thomas Oguin III²

Gregory D. Sempowski^2†

Marcella Sarzotti-Kelsoe^1,2,3

Emmanuel B. Walter^2,4,5

Hang Xie⁶

Marcela F. Pasetti^7,8

M. Anthony Moody^2,3,4

Georgia D. Tomaras^1,2,3,5*

¹Center for Human Systems Immunology, Department of Surgery, Duke University, Durham, NC, United States
²Duke Human Vaccine Institute, Duke University, Durham, NC, United States
³Department of Immunology, Duke University, Durham, NC, United States
⁴Department of Pediatrics, Duke University, Durham, NC, United States
⁵Duke Global Health Institute, Duke University, Durham, NC, United States
⁶Division of Viral Products, Office of Vaccines Research and Review, Center for Biologics Evaluation and Research, U.S. Food and Drug Administration, Silver Spring, MD, United States
⁷Department of Pediatrics, University of Maryland School of Medicine, Baltimore, MD, United States
⁸Center for Vaccine Development, University of Maryland School of Medicine, Baltimore, MD, United States

Introduction: Hemagglutination inhibition (HAI) antibody titers to seasonal influenza strains are important surrogates for vaccine-elicited protection. However, HAI assays can be variable across labs, with low sensitivity across diverse viruses due to lack of standardization. Performing qualification of these assays on a strain specific level enables the precise and accurate quantification of HAI titers. Influenza A (H3N2) continues to be a predominant circulating subtype in most countries in Europe and North America since 1968 and is thus a focus of influenza vaccine research.

Methods: As a part of the National Institutes of Health (NIH)-funded Collaborative Influenza Vaccine Innovation Centers (CIVICs) program, we report on the identification of a robust assay design, rigorous statistical analysis, and complete qualification of an HAI assay using A/Texas/71/2017 as a representative H3N2 strain and guinea pig red blood cells and neuraminidase (NA) inhibitor oseltamivir to prevent NA-mediated agglutination.

Results: This qualified HAI assay is precise (calculated by the geometric coefficient of variation (GCV)) for intermediate precision and intra-operator variability, accurate calculated by relative error, perfectly linear (slope of -1, R-Square 1), robust (<25% GCV) and depicts high specificity and sensitivity. This HAI method was successfully qualified for another H3N2 influenza strain A/Singapore/INFIMH-16-0019/2016, meeting all pre-specified acceptance criteria.

Discussion: These results demonstrate that HAI qualification and data generation for new influenza strains can be achieved efficiently with minimal extra testing and development. We report on a qualified and adaptable influenza serology method and analysis strategy to measure quantifiable HAI titers to define correlates of vaccine mediated protection in human clinical trials.

Introduction

Influenza A (H3N2) has become a predominant circulating subtype post the 2009 H1N1 pandemic and is therefore a focus of influenza vaccine research (1). Seasonal influenza vaccines led to decreases in infection but the elderly, immunocompromised individuals, and individuals with chronic illnesses remain at risk for severe infection and young children and adults without pre-existing immunity could also be vulnerable to novel influenza strains (e.g., the 2009 H1N1 pandemic) (2, 3). The emergence of highly pathogenic avian influenza and other zoonotic influenza viruses poses a continued threat to the public health. Therefore, improved prevention and management approaches for seasonal and pandemic influenza across all populations are urgently needed, including the development of new universal vaccines that offer broad and durable protection (4).

The National Institute of Allergy and Infectious Diseases (NIAID) established the Collaborative Influenza Vaccine Innovation Centers (CIVICs) program. The CIVICs program is a network of research centers and cores that work together to advance the production and clinical testing of improved seasonal and universal influenza vaccines. The most promising vaccine candidates are advanced into Phase I and II clinical trials. The Duke Center for Human Systems Immunology (CHSI) as part of the CIVICs program standardizes and qualifies endpoint serology assays such as the hemagglutination inhibition (HAI) assay for evaluating influenza specific antibodies in CIVICs clinical trials and controlled human challenge studies.

The HAI assay is the most utilized canonical method to quantify influenza specific antibodies for influenza vaccination clinical studies. The principle of the HAI method is to take advantage of the ability of the influenza HA proteins to bind to, or agglutinate, the sialic acids on avian or mammalian RBCs. The agglutination by the HA proteins holds RBCs into a lattice formation and prevents their precipitation. The HA-guinea pig RBC lattice appears as a cloudy pink haze in the microtiter plate well as opposed to the halo morphology typical of precipitated un-agglutinated guinea pig RBCs. HA-specific antibodies can block the formation of HA/RBC lattice resulting in the precipitation of un-agglutinated guinea pig RBCs. The antibody titer corresponds to the inverse of the serum dilution of the last well that contains an RBC precipitate similar in size and morphology to the RBC control wells. The accurate and precise measure of antibody titers through the HAI assay contributes to advancing vaccine research by enabling a quantitative comparison of antibodies elicited by different vaccine regimens. For example, HAI antibody titers to strains predicted to be predominant in the next flu season can be used to estimate vaccine efficacy in simulated vaccine trials (5). Moreover, HAI antibody titers have been used in clinical investigations to evaluate vaccine immunogenicity and predict the proportion of vaccine induced protection (3, 6). It is accepted by regulatory agencies for vaccine licensure.

As the HAI assay must be standardized for each specific influenza virus strain, controlling for technical variables such as type and concentration of red blood cells, incubation times, positive and negative controls and the specialized expertise required to determine HAI antibody titers by visual observation, comparison of results across laboratories can be difficult. Method qualification or validation, which includes development of standardized protocols and establishes the parameters of the assay, can be used to ensure reliable and reproduceable results across laboratories (7, 8). Multiple groups (7, 9–11) have reported on the standardization, qualification, validation and optimization of an HAI assay, and we provide here a concise tabulated summary comparing assay designs and analysis methods made available by these research groups, as well as those presented by our team (Table 1).

TABLE 1

Table 1 Table comparing the previous literature on HAI assay development efforts in terms of assay design, data processing and statistical analysis details.

Here, we report on the qualification of an HAI assay for two representative H3N2 influenza strains, A/Texas/71/2017 and A/Singapore/INFIMH-16-0019/2016. to support work of CIVICs human influenza challenge study ClinicalTrials.gov Identifier: NCT04978454. A/Singapore/INFIMH-16-0019/2016 chosen to support work of CIVICs influenza vaccine study ClinicalTrials.gov Identifier: NCT04960397. Quantification of antibody titers by the HAI assay is dependent on the process of hemagglutination, or the binding of hemagglutinin glycoproteins on the surface of influenza virus to sialic acid receptors on red blood cells (RBCs). The determination of precise and accurate HAI titers can prove challenging due to the inability of modern H3N2 influenza strains to agglutinate avian RBCs and the acquired ability of these strains to agglutinate RBCs through neuraminidase (NA) activity (13). Here we have leveraged the use of guinea pig RBCs and the inclusion of the neuraminidase inhibitor Oseltamivir to prevent NA-mediated agglutination (13). Qualification of the HAI assays was performed in accordance with the United States Food and Drug Administration (FDA) Guidance for Industry: Bioanalytical Method Validation (May 2018) (14), the International Council for Harmonization [ICH] Tripartite Guideline (15) and guidance from NIAID/DMID, as relevant. System suitability criteria was evaluated to ensure the data was suitable for inclusion in the qualification parameter analysis and can serve an important first step for labs attempting to qualify new strains. Parameters tested for this assay qualification include matrix effect, linearity, precision including intermediate precision, accuracy, range, limits of detection and quantitation, specificity, and robustness. We additionally report here on improved methods of data processing, documentation, traceability, and analysis to ensure efficient and accurate data interpretation.

Materials and methods

Influenza strains

A/Texas/71/2017 (H3N2, International Reagent Resource Cat# FR-1622) was propagated in Madin-Darby Canine Kidney cells (MDCK) cells (ATCC, Cat# CCL-34) and used for the initial qualification. A/Singapore/INFIMH-16-0019/2016 (H3N2, International Reagent Resource, Cat# FR-1590) was also propagated in MDCK cells and used in the extended partial qualification. Two lots were tested during robustness analysis.

Guinea pig red blood cells

Guinea pig RBCs (100%, Innovative Research, Cat# IGPRBC10ML, Lot # 34252-01, 34252-02, 35043, 37615) were prepared at 0.75% in phosphate buffered saline (PBS, pH 7.4 Ca2+ & Mg2+ free, Gibco, Cat # 10010-023) for qualification analysis. Two lots of RBCs were tested during robustness analysis.

Oseltamivir phosphate

The neuraminidase inhibitor, Oseltamivir (Selleckchem, Cat # S2597, Lot# S259705), was added into the 0.75% guinea pig RBC solution at a final concentration of 20nM. Oseltamivir was also added into the PBS used to dilute the serum samples at a final concentration of 20nM.

Receptor Destroying Enzyme II

All sera and plasma samples were treated with Receptor Destroying Enzyme type II (RDE II) prior to use in the HAI assay. The lyophilized receptor destroying enzyme II (Hardy Cat # 370013, Lot# 600092, 631082) was reconstituted using phosphate buffered saline (PBS, pH 7.2 Ca2+ & Mg2+ free, Gibco Cat # 20012-027). The samples were treated with RDE II at a 3:1 ratio at 37°C in a heating block for 18 hours. The RDE II was then heat inactivated at 56°C in a heating block for 45 minutes. Following heat inactivation, the RDE II treated samples were diluted with PBS (pH 7.4 Ca2+ & Mg2+ free, Gibco Cat # 10010-023) to bring the samples to a final dilution of 1:10. Diluted samples were aliquoted if needed for multiple tests and stored at -20°C.

Positive and negative controls

A panel of H3 reactive monoclonal antibodies was evaluated, leading to the selection of Ab2210 IgG1 as the positive control (Lot # 99BMH, 33JWM). Ab2210 IgG1 binds to the apex of the HA protein (16) and has an HAI titer of 640 against A/Texas/71/2017 and 160 against A/Singapore/INFIMH-16-0019/2016 when used at 100 µg/ml starting concentration, (1:10 dilution of 1mg/ml stock). The HIV specific monoclonal antibody 7B2 IgG1 mAb (Lot # 180615PPF) (17) and Anti-West Nile Virus-E protein (WNV-E, Clone MGAWN1 reference lot 1-FIN-1027 humanized IgG1, BEI Resources, Cat # NR-31082, Lot# 61277164) were used as negative assay controls at 10 µg/ml starting concentration, prepared as a 1:10 dilution of 0.1 mg/ml stock. These controls were used in determining the system suitability criteria and throughout the HAI assay qualification experiments.

Test samples - A/Texas/71/2017 (H3N2) qualification

•Linearity, precision, accuracy, range, LOD, LOQ and robustness testing: There are currently no commercially available reference standards with known HAI titers against influenza A/Texas/71/2017. A small volume of human pooled convalescent serum to A/Texas/71/2017 was provided to Duke University from the Centers for Disease Control and Prevention (CDC). Due to the limited amount of the CDC antiserum, clinical plasma samples from patients vaccinated with the 2019-2020 seasonal influenza vaccine were profiled for HAI titers against A/Texas/71/2017 to create a panel of sixteen samples with a range of antibody levels. Two of these plasma samples with HAI titers ranging from 320 to 1,280 were pooled to serve as an in-house reference standard (IHRS).Serum samples from this vaccinated cohort were not available. All clinical samples were used with IRB approval.

•Matrix effect testing: Normal human serum (Sigma-Aldrich, Cat # H4522, Lot # SLBX6353) and influenza negative human plasma (BioIVT, Cat # HMPLCPD-RPP1, Lot # HMN410054/00002) with HAI titers ≤ 20 were used as the base for matrix effect testing. The normal human serum was also included in specificity testing.

•Specificity testing: World Health Organization (WHO) serum and supplemental antiserum were obtained through the International Reagent Resource (IRR), Influenza Division, WHO Collaborating Center for Surveillance, Epidemiology and Control of Influenza, Centers for Disease Control and Prevention, Atlanta, GA, USA.

◦Influenza Normal Control Goat Serum (IRR Cat # FR-1377, Lot # 63461731) and Influenza A(H7N9) Reference Ferret Antiserum (IRR Cat # FR-1250, Lot # 61982458) have HAI titers ≤20 and were considered negative when used during specificity testing.

◦The 2014-2015 WHO Antiserum, Influenza A(H3) Reference Goat Antiserum (IRR Cat # FR-1351, Lot # 1415H3AS) and 2019-2020 WHO Antiserum, Influenza A(H3) Reference Goat Antiserum (IRR Cat # FR-1683, Lot # 1920H3AS) have HAI titers ≥ 40 and were considered positive when used during specificity testing.

•Sensitivity testing: A panel of twenty serum samples purported to have limited or no cross reactivity to influenza A/Texas/71/2017 (provided courtesy of NIH/NIAID Division of Microbiology and Infectious Diseases, DMID 10-0016 ClinicalTrials.gov Identifier: NCT01317745, DMID 05-0130 ClinicalTrials.gov Identifier: NCT00311675) were used for sensitivity testing during qualification.

•Seroprevalence survey: Ten HIV seronegative human serum samples with unknown influenza status (BioreclamationIVT/BioIVT) were used for a preliminary determination of the seroprevalence of HAI titers against A/Texas/71/2017.

Test samples - A/Singapore/INFIMH-16-0019/2016 (H3N2) qualification

•Linearity, precision, accuracy, range, LOD and LOQ testing: A panel of twenty human serum samples consisting of 13 unique subject IDs and sample days 8, 36, 57 and 209 with known positive HAI titers (FluGen H3N2-V003 ClinicalTrials.gov Identifier: NCT03999554) was profiled for HAI titers against A/Singapore/INFIMH-16-0019/2016. Five of these serum samples with HAI titers ranging from 640 to 1,280 were pooled to serve as an in-house reference standard (IHRS)

•Influenza A(H3N2)v Reference Ferret Antiserum (IRR Cat # FR-1000, Lot # 60711729); 2019-2020 WHO Antiserum, Influenza A(H1N1)pdm09 Reference Goat Antiserum (IRR Cat # FR-1682, Lot # 1920H1AS); 2018-2019 WHO Antiserum, Influenza B Reference Goat Antiserum, B/Victoria Lineage (IRR Cat # FR-1613, Lot # 1819BVAS); 2019-2020 WHO Antiserum, Influenza B Reference Goat Antiserum, B/Yamagata Lineage (IRR Cat # FR-1685, Lot # 1920BYAS); Influenza A(H7N9) Reference Ferret Antiserum (IRR Cat # FR-1250, Lot # 61982458) have HAI titers ≤10 and were considered negative when used during specificity testing. Normal Goat Serum (MP Biomedicals Cat# 2939149, Lot # S1608) was used as a negative sample in specificity testing.

•The 2016-2017 WHO Antiserum, Influenza A(H3) Reference Goat Antiserum (IRR Cat # FR-1487, Lot # 1617H3AS); 2017-2018 WHO Antiserum, Influenza A(H3) Reference Goat Antiserum (IRR Cat # FR-1562, Lot # 1718H3AS); 2018-2019 WHO Antiserum, Influenza A(H3) Reference Goat Antiserum (IRR Cat # FR-1612, Lot # 1819H3AS); and 2019-2020 WHO Antiserum, Influenza A(H3) Reference Goat Antiserum (IRR Cat # FR-1683, Lot # 1920H3AS) have HAI titers ≥ 640 and were considered positive when used during specificity testing.

Hemagglutination inhibition assay

Using an established HAI protocol as the template (18), the HAI assay was performed by adding 25 µl of PBS (pH 7.4) to column 1 and to columns 3-11 of the 96 well U bottom plate for serum dilutions. For the back titration control, 50 µls of PBS was added to all wells of the row. A red blood cell control was included in column 12, 50 µl of PBS was added to this column. The RBC control contained only RBCs without sample or virus. A serum control was included in column 1, 25 µl of diluted RDE treated serum samples were added to this column to monitor non-specific agglutination in the individual serum samples. To perform serum dilutions, 50 µl of diluted RDE treated serum samples were added to column 2 of the plate. 50 µl of the positive and negative controls were also added into column 2 of their respective control rows. Two-fold serial dilutions were performed, discarding pipet tips after each mixing step. The dilution series was continued from column 3 to column 11, discarding the remaining 25 µl from column 11. The influenza virus was removed from the -80°C freezer and thawed at room temperature immediately before use. The final HA unit of the virus was adjusted to 8 HA units with PBS (pH 7.4) and 25 µl of diluted virus was added to columns 2 - 11, except for the back titration control row. To perform the back titration control, 50 µls of stock virus was added into column 2 of the back titration control row and mixed several times to perform an initial 1:2 dilution. Two-fold serial dilutions of the back titration control row were performed. Plates were tapped gently to mix the serum and virus then incubated for 30 minutes at room temperature. Immediately prior to adding RBCs, they were inverted several times to ensure cells were fully resuspended and 50 µl of diluted RBCs were added to all wells of the plate and incubated for 60 minutes at room temperature to allow RBCs to precipitate. All material in contact with influenza virus stocks was decontaminated with freshly prepared 10% bleach. This includes all vials, tubes, reservoirs, and pipet tips. To determine antibody titers, plates were scored for the presence of hemagglutination using the CypherOne HAI plate reader [InDevR, software version 4.0.0.19 (19)]. The CypherOne software was used to build a plate template to document the location of assay controls position of test samples and the orientation of the dilution series within the plates. A plate list was used to document the specific location of samples and details of the dilution series and to standardize the data analysis parameters used to make the titer determinations including the instrument calibration factor and transition point applied across all plates within an assay. The antibody titer corresponded to the inverse of the serum dilution of the last well that contained an RBC precipitate. Results were exported as both CSV files and annotated images. These files were saved in a secure network drive for data processing and analysis. Geometric mean titers (GMTs) were determined for each set of sample replicates either within an individual assay plate or across multiple plates within an assay (depending on the experimental design). Although the individual replicate titers values can only be the inverse of a value in the dilution series, GMT values other than the inverse of a dilution can occur due to the allowable two-fold variation between duplicates. This occurs when one replicate has a titer value one dilution higher or lower than the other replicate.

Qualification parameters, study designs and acceptance criteria

An assay qualification plan including recommended acceptance criteria was prepared and approved before the conduction of qualification experiments.

System suitability criteria

The system suitability criteria were established based on the performance of the positive control, negative control, red blood cell control and back titration controls. Red blood cell controls were included on each assay plate. One plate within the assay contained the positive control, negative control and back titration controls. For acceptance, the positive control titer must fall within 2-fold of the expected titer value, and the negative control must have a titer<20. The red blood cell controls must all be fully precipitated. The back titration control titer must be within 2-fold of the expected titer value. If any of these criteria were not met, the assay was to be considered as failed and a repeat was performed.

Matrix effect

Due to the lack of matched serum and plasma pairs, matrix effect was evaluated by spiking in the positive control antibody Ab2210 IgG1 into RDE II treated normal human serum and negative human plasma samples diluted 1:20 in PBS, both with a HAI titer against A/Texas/71/2017 ≤ 20. Ab2210 IgG1 was diluted 2-fold beginning at a concentration of 400 µg/ml to match the dilution series performed in PBS for assay linearity testing. Each assay plate was tested in duplicate by two scientists for a sample size of 4 replicates, two per matrix type, within each of the two assays. Four replicates for each of the eight titers in the dilution series, from two assays, resulted in 64 replicate data points, 32 GMTs, 16 for each sample type, which were used in correlation analysis. Matrix effect was determined by comparing the titer values obtained with Ab2210 IgG1 spiked into negative serum versus antibody titers obtained with Ab2210 IgG1 spiked into negative plasma and demonstrating correlation with expected correlation coefficients ≥ 0.9. The percent linearity for each antibody dilution in plasma or serum was also be calculated to aid in the determination of assay linearity. Acceptable dilutional percent linearity was defined as dilution corrected antibody titers that varied no more than 50% to 200% between doubling dilutions. Values had to be 50% to 200% to allow for 2-fold variability in titer values.

Precision and accuracy

The precision of the assay was determined by performing dilutions of the pooled plasma IHRS at 1:20, 1:80, 1:320 and 1:640 in PBS to create samples with high, medium, low, and near LLOQ response levels. The exact dilution scheme was determined from results of the linearity testing with the very low response dilution corresponding to the last dilution to retain acceptable percent linearity. The two-fold serial dilutions were starting at a 1:20 dilution and continued to a 1:5120 dilution. The corresponding titer of each of these antibody dilutions was determined. To evaluate intra-assay repeatability these dilution series were performed in duplicate on a plate with a single scientist testing 5 assay plates on day one for a total of 10 replicates. To evaluate intermediate, inter-assay precision, the above assay was performed with a second scientist testing two additional plates, increasing the total replicate count to 14. Intermediate precision continued to be assessed by repeating the assay on a second day with two scientists testing two plates each containing duplicate dilution series. This resulted in additional 8 replicates to the existing 14 replicates for a final total of 22 replicate titer values, which yielded 11 GMTs, per dilution. Mixed models’ analysis was used to calculate the %GCV for intermediate precision and repeatability, by dilution. The recommended acceptance criterion for the evaluation of repeatability and intermediate precision (% CV) for high, medium, and low response levels was ≤ 20%, and for the near LLOQ level was ≤ 25%. Relative accuracy (mean bias) was calculated for each of the four levels of testing. It was expected that the relative error (%RE) for high, medium, and low response levels would be ≤ 20%. The expected relative error (%RE) for the near LLOQ value was expected to be ≤ 25%. For accuracy, the acceptable level of variability in the assay was 2-fold variation in titer values, so values of 50% to 200% to allow for 2-fold variability in titer values were also acceptable.

Assay linearity

Assay linearity was determined by performing a two-fold dilution of the pooled plasma IHRS for 8 serial dilutions from 1:10 - 1:1280 and determining the corresponding GMT value. Two scientists contributed to the linearity analysis and generated data for two replicate curves in each assay. Each assay plate was tested in duplicate by two scientists for 4 replicates. GMT was calculated by dilution, within assay replicates. Two replicates over eight dilutions yielded 16 GMTs. Linearity was first evaluated by visually assessing the titer versus antibody dilution in the CypherOne instrument graphics. Linearity was also evaluated through regression analysis and plots of titer versus sample dilution. Linearity results were described by correlation coefficient (R), slope, 95% confidence interval of the slope of the least squares regression line, and the coefficient of determination (R²). The expected coefficient of determination, R² was ≥ 0.9. The percent linearity for each sample dilution was also calculated. Acceptable percent linearity was defined as dilution corrected antibody titers that varied no more than 50% to 200% between doubling dilutions. Values had to be between 50% to 200% to allow for 2-fold variability in titer values. Assay linearity was also evaluated through regression analysis, as described above, of plots of titer versus sample dilution obtained during matrix effect evaluation.

Range, limits of detection and quantitation

Range, LOD and LOQ were determined by evaluating the sample dilutions with acceptable assay linearity, as well as precision and accuracy, from data generated in the assay linearity experiments. Antibody dilutions that lacked acceptable percent linearity, precision and accuracy were used to inform the range of the assay and the limits of detection and quantitation. The limit of detection was pre-defined as<10 based on the lowest titer tested in the linearity experiments.

Robustness

The robustness of the assay was demonstrated by evaluating deviations in incubation times and temperatures as well as the impact of changing lots of red blood cells and virus stock. Robustness testing was conducted by performing dilutions of the pooled plasma IHRS at 1:20, 1:80, 1:320 and 1:640 in PBS to create samples with high, medium, low, and very low (near LLOQ) response levels. The exact dilution scheme was determined from results of the linearity testing with the very low response dilution corresponding to the last dilution to retain acceptable percent linearity. These dilutions were performed in duplicate on 6 plates. To assess the impact of variations in erythrocyte incubation length on assay robustness, each of three plates containing the pooled plasma IHRS dilution series, as described above, were incubated for either 45 minutes, 1 hour or 1 hour and 15 minutes during the red blood cell incubation. These three incubations were performed at room temperature (22°C +/- 2°C). To assess the impact of temperature fluctuations during the assay, a fourth assay plate was tested using red blood cells at 4°C that were not equilibrated to room temperature prior to use. This red blood cell incubation was performed for the standard 1-hour timeframe. To assess the impact of changing reagent lots, a fifth and sixth assay plate was tested using a new lot of red blood cells and a new lot of A/Texas/71/2017, respectively. These plates were tested using standard incubation times and temperatures. The conditions were tested independently on the same day but with use of shared prepared reagents when available and appropriate. The corresponding titer of each of these antibody dilutions at each of these conditions was determined and precision and accuracy analysis were performed to calculate the % GCV and % relative error. To be considered robust, each assay condition tested was expected to retain precision and accuracy when compared to the standard assay conditions. The expected values for acceptable precision (% GCV), and accuracy (% RE), for high, medium, and low response levels was ≤ 20%, and for very low/near – LLOQ level, ≤ 25%.

Specificity

The specificity of the assay was evaluated by testing the CDC pooled convalescent serum specific to the A/Texas/71/2017 influenza strain and H3 reference goat antiserum, along with a panel of normal serum from different species and ferret antiserum to a heterologous strain. This experiment was performed on two separate occasions using first a 1:20 starting dilution and then a 1:10 starting dilution of serum samples. Each assay plate was tested in duplicate by two scientists for a total of 4 replicates. Acceptable assay specificity was determined by the ability of the assay to correctly identify three homologous strains and three heterologous strains. A titer of ≥1:40 was considered positive.

Sensitivity and seroprevalence

Sensitivity is a measure of the ability of the assay to detect titers near the lower limits of detection and quantitation. The seroprevalence of an antigen is the frequency that an antibody response to that antigen is detected within a population. Thirty samples were tested by each scientist, and percentage of samples below the LOD, were assessed. There were no pre-specified acceptance criteria for these parameters.

Extended qualification

To eliminate the need to fully re-qualify the HAI assay for each new strain of virus in evaluation, the original qualification using the A/Texas/71/2017 strain was extended to qualify the H3N2 influenza strain A/Singapore/INFIMH-16-0019/2016. An extended partial qualification is performed when an assay has previously been fully qualified for a similar antigen. This extended partial qualification evaluates assay parameters that have the potential to be impacted when a new antigen is used in the assay and will rely on the results obtained in the original assay qualification for parameters that are not expected to be impacted by the addition of a new antigen. The partial qualification utilized the same critical assay reagents and controls as those used in the original qualification for the HAI assay. The extended qualification served a dual purpose. Firstly, and most importantly, it helped to evaluate if the assay design and analysis methods can be effortlessly transferred in testing and qualification of another virus strain. Secondly, the extended qualification was used as an opportunity to improve upon any potential gaps in study design that were identified during the initial qualification analysis and to develop a custom data pipeline for HAI data processing. The parameters tested for the extended qualification include system suitability criteria, precision, accuracy, linearity, range, LOD, LOQ, and specificity. During the original qualification analysis, eight two-fold dilutions of the IHRS beginning at a 1:10 dilution were used to generate a dataset for linearity, which was in turn used for range, LOD and LOQ determination. Precision and accuracy were confirmed on this linearity dataset again to determine range, apart from the original dataset generated for precision. For the extended qualification, rather than generating datasets by parameter being tested, when the same sample (titrated IHRS) was used for data generation, linearity, precision, accuracy, range, LOD and LLOQ, were all determined using one dataset. Eight, two-fold dilutions of the IHRS beginning at a 1:10 dilution, were generated, and the corresponding GMTs were calculated. Each assay tested duplicate sample dilutions on duplicate plates resulting in 4 individual replicates and 2 GMT values, from each assay. The assay was performed by two scientists, and each scientist generated data from three assays, to provide 12 GMTs and 24 individual replicates in total. One set of curves failed quality control criteria due to experimental error, and was excluded from analysis, resulting in 11 GMTs, at each of the eight dilutions, and a total of 88 observations for final analysis (Supplementary Table 5). This improvement helped generate a more balanced dataset, with the same number of replicates at all eight dilutions. Also, this increased the number of observations used for statistical analysis and helped avoid repeated % GCV and % RE analysis on a separate dataset when determining range. Thus, the additional datapoints enabled the more efficient analysis of linearity and range and in turn added more variance to the models. Furthermore, we also developed a standardized data pipeline and custom HAI Module hosted on LabKey (20) infrastructure using the test files generated during qualification. The data processed through this portal were used during extended qualification to ensure data integrity and automation of processing and tracking of controls.

Quality control and data processing

For qualification of the H3N2 influenza strain A/Texas/71/2017, raw data were exported from the CypherOne (19) software as csv files using the standardized template. A standardized data pipeline was developed for HAI assay. Raw data generated by the CypherOne are passed through a series of processing steps within this pipeline. These processing steps are highly structured to ensure adherence to assay protocols and expectations of the data quality and integrity. A custom HAI Module was developed in house for compiling all data pipeline steps and executing them in an internal database, termed the Portal database [LabKey Server, software version 21.3 (20)]. R programming language was used for scripting and incorporating processing in infrastructure provided by LabKey (20). The HAI Module automated data processing steps and aggregated data in a standard format in one database, to document any variations in the data generation that resulted from multiple scientists performing the assay and minimizing potential human error. Data was stored on the Portal to prevent any manipulations or modifications to the original assay data, regardless of the access level, and logged history of any action performed. A system of error and warning messages was designed as part of this data pipeline to communicate any irregularities with the data that should be addressed by the operator prior to data advancing in the pipeline. Metadata was provided by the operator during initial upload steps to the Portal in a highly restricted and structured manner and was parsed and assigned to the raw data. This allowed data to be stored in standardized format without any data points or associated metadata missing. As part of the processing, summary data was generated: geometric mean and %CV of each replicate titer value was calculated to show variability across replicates.

A Quality Control (QC) process was incorporated in the data pipeline and scripted to execute automatically during data upload to the Portal. Quality of the data was determined using the following criteria: RBCs were expected to be fully precipitated (well value measurements above 1000), serum control had no detected non-specific agglutination (well value measurements above 1000), back titration titer value was within two-fold of documented virus titer value, positive control titer value was within two-fold of documented positive control titer value, negative control wells were fully agglutinated (well value measurements below 1000). These QC flags were aggregated into reports that facilitated streamlined data review. Additional QC metrics were applied as part of the data quality and integrity check, which allowed data review on a summary level and tracked historical performance. These additional QC metrics included checks that all titer values were of base 10, replicate titer values were within acceptable range of variability (two-fold for duplicates, four-fold for triplicates, etc.) and back titration titer value is of expected format. Historical performance for positive controls, negative controls, and back titration were tracked and viewed as non-editable graphs, which allowed performance review of the controls on specific study or virus levels.

All raw data, processed data, and quality control data were available for view and access after successful upload onto the Portal (Figure 1). All reports, views, and graphs that are part of this HAI Module could be generated any time after data becomes available, which ensures flexibility to access data and its supplemental materials from one database where all information is linked together. All data pipeline steps are stored in the background of the server which requires a specific access level and have version control implemented, which makes this data pipeline secure and highly structured, preventing any unauthorized changes or updates to any of its steps or components.

FIGURE 1

Figure 1 HAI assay design data pipeline. This figure shows the data flow from raw data generated by CypherOne Software to processed data used for analysis and data sharing. Light grey arrows indicate decision making steps: generated reports and visualization assists with determining the quality of the data and analysis readiness. Dark grey arrow indicates main processing steps performed during data upload. Black arrows indicate data pipeline steps which are scripted and secured.

Data processing, analysis and plot of data was generated using Statistical Analysis Software (SAS) and R statistical software. R version 4.2.2 (2022) The R Foundation for Statistical Computing, Platform: x86_64-w64-mingw32/x64 (64-bit), ggplot package, was used for generating certain plots. Statistical analysis was performed using SAS (r) Proprietary Software 9.4 (TS1M7; Copyright (c) 2016 by SAS Institute Inc., Cary, NC, USA), licensed to DUKE UNIVERSITY - T&R - SFA - NCICU GRANT, Site 70082794. Development of the standardized data pipeline was initiated with qualification experiments for H3N2 influenza strain A/Texas/71/2017 and fully implemented for extended qualification of the H3N2 influenza strain A/Singapore/INFIMH-16-0019/2016.

Analysis

Titers of<20 or<10 dilution were converted to half of the lowest dilution tested in assay, in this case 10 or 5, respectively, to enable statistical analysis. GMTs were determined for each set of sample replicates within assay or assay plate, as applicable, and log 10 transformed for statistical analysis. These GMTs were used to determine the %GCV and relative error. The formulas used are included below:

S t a n d a r d D e v i a t i o n I n t e r m e d i a t e P r e c i s i o n = \sqrt{I n t r a a s s a y V a r i a n c e + I n t e r a s s a y V a r i a n c e}

% G e o m e t r i c C o e f f i c i e n t o f V a r i a t i o n = \sqrt{e^{({[S D_{l o g 10} \times \ln (10)]}^{2})} - 1} \times 100

\begin{array}{l} H e r e S D_{l o g 10} & = standard deviation (SD) of log10 transformed data. \end{array}

% R e l a t i v e E r r o r = (\frac{m e a n o b s e r v e d v a l u e - e x p e c t e d v a l u e}{e x p e c t e d v a l u e}) \times 100

% R e l a t i v e E r r o r = (10^{(m e a n l o g 10 o b s e r v e d v a l u e - l o g 10 e x p e c t e d v a l u e)} - 1) \times 100

% L i n e a r i t y = \frac{(T i t e r \times D i l u t i o n f a c t o r)}{(P r e v i o u s T i t e r \times P r e v i o u s D i l u t i o n f a c t o r)} \times 100

SAS GEOMEAN function was used for calculation of GMTs. For linear regression analysis, for linearity and matrix effect testing, SAS PROC GLM was used, and the results were confirmed using PROC REG. Model statements were log10_Geomean_titer = log10_dilution; log10_Geomean_titer = log10_expected_titer, for the linear regression analysis, as applicable. PROC CORR was used for correlation analysis for matrix effect data. Wherever needed, the 95% confidence intervals were calculated using options SOLUTION and CLPARM, in model statement of PROC GLM. An alpha of 0.05 was used to generate the confidence intervals. Mixed model analysis was used to assess precision on log10 GMTs. The sum of within and between assay variance was used to calculate the %GCV, for precision. PROC MIXED with a random effects model was used to calculate %GCV IP [random effect used in model was assay id]; and %CV intra-operator repeatability [random effect used in model was plate_number], to model the log 10 GMT, as applicable by study design. When there is no variance in this dataset, for cases when all GMTs were the same number, the program would give an error ‘An infinite likelihood is assumed in iteration 0 because of a nonpositive residual variance estimate.’ %CV was set to 0% for such cases since all titer values were the same.

Results

The HAI assay was qualified using predefined acceptance criteria for each parameter evaluated as a set of metrics that determined the success of the qualification process (Figure 2). Parameters tested for the A/Texas/71/2017 HAI assay qualification included matrix effect, precision, accuracy, linearity, range, limits of detection and quantitation, robustness, specificity, sensitivity, and seroprevalence.

FIGURE 2

Figure 2 HAI assay qualification for H3N2 influenza strain A/Texas/71/2017, overview of qualification plan and parameters tested during qualification, recommended acceptance criteria, and results obtained.