- 1Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford, United Kingdom
- 2Department of Biochemistry, University of Oxford, Oxford, United Kingdom
- 3Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, United Kingdom
Antibodies, through their ability to target virtually any epitope, play a key role in driving the adaptive immune response in jawed vertebrates. The binding domains of standard antibodies are their variable light (VL) and heavy (VH) domains, both of which present analogous complementarity-determining region (CDR) loops. It has long been known that the VH CDRs contribute more heavily to the antigen-binding surface (paratope), with the CDR-H3 loop providing a major modality for the generation of diverse paratopes. Here, we provide evidence for an additional role of the VL domain as a modulator of CDR-H3 structure, using a diverse set of antibody crystal structures and a large set of molecular dynamics simulations. We show that specific attributes of the VL domain such as subtypes, CDR canonical forms and genes can influence the structural diversity of the CDR-H3 loop, and provide a physical model for how this effect occurs through inter-loop contacts and packing of CDRs against each other. Our results indicate that the rigid minor loops fine-tune the structure of CDR-H3, thereby contributing to the generation of surfaces complementary to the vast number of possible epitope topologies, and provide insights into the interdependent nature of CDR conformations, an understanding of which is important for the rational antibody design process.
Introduction
A key part of the adaptive immune response of jawed vertebrates is the use of antibodies to recognize and bind to extracellular antigens, thereby driving part of the downstream specific immune response. Standard antibodies are composed of two light and two heavy chains, forming a heterotetrameric complex, with two identical FV regions, which contain the antigen-binding site. This region is made up of the variable heavy (VH) and light (VL) domains, each containing three loops referred to as complementarity-determining regions (CDR1-3) that make up the majority of the antigen-binding residues called the paratope (Akbar et al. (1)). The remainder of the variable domains of both VH and VL is referred to as the framework region. Through two major processes, namely V(D)J recombination and affinity maturation, the structure of the paratope is fine-tuned to form a perfect complementary surface to virtually any non-self epitope (Schroeder and Cavacini (2)).
While the VH and VL domains are largely structurally symmetrical in their framework regions, the degree of symmetry is much lower in the CDR loops. Analogous CDRs of the VL domain present distinct conformations when compared to those of the VH (Chothia and Lesk (3); North et al. (4); Nowak et al. (5)). Most significant is the conformational space occupied by CDR-H3. While all other loops can be classed into distinct canonical forms (Chothia and Lesk (3); North et al. (4); Nowak et al. (5)), the CDR-H3 loop is highly variable and does not lend itself to structural clustering (Regep et al. (6)). This is due to the much larger sequence space explored by this loop as a result of V(D)J recombination, as the CDR-H3 loop has sequence contributions from the V, D, and J genes, with additional diversity stemming from nontemplated nucleotide addition at each junction during the V(D)J recombination event (Alt and Baltimore (7)). The analogous CDR-L3 loop shows much lower sequence diversity due to the lack of the D segment in the light chain. Here, we use the term structural variability to refer to the diversity of CDR loop conformations found in solved crystal structures and the term flexibility to refer to the ability and likelihood of an individual CDR loop to adopt multiple different conformations.
The dissimilarities between VH and VL CDRs is also reflected in their contributions to the paratope, with the CDR-H3 loop generally dominating the paratope, and VL CDRs making smaller contributions than their VH counterparts, thereby leading to imbalanced roles of the two variable domains in antigen binding (Akbar et al. (1)), even though both CDR-H3 and CDR-L3 are poised for interaction with the antigen by virtue of occupying the center of the CDR loops (Figure 1G). Indeed, there are many cases where the VL domain makes no contacts with the antigen at all (Akbar et al. (1); Phillips et al. (8)). Further, the presence of single-domain antibodies in chondrichthyes (IgNARs) (Khalid et al. (9)) and camelids (VHHs) (Mitchell and Colwell (10)) shows that the presence of the VL domain is not necessary for antigen recognition. These observations have raised questions on the precise role of the VL domain and how the presence and sequence of this region influences the VH domain.
Figure 1 Radial distribution functions of CDR loops in the crystal data set. (A–F) We generated histograms for all pairwise CDR loop heavy atom distances using a bin width of 0.1Å, where (A–F) correspond to CDR-H1, -H2, -H3, -L1, -L2, and -L3, respectively. Each bin was then normalized by dividing by the product of the volume of a spherical shell centered at 0 and located on the edges of the bin and the number of possible pairs of heavy atoms. Shaded areas are 95% confidence intervals. (G) shows an example antibody structure, with CDR loops labelled according to the IMGT definition.
Due to the spatial proximity of the CDR-H3 loop and the VL CDRs, it has long been hypothesized that one possible role of the VL domain, in addition to antigen binding, might be modulating the CDR-H3 structure. This has been confirmed in specific cases, showing that the crystal structure (Teplyakov et al. (11)) and conformational ensemble (Fernandez-Quintero et al. (12)) of the CDR-H3 loop are indeed influenced by different pairings of VH and VL domains.
In this study, we show that specific attributes of the VL domain influence both the structure and structural variability of CDR-H3. We first show that CDR loops are tightly packed against each other in a distance-dependent manner in antibody crystal structures and that VL loops are more heavily compacted than their VH counterparts. Further, we show that the CDR-H3 loop is unique in its packing, making interactions with more loops than the analogous CDR-L3 loop.
We next examine these observations in a more detailed way using a set of molecular dynamics simulations of eight antibody structures. We find that the observed packing patterns are recapitulated in simulations and that the VL loops are all more rigid than their VH counterparts, noting that this effect would be expected given the greater compaction of the VL loops. We also show that the CDR-H3 loop explores a significantly greater conformational space than other CDRs and makes contacts with all of these, suggesting that the remaining CDR loops (referred to as the minor loops) act as a “cage” around the CDR-H3, constraining the number of conformations that it is able to adopt.
Using a length-independent structural similarity calculation, we show that different VL genes, subtypes, and CDR canonical forms have a significant effect on the structural variability of CDR-H3, with shorter CDR-L1 and longer CDR-L3 loops increasing conformational diversity of the CDR-H3 loop and rationalize this observation in the context of our constrained CDR-H3 model. We provide a case study that shows the expected effects of VL domain attributes on CDR-H3 conformation, and confirm using a structural data set that pairing the same VH domain with different VL sequences leads to a change in CDR-H3 conformation, with this change being correlated with the diversity of the VL sequences.
Methods
Crystal data set
We mined SAbDab (Dunbar et al. (13)) on 09.08.2022 and filtered based on a 95% sequence identity threshold across the FV domain. We further filtered this data set to exclude structures that did not have paired heavy and light chains, scFvs, structures not solved using X-ray crystallography and those with a resolution greater than 3Å. Lastly, we filtered the data set to remove structures with unresolved loops. Only the first FV in the asymmetric unit was included in the final set. This procedure yielded 2357 non-redundant FV structures.
Region definitions
ANARCI (Dunbar and Deane (14)) was used to number sequences using the IMGT numbering scheme (Lefranc et al. (15)). We chose this numbering scheme to enable comparison across the VH and VL domains (CDR1: IMGT residues 27-38, CDR2: IMGT residues 56-65, CDR3: IMGT residues 105-117, Framework: IMGT residues 1-128 and not within CDR definitions).
Crystal radial distribution functions
Radial distribution functions for each loop pair were calculated by using a modified version of the mdtraj (McGibbon et al. (16)) implementation that does not normalize by unit cell volume. Namely, we generated histograms of pairwise heavy atom distances (up to 10Å) using a bin width of 0.1Å. We then normalized the densities by dividing the counts at each bin by the volume of the spherical shell located at that bin. This results in comparable trends but not absolute frequencies.
Molecular dynamics simulations
The simulated systems were the first Fab domains in the asymmetric units of PDB entries 3gjf, 4dkf, 1e6j, 3l5x, 2cmr, 4zs7, 3l5w and 5bk1. We chose this set of antibodies as they provided a representative sample in terms of inter-loop distances (Figures 2A–J) and a range of CDR loop lengths (CDR-H1: 8-10 residues, CDR-H2: 7-8 residues, CDR-H3: 11-19 residues, CDR-L1: 5-9 residues, CDR-L2: 3 residues, CDR-L3: 8-11 residues), sequence diversity, and span a diverse set of IGHV and IG(K/L)V genes.
Figure 2 Conformations explored by CDR-H3. (A-J) show minimum heavy atom distances between pairs of loops. Green histograms correspond the crystal structures, blue histograms refer to simulation data, and black histograms refer to crystal structures of systems used to perform simulations. (K, L) show the conformations of CDR-H3 loops in cluster representatives extracted from simulation data clustered on CDR RMSD after alignment on the framework regions. CDR-H3 and CDR-L3 are shown in cyan and red, respectively. CDR-H1 and CDR-H2 are shown in dark and light blue, respectively. CDR-L1 and CDR-L2 are shown in magenta and pink, respectively. (K) shows results for 1e6j, whereas (L) shows results for 3l5x.
Simulation protocol
All systems were prepared and simulations performed using OpenMM v7.7 (Eastman et al. (17)). We capped C-termini using N-methyl groups. Next, we protonated the models at a pH of 7.5, soaked them in truncated octahedral water boxes with a padding distance of 1 nm, and added sodium or chloride counterions to neutralize charges and then NaCl to an ionic strength of 150 mM. We parameterized the systems using the Amber14-SB force field (Maier et al. (18)) and modelled water molecules using the TIP3P-FB model (Neria et al. (19)). Non-bonded interactions were calculated using the particle mesh Ewald method (Darden et al. (20)) using a cut-off of distance of 0.9 nm, with an error tolerance of 0.0005. Water molecules and heavy atom-hydrogen bonds were rigidified using the SETTLE (Miyamoto and Kollman (21)) and SHAKE (Ryckaert et al. (22)) algorithms, respectively. We used hydrogen mass repartitioning (Hopkins et al. (23)) to allow for 4 fs time steps. Simulations were run using the mixed-precision CUDA platform in OpenMM using the Middle Langevin Integrator with a friction coefficient of 1 ps-1 and the Monte-Carlo Barostat set to 1 atm. We equilibrated systems using a multi-step protocol: (i) energy minimization over 10,000 steps, (ii) heating of the NVT ensembles from 100 K to 300 K over 200 ps, (iii) 200 ps simulation of the NPT ensembles at 300 K, (iii) cooling of the NVT ensembles from 300 K to 100 K over 200 ps, (iv) energy minimization over 10,000 steps, (v) heating of the NVT ensembles from 100 K to 300 K over 200 ps, and (vi) 5 ns simulation of the NPT ensembles at 300 K.
We then initialized well-tempered metadynamics (Barducci et al. (24)) using a bias height of 10 kJ/mol with width 0.3 rad and a bias factor of 10, depositing biases every 500 steps. We biased a linear combination of sine values of CDR-H3 and CDR-L3 ψ angles in a manner similar to (Fernandez-Quintero et al. (25)), resulting in two collective variables. We performed triplicate simulations of 1µs duration and assessed convergence using block analysis of free energy estimates. To recover unbiased ensembles, we calculated weights (w) for each configuration (s) using the time-independent reweighting scheme (Branduardi et al. (26)):
where is the bias at configuration and is the product of the Boltzmann constant and temperature.
Analysis
Analysis of MD trajectories was performed using mdtraj v1.9.6 (McGibbon et al. (16)) and scikit-learn v1.0.2 (Pedregosa et al. (27)). Minimum heavy atom distances of crystal data and MD trajectories were calculated using mdtraj. MD frames were clustered using average linkage hierarchical clustering (cutoff=1.25Å) based on CDR RMSD after alignment on framework regions excluding 2 residues on either N-terminus. We chose to exclude these residues as they showed considerably higher flexibility than the rest of the framework regions. To calculate RMSD and RMSF values (defined as the time average of RMSD), we used the conformation at the beginning of the simulation as the reference.
Dynamic time warping
To compare CDR-H3 loops of differing lengths, we used dynamic time warping (DTW) (Nowak et al. (5)) after superposing CDR-H3 loops on anchoring residues, defined as IMGT positions 100-104 and 118-122. DTW is analogous to the Needleman-Wunsch algorithm (Needleman and Wunsch (28)), using dynamic programming to find the optimal path in a backbone atom RMSD matrix. When two loops of the same length are compared, the algorithm reduces down to a conventional RMSD calculation. When the two loops are of differing length, the optimally aligning residues are included in the comparison.
We used DTW on our crystal data set to generate a pairwise comparison matrix of all CDR-H3 loops. We then stratified the data set based on antibody subtype, FV genes, and minor loop canonical forms (calculated using SCALOP (Wong et al. (29))) and compared the average distance in the stratified data to the overall average distance. To test for significance, we used a bootstrapping procedure to randomly sample as many points from the whole data set as there were in each stratum, repeated the comparison 10,000 times and calculated a p-value based on how many times an effect of greater size was observed.
VH - VL pairing analysis
To search for antibodies with identical VH but dissimilar VL sequences, we extracted all VH sequences from SAbDab (Dunbar et al. (13)) on 10.12.2022 where the structure was solved using X-ray crystallography and retained only entries where FV backbones were resolved completely. We clustered these antibodies on VH sequences using global alignment at a clustering threshold of 1.0 using cd-hit (Li and Godzik (30)). We then grouped resultant non-singleton antibodies together and clustered them again, this time using a threshold of 0.4 and clustering on the VL sequences. We extracted pairs from within each subcluster if the VL sequence identity was below 99%. This yielded a set of 86 antibody pairs. Every pair obtained contained at least one different residue in the VL CDR loops without our method specifically selecting for pairs with CDR loop sequence identity. For each pair, we aligned the VH structures on the framework regions using Biopython v1.78 (Hamelryck and Manderick (31)) and calculated backbone RMSD values. Lastly, we calculated Pearson correlation coefficients using scipy v1.8.1 (Virtanen et al. (32)).
Data visualization
We generated plots using seaborn v0.11.2 (Waskom (33))and matplotlib v3.2.2 (Hunter (34)) and visualized protein structures using UCSF ChimeraX v1.5 (Pettersen et al. (35)).
Results
Antibody VH and VL CDR loops are characterized by non-equivalent inter-loop contacts in crystal structures
To analyze the structural relationships between antibody CDR loops, we began by examining a data set of antibody crystal structures. As described in the Methods section, we mined SAbDab for a set of antibodies. This yielded a set of 2357 non-redundant structures where all CDR loops were resolved. We then examined the relative packing of loops against each other by calculating radial distribution functions (RDF). For each loop, we selected heavy atoms and tallied the number of heavy atoms belonging to other loops at a given distance interval and normalized this value by dividing by the number of possible pairs. Our results are generally in line with what would be expected – loops that are close to each other in the three-dimensional antibody structure tend to make more contacts with each other and the trends are largely mirrored for the VH and VL loops, owing to the symmetry that marks antibody structures (Figure 1G).
We found that loops sandwiched between other loops make contacts on either side. The CDR-1 loops make contacts with the CDR-2 and CDR-3 loops of the same chain, and the CDR-2 loops, owing to their positions on the edges of the VH-VL complex, only make contacts with their neighboring CDR-1 loops (Figure 1). We did; however, find two major differences between the VH and VL loops.
First, that the CDR-L1/2 loops show tighter packing than the corresponding CDR-H1/2 loops. The interactions between CDR-L1 and CDR-L2 show a larger peak compared to those between CDR-H1 and CDR-H2 (Figures 1A, D). Based on the number of residues, the CDR-L2 loop is considerably shorter ( residues, ) than the CDR-H2 ( residues, ) loop while the CDR-L1 ( residues, ) and CDR-H1 ( residues, ) loops are more comparable in their lengths. This suggests tighter packing of the CDR-L2 loop against the CDR-L1 loop than is observed with CDR-H2 and CDR-H1 based on the RDF trace, despite the shorter average length of CDR-L2.
Second, that the pattern of contacts made by the CDR-H3 loops differs significantly from that of the CDR-L3 loop. The CDR-H3 loop makes significant contacts with CDR-H1, CDR-L3, and CDR-L2 loops, with small contributions from the CDR-H1 loop (Figure 1C), whereas the CDR-L3 loop appears to make contacts mainly with the CDR-L1 and CDR-H3 loops, with a small number of contacts made with CDR-L2 (Figure 1F). While CDR-H3 packs less densely against CDR-H1 than CDR-L3 with CDR-L1, it packs more densely against all other loops than CDR-L3 does against its analogous loops. This is in line with the larger structural variability of CDR-H3. Thought largely to be due to their length and sequence diversity as a result of V(D)J-recombination (Regep et al. (6)), CDR-H3 loops are known to adopt multiple different conformations and can pack against minor loops in different ways. Conversely, owing to the limited conformational space afforded by the canonical forms and limited sequence space, CDR-L3 loops are more likely to pack against the same loops, in the same way (Wong et al. (36)).
We also stratified our data set on light chain subtype. We did not observe any significant differences in RDF densities in / antibodies (Supplementary Figure 2), showing that the density of loop packing is the same in both VH and VL loops, irrespective of the light chain subtype.
Taken together, our results suggest that the VL loops appear to pack more tightly against each other than the VH loops and that the CDR-H3 loop is differentiated from its light chain counterpart by a much broader set of configurations with which it packs against the minor loops. We also show that the CDR loops of the VL domain are more densely packed than those in the VH domain, thereby likely generating a more rigid structure on one side of the FV.
Conformational rearrangements in CDR-H3 do not require major conformational changes in minor loops
In order to test the observations made on crystal structures in a metadynamics context, we created a set of MD simulation data. Briefly, we performed meta dynamics simulations using eight antibody structures (simulation protocol details are provided in the Methods section). Each simulation was allowed to proceed for 1µs and simulations were performed in triplicates. To recover equilibrium ensembles, we re-weighted our trajectories. The simulations were biased on both the CDR-H3 and CDR-L3 loops and therefore extensive sampling of conformational space is expected for these loops only.
In the simulations the minor loops (all loops apart from CDR-H3) generally showed little flexibility. In order to quantify this, we aligned backbone atoms of each loop on three flanking residues on each side and calculated the RMSD and RMSF values relative to the starting structure. The CDR-H3 loop only appears in the starting conformation in a fraction of MD frames, with deviations ranging up to 3Å (Figure 3A). The peak that is within the 1Å cutoff used to define antibody loop conformational differences (Wong et al. (36)) is shifted further to the right for CDR-H3 compared to the major peaks found for other CDR loops (especially when compared to CDR-L3, as this loop was also biased in the simulations) (Figure 3B), suggesting that even in its native conformation, the CDR-H3 loop is more flexible and poised for conformational shifts. In line with previous studies (Fernandez-Quintero et al. (25); Fernandez-Quintero et al. (37)); Fernandez-Quintero et al. (38)), we therefore conclude that the CDR-H3 loop is distinctly flexible. In the context of antigen binding, where CDR-H3 is highly dominant, this might be an adaptation to allow a multitude of loop conformations in order to generate surfaces complementary to the vast range of possible epitope topologies. In our simulated systems, the least flexible CDR-H3 loop is found in non-crystal conformations in roughly 45% of frames whereas the most flexible loop immediately adopts a new conformation without revisiting the native form during the simulation (Figure 3B).
Figure 3 Flexibility of CDR loops in the simulation data set. All loops were individually aligned on their anchor residues, defined as the three residues on either side of the loop. (A) pooled RMSD values for each loop relative to the crystal structure (the histograms were smoothened using a kernel density estimation), (B) the fraction frames where loops were found in their starting conformations for each antibody, defined as being within 1Å of the crystal conformation, and (C) the RMSF of loops for each structure, defined as the time average of the RMSD are reported.
Conversely, the minor loops do not show similar levels of flexibility (Figure 3A). In all cases, the major peak of the distribution is within the 1A cut off and lies around 0.5Å. As is expected based on its length and observed conformational diversity in crystal structures, CDR-L2 is the least flexible loop, with no instances of a conformational shift being observed. The CDR-L3 loop, although biased using the same protocol as CDR-H3, is relatively rigid in its conformation. We hypothesize that this is due to a combination of factors including the length of the loop, the conformational space allowed by CDR-L3 sequences, and the tighter packing of CDR loops in the VL domain. It has been shown that CDR-L3 loops tend to be shorter than CDR-H3 loops on average (Marks and Deane (39); Wong et al. (36)). Further, a number of studies have shown that CDR-L3 loops are both less conformationally diverse and less flexible than CDR-H3 loops (Wong et al. (36); Regep et al. (6); Weitzner et al. (40); Fernandez-Quintero et al. (41)). The former is presumably due to the fact that CDR-L3 loops are generated only by sequence contributions from V and J genes, whereas the D gene also contributes to CDR-H3 sequence, thereby likely increasing the conformational space by virtue of greater sequence space. The presence of a second junction during the CDR-H3 recombination event also leads to greater sequence diversity by virtue of an additional site for nontemplated nucleotide addition (Alt and Baltimore (7)). The reduced flexibility of CDR-L3 compared to CDR-H3, on the other hand, is likely to be driven by a number of factors including loop length, loop sequence, and the presence or lack of stabilizing lateral contacts. In line with this, a much smaller fraction of collected frames of CDR-L3 are found in non-crystal conformations (Figure 3B), though there is a significant range, with the most flexible CDR-L3 loop being in a different conformation in 80% of the trajectories. The CDR-L1 loop shows a much broader distribution in Figure 3B, especially when compared to CDR-H1, suggesting the presence of highly flexible CDR-L1 loops. However, examination of the underlying distribution shows that on average the CDR-L1 loop is found in a conformation much closer to its native pose compared to CDR-H1, as demonstrated by the shift of the major peak towards lower RMSD compared to CDR-H1 (Figure 3A).
To further quantify the observed relative flexibility of the CDR loops, we calculated RMSF values. RMSF values are the time average of RMSD relative to the starting conformation and provide a measure of flexibility. We found that in all cases, the VL CDR loops show decreased flexibility relative to their VH counterparts. However, our biasing scheme was highly aggressive, with biases roughly 10-fold of usual values in metadynamics simulations. This provides a high energetic incentive for conformational space exploration of the biased loops. The lack of significant conformational changes in the minor loops despite this approach suggests that conformational changes in the CDR-H3 loop are not sufficient to induce conformational changes in the minor loops. We hypothesize that the minor loops form a scaffold around the CDR-H3 loop, which then explores feasible conformations available to it based on the available space within this scaffold, as evidenced by the fact that CDR-H3 is the only loop exhibiting RMSF values beyond the 1 Å cutoff for conformational change (Figure 3C).
Our results do not indicate that the minor loops are unable to change conformation - a phenomenon that has been well-documented in the literature (Fernandez-Quintero et al. (25); Fernandez-Quintero et al. (41)). Instead we discuss only the flexibility of the CDR-H3 loop when the minor loops are mainly in their crystal pose conformations. These conformations are thought to be the lowest energy shape for these loops (Fernandez-Quintero et al. (25); Fernandez-Quintero et al. (41)). Our results show that despite significant packing of the CDR-H3 loop against the minor loops, the CDR-H3 loop retains flexibility, exploring non-crystal conformations more than 40% of the time, even in the least flexible antibody in our simulation data set. Further, we show that this movement is possible without significant conformational rearrangement in the minor loops. Taken together, our results point to at least part of the conformational ensemble of CDR-H3 loops being accessible without any movement in the minor loops.
Minor CDR loops create steric boundaries within which CDR-H3 samples conformational space
Based on our observation that CDR-H3 explores the conformational landscape in the simulation data without concurrent movement in the minor loops, we examined whether the observed packing of CDR loops in crystal structures (as shown using the RDFs in Figure 1) changes with CDR-H3 conformation. To this end, we calculated RDFs for CDR loops (Supplementary Figure 1) and inter-loop minimum distances on our simulation data set (Figures 2A–J). The RDFs show that the patterns observed in the crystal data set RDFs are conserved in simulations, with CDR-H3 making a wider range of contacts compared to CDR-L3 (Supplementary Figure 1C, F), and the VL loops showing greater inter-loop packing than their VH counterparts. This is supported by our minimum distance calculations, where in the vast majority of pairs, no significant change in the average minimum distance is observed (Figures 2A–C, E, F, H, J).
The RDFs for CDR-3 loops showed minor differences in interactions with CDR-2 loops (Figures 4A–D). In the simulation data, we did not observe the presence of CDR-L2-CDR-H3 interaction that was found in the crystal data set (Figures 4C, D). Indeed, an examination of the minimum distance distribution for this pair (Figure 2D) shows an average increase of 2.47Å, with the majority of the distribution shifting out of the 0-4.5Å heavy atom distance range used to define contacts (Yuan et al. (42)). This confirms the loss of interactions between CDR-L2 and CDR-H3 in the simulations as is shown in (Figure 2L). This is not always the case, Figure 2D exhibits a minor shoulder within the 4.5A cutoff and Figure 2K confirms that conformations of CDR-H3 that allow interactions with CDR-L2 still exist, especially when the CDR-H3 loop is heavily oriented towards the VL domain. The observation that the CDR-H3 loop generally pulls away from the CDR-L2 loop during exploration of conformational space indicates that the CDR-H3-CDR-L2 interaction might be important in the stabilization of the crystal pose of antibodies.
Figure 4 Changes in loop heavy atom packing patterns in (A, C) simulation and (B, D) crystal structure data sets. For simulation data (A, C), we calculated RDFs using the mdtraj implementation using a bin width of 0.1Å. For crystal structure data (B, D), we used an analogous method without the unit cell correction term. Namely, we generated histograms for all pairwise CDR loop heavy atom distances using a bin width of 0.1Å. Each bin was then normalized by dividing by the product of the volume of a spherical shell centered at 0 and located on the edges of the bin and the number of possible pairs of heavy atoms. p-values are reported based on t-tests conducted at each bin. Shaded areas correspond to 95% confidence intervals.
We also identified a deviation from the crystal data set in the case of the CDR-H2-CDR-L3 interaction. In this case, the opposite effect was observed, with the average minimum distance between these loops decreasing by ∼3Å on average and a concurrent shift in the simulation frame distribution to the minor peak of the crystal data distribution. This is confirmed by the appearance of density in the RDF (Figures 4C, D), showing that in simulations, CDR-L3 and CDR-H2 close the gap between them, thereby providing steric bulk in the VH/VL interface. Based on the fact that the minor loops do not undergo any major conformational changes in our simulations, this could be explained by independent changes in the framework regions of VH and VL domains. We found no correlation between the increase in CDR-H3-CDR-L2 and decrease in CDR-L3-CDR-H2 minimum distances.
The superimposed density maps of representatives from clustered MD trajectories show that the CDR-H3 loop samples conformations to exhaustively fill the space left by the minor loops (Figures 2K, L). This suggests that out of all possible conformations given the CDR-H3 sequence and anchor positions, the loop explores only those that are allowed within the steric mass of the minor loops. This is analogous to a “cage” formed around CDR-H3 by these loops, within which the CDR-H3 loop is able to adopt new conformations. Thus, the conformation of the other loops fine tunes the conformational landscape of CDR-H3, influencing the potential orientations of the paratope residues provided by the CDR-H3 loop. The CDR-L3 loop, being the main VL loop that CDR-H3 packs against (Figures 1C, F, G) would be a major contributor to this, as the CDR-H3 and CDR-L3 loops are oriented such that they diagonally cross each other in FV structures, as exemplified in Figure 1G. Following from the idea of the CDR-H3 loop being “caged” by the minor loops, we propose that different minor loop canonical forms would lead to increases or decreases in CDR-H3 conformational diversity if a canonical form provides less or more steric bulk, thereby making the cage larger or smaller, respectively. Further, the packing between the CDR-H3 loop and minor loops provides another avenue for the modulation of CDR-H3 conformation through the presence or absence of specific interactions, in line with the observation that affinity maturation often targets residues not directly involved in contacts with the antigen, but those that make contact with other loops forming the paratope (Yuan et al. (42)).
Based on our observations that CDR loops are packed against each other and CDR-H3 explores conformational space available to it within the constraints of the minor loops, we propose a model of a “caged” CDR-H3. This cage is constructed of the minor loops, with the precise orientation of these loops forming the boundaries. The CDR-H3 loop is able to undergo conformational shifts that are permitted by the exact shape and dimensions of the cage.
Antibody subtypes, genes, and CDR canonical forms show preference for CDR-H3 conformations with different levels of diversity
Based on our “caged” CDR-H3 model, we predict that minor loops that lead to more steric bulk around the CDR-H3 loop should also lead to a reduction in conformational diversity of CDR-H3 loops. Conversely, we predict that structures with minor loop attributes that widen the physical space available to the CDR-H3 loop should lead to the broadening of conformational diversity. These predictions are in line with our results showing that CDR-H3 explores conformational space within the boundaries allowed by the minor loops in simulations. Since it is expected that the crystal pose will be part of this conformational ensemble, it follows that the steric bulk provided by minor loops should also influence the conformational diversity of CDR-H3 loops. To test these predictions, we examined whether the subtypes, genes, or minor loop canonical forms have an observable effect on CDR-H3 conformational space in crystal structures using dynamic time warping (DTW). DTW allows for the structural comparison of loops of different lengths and reduces to root mean square deviation when two loops have the same number of residues. This has previously been used to effectively compare antibody CDR loops of different lengths (Nowak et al. (5)) and allows us to embed the CDR-H3 conformations in our data set on a manifold in a length-independent manner. We calculated the average DTW CDR-H3 distance for the entire data set, and then stratified by subtype, gene, and canonical form before calculating average DTW CDR-H3 distances for each of these subgroups, which we then compared to baseline CDR-H3 diversity. To test for significance, we used a bootstrapping procedure.
We began by testing for differences based on light chain subtype, and found that κ subtypes lead to a roughly 3.41% reduction in the average DTW distance compared to baseline, whereas λ subtypes lead to a 4.81% increase (Figure 5D), leading to an overall difference in CDR-H3 conformational diversity of 8.22% between κ and λ light chains, suggesting that κ light chains might be more likely to restrict the CDR-H3 loop. In both of these cases, we found the effect to be statistically significant with p < 0.05. However, we found no significant differences for inter-loop packing using RDFs between κ and λ antibodies (Supplementary Figure 2), suggesting that κ antibodies might be more likely to position minor loops in a way that restricts the CDR-H3 and that λ antibodies provide a larger “cage” for the CDR-H3 loop to explore conformations in. In order to examine whether specific genes drive this observation, we further stratified our data by genes. The general trend found for light chain subtypes was recapitulated in the case of κ genes (Figure 5B), but we found that that certain genes have a higher impact on CDR-H3 conformational diversity than others. The effect was most pronounced for IGKV6 and IGKV12, whereas the effect was not statistically significant in the case of IGKV2/3/5/10 (Figure 5B). In the case of IGLV genes, we found no single gene that had a predictable effect on CDRH3 conformational diversity in either direction (Figure 5C). We argue that this might be due to a lack of gene-specific “cage” shapes in these antibodies or due to the more limited number of λ antibodies in our data set. As such, the only conclusion we draw for this subclass is that λ antibodies show a significant increase in CDR-H3 conformational diversity when compared to baseline.
Figure 5 Changes in conformational diversity of CDR-H3 loops compared to baseline after stratification on (A) VH genes, (B, C) VL genes, (D) light chain type, and (E–H) CDR canonical forms based on dynamic time warping distances. (A–D) are based on a set of non-redundant human antibody structures whereas (e-h) are based on a non-redundant set of antibody structures from any species. We aligned CDR-H3 loops on anchoring residues and generated a pairwise distance matrix which was then stratified based on the examined attributes. The difference between the mean diversity of CDR-H3 loops compared to baseline (all CDR-H3 loops in the set) is reported, with * denoting statistical significance based on bootstrapping.
Lastly, we examined the effect of IGHV genes. The majority led to a modest reduction in CDR-H3 diversity, with IGHV7 as an exception (Figure 5A). This result is expected, as the IGHV gene contributes to the CDR-H3 sequence. A less diverse sequence space for the CDR-H3 loop would lead to a less diverse space of possible conformations for the loop due to residue and position-specific preferences of backbone dihedral angles.
We also examined the effect of specific canonical forms on CDR-H3 diversity. For CDR-H1 and CDR-H2, we observed that CDR-H3 diversity is limited by specific canonical forms (with H1-13-C being the single significant outlier) (Figures 5E, F). Since CDR-H1 and CDR-H2 canonical forms are determined by the IGHV gene, which also makes sequence contributions to the CDR-H3, we do not draw conclusions on CDR-H1-2 canonical forms on CDR-H3 conformational diversity. We also did not extend our analysis include to CDR-L2, due to its lack of diverse canonical forms.
In the case of CDR-L1 canonical forms, we observe that longer forms (14-17 residues) tend to lead to a reduction in diversity (Figure 5G), consistent with our previous observations of loop packing. Longer CDR-L1 loops will provide more steric bulk, therefore constraining the CDR-H3 loop. Using our previously outlined framework of the minor loop ‘cage’ around the CDR-H3, this would lead to a smaller space for the CDR-H3 loop. Conversely, shorter CDR-L1 canonical forms generally lead to an increase in conformational diversity compared to baseline. For CDR-L3, we find the opposite effect: longer CDR-L3 canonical forms tend to increase the diversity of CDR-H3 conformations compared to baseline (Figure 5H). We hypothesize that this observation is due to the relative orientation of CDR-H3 and CDR-L3. Usually in antibody crystal structures, CDR-H3 and CDR-L3 cross each other (Figures 3K, L), with CDR-H3 pointing slightly towards the VL domain and CDR-L3 pointing towards the VH domain. Therefore, a shorter CDR-L3 loop would enable a groove to open in the VH/VL interface (Figure 1G). As CDR-H3 makes significant contacts with CDR-L3 (shown in both our crystal data set and in our simulations), this could lead to CDR-H3 packing down towards the rest of the antibody, rather than extending further up, causing the observed decrease in CDR-H3 diversity.
Further, the observed preferences of different canonical forms and CDR-L3 lengths for differing levels of CDR-H3 conformational diversity provides an explanation for the difference observed between λ and κ light chains. It is well-established that human κ antibodies have a preference for shorter CDR-L3 loops compared to their λ counterparts (Rock et al. (43); Mitsunaga and Snyder (44); Townsend et al. (45)). Our results show that longer CDR-L3 loops have a preference for higher CDR-H3 diversity compared to baseline, and we posit that the observed lower CDR-H3 diversity in κ antibodies may be due to the shorter CDR-L3 loops.
These results are consistent with our model of a “caged” CDR-H3 loop, whose conformational diversity partially depends on the minor loops surrounding it, showing that CDR-H3 structural diversity is significantly modulated by VL attributes as granular as CDR canonical forms.
Identical VH sequences show differences in CDR-H3 conformation when paired with different VL sequences
To confirm our results from the DTW analysis of antibodies that canonical loop forms lead to a change in CDR-H3 conformational diversity, we examined a set of antibody pairs from with identical VH but dissimilar VL sequences, yielding a data set of 86 pairs of crystal structures, called the pairing data set (see Methods section). Briefly, we aligned each pair on the VH framework region and calculated RMSD values of the VH CDRs and framework region before calculating Pearson’s correlation coefficients () (Figures 6A–D). It should be noted that at the 99% VL sequence identity cutoff we used (see Methods section), all pairs we identified had at least one difference in the VL CDR loops. We found that the VL sequence identity between these pairs was correlated significantly with CDR-H3 RMSD (, ), but not with CDR-H1 (, ), CDR-H2 (, ), or VH framework (, ) RMSD. We also note one outlier where VL sequence identity was found to be below 55%. In order to ensure that our observed correlation between sequence identity and CDR-H3 conformation is not driven by this pair, we calculated the correlation coefficient after removing it and found the correlation to still be significant (, ).
Figure 6 Changes in CDR-H3 conformation upon changes in VL sequences. We extracted from SAbDab pairs of structures that have identical VH but different VL sequences and aligned structures on the VH framework regions before calculating RMSD values of (A) CDR-H1, (B) CDR-H2, (C) VH framework, and (D) CDR-H3 regions. Lines of best fit are reported with shaded areas corresponding to 95% confidence intervals. (E) shows the comparison of region RMSDs in the VH-identical pairs when all VL loops have the same canonical forms, or when at least one loop is has a different canonical form. Statistical significance was calculated using Mann-Whitney U tests (ns: not significant, *: 0.005< p< 0.05, ***: p< 5 × 10−7) (F) shows the change in CDR-H3 conformation for VH-identical pair 5y2k (orange) and 5y2l (cyan). CDR-H3 loops are shaded using darker colors, and CDR-L3 loops are shown in lighter shades. (G) shows 5y2k with a shallower groove occupied by CDR-H3 due to the longer CDR-L3 and shorter CDR-L1 loop, whereas (H) shows 5y2l with a deeper groove occupied by CDR-H3 due to the shorter CDR-L3 and longer CDR-L1 loop. Minor loops are shown using spheres, with CDR-H3 shown in cyan.
Next, we examined the canonical forms of these pairs. We separated our data set based on whether all VL canonical forms remain the same or whether at least one changes and found that changing VL canonical forms lead to a greater average CDR-H3 RMSD, with an increase of 1.17 Å compared to pairs where VL canonical forms remain identical. We did not observe effects of similar magnitude for CDR-H1, CDR-H2, or VH framework regions, with mean RMSD increases of 0.16 Å, 0.28 Å, and 0.08 Å, respectively (Figure 6E). We found that the correlation between CDR-H3 RMSD and VL sequence identity disappeared when we examined only pairs where all VL canonical forms remain the same (, ). This suggests that even small changed in the VL sequence are able to induce conformational changes in CDR-H3 without accompanying changes in the rest of the VH domain. Further, we show that this is dependent on the canonical forms of VL CDR loops. This supports our “caged” CDR-H3 model and provides evidence for the influence of VL CDR loop steric bulk on the structure of CDR-H3 in crystal structures.
We selected the pair that showed <55% VL sequence identity from this data set for further analysis: 5y2k (VH: chain A, VL: chain B) (Figure 6G) and 5y2l (VH: chain I, VL: chain J) (Figure 6H). 5y2k has one additional residue on CDR-L3 compared to 5y2l, and one fewer residue on CDR-L1. The effect of this is that the space available to CDR-H3 is much broader in 5y2k, whereas the short CDR-L3 coupled with the long CDR-L1 form the hypothesized groove that the CDR-H3 loop packs into. Indeed, when both FV domains are aligned on the VH domain, we observe that in the case of the shorter CDR-L3 loop, the CDR-H3 loop is folded down into the cleft and that this conformation of the loop would clash with the longer CDR-L3 loop (Figure 6F).
Discussion
It has long been hypothesized that the conformational space of CDR-H3 loops depends not only on their sequences, but also on structural features in their vicinity. Several studies have confirmed this to be the case in specific FV regions (Fernandez-Quintero et al. (41); Teplyakov et al. (11)). A question that remains is whether the presence of the VL domain, or indeed specific attributes of the VL domain can generally be shown to modulate CDR-H3 conformation in a predictable manner
In this study, we show that VL domains influence CDR-H3 structure by demonstrating that genetic attributes of the VL domain influence both the structure and structural diversity of CDR-H3 loops. We propose a model to provide a physical explanation for this phenomenon, namely that the minor loops of the VL domain can be arranged in different forms to physically restrain CDR-H3 loops, thereby modulating the conformational space that the latter are able to occupy. We argue that this is likely mediated through CDR-L loops providing steric bulk and scaffolding with which CDR-H3 can make interactions. We show that the CDR-H3 loop forms contacts with the minor loops by packing against them, and therefore hypothesize that if these minor loops were to be found in different conformations, this should be reflected in the conformation of the CDR-H3 loop. We found that in simulations where minor loop conformations do not change, the packing effects are recapitulated, in that CDR-H3 efficiently explores the conformational space that is available based on the minor loop bulk. Further, we show that inter-loop packing is tighter in the case of the VL, suggesting increased rigidity in the CDRs of this domain compared to the VH domain. This is further supported by our observation of reduced flexibility of VL loops compared to VH loops.
Based on this, we show that predictable effects of minor loop attributes on CDR-H3 loop conformational diversity can be identified. We also identify the seemingly paradoxical observation that while longer CDR-L1 loops appear to reduce CDR-H3 diversity, the opposite is true for longer CDR-L3 loops and explain this observation within the context of our “caged” CDR-H3 model.
The effects observed here are one of many CDR-H3 conformation modulators, with the most significant being the specific sequence of the CDR-H3 loop. As is well-documented, the CDR-H3 loop does not lend itself to sequence-based clustering approaches suggesting that a plethora of factors are important in determining its exact structure, among which is the steric bulk provided by the minor loops. This also explains the observation that deep learning-based CDR-H3 loop modelling algorithms perform significantly better when trained using data that contains information on what is in the vicinity of the CDR-H3 loop (Abanades et al. (46)). Furthermore, our observations also provide a strong case for the formulation of antibody design as a multi-objective optimization problem where all loops must be considered together, rather than the traditional approach of modular design, where minor loops are clicked into place before subsequent modelling of CDR-H3.
In the context of B-cell immunity, our model is in line with previous studies suggesting that conformational diversity of antibodies generally decreases over the course of affinity maturation (Julian et al. (47); Wong et al. (48); Fernandez-Quintero et al. (37); Fernandez-Quintero et al. (49); Ovchinnikov et al. (50)). A germline combination begins with a sub-space of possible CDR-H3 conformations due to the initial FV pairing (Briney et al. (51)) and over the course of the maturation process accumulates more and more mutations that home in on and stabilize the required conformation of the CDR-H3 loop for epitope complementarity.
This might then explain the observation that individual repertoires show enrichment of similar VL genes upon similar antigenic insult and in functional antibodies (Jaffe et al. (52)), as these genes may be conducive to the necessary CDR-H3 conformational space being explored to target the antigen. However, this is only one driver of VL selection, as for example in many cases the VL CDR loops also make contact with the antigen, placing emphasis again on antibody design and maturation as a multi-objective problem, in both in silico and natural B-cell immunity contexts.
In summary, we demonstrate the role of the VL CDR loops as modulators of the CDR-H3 loop through modifications of the conformational subspace of the loop. Further research into this role and its examination in dynamical studies of antibodies where minor loop flexibility is also explored will allow us to more accurately understand the interconnected nature of CDR loop conformations. This will be important to fully understand the effect of this on rational antibody design.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Author contributions
BG and CD conceived the project and designed the study. BG performed experiments. BG and CD wrote the manuscript. CD supervised the project. All authors contributed to the article and approved the submitted version.
Funding
BG is supported by the Wellcome Trust (Grant ID: 102164/Z/13/Z).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1223802/full#supplementary-material
References
1. Akbar R, Robert PA, Pavlović M, Jeliazkov JR, Snapkov I, Slabodkin A, et al. A compact vocabulary of paratope-epitope interactions enables predictability of antibody-antigen binding. Cell Rep (2021) 34:1–15. doi: 10.1016/j.celrep.2021.108856
2. Schroeder HW, Cavacini L. Structure and function of immunoglobulins. J Allergy Clin Immunol (2010) 125:S41–52. doi: 10.1016/j.jaci.2009.09.046
3. Chothia C, Lesk AM. Canonical structures for the hypervariable regions of immunoglobulins. J Mol Biol (1987) 196:901–17. doi: 10.1006/jmbi.1999.3358
4. North B, Lehmann A, Dunbrack RL. A new clustering of antibody CDR loop conformations. J Mol Biol (2011) 406:228–56. doi: 10.1016/j.jmb.2010.10.030
5. Nowak J, Baker T, Georges G, Kelm S, Klostermann S, Shi J, et al. Length-independent structural similarities enrich the antibody CDR canonical class model. mAbs (2016) 8:751–60. doi: 10.1080/19420862.2016.1158370
6. Regep C, Georges G, Shi J, Popovic B, Deane CM. The H3 loop of antibodies shows unique structural characteristics. Proteins: Structure Funct Bioinf (2017) 85:1311–8. doi: 10.1002/prot.25291
7. Alt FW, Baltimore D. Joining of immunoglobulin heavy chain gene segments: implications from a chromosome with evidence of three d-J(H) fusions. Proc Natl Acad Sci United States America (1982) 79:4118–22. doi: 10.1073/pnas.79.13.4118
8. Phillips AM, Lawrence KR, Moulana A, Dupic T, Chang J, Johnson MS, et al. Binding affinity landscapes constrain the evolution of broadly neutralizing anti-influenza antibodies. eLife (2021) 10:1–40. doi: 10.7554/eLife.71393
9. Khalid Z, Chen Y, Yu D, Abbas M, Huan M, Naz Z, et al. IgNAR antibody: structural features, diversity and applications. Fish Shellfish Immunol (2022) 121:467–77. doi: 10.1016/j.fsi.2022.01.027
10. Mitchell LS, Colwell LJ. Comparative analysis of nanobody sequence and structure data. Proteins: Structure Funct Bioinf (2018) 86:697–706. doi: 10.1002/prot.25497
11. Teplyakov A, Obmolova G, Malia TJ, Luo J, Sweet R, Almagro JC, et al. Structural diversity in a human antibody germline library. mAbs (2016) 8:1045–63. doi: 10.1080/19420862.2016.1190060
12. Fernández-Quintero ML, Kroell KB, Bacher LM, Loeffler JR, Quoika PK, Georges G, et al. Germline-dependent antibody paratope states and pairing specific VH-VL interface dynamics. Front Immunol (2021) 12:675655. doi: 10.3389/fimmu.2021.675655
13. Dunbar J, Krawczyk K, Leem J, Baker T, Fuchs A, Georges G, et al. SAbDab: the structural antibody database. Nucleic Acids Res (2014) 42:1140–6. doi: 10.1093/nar/gkt1043
14. Dunbar J, Deane CM. Anarci: antigen receptor numbering and receptor classification. Bioinformatics (2016) 32:298–300. doi: 10.1093/bioinformatics/btv552
15. Lefranc MP, Pommié C, Ruiz M, Giudicelli V, Foulquier E, Truong L, et al. Imgt unique numbering for immunoglobulin and t cell receptor variable domains and ig superfamily v-like domains. Dev Comp Immunol (2003) 27:55–77. doi: 10.1016/S0145-305X(02)00039-3
16. McGibbon RT, Beauchamp KA, Harrigan MP, Klein C, Swails JM, Hernández CX, et al. Mdtraj: a modern open library for the analysis of molecular dynamics trajectories. Biophys J (2015) 109:1528–32. doi: 10.1016/j.bpj.2015.08.015
17. Eastman P, Swails J, Chodera JD, McGibbon RT, Zhao Y, Beauchamp KA, et al. OpenMM 7: rapid development of high performance algorithms for molecular dynamics. PloS Comput Biol (2017) 13:1–17. doi: 10.1371/journal.pcbi.1005659
18. Maier JA, Martinez C, Kasavajhala K, Wickstrom L, Hauser KE, Simmerling C. ff14sb: improving the accuracy of protein side chain and backbone parameters from ff99sb. J Chem Theory Comput (2015) 11:3696–713. doi: 10.1021/acs.jctc.5b00255
19. Neria E, Fischer S, Karplus M. Simulation of activation free energies in molecular systems. J Chem Phys (1996) 105:1902–21. doi: 10.1063/1.472061
20. Darden T, York D, Pedersen L. Particle mesh ewald: an N·log(N) method for ewald sums in large systems. J Chem Phys (1993) 98:10089–92. doi: 10.1063/1.464397
21. Miyamoto S, Kollman PA. Settle: an analytical version of the SHAKE and RATTLE algorithm for rigid water models. J Comput Chem (1992) 13:952–62. doi: 10.1002/jcc.540130805
22. Ryckaert JP, Ciccotti G, Berendsen HJ. Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J Comput Phys (1977) 23:327–41. doi: 10.1016/0021-9991(77)90098-5
23. Hopkins CW, Le Grand S, Walker RC, Roitberg AE. Long-time-step molecular dynamics through hydrogen mass repartitioning. J Chem Theory Comput (2015) 11:1864–74. doi: 10.1021/ct5010406
24. Barducci A, Bussi G, Parrinello M. Well-tempered metadynamics: a smoothly converging and tunable free-energy method. Phys Rev Lett (2008) 100:1–4. doi: 10.1103/PhysRevLett.100.020603
25. Fernández-Quintero ML, Loeffler JR, Kraml J, Kahler U, Kamenik AS, Liedl KR. Characterizing the diversity of the CDR-H3 loop conformational ensembles in relationship to antibody binding properties. Front Immunol (2019) 9:3065. doi: 10.3389/fimmu.2018.03065
26. Branduardi D, Bussi G, Parrinello M. Metadynamics with adaptive gaussians. J Chem Theory Comput (2012) 8:2247–54. doi: 10.1021/ct3002464
27. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: machine learning in Python. J Mach Learn Res (2011) 12:2825–30.
28. Needleman SB, Wunsch CD. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol (1970) 48:443–53. doi: 10.1016/0022-2836(70)90057-4
29. Wong WK, Georges G, Ros F, Kelm S, Lewis AP, Taddese B, et al. SCALOP: sequence-based antibody canonical loop structure annotation. Bioinformatics (2019) 35:1774–6. doi: 10.1093/bioinformatics/bty877
30. Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics (2006) 22:1658–9. doi: 10.1093/bioinformatics/btl158
31. Hamelryck T, Manderick B. PDB file parser and structure class implemented in Python. Bioinformatics (2003) 19:2308–10. doi: 10.1093/bioinformatics/btg299
32. Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat Methods (2020) 17:261–72. doi: 10.1038/s41592-019-0686-2
33. Waskom ML. Seaborn: statistical data visualization. J Open Source Software (2021) 6:3021. doi: 10.21105/joss.03021
34. Hunter JD. Matplotlib: a 2d graphics environment. Computing Sci Eng (2007) 9:90–5. doi: 10.1109/MCSE.2007.55
35. Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci (2021) 30:70–82. doi: 10.1002/pro.3943
36. Wong WK, Leem J, Leem J. Comparative analysis of the CDR loops of antigen receptors. Front Immunol (2019) 10:2454. doi: 10.3389/fimmu.2019.02454
37. Fernández-Quintero ML, Loeffler JR, Bacher LM, Waibl F, Seidler CA. Local and global rigidification upon antibody affinity maturation. Front Mol Biosci (2020) 7:182. doi: 10.3389/fmolb.2020.00182
38. Fernández-quintero ML, Heiss MC, Pomarici ND, Math A, Liedl KR, Heiss MC, et al. Antibody CDR loops as ensembles in solution vs. canonical clusters from X-ray structures structures. mAbs (2020) 12:1–11. doi: 10.1080/19420862.2020.1744328
39. Marks C, Deane CM. Antibody H3 structure prediction. Comput Struct Biotechnol J (2017) 15:222–31. doi: 10.1016/j.csbj.2017.01.010
40. Weitzner BD, Dunbrack RL, Gray JJ. Article the origin of CDR H3 structural diversity. Structure (2015) 23:302–11. doi: 10.1016/j.str.2014.11.010
41. Fernández-Quintero ML, Pomarici ND, Math BA, Kroell KB, Waibl F, Bujotzek A, et al. Antibodies exhibit multiple paratope states influencing VH–VL domain orientations. Commun Biol (2020) 3:1–14. doi: 10.1038/s42003-020-01319-z
42. Yuan C, Chen H, Kihara D. Effective inter-residue contact definitions for accurate protein fold recognition. BMC Bioinf (2012) 12:1–13. doi: 10.1186/1471-2105-13-292
43. Rock EP, Sibbald PR, Davis MM, Chien Y-H. CDR3 length in antigen-specific immune receptors. J Exp Med (1994) 179:323–8. doi: 10.1084/jem.179.1.323
44. Mitsunaga EM, Snyder MP. Deep characterization of the human antibody response to natural infection using longitudinal immune repertoire sequencing. Mol Cell Proteomics (2020) 19:278–93. doi: 10.1074/mcp.RA119.001633
45. Townsend CL, Laffy JM, Wu YCB, O’Hare JS, Martin V, Kipling D, et al. Significant differences in physicochemical properties of human immunoglobulin kappa and lambda CDR3 regions. Front Immunol (2016) 7:388. doi: 10.3389/fimmu.2016.00388
46. Abanades B, Wong WK, Boyles F, Georges G, Deane CM. ImmuneBuilder: deep-learning models for predicting the structures of immune proteins. bioRxiv (2023) 6:1–8. doi: 10.1038/s42003-023-04927-7
47. Julian MC, Li L, Garde S, Wilen R, Tessier PM. Efficient affinity maturation of antibody variable domains requires co-selection of compensatory mutations to maintain thermodynamic stability. Sci Rep (2017) 7:1–13. doi: 10.1038/srep45259
48. Wong SE, Sellers BD, Jacobson MP. Effects of somatic mutations on CDR loop flexibility during affinity maturation. Proteins: Structure Funct Bioinf (2011) 79:821–9. doi: 10.1002/prot.22920
49. Fernández-Quintero ML, Seidler CA, Liedl KR. T-Cell receptor variable β domains rigidify during affinity maturation. Sci Rep (2020) 10:1–8. doi: 10.1038/s41598-020-61433-0
50. Ovchinnikov V, Louveau JE, Barton JP, Karplus M. Role of framework mutations and antibody flexibility in the evolution of broadly neutralizing antibodies. eLife (2018) 7:1–24. doi: 10.7554/eLife.33038
51. Briney B, Inderbitzin A, Joyce C, Burton DR. Commonality despite exceptional diversity in the baseline human antibody repertoire. Nature (2019) 566:393–7. doi: 10.1038/s41586-019-0879-y
Keywords: antibody dynamics, paratope, antibody design, CDR-H3, structural modeling, loop dynamics
Citation: Guloglu B and Deane CM (2023) Specific attributes of the VL domain influence both the structure and structural variability of CDR-H3 through steric effects. Front. Immunol. 14:1223802. doi: 10.3389/fimmu.2023.1223802
Received: 16 May 2023; Accepted: 28 June 2023;
Published: 26 July 2023.
Edited by:
Bryan Briney, The Scripps Research Institute, United StatesReviewed by:
Kannan Sankar, Novartis Institutes for BioMedical Research, United StatesHarry W. Schroeder, University of Alabama at Birmingham, United States
Copyright © 2023 Guloglu and Deane. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Charlotte M. Deane, ZGVhbmVAc3RhdHMub3guYWMudWs=