- 1Molecular Biophysics and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
- 2David Shin Consulting, Berkeley, CA, United States
- 3Neutron Scattering Division, Oak Ridge National Laboratory, Oak Ridge, TN, United States
- 4Scientific Data Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
- 5Applied Mathematics and Computational Science Division, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
- 6Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, United States
Lipid nanoparticles (LNPs) are being intensively researched and developed to leverage their ability to safely and effectively deliver therapeutics. To achieve optimal therapeutic delivery, a comprehensive understanding of the relationship between formulation, structure, and efficacy is critical. However, the vast chemical space involved in the production of LNPs and the resulting structural complexity make the structure to function relationship challenging to assess and predict. New components and formulation procedures, which provide new opportunities for the use of LNPs, would be best identified and optimized using high-throughput characterization methods. Recently, a high-throughput workflow, consisting of automated mixing, small-angle X-ray scattering (SAXS), and cellular assays, demonstrated a link between formulation, internal structure, and efficacy for a library of LNPs. As SAXS data can be rapidly collected, the stage is set for the collection of thousands of SAXS profiles from a myriad of LNP formulations. In addition, correlated LNP small-angle neutron scattering (SANS) datasets, where components are systematically deuterated for additional contrast inside, provide complementary structural information. The centralization of SAXS and SANS datasets from LNPs, with appropriate, standardized metadata describing formulation parameters, into a data repository will provide valuable guidance for the formulation of LNPs with desired properties. To this end, we introduce Simple Scattering, an easy-to-use, open data repository for storing and sharing groups of correlated scattering profiles obtained from LNP screening experiments. Here, we discuss the current state of the repository, including limitations and upcoming changes, and our vision towards future usage in developing our collective knowledge base of LNPs.
1 Introduction
Lipid nanoparticles (LNPs) have been transformative for delivering oligonucleotide therapeutics. These lipid vesicles protect the nucleic acids from degradation and from triggering unwanted immune responses until they are taken up by cells (Yin et al., 2014; Dowdy et al., 2017). Safe and effective delivery of nucleic acids has been demonstrated by their application as COVID-19 vaccines (Anderson et al., 2020; Polack et al., 2020), as well as therapies to address genetic disorders and cancer (Akinc et al., 2019; Yao et al., 2020). With an established, largely successful and widespread roll-out of LNP therapeutics, there is a new interest in designing LNPs for a wide variety of applications (Tenchov et al., 2021), a process which would benefit from an understanding of the relationships between LNP composition, structure, and efficacy. Figure 1 illustrates the rise in structural investigations of LNPs, as shown by the steady increase of publications containing keywords specific to LNPs from 2018 to 2023.
Figure 1. Literature search results using LNP-relevant keywords. Increase of interest in LNPs and their structures can be observed in recent years. Data was obtained from Digital Science’s Dimensions platform (Digital Science, 2018).
LNPs form a variety of structures, ranging from vesicles to complex nanostructures that are encapsulated by a lipid layer, such as cubosomes (Barriga et al., 2018; Mohammad et al., 2022; Tan et al., 2022; Sivadasan et al., 2023), hexosomes (Abdel-Bar et al., 2020; Mohammad et al., 2022; Tan et al., 2022), and spongosomes (Chen et al., 2015; Zou et al., 2017). Detailed descriptions of the various lipid-based nanoparticles can be found in reviews by Tenchov et al., Zhai et al., and Thi et al. (Zhai et al., 2019; Tenchov et al., 2021; Thi et al., 2021). LNPs have been reported to exhibit multitudes of internal structural features, including inverse hexagonal lipid phase (Yanez Arteta et al., 2018; Aburai et al., 2018; Carrasco et al., 2021), unilamellar (Leung et al., 2012; Kulkarni et al., 2018; Nogueira et al., 2020; Patel et al., 2020; Carrasco et al., 2021), multilamellar (Uebbing et al., 2020; Fan et al., 2021), disordered core (Kulkarni et al., 2018), and combinations of such phases. In addition, phase-separate lipid-RNA cores forming ‘bleb’ structures have also been reported (Cheng et al., 2023; Thelen et al., 2024). Small variations in LNP composition can greatly affect their size, internal structure and chemistry.
The strong dependence of structure on the chemical composition is particularly challenging as LNPs are typically an assembly of many types of molecules. For example, the Pfizer/BioNTech and the Moderna COVID-19 vaccines carry nucleic acid therapeutics and are formulated from a mixture of cationic ionizable lipids, helper phospholipids, cholesterol, and polyethylene glycol-lipids (PEG-lipids). Cationic ionizable lipids facilitate the encapsulation of negatively-charged oligonucleotides and their delivery from inside the endosomes (Maier et al., 2013). Helper phospholipids and cholesterol stabilize the nanoparticle structure and aid in intracellular delivery of the cargo (Cheng and Lee, 2016). PEG-lipids modulate the surface properties of LNPs, including particle size and zeta potential, and affect their interactions with the environment, which can impact circulation time and clearance rate (Mui et al., 2013; Ryals et al., 2020; Kim et al., 2021). The choice of each lipid components (Nogueira et al., 2020; Patel et al., 2020), ratios between the different lipids (Yanez Arteta et al., 2018; Kulkani et al., 2018), ratios between lipids and nucleic acids (Kulkarni et al., 2018; Carrasco et al., 2021), as well as the type and length of oligonucleotide cargo (Aburai et al., 2020; Li et al., 2023), are all factors that could be fine-tuned for desirable outcomes.
In addition to variations in common chemical components, a host of other parameters can also dramatically affect LNP size, structure and chemistry. For example, new chemistries have been explored, such as the covalent addition of peptides to lipids that may aid in targeting LNPs to specific cells (Endsley and Ho, 2012). The physical mixing process to form LNPs can be done with various microfluidic designs or robotic handling (Chen et al., 2012; Sarode et al., 2022). pH-dependent structural changes of LNPs have also been studied to gain further insights on LNP formation and drug release (Yu et al., 2023). LNPs can change as a function of time (Ball et al., 2017), thus limiting shelf life. Storage methods may include highly concentrated solutions, freezing (−20 or −80°C) or lyophilization, and require identification of beneficial excipients. Once delivered to biological tissues, structural changes may enhance targeting and therapeutic release.
The characterization of structural features has depended on small-angle scattering (SAS) and/or cryogenic electron microscopy (cryo-EM). These techniques are complementary to each another, and provide valuable insights into the complex and dynamic LNP structures. Small-angle X-ray scattering (SAXS) and small-angle neutron scattering (SANS) provide population-wide information about the average internal structure and size of LNPs in relevant physiological conditions. SAXS has been used to characterize lipid-based nanoparticles for delivery of small-molecule drugs or macromolecular therapeutics (Angelova et al., 2011; Angelova et al., 2017; Angelova et al., 2018; Mathews et al., 2022) and the nanocomplexes they may form with biomolecules (Angelov et al., 2013; Angelov et al., 2014; Angelov et al., 2017; Yu et al., 2023). SANS can take advantage of the natural neutron contrast between lipids and nucleic acids to probe different structural regions within LNPs. As an alternative approach, SANS can exploit the difference in scattering length density (SLD) between hydrogen (1H) and deuterium (2H or D) by isotopic labeling of LNP components to selectively highlight structural regions within LNPs, thus providing detailed insights into LNP morphology (Di Cola et al., 2016; Sebastiani et al., 2021; Leite et al., 2022; Urimi et al., 2022; Yang et al., 2022; Kangarlou et al., 2023). SANS can also provide quantitative information regarding the distribution of molecules within the LNP structure, including water content in solvated regions. Cryo-EM provides direct visualization of the size and morphology of individual particles in hydrated frozen thin films (Kulkarni et al., 2018; Yanez Arteta et al., 2018; Aburai et al., 2020).
In a recent example, SANS, in combination with SAXS and cryo-EM, was used to structurally characterize a non-spherical morphology of LNPs containing self-amplifying mRNA (SAM-LNPs). SAMs are significantly larger, approximately three to four times, than conventional mRNA. SAM can generate many antigen-encoding mRNA copies and prolong expression of the antigen with lower doses than those required for conventional mRNA-LNPs. Cryo-EM showed that SAM-LNPs had two-compartments surrounded by an external shell. This non-spherical morphology was rationalized as LNPs comprising of lipid and aqueous RNA-rich phase-separated cores, both surrounded by a distearoylphosphatidylcholine (DSPC)/PEG lipid shell, also reported as ‘bleb’ LNPs (Thelen et al., 2024). Modeling of SANS data using partially deuterated LNPs indicated important structural features, including the overall size of aqueous RNA and lipid-rich compartments, water content in the aqueous RNA core, and the overall size of SAM-LNPs. Although this non-spherical morphology is not intrinsically unusual, it has been adopted by a variety of other RNA-LNPs (Kulkarni et al., 2018; Brader et al., 2021).
Compared to the vast parameter space involved in LNP formulation and the wide range of observable phases, SANS and cryo-EM have limited throughput. Furthermore, linking LNP morphologies to function requires a comprehensive understanding of LNP structure-activity relationships (SARs). Structural analyses that match or exceed the pace of producing different LNP formulations and assaying their activity will greatly enhance LNP development. Recently, HT-SAXS has been shown as a promising technique that can match the throughput of LNP formulation and provide structural information to link to cellular activity data (Hammel et al., 2023). Upon generating LNP libraries in an automated and multiplexed process, structural characterization was accomplished via HT-SAXS platform at the SIBYLS beamline (Hura et al., 2009), and efficacy data were obtained through high-throughput in vitro assays. Deconvolution of SAXS data using Lorentzian functions that represent different phases elucidated finer structural details that were correlated with cellular activity. For delivering short antisense oligonucleotides (ASOs), the study showed that efficacy was linked to the structural order in the core of LNPs influenced by the choice of PEG-lipids.
As the amount of data generated for LNP analyses increases, sharing the data on a platform that is easily accessible by the global community becomes critical. As demonstrated by the Protein Data Bank (PDB) (Berman et al., 2007), a comprehensive data bank with unified formats and standards empowers the field of structural biology and its related research areas. Publicly available structural data and models guide characterization of biochemical processes and inspire the development of bioinformatics and machine learning-assisted tools for analyses and predictions of biomolecular structures (Gainza et al., 2020; Baek et al., 2021; Jumper et al., 2021; Renaud et al., 2021). Without such repositories, structural data only exist as figures in the Supplementary Information of scientific articles, scattered entries across different databases, or files on a local drive that are inaccessible to the public.
The Small Angle Scattering Biological Data Bank (SASBDB) is a data archive specifically for SAS data (Kikhney et al., 2020). The primary purpose of SASBDB is to make analyses of SAS profiles available to the public and provide their computation of the underlying atomic structures of the macromolecule being studied. SASBDB is an excellent and leading resource for the SAS community to impose standards for these kinds of studies, as set out in community guidelines. Structural biologists who are not experts in the interpretation of SAS data can use the atomic models within the SASBDB when they become available, along with access to the primary data to test additional models or hypotheses. However, SASBDB contains only a few entries on lipid nanoparticles (Flayhan et al., 2018). This is, in part, because the guidelines that have been established are specific for homogeneous solutions of macromolecules and not heterogenous, mixed-density LNPs. Moreover, experiments that are likely to lead to breakthroughs in the LNP field are not singular measurements but will require integration of information from tens to thousands of measurements.
As discussed above, the large and multi-faceted parameter space of LNPs is a major challenge for optimizing formulations. If previous SAS results are to direct the optimization of a new formulation, an appropriate repository for scattering data from LNPs is indispensably necessary. Connecting an individual scattering profile from an LNP to a real space model requires comparisons to many other scattering profiles, where small changes are made and where additional data from lower throughput techniques are available. The ability to compare and contrast LNP scattering across the community would expedite the process of linking formulation to structure and efficacy. If a comprehensive data repository was available for LNPs, hundreds and thousands of scattering data generated through the high-throughput workflow could be found on a single platform to be mined and analyzed for a deeper understanding of LNP structure-activity relationship.
However, complete standardization of required metadata as done in the PDB or SASBDB may not be practical, and developing a repository that anticipates all variables is likely impossible. This challenge is not unique to LNP data, but is an emerging need across science domains. Several models have been proposed to deal with the diversity of metadata driven by the heterogeneity of scientific experiments. For example, the NWB neurophysiology data standard (Rübel et al., 2022) supports formal, user-defined extension to the data model, and the CORAL system (Novichkov et al., 2022) uses the concepts of micro-typing and contextons to support flexible extension of the data model while adhering to FAIR principles. The Linked Open Data Modeling Language (LinkML) (Moxon et al., 2021) is further an emerging standard for defining standard data schema for linked data integrated with ontologies while enabling integration with a broad range of application models, e.g., JSON-Schema, ShEx, RDF, OWL, GraphOL, and SQL DDL. Here, we propose an intermediate solution with a core metadata model is needed that is strict where possible but easy for users to extend the data model to add metadata fields not covered by the core standard. This would facilitate compliance with FAIR principles through linking of metadata to ontologies and linking of data to related datasets.
2 Simple Scattering
We have begun modifying a data repository we maintain called “Simple Scattering” (simplescattering.com) for the storage of LNP data. Simple Scattering is an open data repository that provides a space for SAS data from high-throughput LNP screening experiments. This data bank was designed to hold SAS data generated from various methods including size-exclusion chromatography and time-resolved experiments (Murray et al., 2023). However, it differs from other SAS repositories such as SASBDB (Kikhney et al., 2020) and BioISIS (Hura et al., 2009), which store extensively analyzed and curated SAS curves and any models derived from these data. Instead, Simple Scattering holds correlated SAS curves generated from a set of related experiments, such as buffer screens, allowing an intuitive and quick deposition of multiple, related datasets by users. Once the user decides to make the datasets public, deposited data entries are downloadable, easily searchable with keywords, and associated with unique, persistent, digital identifier codes. These data entries serve not only as storage and records of scattering data, but also as supplementary data linked to a publication for data validation.
With Simple Scattering, our goal was to create a flexible deposition system inspired by some aspects of Zenodo, while also providing sufficient standardization of data and metadata so that data is findable and accessible via application programming interfaces (APIs) and reusable for computational analyses and machine learning in adherence with FAIR principles. We do not intend to provide a direct and individualized connection of each SAS curve to real space models. We seek to design a system that provides access to the primary scattering data along with a thorough understanding of the formulation parameters including a complete description of composition. The development of Simple Scattering is also connected to an HT-SAXS synchrotron beamline (SIBYLS at the Advanced Light Source in Berkeley, CA) such that data collected at the beamline is immediately uploaded to Simple Scattering after minimal processing (Murray et al., 2023). Significant quantities of data are available with more expected as the application of SAXS to LNPs grows. The repository is open for participation by others, enabling the opportunity for a greater collection of LNP scattering data. In this broader context, we have also been actively collaborating with colleagues at the Biological Small-Angle Neutron Scattering Beamline (Bio-SANS) at Oak Ridge National Laboratory. This collaboration aims to extend the scope of Simple Scattering to include neutron scattering data, further enriching the content of the repository, and facilitating valuable insights into the structure of LNPs. A SANS dataset of LNP is currently deposited and we expect many others from the SANS and LNP communities to be deposited in the future.
Thus far, the majority of the datasets in Simple Scattering have consisted of proteins and nucleic acids. Currently, Simple Scattering holds LNP data entries from 3 publications. Hammel et al. utilized SAS to investigate the effect of PEG-lipids on structure and activity of LNPs loaded with antisense oligonucleotides (ASO-LNPs, dataset codes: XSDM4ADX, XSSOSU8C, XSK1IXUI), varying the type of PEG-lipid as well as their relative molar ratios to other lipid components (Hammel et al., 2023). O’Brien Laramy et al. performed a comparative evaluation of two different formulation methods of preparing ASO-LNPs, coaxial turbulent jet mixing versus microfluidic mixing (dataset code: XS0ZP76H, O’Brien Laramy et al., 2023), and Thelen et al. used a combination of different structural techniques, including cryo-EM, SAXS, and SANS to structurally characterize LNPs encapsulating self-amplifying mRNA molecules (dataset code: XSXEB4SL, Thelen et al., 2024). Numerous literature precedence exists for studying LNPs using SAS: interactions between proteins and LNPs (Flayhan et al., 2018; Sebastiani et al., 2021; Barriga et al., 2022); mixing parameters and post-formulation dilution (Hassett et al., 2021); composition ratios (Yanez Arteta et al., 2018; Barriga et al., 2022); lipid component screens (Aburai et al., 2020; Patel et al., 2020; Carrasco et al., 2021; Wilhelmy et al., 2023); and pH (Li et al., 2023). As the parameter space of LNP formulation continues to be explored, we anticipate a tremendous growth in the number of datasets collected. To reflect the increase in demand for SAS analysis of LNPs, Simple Scattering is undergoing changes to better accommodate datasets related to lipid nanoparticles.
2.1 Data entry system
Previous to the incorporation of LNP datasets, the Simple Scattering site was geared towards SAXS only, and in the context of biological samples. Later, we incorporated the ability to upload and share any type of SAS experiment, including SANS. Given that the site was focusing on biological samples, our users suggested additional records for typical protein and nucleotide databases and their associated IDs, such as the addition of a field for the UniProt database for proteins. Since these fields are options for their specific use case, they would simply be null for LNPs and other macromolecules or particles. Instead of simply expanding the number of fields for numerous databases and IDs to include information for LNPs, we will reconfigure both the database and the website form views to handle multiple databases within a single associated database table column, and an associated database ID column. This is in anticipation that once LNPs are introduced and flourish on the site that other communities that study different nanoparticles may also be included. For lipid nomenclature and classification scheme, Lipid Maps Structure Database (LMSD) (Sud et al., 2007) may be used, and valuable information can be provided from other databases or collections such as LIPIDAT (Caffrey M and Hogan J, 1992) through our planned APIs. In addition, we are building the database so that it is extendable to other hierarchical assemblies with similar needs for characterization.
As described above, effectively recording important and often non-standard metadata for LNP samples is a challenge. Typically, LNP formulations are described using 1) percentage ratios of lipid components that were mixed together and 2) the final concentration of the LNPs in the buffer solution. In case LNP samples were also studied using SANS, the deuteration levels of lipids or nucleic acids also need to be recorded in the metadata. While the concentrations of each component may easily be converted from being relative to a larger mass to that of the solvent, the conversions may introduce floating point errors when going back and forth between the database and the website views. Therefore, we are reengineering the site’s database and rewriting the code for the associated models, controllers and views such that the following will occur (Figure 2A): 1) General Dataset metadata will be stored in a Dataset table. 2) Each Dataset must contain at least one experiment or sample. 3) The composition of the sample under study or the subject of the experiment must contain at least one larger particle (LNP or other nanoparticles) made up of large molecules, or at least one large molecule (protein, DNA, etc.). For example, a lipid nanoparticle sample (“large particle”) can be blank (no “large molecule”) or contain DNA as its active ingredient (“large molecule”). A sample consisting of only proteins and/or nucleic acids will not contain a “large particle.” 4) Each sample or experiment must include at least one small molecule if there is any type of buffer component; otherwise, small molecules may be omitted if the large particle or large molecule is in just a single solvent such as water. The smaller molecules are generally buffer components that will also be subjected to SAS for the purpose of background subtraction. For instance, if deuterium oxide (D2O) was used in a SANS experiment, it can be recorded under small molecule metadata. Other typical small molecules may include excipients such as sucrose and glycerol.
Figure 2. Reconfiguration of Simple Scattering database and website form views. (A) Entity relationship diagram for datasets, experiments or samples belonging to the dataset, and the particles and molecules that make up the sample. The table on the far left represents a single Dataset entry into Simple Scattering (for simplicity, specific metadata are not listed). The Dataset then must have one or more experiments or samples, and records that are common such as filenames, solvent and pH, will be included in this table. Each experiment must have one or more larger (nano)particles or larger molecules. One or more small molecule components (buffer) may also be included as part of the composition of each sample. Legend for arrow heads within the diagrams: 1) Double crossed: mandatory one. 2) Single-crossed with circle: optional one–note, however, it is mandatory to have one large (nano)particle or large protein. 3) Circle chickenfoot: optional many. (B) The original sample information form geared towards proteins and oligonucleotides.
Currently, sample information is entered manually in pre-designated blanks or reported in lists within a “File description” text box (Figure 2B). To make the ability to fill in the form to describe many samples quickly, we will allow an option to upload comma-separated values (csv) in the form of a file. To make generating csv files easier, we will implement several web-based templates, downloadable scripts that can build csv files, and possibly custom conversion programs for regular users. These tools for generating csv files will allow users to provide minimal information necessary to replicate their formulation process, such as lipid components, deuteration levels, and their relative ratios, and enable facile data mining for any future machine learning analysis.
2.2 Linking related data entries
Another useful feature to be added is the ability to link related data entries. Since data collection for HT-SAXS occurs much more rapidly than SANS, HT-SAXS could be used for screening and SANS experiments could be run to provide a deeper insight into the internal structures of a few representative samples. Therefore, we will also join these together within the website’s database along with links within the website views that are determined by the Researcher that is uploading their data to share. By linking the SAXS and SANS data collected from identical samples, the relationship between the two datasets can be made clear to the public.
While in-depth analyses of SAS curves will occur prior to deposition and outside of the repository, users will also benefit from being able to visualize the scattering profiles directly from Simple Scattering. Additionally, since HT-SAXS data collection from a 96-well plate containing LNP libraries takes only 1.5 h (Hura et al., 2009), the number of scattering data will escalate quickly and make manual comparisons between data difficult. Therefore, implementation of SAXS Similarity (Hura et al., 2013) to automatically generate a structural comparison map will provide a useful, global overview of data and instantly identify clusters of similar scattering profiles. With these developments to come, Simple Scattering will hopefully become a resource that is more useful and easier to utilize for LNP research communities.
3 Discussion
Due to the complexity of lipid nanoparticles governed by many factors including different lipid components and nucleic acids, making direct associations between the formation of different structural phases and lipid compositions is difficult. This challenge is exacerbated by the limited throughput of techniques and resources to study them. However, with automation and robotics continuing to increase the throughput of experiments, the scope of experimental parameters that can be tested has expanded dramatically. As demonstrated by Hammel et al., the field of LNP development can leverage the high-throughput capabilities in formulation, structural analyses, and cellular activity assays to explore the vast chemical space and factors involved in LNP drug design and optimization (Hammel et al., 2023). Rapid data collection via HT-SAXS, combined with complementary methods such as SANS, may expedite the creation of far more effective vaccines when necessary.
During the high-throughput LNP workflow, large volumes of data will be generated, and we have introduced Simple Scattering as a data bank to store scattering data in an organized and accessible manner. While limitations do exist, Simple Scattering provides a starting point for depositing thousands of scattering profiles generated using HT-SAXS and SANS, and with the continual growth of interest in LNPs, we envision that the database will grow rapidly. As the current shortcomings are addressed by the upcoming changes, more input and suggestions from the broader community of scientists working with SAS and/or LNPs would be greatly appreciated to develop this repository into a collaborative and useful resource for all. As groups around the world screen for conditions to achieve optimal LNP performance and deposit the resulting data on Simple Scattering, this data bank will prevent unnecessary and unintentional replications of experiments and provide training datasets for future analyses assisted by computational resources. Aided by various machine learning algorithms, key conditions for achieving optimal activity may be identified, and predictions of LNP structure may be made based on specific combination of lipid components and relative molar ratios. Ultimately, we hope to develop Simple Scattering into a collaborative and collective knowledge base to improve our understanding of LNPs and facilitate the development of future LNP therapeutics with optimal performance.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
LK: Conceptualization, Writing-original draft, Writing-review and editing, Visualization. DS: Conceptualization, Methodology, Software, Visualization, Writing-original draft, Writing-review and editing. WCL: Conceptualization, Writing-original draft, Writing-review and editing. HO’N: Conceptualization, Writing-original draft, Writing-review and editing. OR: Conceptualization, Writing-review and editing. AT: Conceptualization, Writing-review and editing. GLH: Conceptualization, Supervision, Writing-original draft, Writing-review and editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work is funded by the following: National Institute of Health through ALS-ENABLE (P30 GM124169), National Institute of General Medical Sciences (NIGMS), Department of Energy through Biopreparedness Research Virtual Environment (BRaVE) Initiative, Advanced Scientific Computing Research (ASCR) program, and Basic Energy Science grant DE-AC02-05CH11231.
Acknowledgments
The authors acknowledge Chunwan Yen, Yuchen Fan, and Sarah Poon for insightful discussion. Simple Scattering is managed by the SIBYLS Advanced Light Source beamline, and GLH and LK acknowledge support from the following sources: National Institutes of Health grant ALS-ENABLE (P30 GM124169), National Cancer Institute grant SBDR (CA92584), Department of Energy through Basic Energy Science grant DE-AC02-05CH11231 and Biological Environmental Research grant IDAT. HO’N and WL acknowledge the Center for Structural Molecular Biology (FWP ERKP291) that supports the Bio-SANS instrument funded by the Office of Biological and Environmental Research (OBER), U. S. Department of Energy (DOE).
Conflict of interest
Author DS is employed by David Shin Consulting.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Abdel-Bar, H. M., Khater, S. E., Ghorab, D. M., and Al-mahallawi, A. M. (2020). Hexosomes as efficient platforms for possible fluoxetine hydrochloride repurposing with improved cytotoxicity against HepG2 cells. ACS Omega 5 (41), 26697–26709. doi:10.1021/acsomega.0c03569
Aburai, K., Hatanaka, K., Takano, S., Fujii, S., and Sakurai, K. (2020). Characterizing an siRNA-containing lipid-nanoparticle prepared by a microfluidic reactor: small-angle X-ray scattering and cryotransmission electron microscopic studies. Langmuir 36 (42), 12545–12554. doi:10.1021/acs.langmuir.0c01079
Akinc, A., Maier, M. A., Manoharan, M., Fitzgerald, K., Jayaraman, M., Barros, S., et al. (2019). The Onpattro story and the clinical translation of nanomedicines containing nucleic acid-based drugs. Nat. Nanotechnol. 14, 1084–1087. doi:10.1038/s41565-019-0591-y
Anderson, E. J., Rouphael, N. G., Widge, A. T., Jackson, L. A., Roberts, P. C., Makhene, M., et al. (2020). Safety and immunogenicity of SARS-CoV-2 mRNA-1273 vaccine in older adults. N. Engl. J. Med. 383, 2427–2438. doi:10.1056/NEJMoa2028436
Angelov, B., Angelova, A., Filippov, S. K., Drechsler, M., Štěpánek, P., and Lesieur, S. (2014). Multicompartment lipid cubic nanoparticles with high protein upload: millisecond dynamics of formation. ACS Nano 8, 5216–5226. doi:10.1021/nn5012946
Angelov, B., Angelova, A., Filippov, S. K., Narayanan, T., Drechsler, M., Štěpánek, P., et al. (2013). DNA/fusogenic lipid nanocarrier assembly: millisecond structural dynamics. J. Phys. Chem. Lett. 4, 1959–1964. doi:10.1021/jz400857z
Angelov, B., Garamus, V. M., Drechsler, M., and Angelova, A. (2017). Structural analysis of nanoparticulate carriers for encapsulation of macromolecular drugs. J. Mol. Liq. 235, 83–89. doi:10.1016/j.molliq.2016.11.064
Angelova, A., Angelov, B., Mutafchieva, R., Lesieur, S., and Couvreur, P. (2011). Self-assembled multicompartment liquid crystalline lipid carriers for protein, peptide, and nucleic acid drug delivery. Acc. Chem. Res. 44, 147–156. doi:10.1021/ar100120v
Angelova, A., Drechsler, M., Garamus, V. M., and Angelov, B. (2018). Liquid crystalline nanostructures as PEGylated reservoirs of omega-3 polyunsaturated fatty acids: structural insights toward delivery formulations against neurodegenerative disorders. ACS Omega 3, 3235–3247. doi:10.1021/acsomega.7b01935
Angelova, A., Garamus, V. M., Angelov, B., Tian, Z., Li, Y., and Zou, A. (2017). Advances in structural design of lipid-based nanoparticle carriers for delivery of macromolecular drugs, phytochemicals and anti-tumor agents. Adv. Colloid Interface Sci. 249, 331–345. doi:10.1016/j.cis.2017.04.006
Baek, M., DiMaio, F., Anishchenko, I., Dauparas, J., Ovchinnikov, S., Lee, G. R., et al. (2021). Accurate prediction of protein structures and interactions using a three-track neural network. Science 373 (6557), 871–876. doi:10.1126/science.abj8754
Ball, R. L., Bajaj, P., and Whitehead, K. A. (2017). Achieving long-term stability of lipid nanoparticles: examining the effect of pH, temperature, and lyophilization. Int. J. Nanomedicine 12, 305–315. doi:10.2147/IJN.S123062
Barriga, H. M. G., Holme, M. N., and Stevens, M. M. (2018). Cubosomes: the next generation of smart lipid nanoparticles? Angew. Chem. Int. Ed. 58 (10), 2958–2978. doi:10.1002/anie.201804067
Barriga, H. M. G., Pence, I. J., Holme, M. N., Doutch, J. J., Penders, J., Nele, V., et al. (2022). Coupling lipid nanoparticle structure and automated single-particle composition analysis to design phospholipase-responsive nanocarriers. Adv. Mater 34, 2200839. doi:10.1002/adma.202200839
Berman, H., Henrick, K., Nakamura, H., and Markley, J. L. (2007). The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data. Nucleic Acids Res. 35, D301–D303. doi:10.1093/nar/gkl971
Brader, M. L., Williams, S. J., Banks, J. M., Hui, W. H., Zhou, Z. H., and Jin, L. (2021). Encapsulation state of messenger RNA inside lipid nanoparticles. Biophys. J. 120, 2766–2770. doi:10.1016/j.bpj.2021.03.012
Caffrey, M., and Hogan, J. (1992). LIPIDAT: a database of lipid phase transition temperatures and enthalpy changes. DMPC data subset analysis. Chem. Phys. Lipids 61, 1–109. doi:10.1016/0009-3084(92)90002-7
Carrasco, M. J., Alishetty, S., Alameh, M.-G., Said, H., Wright, L., Paige, M., et al. (2021). Ionization and structural properties of mRNA lipid nanoparticles influence expression in intramuscular and intravascular administration. Commun. Biol. 4, 956. doi:10.1038/s42003-021-02441-2
Chen, D., Love, K. T., Chen, Y., Eltoukhy, A. A., Kastrup, C., Sahay, G., et al. (2012). Rapid discovery of potent siRNA-containing lipid nanoparticles enabled by controlled microfluidic formulation. J. Am. Chem. Soc. 134 (16), 6948–6951. doi:10.1021/ja301621z
Chen, Y., Angelova, A., Angelov, B., Drechsler, M., Garamus, V. M., Willumeit-Römer, R., et al. (2015). Sterically stabilized spongosomes for multidrug delivery of anticancer nanomedicines. J. Mater Chem. B 3, 7734–7744. doi:10.1039/C5TB01193K
Cheng, M. H. Y., Leung, J., Zhang, Y., Strong, C., Basha, G., Momeni, A., et al. (2023). Induction of bleb structures in lipid nanoparticle formulations of mRNA leads to improved transfection potency. Adv. Mater 35 (31), 2303370. doi:10.1002/adma.202303370
Cheng, X., and Lee, R. J. (2016). The role of helper lipids in lipid nanoparticles (LNPs) designed for oligonucleotide delivery. Adv. Drug Deliv. Rev. 99, 129–137. doi:10.1016/j.addr.2016.01.022
Di Cola, E., Grillo, I., and Ristori, S. (2016). Small angle X-ray and neutron scattering: powerful tools for studying the structure of drug-loaded liposomes. Pharmaceutics 8 (2), 10. doi:10.3390/pharmaceutics8020010
Digital Science, (2018). Dimensions [software]. https://app.dimensions.ai (available from under license agreement Accessed on Sept.13, 2023.
Dowdy, S. F. (2017). Overcoming cellular barriers for RNA therapeutics. Nat. Biotechnol. 35, 222–229. doi:10.1038/nbt.3802
Endsley, A. N., and Ho, R. J. Y. (2012). Design and characterization of novel peptide-coated lipid nanoparticles for targeting anti-HIV drug to CD4 expressing cells. AAPS J. 14 (2), 225–235. doi:10.1208/s12248-012-9329-6
Fan, Y., Yen, C.-W., Lin, H.-C., Hou, W., Estevez, A., Sarode, A., et al. (2021). Automated high-throughput preparation and characterization of oligonucleotide-loaded lipid nanoparticles. Int. J. Pharm. 599, 120392. doi:10.1016/j.ijpharm.2021.120392
Flayhan, A., Mertens, H. D. T., Ural-Blimke, Y., Martinez Molledo, M., Svergun, D. I., and Löw, C. (2018). Saposin lipid nanoparticles: a highly versatile and modular tool for membrane protein research. Structure 26 (2), 345–355. doi:10.1016/j.str.2018.01.007
Gainza, P., Sverrisson, F., Monti, F., Rodolà, E., Boscaini, D., Bronstein, M. M., et al. (2020). Deciphering interaction fingerprints from protein molecular surfaces using geometric deep learning. Nat. Methods 17, 184–192. doi:10.1038/s41592-019-0666-6
Hammel, M., Fan, Y., Sarode, A., Byrnes, A. E., Zang, N., Kou, P., et al. (2023). Correlating the structure and gene silencing activity of oligonucleotide-loaded lipid nanoparticles using small-angle X-ray scattering. ACS Nano 17 (12), 11454–11465. doi:10.1021/acsnano.3c01186
Hassett, K. J., Higgins, J., Woods, A., Levy, B., Xia, Y., Hsiao, C. J., et al. (2021). Impact of lipid nanoparticle size on mRNA vaccine immunogenicity. J. Control Release 335, 237–246. doi:10.1016/j.jconrel.2021.05.021
Hura, G. L., Budworth, H., Dyer, K. N., Rambo, R. P., Hammel, M., McMurray, C. T., et al. (2013). Comprehensive macromolecular conformations mapped by quantitative SAXS analyses. Nat. Methods 10, 453–454. doi:10.1038/nmeth.2453
Hura, G. L., Menon, A. L., Hammel, M., Rambo, R. P., Poole, F. L., Tsutakawa, S. E., et al. (2009). Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS). Nat. Methods 6, 606–612. doi:10.1038/nmeth.1353
Jumper, J., Evans, R., Pritzel, A., Green, T., Figurnov, M., Ronneberger, O., et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. doi:10.1038/s41586-021-03819-2
Kangarlou, B., Hoy, D., Scott, H. L., Pingali, S. V., Khalil, N., Chung, B., et al. (2023). Water content in nanoparticles determined by small-angle neutron scattering and light scattering. Langmuir 39 (1), 227–235. doi:10.1021/acs.langmuir.2c02420
Kikhney, A. G., Borges, C. R., Molodenskiy, D. S., Jeffries, C. M., and Svergun, D. I. (2020). SASBDB: towards an automatically curated and validated repository for biological scattering data. Protein Sci. 29 (1), 66–75. doi:10.1002/pro.3731
Kim, J., Eygeris, Y., Gupta, M., and Sahay, G. (2021). Self-assembled mRNA vaccines. Adv. Drug Deliv. Rev. 170, 83–112. doi:10.1016/j.addr.2020.12.014
Kulkarni, J. A., Darjuan, M. M., Mercer, J. E., Chen, S., van der Meel, R., Thewalt, J., et al. (2018). On the formation and morphology of lipid nanoparticles containing ionizable cationic lipids and siRNA. ACS Nano 12 (5), 4787–4795. doi:10.1021/acsnano.8b01516
Leite, W. C., Wu, Y., Pingali, S. V., Lieberman, R. L., and Urban, V. S. (2022). Change in morphology of dimyristoylphosphatidylcholine/bile salt derivative bicelle assemblies with dodecylmaltoside in the disk and ribbon phases. J. Phys. Chem. Lett. 13 (42), 9834–9840. doi:10.1021/acs.jpclett.2c02445
Leung, A. K. K., Hafez, I. M., Baoukina, S., Belliveau, N. M., Zhigaltsev, I. V., Afshinmanesh, E., et al. (2012). Lipid nanoparticles containing siRNA synthesized by microfluidic mixing exhibit an electron-dense nanostructured core. J. Phys. Chem. C 116 (34), 18440–18450. doi:10.1021/jp303267y
Li, Z., Carter, J., Santos, L., Webster, C., van der Walle, C. F., Li, P., et al. (2023). Acidification-induced structure evolution of lipid nanoparticles correlates with their in vitro gene transfections. ACS Nano 17 (2), 979–990. doi:10.1021/acsnano.2c06213
Maier, M. A., Jayaraman, M., Matsuda, S., Liu, J., Barros, S., Querbes, W., et al. (2013). Biodegradable lipids enabling rapidly eliminated lipid nanoparticles for systemic delivery of RNAi therapeutics. Mol. Ther. 21 (8), 1570–1578. doi:10.1038/mt.2013.124
Mathews, P. D., Mertins, O., Angelov, B., and Angelova, A. (2022). Cubosomal lipid nanoassemblies with pH-sensitive shells created by biopolymer complexes: a synchrotron SAXS study. J. Colloid Interface Sci. 60, 440–450. doi:10.1016/j.jcis.2021.08.187
Mohammad, Y., Prentice, R. N., Boyd, B. J., and Rizwan, S. B. (2022). Comparison of cubosomes and hexosomes for the delivery of phenytoin to the brain. J. Colloid Interface Sci. 605, 146–154. doi:10.1016/j.jcis.2021.07.070
Moxon, S., Solbrig, H., Unni, D., Jiao, D., Bruskiewich, R., Balhoff, J., et al. (2021). The linked data Modeling Language (LinkML): a general-purpose data modeling framework grounded in machine-readable semantics. ICBO 3073, 148–151.
Mui, B. L., Tam, Y. K., Jayaraman, M., Ansell, S. M., Du, X., Tam, Y. Y. C., et al. (2013). Influence of polyethylene glycol lipid desorption rates on pharmacokinetics and pharmacodynamics of siRNA lipid nanoparticles. Mol. Ther. Nucleic Acids 2, 139. doi:10.1038/mtna.2013.66
Murray, D. T., Shin, D. S., Classen, S., Brosey, C. A., and Hura, G. L. (2023). Visualizing and accessing correlated SAXS data sets with Similarity Maps and Simple Scattering web resources. Methods Enzymol. 678, 411–440. doi:10.1016/bs.mie.2022.09.024
Nogueira, S. S., Schlegel, A., Maxeiner, K., Weber, B., Barz, M., Schroer, M. A., et al. (2020). Polysarcosine-functionalized lipid nanoparticles for therapeutic mRNA delivery. ACS Appl. Nano Mater 3 (11), 10634–10645. doi:10.1021/acsanm.0c01834
Novichkov, P. S., Chandonia, J.-M., and Arkin, A. P. (2022). CORAL: a framework for rigorous self-validated data modeling and integrative, reproducible data analysis. GigaScience 11, giac089. doi:10.1093/gigascience/giac089
O’Brien, L. M. N., Costa, A. P., Cebrero, Y. M., Joseph, J., Sarode, A., Zang, N., et al. (2023). Process robustness in lipid nanoparticle production: a comparison of microfluidic and turbulent jet mixing. Mol. Pharm. 20, 4285–4296. doi:10.1021/acs.molpharmaceut.3c00390
Patel, S., Ashwanikumar, N., Robinson, E., Xia, Y., Mihai, C., Griffith, J. P., et al. (2020). Naturally-occurring cholesterol analogues in lipid nanoparticles induce polymorphic shape and enhance intracellular delivery of mRNA. Nat. Comm. 11, 983. doi:10.1038/s41467-020-14527-2
Polack, F. P., Thomas, S. J., Kitchin, N., Absalon, J., Gurtman, A., Lockhart, S., et al. (2020). Safety and efficacy of the BNT162b2 mRNA Covid-19 vaccine. N. Engl. J. Med. 383, 2603–2615. doi:10.1056/NEJMoa2034577
Renaud, N., Geng, C., Georgievska, S., Ambrosetti, F., Ridder, L., Marzella, D. F., et al. (2021). DeepRank: a deep learning framework for data mining 3D protein-protein interfaces. Nat. Commun. 12, 7068. doi:10.1038/s41467-021-27396-0
Rübel, O., Tritt, A., Ly, R., Dichter, B. K., Ghosh, S., Niu, L., et al. (2022). The Neurodata without Borders ecosystem for neurophysiological data science. eLife 11, e78362. doi:10.7554/eLife.78362
Ryals, R. C., Patel, S., Acosta, C., McKinney, M., Pennesi, M. E., and Sahay, G. (2020). The effects of PEGylation on LNP based mRNA delivery to the eye. PLoS One 15 (10), e0241006. doi:10.1371/journal.pone.0241006
Sarode, A., Fan, Y., Byrnes, A. E., Hammel, M., Hura, G. L., Fu, Y., et al. (2022). Predictive high-throughput screening of PEGylated lipids in oligonucleotide-loaded lipid nanoparticles for neuronal gene silencing. Nanoscale Adv. 4 (9), 2107–2123. doi:10.1039/d1na00712b
Sebastiani, F., Yanez Arteta, M., Lerche, M., Porcar, L., Lang, C., Bragg, R. A., et al. (2021). Apolipoprotein E binding drives structural and compositional rearrangement of mRNA-containing lipid nanoparticles. ACS Nano 15 (4), 6709–6722. doi:10.1021/acsnano.0c10064
Sivadasan, D., Sultan, M. H., Alqahtani, S. S., and Javed, S. (2023). Cubosomes in drug delivery–a comprehensive review on its structural components, preparation techniques and therapeutic applications. Biomedicines 11 (4), 1114. doi:10.3390/biomedicines11041114
Sud, M., Fahy, E., Cotter, D., Brown, A., Dennis, E. A., Glass, C. K., et al. (2007). LMSD: LIPID MAPS structure database. Nucleic Acids Res. 35, D527–D532. doi:10.1093/nar/gkl838
Tan, C., Hosseini, S. F., and Jafari, S. M. (2022). Cubosomes and hexosomes as novel nanocarriers for bioactive compounds. J. Agric. Food Chem. 70 (5), 1423–1437. doi:10.1021/acs.jafc.1c06747
Tenchov, R., Bird, R., Curtze, A. E., and Zhou, Q. (2021). Lipid nanoparticles–from liposomes to mRNA vaccine delivery, a landscape of research diversity and advancement. ACS Nano 15 (11), 16982–17015. doi:10.1021/acsnano.1c04996
Thelen, J. L., Leite, W., Urban, V. S., O’Neill, H. M., Grishaev, A. V., Curtis, J. E., et al. (2024). Morphological characterization of self-amplifying mRNA lipid nanoparticles. ACS Nano 18 (2), 1464–1476. doi:10.1021/acsnano.3c08014
Thi, T. T. H., Suys, E. J. A., Lee, J. S., Nguyen, D. H., Park, K. D., and Truong, N. P. (2021). Lipid-based nanoparticles in the clinic and clinical trials: from cancer nanomedicine to COVID-19 vaccines. Vaccines 9 (4), 359. doi:10.3390/vaccines9040359
Uebbing, L., Ziller, A., Siewert, C., Schroer, M. A., Blanchet, C. E., Svergun, D. I., et al. (2020). Investigation of pH-responsiveness inside lipid nanoparticles for parenteral mRNA application using small-angle X-ray scattering. Langmuir 36 (44), 13331–13341. doi:10.1021/acs.langmuir.0c02446
Urimi, D., Hellsing, M., Mahmoudi, N., Söderberg, C., Widenbrig, R., Gedda, L., et al. (2022). Structural characterization study of a lipid nanocapsule formulation intended for drug delivery applications using small-angle scattering techniques. Mol. Pharm. 19 (4), 1068–1077. doi:10.1021/acs.molpharmaceut.1c00648
Wilhelmy, C., Keil, I. S., Uebbing, L., Schroer, M. A., Franke, D., Nawroth, T., et al. (2023). Polysarcosine-functionalized mRNA lipid nanoparticles tailored for immunotherapy. Pharmaceutics 15 (8), 2068. doi:10.3390/pharmaceutics15082068
Yanez Arteta, M., Kjellman, T., Bartesaghi, S., Wallin, S., Wu, X., Kvist, A. J., et al. (2018). Successful reprogramming of cellular protein production through mRNA delivered by functionalized lipid nanoparticles. Proc. Natl. Acad. Sci. U. S. A. 115 (15), E3351-E3360–60. doi:10.1073/pnas.1720542115
Yang, Y., Kozlovskaya, V., Zhang, Z., Xing, C., Zaharias, S., Dolmat, M., et al. (2022). Poly(N-vinylpyrrolidone)-block-Poly(dimethylsiloxane)-block-Poly(N-vinylpyrrolidone) triblock copolymer polymersomes for delivery of PARP1 siRNA to breast cancers. ACS Appl. Bio Mater 5 (4), 1670–1682. doi:10.1021/acsabm.2c00063
Yao, Y., Zhou, Y., Liu, L., Xu, Y., Chen, Q., Wang, Y., et al. (2020). Nanoparticle-based drug delivery in cancer therapy and its role in overcoming drug resistance. Front. Mol. Biosci. 7, 193. doi:10.3389/fmolb.2020.00193
Yin, H., Kanasty, R. L., Eltoukhy, A. A., Vegas, A. J., Dorkin, J. R., and Anderson, D. G. (2014). Non-viral vectors for gene-based therapy. Nat. Rev. Genet. 8, 541–555. doi:10.1038/nrg3763
Yu, H., Angelova, A., Angelov, B., Dyett, B., Matthews, L., Zhang, Y., et al. (2023). Real-time pH-dependent self-assembly of ionizable lipids from COVID-19 vaccines and in situ nucleic acid complexation. Angew. Chem. Int. Ed. 62, e202304977. doi:10.1002/anie.202304977
Zhai, J., Fong, C., Tran, N., and Drummond, C. J. (2019). Non-lamellar lyotropic liquid crystalline lipid nanoparticles for the next generation of nanomedicine. ACS Nano 13 (6), 6178–6206. doi:10.1021/acsnano.8b07961
Keywords: lipid nanoparticles, SAS, small-angle scattering, database, vaccines, structure-activity relationship, web application
Citation: Kim LJ, Shin D, Leite WC, O’Neill H, Ruebel O, Tritt A and Hura GL (2024) Simple Scattering: Lipid nanoparticle structural data repository. Front. Mol. Biosci. 11:1321364. doi: 10.3389/fmolb.2024.1321364
Received: 13 October 2023; Accepted: 19 February 2024;
Published: 22 March 2024.
Edited by:
Zehra Sayers, Sabancı University, TürkiyeReviewed by:
Angelina Angelova, UMR8612 Institut Galien Paris Sud (IGPS), FranceCopyright © 2024 Kim, Shin, Leite, O’Neill, Ruebel, Tritt and Hura. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Greg L. Hura, Z2xodXJhQGxibC5nb3Y=
†These authors have contributed equally to this work and share first authorship