Machine learning approaches for improving atomic force microscopy instrumentation and data analytics

Masud, Nabila; Rade, Jaydeep; Hasib, Md. Hasibul Hasan; Krishnamurthy, Adarsh; Sarkar, Anwesha

doi:10.3389/fphy.2024.1347648

REVIEW article

Front. Phys., 24 September 2024

Sec. Biophysics

Volume 12 - 2024 | https://doi.org/10.3389/fphy.2024.1347648

Machine learning approaches for improving atomic force microscopy instrumentation and data analytics

Nabila Masud¹^†

Jaydeep Rade¹^†

Md. Hasibul Hasan Hasib¹

Adarsh Krishnamurthy^1,2

Anwesha Sarkar¹*

¹Electrical and Computer Engineering, Iowa State University, Ames, IA, United States
²Mechanical Engineering, Iowa State University, Ames, IA, United States

Atomic force microscopy (AFM) is a part of the scanning probe microscopy family. It provides a platform for high-resolution topographical imaging, surface analysis as well as nanomechanical property mapping for stiff and soft samples (live cells, proteins, and other biomolecules). AFM is also crucial for measuring single-molecule interaction forces and important parameters of binding dynamics for receptor-ligand interactions or protein-protein interactions on live cells. However, performing AFM measurements and the associated data analytics are tedious, laborious experimental procedures requiring specific skill sets and continuous user supervision. Significant progress has been made recently in artificial intelligence (AI) and deep learning (DL), extending into microscopy. In this review, we summarize how researchers have implemented machine learning approaches so far to improve the performance of atomic force microscopy (AFM), make AFM data analytics faster, and make data measurement procedures high-throughput. We also shed some light on the different application areas of AFM that have significantly benefited from applications of machine learning frameworks and discuss the scope and future possibilities of these crucial approaches.

1 Introduction

Members of the scanning probe microscopy (SPM) family, such as atomic force microscopy (AFM) [1, 2], scanning tunneling microscopy (STM) [3] and scanning near-field optical microscopy (SNOM) [4] possess sharp probes to scan the surface of the substrate and measure physical quantities, i.e., tip-sample interaction force as a function of tip-sample separation distance and quantum tunneling current, respectively. Single-molecule techniques such as AFM [5, 6], optical tweezers [7, 8], and magnetic tweezers [9, 10] have established themselves as powerful experimental techniques to investigate single molecules at a time producing crucial information about the topographical features, and parameters of binding kinetics [11, 12]. AFM [2, 6, 13–19] has several advantages over other single-molecule techniques as it can perform high-resolution 2.5D imaging and nanomechanical property characterization of hard substrate (graphene) as well as soft substrates (soft biological samples, live cells, proteins, DNA, RNA) without using complicated sample preparation protocols such as freezing, drying, or metal coating. AFM produces 2.5D images by offering an “above-view” perspective that excludes the bottom surface of a 3D structure, as the sample is either immobilized or adhered to the substrate. It measures perpendicular forces to maintain a consistent tip-sample distance throughout the imaging process [20, 21]. It can also measure single molecule forces in the piconewton (pN) range on carbon nanotubes (CNTs), dry powders, and evaporated suspensions, as well as on biomolecules and live cells such as Lostao et al. [22] demonstrated how AFM can advance the inter-biomolecular interactions at nanoscale. Also, many ligand-receptor systems [12, 23–25] have been investigated using AFM to shed light on their single molecule interaction forces [26–28], binding probability, dissociation and association rates, affinity, and receptor density on live cells.

In AFM, a sample is scanned with a probe consisting of a cantilever and a sharp tip. As the tip scans the sample surface, the interaction force between the tip and the sample is measured by monitoring the cantilever’s deflection. A laser beam, directed onto the back of the cantilever, reflects onto a position-sensitive detector (PSD), and any deflection causes a deviation in the laser spot, inducing a voltage difference in the PSD. This voltage is used by a feedback system to generate a topographic image in imaging mode or to calculate the force acting on the cantilever in force spectroscopy mode. Typically, it maintains a constant force (in contact mode) or a constant amplitude (in tapping mode) by adjusting the height of the cantilever or the sample position relative to the cantilever (see Figure 1 middle panel mentioned “AFM”). Image contrast arises from the forces between the tip and sample, influenced by their separation and material properties [33].

Figure 1

Figure 1. Overview of different steps of the AFM enhanced with the help of machine learning. Machine learning models can improve both the experimental and the data analytics steps of AFM by improving (1) sample or scanning site selection ([29] developed ML-guided cell shape detection framework for automatic AFM tip navigation), (2) AFM scanning process ([30] developed a particle smoother (PS) method for Bayesian data assimilation to integrate molecular dynamics simulations with asynchronous HS-AFM movie data), (3) AFM data analytics ([31] proposed using machine learning to identify atomic structures of interfacial water and ionic hydrates based on AFM images) and (4) creating virtual AFM ([32] proposed GPU-accelerated volume rendering technique to generate synthetic AFM images of protein samples).

AFM plays a major role in improving targeted drug delivery approaches [11, 19], helping to select a better binding partner by measuring the binding force and binding probability among different types of ligands and the targeted receptors. However, on both organic and inorganic samples, AFM results suffer from image artifacts due to the morphology of the cantilever tip. The rough, contaminated surface of the substrate can also cause misleading, corrupt data and damage to the cantilever tip. Also, high-resolution imaging and single molecule interaction force measurement with AFM and data analytics require special skill sets of the experimentalists and continuous human supervision, and these are also time-consuming. In state-of-the-art AFM, human experimentalists manually choose a region of interest on the substrate and assess the quality of obtained images based on their experiences, and trial and error. Over the last decade, astounding progress in machine learning, ML (especially deep learning, DL), has started a revolution in imaging of hard substrates, as well as bioimaging and image post-processing. These approaches have improved our structural understanding of biomolecules by enhancing the quality of data and the characterization of their nanomechanical properties. Implementing several machine-learning approaches can benefit all aspects of the AFM measurement process, including probe conditioning, imaging efficacy, manual operation, and selecting regions of interest. Some previous studies aimed to enhance imaging efficiency using inverse imaging of the cantilever tip through features of the sample, manipulation of the AFM probe, and analytical simulations [34, 35]. For example, Montelius et al. [36] introduced an in-situ method for directly observing the actual shape of the AFM tip through inverse AFM mode, significantly improving the accuracy and reliability of AFM imaging. Another way to reduce human operation is SPM automation. Huang et al. [37] have explored examples of automatic sample selection for AFM in specific applications using machine learning. Some research studies have explored examples of automatic sample selection for AFM in specific applications using machine learning. Instead of abiding by traditional rules, ML models extract decision strategies from training data sets. Results/predictions of ML models surpass humans, and these results are often dependent on neural networks (NNs) [38] or convolutional neural networks (CNNs) [39] that are popularly used for processing images.

Machine learning serves not only to assist human experimentalists in effectively selecting scanning locations but also to improve measurement quality, as evidenced by the findings of a recent investigation conducted by Rashidi and Wolkow [40]. This research illustrates how machine learning supports human operators in identifying and correcting specific probe defects within hydrogen-terminated silicon in the context of SPM. In this study, a trained CNN gauged the quality of the obtained SPM images and performed the SPM probe-conditioning protocol afterward. In a different study [41], machine learning approaches were implemented for determining the imaging quality from just a small number of the obtained scan lines, not complete images. Other than the ones referred to above, multiple pioneering research studies were performed to improve scanning probe microscopy using machine learning approaches. For example, Gordon and Moriarty [42] integrated machine learning algorithms into scanning probe microscopy, particularly STM, offering the transformative potential for mapping and manipulating matter at the atomic and molecular levels. Kelley et al. [43] presented a method to enhance SPM acquisition speed using spiral scanning combined with image reconstruction, demonstrating that Gaussian process regression achieves higher accuracy than compressed sensing despite higher computational costs. Ziatdinov et al. [44] investigated the general framework for applying Bayesian active learning methods in scanning probe microscopy (SPM), with a specific focus on piezoresponse force microscopy (PFM) as an illustrative example. Krull et al. [45] utilized the deepSPM machine learning for autonomous SPM operation, integrating sample selection and image assessment via neural networks. However, achieving a fully autonomous and generalized operation of AFM, including handling variations in probe defects and probe conditioning protocols, remains a challenge. AFM represents a more complex and diverse subset of SPM with specific operational demands. Optimal probe selection and conditioning are crucial for achieving high performance in AFM. These processes depend on various physical characteristics, including the tip radius [46], spring constant [47], and cantilever material [48]. These factors must be customized to match the specific requirements of the sample being analyzed. For instance, the tip radius impacts the resolution of imaging, while the spring constant affects force measurements, and the cantilever material determines compatibility with different sample types. Understanding these parameters is essential for selecting the appropriate probe and optimizing its performance. Besides supervised models such as a CNN, machine learning algorithms can also learn via trial and error if a large labeled data set is unavailable. This class of models, known as deep reinforcement learning (RL), has successfully improved SPM measurements by selecting appropriate SPM probe conditioning actions [49].

Machine learning and artificial intelligence (AI) have transformed our way of living and multiple application areas, for example, diagnosing cancer from medical images [50] and recognizing our voices from our phones [51]. AI has not only helped medical science but also propelled the discovery of new compounds. It has also improved image analysis, especially for images obtained from different types of microscopy and spectroscopy, where it supersedes the performance of human experimentalists in pattern recognition. Many of these AI-enabled image analyses are concentrated on data post-processing. In some cases, it is essential to generate a response to the images or data obtained on the fly, for example, by obtaining extra data in important regions of interfaces of materials [52, 53]. The discussions above are critical as they substantially improve the efficiency and throughput of atomic force microscopy (AFM), facilitating more rapid and accurate analyses. Since ML in AFM research and application is still in its early stages, according to our understanding, improving data acquisition and interpretation can significantly accelerate research in materials science, biology, and nanotechnology. This work will provide a foundation for researchers to advance ML-AFM applications further. In this article, we will summarize some of the leading and trailblazing research studies executed so far that implemented machine-learning approaches in atomic force microscopy (see Figure 1) to make it efficient and high-throughput. We will also shed some light on the areas that still need improvement and the open questions and opportunities.

2 Autonomous, effective, and high-throughput AFM using machine learning approaches

We have organized this review in primarily four categories of research studies (as seen in Figure 1) that focus on the use of AI and ML tools in (1) efficient sample or scanning site selection for AFM measurements, (2) improving the AFM scanning process, (3) accelerating AFM data analysis, and (4) building virtual AFM for performing in silico experiments. This section summarizes some pioneering works in these four research directions to articulate how researchers have combined machine learning approaches with AFM to enhance its performance, reduce human effort, and generate novel insights from data.

2.1 AFM sample or scanning site (important feature) selection

In one of the early works for using ML to select important AFM scanning sites, Huang et al. [37] developed an artificial intelligence-based AFM (AI-AFM) that integrates Support Vector Machine (SVM) algorithms with AFM control recognize patterns and identify features (see Figure 2). This AI-AFM also responds to adaptive testing by adding extra probing at key spots, all in real-time and without the involvement of a human, saving human users the hassle. In this vision, an AFM feeds real-time scanning data to a machine learning algorithm. This real-time data was used to pre-train the algorithm for feature recognition and material classification. This study specifically demonstrated how machine learning and artificial intelligence-powered AFM could improve the study of ferroelectric materials and electrochemical systems by implementing feature identification and pattern recognition and retorting to classification using adaptive experimentation. In real-time, this was achieved using additional probing at grain boundaries and critical domain walls (DW) locations. They implemented a ‘support vector machine (SVM)’ based machine learning approach with the ability to perform high-fidelity pixel-by-pixel recognition. Dynamic strain-based scanning probe microscopy was employed to investigate electromechanical coupling at the nm range. Both electrochemical strain microscopy (ESM) and piezoresponse force microscopy (PFM) probed different substrates using a charged conductive probe to measure the local deformation. This study aimed to determine if a trained machine learning algorithm can distinguish between non-ferroelectric and ferroelectric mappings and respond accordingly to perform required extra probing at particular critical locations for future analysis. Following identifying GBs in electrochemical materials and domain walls (DWs) in ferroelectrics using machine learning strategies, additional probing spots were decided at important material interfaces. While CNN relies on full mapping for recognition, SVM can adaptively alter experimental parameters during scanning. This study created an AI-AFM platform that instantly categorizes ferroelectric materials and uses on-the-fly adaptive experimentation to test the specific properties. The proposed system of Huang et al. [37] is capable of autonomously classifying ferroelectric materials with 180° domain walls in real time, recognizing patterns, and identifying features. Figure 2 illustrates this capability, showing amplitude and phase mappings of ferroelectric PMN-Pt single crystals (left panel of Figure 2C) and electrochemical ceria ceramics (right panel of Figure 2C). The system effectively identified domain walls (left panel of Figure 2D) and grain boundaries (right panel of Figure 2D) based on these mappings. The autonomous prediction methods outlined above are not limited to AFM images; they can also be extended to scanning electron microscopy (SEM) images [54] and combinational methods like hybrid approaches that integrate both AFM and SEM images. Li et al. [55] and Crouzier et al. [56] showcased nanoscale mechanical property analysis of coal and nanoparticle dimensional metrology analysis using a hybrid approach combining AFM and SEM images respectively. Zhang and Zhou [54] applied a fully convolutional network (FCN) that replaces the fully connected layer in CNN with extra convolutional layers and classifies every image pixel post-learning, retaining the spatial information of the original image.

Figure 2

Figure 2. A complete framework to identify domain walls (DWs) and grain boundaries (GBs) of ferroelectric materials and electrochemical systems from unknown samples by training real-time data in machine learning: (A, B) Visual representation of feeding amplitude and phase mapping data set to a machine learning algorithm; (C, D) Identify DWs or GBs based on pre-trained data for classifying materials and recognizing features; (E) Additional experiments are performed on the fly close to critical features; (F) Measurements of ferroelectric hysteresis, butterfly loops, and first and second-harmonic piezo responses on DWs and GBs, respectively; (G) Confusion matrix shows how well the classifier performed on a set of 7,174 different dynamic strain-based scanning probe microscopy (ds-SPM) maps of different materials [37].

More recently, Arias et al. [57] proposed that AI-guided automation is used to overcome the challenges posed during the imaging by high-resolution atomic force microscopy (HR-AFM) that are both time-consuming and specialized knowledge-intensive. HR-AFM experiments can be extremely time-consuming, often requiring days to months to gather enough data. This is due to the need for individual imaging of molecules and optimizing this imaging to capture detailed information. Even under ideal conditions, imaging a single molecule can take hours, especially when working to find the best imaging conditions for complex molecules. For a dataset to be statistically relevant, particularly in complex molecular mixtures, it’s necessary to image at least 50 molecules. This article proposed an automated approach using a Python script named Auto-HR-AFM that combines machine learning (ML), artificial intelligence (AI), and HR-AFM. Using ML approaches, the program independently takes HR-AFM images and modifies the imaging environment. A ResNet + feature pyramid network (FPN) architecture, initially trained on the COCO database, forms the basis of the main model. It uses an instance segmentation neural network to identify the features in HR-AFM images and categorizes them into three classes based on the proximity of the probe to the molecule; these classes are too close, too far, or at an ideal distance for imaging the molecules. The instance segmentation model can locate the unique instances of molecules in an image and assign them an appropriate class label. For the segmentation model, they used a Detectron2 [58] architecture that uses feature-pyramid network (FPN) [59] as the backbone to extract feature maps from the input image. Further, a region proposal network (RPN) [60] processed these feature maps to detect the region of interest, which are then fed into the region of interest (ROI) [61] heads to predict the location and the class label of the instances in an image. They generated and labeled 599 HR-AFM images representing 160 different petroleum chemical compounds (divided into 60 images for training and 539 for testing) and achieved $69 %$ accuracy for finding the molecules in the class of ideal distance region. Moreover, if it isn’t ideal distance it labels these molecules either “too far” or “too close”. This information can further be used to move the probe accordingly to collect the optimal images [61].

Specifically, in the context of biological samples, in a recent study [29], our group has addressed one of the yet-to-be-addressed categories: autonavigation of AFM probe towards samples, specifically live cells of desired shapes. Selecting the samples manually and navigating the AFM probe toward the samples is tedious and time-consuming, especially in soft biological samples such as live cells. Biological samples are tedious due to their significantly softer and more fragile nature compared to most non-biological materials. Hence, rough handling during sample selection and probe positioning can easily damage these samples and make them difficult for AFM imaging [62]. Furthermore, the viscoelastic nature of live cells introduces additional challenges. Cells display both viscous and elastic characteristics, allowing them to deform under applied force and gradually return to their original shape, which complicates the positioning of the AFM probe and the imaging process [63, 64]. Additionally, dynamic cellular activities such as blebbing and cell division can swiftly change the cell’s structure and mechanical properties, making it difficult to achieve consistent and precise measurements. These continuous changes necessitate frequent adjustments, further increasing the complexity and duration of AFM imaging [65, 66]. Also, AFM experiments on biomolecules or live cells are time-consuming and low-throughput due to the absence of appropriate tools to select and detect samples automatically and AFM auto-navigation [28]. For measurement of nanomechanical properties [67] of live cells, the usual steps are manual engaging of the AFM cantilever tip [68] on a live cell followed by retraction of the AFM probe and manual re-engaging on a new cell. Our study addressed these limitations by proposing a machine learning-enabled AFM operational framework to make the measurement process fast and save the effort and time of the experts. We proposed an AI-guided AFM navigation framework [29] that includes detection of different cell shapes (round, spindle, polygonal) and AFM stage motor controls for automated navigation of AFM cantilever tip during the AFM measurement (see Figure 3). Cell shape detection was performed using a deep learning model called YOLOv3 [69], yielding the coordinates of cell shapes of interest in the phase contrast images captured via the AFM stage camera. Those coordinates were used for intelligent navigation to the desired cell shape by regulating the AFM stage motors with navigation speed enhancement by 60 $\times$ . We trained the deep neural network on a dataset of phase-contrast images of various kinds of cell shapes (round, polygonal, and spindle) labeled by the expert. Our annotated data, framework, and algorithm are publicly available, propelling future research and helping to interpret the interrelationship between the morphology and behavior of the live cells and AFM biomechanical measurements of cells. The proposed system would be beneficial for researchers to eliminate multiple tedious manual steps and conduct numerous experiments with less expert effort.

Figure 3

Figure 3. AFM probe navigation accelerated using deep learning model-based cell shape identification and localization [29]. (I) We collected 221 images using the camera system available in the AFM. We used the Labelbox platform to annotate 3 cell shapes: round (red boxes), spindle (blue boxes), and polygonal (green boxes). (II) We then fine-tune the widely used DL mode YOLOv3 [69] on the annotated data for cell shape detection. (III) The coordinates detected by the deep learning model are then fed into the AFM navigation control program to guide it to the target cell shape location.

2.2 Improving AFM scanning process

In addition to selecting important scanning sites for AFM, ML can be used to improve the scanning process itself. For example, the work proposed by Vekinis and Constantoudis [70] explains how an artificial neural network (ANN) can determine the impact of an improper geometrical design of the AFM tip on AFM measurements on a rough substrate. To predict the impact, they considered the parameters of the rough sample and the geometry of the AFM tip as input to the ANN. They trained a multi-layer perceptron (MLP), a fully connected style neural network. They compared the impact in terms of a “certainty map” that compared the height values of the simulated AFM image with the original images of the rough substrate. Input to the ANN was a 10-dimensional vector passed through 6 MLP layers with 10, 50, 100, 200, 100, and 50 neurons (number of units in each layer), respectively. The output was a scalar, which was predicted as a certainty metric. The ANN was trained using the ADAM optimizer with a learning rate 0.001 with a weight decay value of 0.001 to prevent the overfitting problem. The model was then trained for 1000 epochs using around 4.3K training samples, and testing was performed on 1.4K samples with the mean absolute percentage error as a loss function. It approximately converged to $\sim$ 0.4%–0.6% for training and testing samples demonstrating a good fit.

Dynamic AFM [71] generates spatially detailed maps of the sample’s mechanical, chemical, and biological properties by indirectly assessing the tip-sample contact force. Analytical and experimental techniques are the two basic categories that correspond to reconstruction techniques in dynamic AFM. Even though they are flexible, analytical techniques can only measure the averaged interaction force, not the instantaneous variations that occur during oscillations. In contrast, experimental techniques either need to measure the experimental transfer function or use specialized harmonic probes. Chandrashekar et al. [72] utilized current data science and machine learning developments to create a unique technique for forecasting the tip-sample interaction forces of dynamic AFM. This technique can measure the elasticity of distinct components in the sample and focus on how much energy is lost during contact (see Figure 4). They demonstrated that their data-driven algorithm could estimate the tip-sample interaction forces on numerically generated data with 90% accuracy. The authors executed multiple experiments on stiffer polystyrene (PS) polymeric samples and softer low-density polyethylene (LDPE) samples with successful prediction of the nanoscale interaction forces with high resolution.

Figure 4

Figure 4. Improvement of AFM scanning process using machine learning approaches. (left) Quantification of single molecule forces utilizing machine learning in dynamic AFM: (A) Experimental data obtained directly from the AFM photodetector. (B) Data was processed via FPGA to create state vector channels. (C) State vectors were used in a sparse identification algorithm to model the system. (D) The model estimated the tip-sample interaction force [72]; (right) Assimilation of asynchronous movie data of high-speed AFM (HS-AFM) with MD simulations using a particle smoother (PS): (E) Asynchronous AFM data acquisition for moving objects on a 3 $\times$ three-pixel area, measuring heights pixel by pixel with a raster scan (red). An image obtained after one frame scan is shown on the right side. (F) Hypothetical synchronous AFM data acquisition, approximating frame image likelihood using the last time point in each frame. (G) Comparison of pseudo-AFM images: (left) nucleosome snapshot from MD trajectory, (center) instantaneous AFM image without time difference, (right) asynchronous AFM image with time difference [30].

With a resolution of roughly 1 nm, high-speed atomic force microscopy (HS-AFM) [5, 73–75] is a helpful instrument for analyzing biomolecular structures and motions at close to their physiological settings. Its drawbacks include the difficulty of clearly seeing three-dimensional structures and the fuzziness of soft biomolecule pictures. To solve these difficulties, computational techniques like Bayesian data assimilation and particle filter simulations combined HS-AFM data with molecular dynamics simulations, addressing problems with molecular motion and data asynchronicity. Kato et al. [30] described the development of a particle smoother (PS) simulation approach (see Figure 4) that took time differences in HS-AFM movie data using the pixel-by-pixel (PBP) method into account. It was compared to a technique that ignored time differences (PS-AAO simulation). In locations with flexible linker DNAs, the PS-PBP approach reduced errors by handling data asynchronicity in HS-AFM measurements. The research found difficulties duplicating dynamic behavior, particularly when a significant molecular motion occurred during the probe scanning time. The study also examined the effect of re-sampling frequency on PS-PBP simulation performance, finding that higher frequencies, especially the 1F re-sampling frequency, produced better outcomes. The authors executed CG-MD simulations of a nucleosome. The simulations worked well with the genuine HS-AFM experimental data, especially for bigger biomolecular complexes.

The inherent components of AFM, including the probe, scanner, electronic circuits, and possible electromagnetic or mechanical noise or improper scanning parameters, can lead to distortion and artifacts in the AFM output. While post-processing filters are available to correct these distortions, it’s essential to comprehend the specific nature of the distortion or artifact to apply the necessary corrections effectively. Kocur et al. [76] presented the CNN-based approach, where the CNN learns from pairs of distorted images and the ground truth image to either remove or suppress the artifacts and produce the filtered image autonomously (without any prerequisite information about the distortions or any other human knowledge). Kocur et al. [76] used ResU-Net [77] architecture for the CNN and trained on the synthetic data. The AFM, LiteScope, was used to analyze the materials. It was designed to work with Scanning Electron Microscopy (SEM) because of its distinctive probe detection method, which combines self-actuating and self-sensing probes. Furthermore, the Akiyama probe [78] was used to obtain topographical information. Synthetic data was utilized for the network to overcome the difficulties associated with acquiring actual AFM training data. A novel dataset was also introduced, and the code has been made openly available to strengthen the method’s reliability in preserving topographical details while mitigating distortions and noise.

2.3 Improving AFM data analysis

There is a significant body of work in leveraging different ML techniques to improve AFM data analysis in terms of both analysis of AFM images as well as AFM force curves. Here, we review some of the major studies in this regard.

2.3.1 Improving AFM image analysis

We have summarized the articles referenced in this section in Table 1 for efficient and easy understanding.

Table 1

Table 1. Overview of different papers on improving AFM image analysis.

We begin with the study presented in Borodinov et al. [79], where the authors introduced a multidomain processing technique based on machine learning and applied it to segment atomically and molecularly resolved atomic force microscopy (AFM) images. This study used tapping mode for AFM imaging of melem on boron nitride and ionic liquid layers constructed on top of graphite as benchmarks for the algorithm. The authors developed a supervised three-step method that was easily adaptable to the characteristics of various images or data sets, even with high noise levels. The steps were to extract structural descriptors from a picture, use dimensionality reduction algorithms, assign pixels to an image, and compute pertinent statistics. This work proposed a robust and scalable method for segmenting AFM images based on pre-selected classification criteria. This study demonstrated the applicability of this method for the segmentation of high-resolution AFM images based on the observed domains’ crystal orientation, automated boundary selection, and gathering pertinent data.

In another study by Oinonen et al. [80], a machine-learning method was developed to accurately forecast electrostatic fields from a collection of typical experimental AFM images. Numerous processes, including catalysis, chemical reactions, and biological activities, heavily rely on the electrostatic characteristics of nanoparticles. Understanding the relationship between molecular functionality and electrostatics provides significant tools for controlling and precisely designing functionality at the nanoscale. High-resolution local potential maps of molecules adsorbed on surfaces were generated using Kelvin probe force microscopy (KPFM). A convolutional neural network (CNN) was used as the foundation for the ED-AFM (Electrostatic Discovery-Atomic Force Microscopy) approach (see Figure 5). This network was trained to connect input from a collection of constant-height AFM images. This approach used only simulated data to evaluate three molecular systems. ED-AFM is an effective instrument for comprehending the characteristics of molecules. The electrostatic (ES) Map descriptor, the vertical component of the electrostatic field over the sample molecule, was translated by the neural network using two sets of AFM pictures as input. For the model’s training, they used a database of tens of thousands of molecular geometries to construct simulated sets of input-output pairings. The trained model can then estimate the sample electric field using experimental AFM images. The CNN architecture used in this work was called Attention-U-Net, which utilized an attention-gate (AG) block in the skip connections. The input to the network was $128 \times 128 \times 6$ AFM images stacked together, and the output was the ES map descriptor. The encoder contained three blocks of 3D convolutional layers and average pooling layers. In the bottleneck, the 2D feature maps were formed by concatenating the 3D feature maps as channel information in the 2D feature maps. The decoder then sampled this bottleneck 2D feature map and fed it into AG. The contribution of AG was to underline the essential features in the map and attenuate the unimportant features. The output was again $128 \times 128 \times 1$ ES map. The total number of parameters in the CNN was 15.6M. The total dataset number was 235K, split into training, validation, and test sets (75%, 10%, and 15%, respectively). The model was trained using the Adam optimizer with a learning rate 0.0001. Mean-squared error (MSE) was used as a loss function for training the network, and the value of MSE after training was $2.17 \times 1 0^{-} 5$ and $2.49 \times 1 0^{-} 5$ and $2.47 \times 1 0^{-} 5$ on training, validation and testing dataset, respectively.

Figure 5

Figure 5. Improvement of AFM data analysis using machine learning approaches: (left) example of improvement of AFM image analysis using machine learning approaches example: a machine learning based method that yields immediate maps of the electrostatic potential directly from atomic force microscopy images with functionalized tips [80]; (right) example of improvement of AFM force curve analysis using machine learning approaches example: Schematic of the few-shot learning algorithm using triplet loss for training AFM force curves [92].

Understanding the different exceptional physical and chemical characteristics of the ionic/solid interface depends on identifying the structural details of ionic hydrates at the atomic level. Identifying hydrogens from AFM images is difficult since they are scarcely discernible. Tang et al. [81] described a transfer learning-based cost-effective technique for determining the atomic structure of interfacial ionic hydrates from AFM images. The neural network (NN) was trained to extract the H-bond network structure information from AFM height images. After that, it was retrained using transfer learning on a modest amount of expensive interfacial Na + hydrates data to achieve effective structural identification. The retrained NN prediction was with an accuracy of 95% for sodium and oxygen and 85% for hydrogen. The authors implemented the 3D U-Net architecture of CNN, popularly used for biomedical image segmentation and various computer vision tasks. It consists of an encoder-decoder part with skip connections going from encoder to decoder for better output representation quality. The network input was a stack of 10 AFM images at different tip-sample distances with the resolution of $128 \times 128$ , and the output was a three-channel image of the same spatial shape as the input. Each channel in the output was a grayscale representation of the atom type and positive and negative charge. The network was first trained on the interfacial water data with a learning rate 0.001 using the Adam optimizer [93] for the first 30 epochs and then reduced to 0.0001 for the following 30 epochs. Then, using the trained parameters, the network was trained further on ${N a}^{+}$ hydrates for 60 more epochs.

Molecular identification based on only AFM images is a challenging problem because of ambiguity of AFM images for different molecules with similar shapes but with different atoms. Carracedo-Cosme et al. [82] proposed to use deep learning-based approach for automatic molecular classification of AFM images. This study used SPMTH-60 datatset (consists of a lot of theoretical constant-height AFM images built from 60 selected flat organic molecules), adding more variability for different molecular orientations. In total, they simulated 483,840 images. With this big dataset, authors initially experimented with two of the prominent DL-based image classification architectures: MobileNetV2 [94] and VGG16 [95]. They observed that these standard models did not perform well and led to overfitting problem. To tackle this challenge, they introduced the ML-AFM model, consisting of convolutional, pooling, dropout, flattening, and fully connected layers. They employed various strategies, including optimizing the number of convolutional layers and filters using concatenation to create alternative paths between input and output layers, integrating dropout layers, and applying regularization within convolutional layers. These strategies collectively enhanced the model’s performance and flexibility on theoretical dataset but does not perform well on the experimental AFM images. For this, they proposed to use the variational autoencoder (VAE) to augment the training dataset to include the images with the similar features as experimental images. With this augmented data, the accuracy on experimental images improved significantly.

The work by Ziatdinov et al. [83] highlights the effectiveness of a deep learning-based workflow in converting high-speed atomic force microscopy (HS-AFM) data into a comprehensive map of time-dependent particle positions and orientations. This, in turn, facilitated the reconstruction of classical two-point correlation-based descriptors like 2D fast Fourier transforms (FFT), correlation functions, and pair distribution functions. Leveraging principal component analysis and non-negative matrix factorization (NMF), this study further explored the dynamic evolution from metastable states toward equilibrium. Additionally, machine learning-based feature extraction was crucial in defining particle neighborhoods and trajectories without imposing physics constraints. This approach enabled the separation of various particle behavior classes and the identification of associated transition probabilities, slow modes, and configurations. As a result, it promoted systematic exploration and predictive modeling of the temporal dynamics of the system. These findings had significant implications for both the analysis of HS-AFM data related to protein self-assembly and the development of a quantitative understanding of the underlying energy landscapes governing this process. Notably, abundant proteins within a single frame offered a robust dataset for training neural networks. Extracted particle descriptors provided insights into individual and collective properties, including positional and angular trajectories, configuration distributions, and transitions between metastable and stable orientations.

Kobayashi et al. [84] highlighted a notable gap in the application of machine learning to enhance the performance of organic photovoltaics (OPVs) related to bulk heterojunction structures. The study investigated the potential of atomic force microscopy (AFM) images as a key input for an ML model for predicting power conversion efficiency (PCE) in polymer: non-fullerene molecular acceptor OPVs. The authors collected AFM images, conducted data preprocessing, and executed a range of image analyses and ML linear regression techniques. Surprisingly, the study revealed that augmenting the model with AFM data alongside chemical structure fingerprints, material properties, and process parameters did not significantly improve accuracy. Furthermore, the application of gray-level co-occurrence matrix (GLCM) and histogram analysis (HA) methods, encompassing parameters like homogeneity, correlation, and skewness, demonstrated the potential for expanding the horizons of image analysis and artificial intelligence within the realm of materials science research. This study demonstrated the importance of addressing bulk heterojunction structures with the ML-driven optimization of OPVs and suggested promising avenues for AI-based approaches in materials science.

More recently, Corrias et al. [85] introduced an open-source tool, AiSurf, developed for automated analysis of atomically resolved images in atomic force microscopy (AFM). The tool extracted unit cells, structural distortions, and primitive lattice vectors from the images utilizing scale-invariant feature transform (SIFT) and algorithms for clustering with minimal to no user interaction. As a filter, a difference of Gaussians was used to preprocess the images before applying SIFT, which acted as a Fourier domain bandpass filter to avoid lower frequency background variations and higher frequency noise. The AiSurf tool was tested on different substrates with atomically resolved AFM images, including oxygen-deficient rutile $T i O_{2}$ , $S r T i O_{3}$ , anatase $T i O_{2}$ , and graphene that delivered excellent results in terms of lattice extraction and identification of defects.

There has also been significant progress in the context of Bio-AFM image analysis and enhancement using machine learning. For example, Zeng et al. [86] proposed a novel approach using multidimensional cell parameters extracted by AFM and machine learning to identify human hepatocytes and hepatoma cells simultaneously. This study used Atomic Force Microscopy (AFM) to analyze the nanofeatures of three different cell lines: normal human hepatic cells (HL-7702), hepatocellular carcinoma cells (SMMC-7721), and Hep G2 cells. The AFM height images revealed that cancer cells were shorter compared to normal human cells, with distinct pseudopodia on the edges. The elastic modulus at the center of cancer cells was smaller than at the edges. However, the differences in these features were not significant, making it challenging to distinguish between cell types based on a single nanoscale parameter. Statistical analysis of the nanofeatures showed overlapping parameter distributions, making it challenging to classify cells accurately using traditional statistical methods. So, this study employed machine learning algorithms, including Gaussian Naive Bayes [96], logistic regression [97], and Support Vector Machine (SVM) [98], to classify and identify cell types based on multiple nanofeatures to overcome these traditional limitations. Among these algorithms, Gaussian Naive Bayes achieved the highest accuracy of $94.69 %$ , while SVM exhibits the lowest accuracy of $91.96 %$ . The confusion matrix, precision, recall, specificity, and F1-score were used to evaluate the classifiers, with Gaussian Naive Bayes performing the best in terms of F1-score and recall.

Multiple physical channels, for example, height, adhesion, viscoelastic losses, stiffness, deformation, etc., can be captured simultaneously during AFM measurements. However, collecting such multidimensional AFM data sets of soft biomolecules, live cells, or tissues could be challenging. Dokukin and Dokukina [87] suggested an innovative, generalized method for reducing the number of dimensions in AFM data when only a few parameters were derived from each AFM map during quantitative nanomechanical mapping [99]. In this study, various types of data were gathered from live cells, and there were a few options for handling training set size. The first option involved rotating and flipping a single-tagged sample image to increase the number of effective samples. The second option involved the usage of the transfer learning technique in reusing models that have already been trained for a base function. However, both approaches could have been more helpful in analyzing AFM data. The dimensionality of data space was significantly decreased by implementing multiple parameters calculated for every AFM map, preventing issues related to applying deep learning techniques. They reduced the dimensionality of AFM data by maintaining a fixed set of parameters with the lowest inter-parameter correlation and the highest segregation power. Machine learning algorithms like Random Forest, which is robust for overfitting, were then utilized, and these algorithms were almost independent of data splitting as a training and testing set. The authors successfully evaluated the performance of this algorithm to detect bladder cancer by using the cells from urine samples. This study achieved the highest accuracy, up to 94%, using 5-cell samples per patient.

Similarly, Sokolov et al. [88] also explored detecting bladder cancer using AFM and ML. Bladder cancer is among the most common cancers and one of the main reasons for cancer-related fatalities. Cystoscopy, tumor excision, and biopsy are the gold standard for diagnosis. Patients’ participation in screening and early detection evaluation programs will increase significantly with the development of low-cost, accurate, effective, and noninvasive tests. A strategy for finding cancer via imaging of bodily fluids—in this case, urine—was described by Sokolov et al. [88]. They used sub-resonant tapping AFM methods for imaging cell surfaces (Peak Force and Ringing mode [100]). The AFM Ringing imaging mode is substantially faster (P $<$ 0.05) and less prone to possible artifacts than the current cystoscopy method for diagnosing bladder cancer. Machine learning techniques have been shown to identify various objects and patterns in medical applications effectively. They evaluated 3 ML models: Random Forests, Extremely Randomized Forests, and Gradient Boosting Trees. Two cohorts were used for training and testing (validation) to avoid unnatural over-training. The data set was randomly divided 1,000 times for each training and testing process. Height and adhesion were the ideal channel-specific characteristics with strong segregation power and low inter-parameter correlation. Keeping the number of surface parameters between 8 and 10 was sufficient for good precision. The percentage split between the training and testing sets had no bearing on the statistical robustness of this outcome. After 1,000 arbitrary diagnosis set selections, the method’s accuracy for detecting cancer was $53 \pm 10 %$ .

In another similar work, Prasad et al. [89] demonstrated the detection of bladder cancer cells extracted from urine samples using AFM adhesion maps. AFM allows visualizing these cell surfaces at the nanoscale level, and AFM imaging of fixed cells can be used to detect all stages of progression toward cancer. This study distinguished between 2 cell lines by analyzing cellular mechanics. It recovered all three physical metrics, such as equilibrium length of the pericellular coat, effective grafting density, and Young’s modulus of the pericellular coat molecules, by indenting cells with an AFM probe. Combining all these physical parameters, the authors could differentiate between HT29 and Csk cells with 94% accuracy. Ringing mode (RM) [100] of AFM allowed physical property mapping of sample surfaces with higher resolution and speed, leveraging advanced processing of the dynamic response of the cantilever post AFM-probe withdrawal. In this investigation, three unique RM channels were measured in addition to the standard height and adhesion channels: RM adhesion, RM restored adhesion, and RM viscoelastic adhesion. They used the Gaussian process classifier for the classification task using RBF kernel [101] and Laplace approximation, and the maximum number of iterations was set to 500. To overcome the issue of acquiring a large number of AFM images and the dimensionality of data space, they reduced each channel of AFM data to 35 surface parameters. They further reduced the number of surface parameters based on the Gini index [102] ranking. The dataset was divided into training and verification (testing) subsets to measure classification accuracy. The addition of RM channels in the classifier improves the accuracy to 94%.

In another cancer detection application, Yu et al. [90] demonstrated a method that combined atomic force microscopy (AFM) imaging with machine learning to identify prostate cancer cells with higher accuracy. The authors used a support vector machine (SVM) [98] for classification and compared it with other classifiers such as stochastic gradient descent (SGD) [103], random forest (RF) [104], and logistic regression (LR) [97], as well as K-nearest neighbors (KNN) [105]. They used an open-source software known as Gwyddion for image processing and the BorutaShap algorithm for feature selection of the final 18 datasets to reduce the dimensionality of data and improve the classification performance of most classifiers. A comparison of four classifiers on the prostate cancer dataset showed that SVM provided the highest accuracy of $82.5 %$ after the feature selection. KNN classifier had the best accuracy of $78.7 %$ without feature selection. However, SVM outperformed it after feature selection was included. SVM and RF classifiers showed the most improvement in classification accuracy after feature selection, and SVM showed an increase of $9.6 %$ at that time. This study demonstrated that AFM cell imaging combined with SVM can accurately identify prostate cancer cells.

Another recent study performed by Raila et al. [91] explored AI-based AFM image analysis for defect detection caused by pore-forming proteins in tethered bilayer membranes (tBLMs) and predicted the electrochemical impedance spectroscopy (EIS) response of such membranes. For defect detection, the authors used convolutional neural network (CNN) architecture called SSD FPN [106], popularly used for object detection tasks. To use this CNN model for defect detection in AFM images, the authors re-trained the model with 15 AFM images consisting of 510 annotated defects. Although the model was trained using a minimal number of images, it was observed that the AI-based prediction of EIS curves matched with manually derived EIS curves. Furthermore, this study demonstrated that employing an automated AI-based algorithm for AFM image analysis enables the generation of EIS spectra predictions. These predictions can serve as valuable tools for evaluating critical physical parameters of tBLMs, including submembrane-specific resistance.

2.3.2 Improving AFM force curve analysis

Compared to ML-enabled analysis of AFM images, the use of ML in analyzing AFM force measurements is a lesser-explored area. However, many opportunities remain, as evidenced by the studies reviewed here. For example, Müller et al. [107] presented a machine learning approach (Python package - nanite) that provided a fully controlled evaluation of biological force-distance (FD) data. In this work, AFM is used to evaluate the mechanical characteristics of individual cells from spinal cord sections of zebrafish. In this regard, preprocessing and fitting force-distance (FD) curves are tedious tasks and require specialists to go through large data sets. This study used ML (specifically, neural networks) to reduce human effort in such tasks. In addition, artifacts seen in nano-indentation studies are typically removed from the analysis manually. However, this process is also tedious for densely sampled FD maps of cell segments. This research presented an innovative approach for automating AFM FD data quality assessment. To assess the quality of AFM FD data, they proposed to use supervised learning that used quality-dependent features, three of which were binary features used for preprocessing and an additional twelve features used for training. They fitted the experimental FD curves with an appropriate model function to generate the data to train the regressor model. They manually rated them, generating around 1132 FD curves from zebrafish spinal cord sections. They extensively compared by training regression models such as the Decision Tree regressor, the linear Support Vector-machine regressor (SVR), AdaBoost, Random Forest, Gradient Tree Boosting, and Extra Trees. They used MSE to compare the performance and observed that the Extra Trees regressor [108] gave the best MSE value, below 1.0, and obtained an accuracy of 87.4%.

One of the critical issues in deploying (advanced) ML tools for AFM FD data is the lack of large data sets for specific problems. In one of our recent works [92], we addressed this problem by building a few-shot learning-based framework for AFM force curve characterization in single-molecule interactions. In measuring single molecule interaction forces and binding kinetics for protein-protein interactions or ligand-receptor interactions, ligand and receptor are attached to the AFM cantilever tip and substrate, respectively, or vice versa. Usually, in AFM force measurement studies, a ligand-functionalized cantilever is brought near the live cells with complementary receptors or functionalized substrates so that the ligand can bind to the receptor. The minimum force needed to break the bond is measured as the unbinding or rupture force. Currently, experimentalists analyze thousands of force curves manually to identify four different types of rupture events: single, double, multiple, and no rupture events (see Figure 6, which is time-consuming. However, building an extensive, annotated data set to train supervised DL models can be difficult. So, we explored sample-efficient deep learning (DL) approaches. It is suitable to do baseline corrections on AFM force curves to highlight the spikes in the data, making the learning of the DL model easier. We introduced a few-shot DL approach for the classification of these force curves (see Figure 6) automatically to prevent the wastage of significant expert effort and time.

Figure 6

Figure 6. Efficient AFM force curve characterization (red arrows and circles show the no rupture, single rupture, two ruptures, and multiple ruptures in the curve) using the few-shot deep learning method [92].

Similar to the studies for bladder cancer detection using AFM imaging discussed earlier, Zhu et al. [109] introduced the idea of using AFM to measure cellular mechanical properties (CMPs) as potential indicators, intending to improve the accuracy of detecting malignant cells in bladder cancer cases. The study mainly aimed to group urothelial tumor cells into four distinct grades of bladder carcinoma cells (G1, G2, G3, and G4) based on their CMPs using machine learning classification algorithms. The AFM indentation technique was used to generate FD curves, from which various CMPs such as elastic modulus, work of adhesion (WoA), cellular membrane tension (CMT), and adhesiveness were extracted. Only FD curves closely matching a theoretical model with a coefficient of determination (R2) exceeding 0.85 were considered for analysis. Laser scanning confocal microscopy (LSCM) and immunofluorescence were employed to visualize and study cellular components like F-actin and cellular nuclei. This research utilized a Genetic Algorithm-Back Propagation Neural Network (GA-BPNN) model to classify bladder carcinoma cells into four grades based on their CMPs. The model was trained using $80 %$ of the data and tested on the remaining $20 %$ , achieving an impressive accuracy rate of $91.25 %$ . Further validation using receiver operating characteristic (ROC) curves and the area under the curve (AUC) values demonstrated effective classification performance, and these results remained consistent when different data subsets and numbers of CMPs were considered.

Finally, we discuss another ML-based AFM force curve analysis work where Weber et al. [110] proposed an unsupervised artificial neural network using self-organizing maps (SOMs) to analyze the mechanical properties of breast cancer cells obtained using AFM. Various substances that impact estrogen receptor signaling were used in the measurement. The overall methodology involved estimating stress relaxation (from force vs distance and force vs time curves) by AFM to calculate the mechanical characteristics of the cells and then fitting a five-element Maxwell model to the stress relaxation curves. The input data for the system consisted of nine properties determined by the stress relaxation fitting sections, which were $δ$ , $E_{1}$ , $E_{2}$ , $E_{α}$ , $E_{inst}$ , $η_{1}$ , $η_{2}$ , $τ_{1}$ , and $τ_{2}$ . Then, the input mechanical properties were used to train SOMs and visualize the results of an artificial neural network (ANN). To achieve better performance and select the map with the lowest quantization error, 50,000 iterations were conducted, with repeated calculations 50 times. To test the trained ANN’s unsupervised capacity to distinguish between control, estrogen-treated, and resveratrol-treated cells, breast cancer cells (MCF-7 cell line) were treated with various chemicals known to affect cell mechanical characteristics. Using this approach, the authors successfully provided insights into the mechanisms underlying mechanical changes observed in the cells.

2.4 Virtual AFM

Finally, in this section, we review how researchers have begun to combine ML approaches with AFM to build virtual AFM simulations. Virtual AFM (Atomic Force Microscopy) refers to simulation or computational models that mimic the operation and results of an actual atomic force microscope. This technology allows researchers and students to understand and predict how an AFM would interact with various samples without physically conducting the experiments. Virtual AFM tools are used for educational purposes, to train users on the intricacies of AFM operations, and for experimental planning, allowing scientists to simulate outcomes before actual laboratory testing. This can save significant time and resources and enhance the understanding of potential results and experimental errors. Additionally, virtual AFM can generate extensive data sets useful for training ML and DL algorithms. Such platforms can help alleviate various scale-up issues related to AFM, as it is typically a slow and low-throughput platform.

We begin our discussion with Alldritt et al. [111], where the authors developed a deep learning framework to predict molecular structures and automated structure discovery. This ML infrastructure matched a set of AFM images using a unique descriptor to characterize the molecular configuration. AFM is an outstanding platform to probe the atomic structure of organic substrates at a single molecule level. However, many previous studies were restricted to mostly planar aromatic molecular structures due to several obstacles in interpreting distorted AFM images of nonplanar molecules. Multiple adsorption configurations of 1S-camphor on Cu (111) dependent on AFM measurements at low temperatures were resolved using this method. The variation of resonant frequency was the measured signal in CO-AFM. This signal was caused by all significant tip-sample interactions and determined by the interaction of the nearest atom of the substrate to the AFM cantilever tip and oxygen of the CO molecule. This study described a method to invert the imaging process and directly produce coordinates of specific atoms from a collection of AFM height images (see Figure 7). A CO molecule linked to a metal tip was used in low-temperature ultrahigh vacuum AFM to capture high-resolution images of molecules on surfaces. Bond order, charge distributions, and the different stages of on-surface chemical reactions can all be studied using this method. In this study, the virtual synthesis of 3D AFM data from 134,000 isolated molecules was performed to identify the molecular geometry of any experimental image. This study proposed a CNN architecture implemented as an encoder-decoder style framework to solve the inverse imaging problem. The input to the CNN was the 3D stack of 10 $128 \times 128$ images, and the output was a single $128 \times 128$ image. They implemented several 3D convolutional layers for the encoding part with LeakyReLU non-linearity and average pooling layer to downsample the input by $8 \times$ factor. For the decoder part, they upsampled the feature map. They processed it further using 2D convolutional layers with LeakyReLU activation, except in the last layer, which had ReLU activation until the expected image resolution was achieved. They used the mean square error as a loss function and Adaptive Moment Estimation (Adam) optimizer with a learning rate of 0.0001 and weight decay of $1 0^{-} 5$ for training the CNN. This computerized pattern-finding AFM approach will make it possible to use high-resolution AFM on many systems.

Figure 7

Figure 7. Simplified demonstration of CO-tip AFM imaging process and inverse imaging process from [111]: (A) molecular geometry X; (B) Displacement of a probe particle (PP) because of the interactions with sample atoms; (C) Relaxed PP integrates forces along its path, which leads to shifts in the measured oscillation frequency, shown in (D). (E) Approximation to a molecular geometry: vdW-spheres, from sample atomic structures and in accord with 3D AFM height images, shown in (G) at various vertical positions; (F) NN as an effective fitting scheme. (H–K) Experiment: The CNN model receives the 3D experimental image set ((H) far, (I) middle, (J) close tip-sample distances) and predicts (K) 2D image (vdW-Spheres) using this information. (L–P) Simulation: The complete molecular structure (M) produced by the best match to the experiment corresponds to the vdW-Spheres representation rendering (L). Inverse image processing predicts AFM images ((N) far, (O) middle, (P) close) of 1S-camphor adsorption on Cu(111).

As discussed, AFM imaging captures the surface topography of the sample under consideration, such as bio-molecules. It outputs the image of the sample’s height map lying on the AFM bed. Molecules of the sample adhere to the AFM bed in random orientation, capturing the height map of the sample in different orientations and giving different views of the sample. This information can be crucial in the problem of determining the 3D structure of bio-molecules such as proteins by combining these multi-view height maps. Although reconstructing the 3D structure of a protein from multi-view AFM images is a non-trivial task, in Rade et al. [32], we proposed to leverage the deep learning methods for this 3D reconstruction task. Training a DL model demands a large data set of AFM images of different proteins, and capturing AFM images for a large number of proteins is a non-pragmatic task. To overcome this, we developed the concept of virtual AFM, a GPU-accelerated volume rendering technique [112] to generate virtual AFM images for the proteins, illustrated in Figure 8. Similar to actual AFM images, these virtual images give the height map of the proteins. With this, the dataset of virtual AFM images for a large number of proteins can be constructed, which can further be used to train the DL models for predicting the 3D structure of proteins.

Figure 8

Figure 8. (left) Volume rendering of 3D voxelized protein structure to obtain AFM-like 2D images. (right) Comparing the virtual and actual AFM images [32].

AFM is well-known for its remarkable atomic-level resolution. However, its operation demands meticulous parameter configuration and operator proficiency, posing a significant challenge. Traditional simulation platforms fail to comprehensively visualize and analyze AFM operations. In response to this challenge, Yang et al. [113] introduced an innovative approach that combines machine learning and virtual reality to replicate the predictive imaging process of AFM in contact mode using the Unity3D platform. This involved utilizing a standard grating as the experimental sample and creating a virtual representation of the AFM probe and optical system. The imaging prediction process revolved around adjusting optical spot positions to account for probe undulations, thereby enabling the prediction of sample topography. Furthermore, the research encompassed data collection, calibration, and correlation tests strengthening the connection between RGB components and the scan data from the virtual system. A range of machine learning algorithms, including XgBoost, Random Forest, AdaBoost, GBDT, CatBoost, and KNN, were employed to construct these predictive models. Among these, XgBoost is the most promising choice due to its optimal balance between accuracy and computational efficiency. The evaluation of the predictive models employed the Structural Similarity Index (SSIM) to gauge the likeness between predicted and actual topographical maps. Notably, CatBoost exhibits superior performance compared to other models in this context.

3 Discussion

This paper presents an overview of ML and AI applications in advancing AFM. We articulate how AI can enhance the performance of AFM in imaging and manipulating various samples, such as biological molecules, material interfaces, and nanoscale structures. We also highlight recent studies implementing machine-learning approaches in AFM for data post-processing and on-the-fly feedback. For example, we review how AI can help segment and classify AFM images, optimize the scanning parameters and tip-sample interactions, reconstruct and correct AFM images, and extract and analyze relevant features and properties from AFM data. The paper also discusses the challenges and opportunities of using AI/ML in AFM, such as improving the accuracy, speed, and reliability of AFM measurements, overcoming the limitations of human operators, and enabling new functionalities and capabilities of AFM. For instance, we review how AI can assist in performing high-resolution and high-speed AFM imaging, control and manipulate single molecules and atoms with AFM, perform multimodal and multifunctional AFM measurements, and discover new phenomena and insights with AFM.

The results from the previous studies summarized in this paper indicate that incorporating ML/AI approaches before, during, and after AFM operation has been extremely helpful in making AFM comparatively high-throughput and efficient. Although significant improvements have been demonstrated in all these studies, there are still opportunities to improve in multiple categories. Machine learning enhanced AFM can become substantially high-throughput if more exploration can be done in the following areas: a) AFM sample or scanning site (important features or region of interest) selection and improving the AFM scanning process in a closed-loop manner with feedback from human experimentalists, b) application of auto-navigation of AFM on soft biological samples especially live cells as well as optimization of sample-specific loading forces, protecting both cantilever tip and samples, (c) high-fidelity 3D reconstruction from AFM images, and (d) fusion of multimodal AFM measurements (height sensor images, peak force error images, nanomechanical property maps, and force curves). Finally, the recent success of virtual AFM frameworks shows the promise of building a comprehensive AI-based digital twin framework for AFM by accurately modeling and characterizing the interactions of the AFM probe with the surface. If successful, such digital twins can be used for various applications, including providing decision support to experimentalists, accelerated data analysis, enhancement and characterization, large-scale synthetic AFM data generation, and designing effective AFM controllers for ultra-precision, high-quality measurements.

Author contributions

NM: Investigation, Writing–original draft, Writing–review and editing. JR: Investigation, Writing–original draft, Writing–review and editing. MH: Investigation, Writing–original draft. AK: Funding acquisition, Supervision, Writing–review and editing. AS: Conceptualization, Funding acquisition, Investigation, Supervision, Visualization, Writing–original draft, Writing–review and editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Science Foundation (NSF) under grants DMREF-2323715/2323716 (AK) and CPS-2409359 (AS, AK), the ISU College of Engineering Exploratory Research Program (AS, AK) grant, and Translational AI center, ISU SEED grant (AS, AK).

Acknowledgments

We would like to acknowledge Prof. Soumik Sarkar (Department of Mechanical Engineering, Iowa State University) for providing his guidance and valuable feedback on this review article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Gaczynska M, Osmulski PA. Afm of biological complexes: what can we learn? Curr Opin Colloid and Interf Sci (2008) 13:351–67. doi:10.1016/j.cocis.2008.01.004

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Muller DJ. Afm: a nanotool in membrane biology. Biochemistry (2008) 47:7986–98. doi:10.1021/bi800753x

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Binnig G, Rohrer H. Scanning tunneling microscopy. Surf Sci (1983) 126:236–44. doi:10.1016/0039-6028(83)90716-1