Safe contact-based robot active search using Bayesian optimization and control barrier functions

Vinter-Hviid, Frederik; Sloth, Christoffer; Savarimuthu, Thiusius Rajeeth; Iturrate, Iñigo

doi:10.3389/frobt.2024.1344367

BRIEF RESEARCH REPORT article

Front. Robot. AI, 29 April 2024

Sec. Robotic Control Systems

Volume 11 - 2024 | https://doi.org/10.3389/frobt.2024.1344367

Safe contact-based robot active search using Bayesian optimization and control barrier functions

Frederik Vinter-Hviid¹*

Christoffer Sloth²

Thiusius Rajeeth Savarimuthu^1,2

Iñigo Iturrate²

¹ROPCA ApS, Odense, Denmark
²SDU Robotics, Maersk McKinney Moller Institute, University of Southern Denmark, Odense, Denmark

In robotics, active exploration and learning in uncertain environments must take into account safety, as the robot may otherwise damage itself or its surroundings. This paper presents a method for safe active search using Bayesian optimization and control barrier functions. As robot paths undertaken during sampling are continuous, we consider an informative continuous expected improvement acquisition function. To safely bound the contact forces between the robot and its surroundings, we leverage exponential control barrier functions, utilizing the derivative of the force in the contact model to increase robustness to uncertainty in the contact boundary. Our approach is demonstrated on a fully autonomous robot for ultrasound scanning of rheumatoid arthritis (RA). Here, active search is a critical component of ensuring high image quality. Furthermore, bounded contact forces between the ultrasound probe and the patient ensure patient safety and better scan quality. To the best of our knowledge, our results are both the first demonstration of safe active search on a fully autonomous robot for ultrasound scanning of rheumatoid arthritis and the first experimental evaluation of bounding contact forces in the context of medical robotics using control barrier functions. The results show that when search time is limited to less than 60 s, informative continuous expected improvement leads to a 92% success, a 13% improvement compared to expected improvement. Meanwhile, exponential control barrier functions can limit the force applied by the robot to under 5 N, even in cases where the contact boundary is specified incorrectly by −1 or +4 mm.

1 Introduction

Active learning, along with related problems such as active object exploration and localization, find widespread applications across various robotic domains. From mobile robot navigation (Lluvia et al., 2021) to grasping (Kroemer et al., 2010) to object and scene reconstruction (Jamali et al., 2016), many robotic problems involve variations of the exploration–exploitation dilemma, where an autonomous system must simultaneously regress and optimize an unknown or uncertain function.

Several methods have been proposed to tackle the above problems, including many based on deep learning (Ren et al., 2021). For the low-data regime, which is often the case for online learning in robotics, the most common are methods based on Gaussian process regression (Jamali et al., 2016; Yi et al., 2016; Driess et al., 2019; De Farias et al., 2021) and Bayesian optimization (Nogueira et al., 2016; Goel et al., 2022; Roveda et al., 2022; Berkenkamp et al., 2023). Koopman operators have also been suggested for the case of learning the nonlinear dynamics of a physical system for control (Abraham and Murphey, 2019), but this method is only applicable to learning dynamics and not to a more general function approximation problem.

As active exploration is, by definition, an uncertain process, a key consideration is the assurance of safety despite operating in fully or partially unknown environments. Safety can be handled algorithmically in the learning process by considering, for example, the uncertainty in a Gaussian process regression (Turchetta et al., 2019). However, here we are more interested in methods that consider the safety of the controlled system, that is, the robot, as a query deemed safe algorithmically can still result in an unsafe robot action due to uncertainty in the control and environmental dynamics. Various methods have been proposed, such as safe control based on energy functions (Pandya and Liu, 2022), safe model predictive control (Koller et al., 2018), and Bayesian meta-learning through alternating sequential optimal control problems for exploration and exploitation (Lew et al., 2022). A prominent approach to safety is control barrier functions (CBFs), a framework for ensuring the safety of nonlinear control affine systems (Ames et al., 2014; Romdlony and Jayawardhana, 2014; Ames et al., 2019). The resulting equations can be solved using quadratic programming, making the implementation feasible in control loops. The methodology has been expanded to high relative degree systems, making it generally applicable (Nguyen and Sreenath, 2016). Model uncertainty has been considered in CBFs in terms of parametric uncertainty (Cohen and Belta, 2022) and non-parametric uncertainty (Castañeda et al., 2021). These works ensure that safety can be guaranteed despite having a bounded model uncertainty.

In this paper, we consider the combination of the above two problems, namely, safe active exploration, in the context of an autonomous robot system for ultrasound (US) scanning of rheumatoid arthritis (RA). US imaging is a popular method for diagnosis and monitoring of RA and various other diseases, as it allows inspection of tissue and joint structures at low cost and without the use of radiation. The use of ultrasound does, however, rely on trained professionals with limited availability, so automation of the process has high potential value.

Active exploration is a major component of US scanning for RA and is, in fact, how the procedure is performed manually by trained professionals. During the procedure, the probe is placed in contact with joints in the patient’s hand, allowing the subsurface structures to be inspected on a connected monitor. The professional must ensure that contact between the probe and the patient’s hand is maintained and find an optimal placement in order to capture potential disease activity. When an appropriate area has been found, the probe is kept stationary, and Doppler mode imaging is used to assess the flow of synovial fluid in the joint cavity, indicating inflammation. Correct placement of the probe will ensure high-quality scans. Different approaches have been proposed to optimize the ultrasound image quality. A commonly used method is based on ultrasound confidence maps (Karamalis et al., 2012; Chatelain et al., 2017; Jiang et al., 2020) that can be used to detect if contact is poor and adjustments to the probe position must be made. Force-based methods have also been explored to optimize the image quality by attempting to estimate the surface normal and position of the probe accordingly (Jiang et al., 2021a). As these methods do not consider the structure of tissues in the image but only the overall quality, they are not suited for RA, where the structure is vital. Bayesian optimization based on segmented ultrasound images is proposed by Goel et al. (2022). This allows certain structures in the image to be optimized efficiently, and as the optimization is guided by a statistical model, uncertainties are handled implicitly. While the above methods consider the positioning of the probe, limiting the maximum applied force is also critical. First, the contact force must be kept low to ensure the flow of synovial fluid is not blocked by the pressure (Möller et al., 2017). Second, patients with high levels of disease activity will experience pain from excessive pressure.

To address the above use case, we propose to draw from the wider active exploration and safe control literature. To determine the probing location, we use Bayesian optimization, similar to Goel et al. (2022), but make use of a classifier trained to estimate the US image quality instead of a segmentation model. This is done because assessing the quality of an ultrasound image for RA is a complex task requiring the correct placement, appearance, and relative size of different tissue types. We extend the method by considering the ability to sample images continuously while positioning the probe on the patient, leading to a more efficient search. To establish and maintain contact with the patient’s hand, we use direct force control and add an additional level of safety in the form of a high-order CBF to bound the contact forces below a safe threshold.

Our main contributions are as follows:

• We present, to the best of our knowledge, the first results on CBFs applied to safe force control in the medical domain of a real (not simulated) robotic system in a laboratory setting.

• We present a method for autonomous robot scanning of RA. While research has been performed on autonomous US scanning for various applications such as scanning of vessels (Virga et al., 2016; Jiang et al., 2021b) and breasts (Welleweerd et al., 2020), to the best of our knowledge, no previous work within autonomous scanning for RA exists.

• Additionally, a minor contribution is the extension of informative continuous expected improvement (ICEI) as an acquisition function for Bayesian optimization to the context of robot motion planning.

2 Problem formulation

We consider two sub-problems in the context of ultrasound scanning for RA:

Problem 1. (Active search). This problem is concerned with finding areas of high ultrasound image quality in the presence of uncertain measurements. We consider the search problem as finding a global maximum p_opt of an unknown underlying function f(p) with uncertainty ϵ where p is a subset of the Cartesian space.

p_{opt} = \underset{p \in R^{n}}{\arg} \max f (p) + ϵ . (1)

Function estimates are achieved using a classifier trained on ultrasound images. It is assumed that a starting point is given from external sensing, such as a RGB-D camera combined with a method of estimating joint locations in a RGB image. Solutions for this exist, at least for simple, well-lit scenes (Zhang et al., 2020).

Problem 2. (Bounding the contact force). This problem consists of ensuring bounded contact force with the patient despite uncertainties in the patient’s position. A robot manipulator with an ultrasound probe mounted—as seen in Figure 1—is employed. The robot is controlled at the joint position level. We consider the problem of finding a safe control input u^* given a potentially unsafe nominal control input u_no that minimizes the differences between u^* and u_no. Safety is defined in terms of bounded contact force f_c ≤ f_max, where f_c and f_max are the contact force and maximum allowed force, respectively.

Figure 1

Figure 1. Left, the experimental setup for autonomous US scanning of RA. Right, a flowchart of the proposed system for active search for high-quality US images while achieving bounded contact force.

3 Preliminaries

This section will present theoretical background on Gaussian process regression and Bayesian optimization, which are used as the basis of our solution to Problem 1, and exponential control barrier functions, which are used as part of our solution to Problem 2.

3.1 Gaussian process regression

Gaussian process regression (GPR) allows unknown functions, along with their uncertainty, to be estimated based on limited samples (Rasmussen and Williams, 2005). A Gaussian process (GP) is a distribution over functions and is defined fully by a mean function m(p) and a covariance function k (p, p′). It is assumed that the vector input $p \in R^{d}$ is related to the scalar output y by the function f (⋅) with the following relationship: y = f(p) + ϵ, where $ϵ \sim N (0, σ_{ϵ}^{2})$ . The training input is defined as P with y being the associated targets. Together, this is denoted as $D : {(p_{1}, y_{1}), (p_{2}, y_{2}), \dots, (p_{n}, y_{n})}$ . Observations y and discrete function values to be predicted f_* then follow a multivariate normal distribution:

[\begin{matrix} y \\ f_{*} \end{matrix}] \sim N ([\begin{matrix} m (P) \\ m (P_{*}) \end{matrix}], [\begin{matrix} K_{ϵ} & K (P, P_{*}) \\ k (P_{*}, P) & K (P_{*}, P_{*}) \end{matrix}]), (2)

where $K_{ϵ} = K (P, P) + σ_{ϵ}^{2} I$ , with $σ_{ϵ}^{2}$ being the inherent noise on the measurements—allowing optimization under uncertainty—and

K (P, P^{'}) = [\begin{matrix} k (p_{1}, p_{1}^{'}) & k (p_{1}, p_{2}^{'}) & \dots & k (p_{1}, p_{k}^{'}) \\ k (p_{2}, p_{1}^{'}) & k (p_{2}, p_{2}^{'}) & \dots & k (p_{2}, p_{k}^{'}) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ k (p_{k}, p_{1}^{'}) & k (p_{k}, p_{2}^{'}) & \dots & k (p_{k}, p_{k}^{'}) \end{matrix}] . (3)

The mean vector associated with f_* is then

{\bar{f}}_{*} = m (P_{*}) + K (P_{*}, P) K_{ϵ}^{- 1} (y - m (P)) (4)

and the covariance is given by:

cov (f_{*}) = K (P_{*}, P_{*}) - K (P_{*}, P) K_{ϵ}^{- 1} K (P, P_{*}) . (5)

The estimate of the standard deviation along the estimated function is then the diagonal of cov (f_*) and is denoted σ_f(p). The mean function is often set to zero unless some underlying information about the process exists. The covariance function is generally defined such that points near each other are more correlated. A common choice is the Matern class of covariance functions. This class of functions leads to different levels of differentiability and, thus, smoothness of f(p) based on a smoothness parameter ν. For $ν = \frac{3}{2}$ , the Matern kernel is

k_{ν = \frac{3}{2}} (p, p^{'}) = (1 + \sqrt{3} d (p, p^{'})) \exp (- \sqrt{3} d (p, p^{'})), (6)

where d (p, p′) is the Euclidean distance between p and p′ scaled by a diagonal matrix Θ:

d = (p - p^{'}) Θ^{- 1} (p - p^{'}), (7)

Θ = diag (l_{1}, l_{2}, \dots, l_{d}) = diag (θ), (8)

with $θ = {[\begin{matrix} l_{1} & l_{2} & \dots l_{d} \end{matrix}]}^{T}$ as the length scales of the function. The length scales encode how rapidly f (⋅) changes along each dimension. These are considered free parameters that must be determined from the training data. This is done by maximizing the marginal log-likelihood of the training examples given the parameters.

3.2 Bayesian optimization

Bayesian optimization provides a framework for maximizing functions that are costly to evaluate, where uncertainty is present, and where following the gradient is infeasible (Shahriari et al., 2016). The method consists of three elements: a statistical surrogate model, an acquisition function, and a stopping criterion. A common choice for the surrogate model is Gaussian processes, as defined above. The acquisition function α(⋅) determines where to sample next in the search space based on the surrogate model by balancing exploration and exploitation:

p_{n + 1} = \underset{p \in R^{d}}{\arg} \max α (p; D) . (9)

A common choice for the acquisition function is expected improvement (EI) (Mockus et al., 1978). EI maximizes the expected gain over the previous best sampled score, f^*.

E I (p) ≔ E [\max ((f (p) - f^{*}), 0)], (10)

EI can be calculated analytically as

E I (p) = \{\begin{cases} ({\bar{f}}_{*} (p) - f^{*}) Φ (\frac{{\bar{f}}_{*} (p) - f^{*}}{σ_{f} (p)}) + σ_{f} (p) ϕ (\frac{{\bar{f}}_{*} (p) - f^{*}}{σ_{f} (p)}) & if σ_{f} (p) > 0 \\ 0 & if σ_{f} (p) = 0 \end{cases}, (11)

where Φ(⋅), ϕ(⋅) are the cumulative distribution function (CDF) and the probability density function (PDF), respectively.

3.3 Control barrier functions

Control barrier functions provide a framework for ensuring system safety (Ames et al., 2014). Throughout this section, a nonlinear control affine system of the following form is considered:

\dot{x} = f (x) + g (x) u, (12)

where $f : R^{n} \to R^{n}$ and $g : R^{n} \to R^{n \times m}$ are local Lipschitz functions, with $x \in X \subset R^{n}$ and $u \in U \subset R^{m}$ being the admissible states and control inputs, respectively. The goal of the CBF is to keep the system in a safe region of the state space. This safe region $C$ is defined based on a continuously differentiable function $h : X \subset R^{n} \to R$ as

\begin{align} C & = \{x \in X | h (x) \geq 0\}, \\ \partial C & = \{x \in X | h (x) = 0\}, \\ Int (C) & = \{x \in X | h (x) > 0\} . \end{align} (13)

The system is thus safe if a control law input u leads to the scalar function $h (x) \geq 0 \forall t \land \forall x_{0} \in C$ . Equivalently, this can be described as the set $C$ being rendered forward invariant by the control law. When the relative degree r of the system with respect to h is larger than one, higher-order CBFs can be used to render a set $C$ safe. The relative degree of a continuously differentiable function h(x) with respect to System (12) is the number of times it can be differentiated along (12) before the control input u explicitly shows. Exponential control barrier functions (ECBFs) are a class of higher-order CBFs (Nguyen and Sreenath, 2016). This formulation is based on calculating higher-order time derivatives of h(x). The r^th time derivative is

h^{(r)} (x, u) = L_{f}^{r} h (x) + L_{g} L_{f}^{r - 1} h (x) u, (14)

where $L_{f}^{r} h (x)$ is the r^th Lie derivative of h along f. ECBFs are defined based on a series of integrators that relate h^(r)(x, u) to h(x):

\begin{aligned} \dot{η} (x) & = F η (x) + G μ, \\ h (x) & = C η (x) . \end{aligned} (15)

With

\begin{align} η (x) & = [\begin{matrix} h (x) \\ L_{f} h (x) \\ L_{f}^{2} h (x) \\ ⋮ \\ L_{f}^{r - 1} h (x) \end{matrix}] = [\begin{matrix} h (x) \\ \dot{h} (x) \\ \ddot{h} (x) \\ ⋮ \\ h^{(r - 1)} (x) \end{matrix}], \\ μ & = L_{f}^{r} h (x) + L_{g} L_{f}^{r - 1} h (x) u = h^{(r)} (x, u) \end{align} (16)

and F : r × r, G : r × 1, C : 1 × r

F = [\begin{matrix} 0 & 1 & 0 & \dots & 0 \\ 0 & 0 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & 1 \\ 0 & 0 & 0 & \dots & 0 \end{matrix}], G = [\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ 1 \end{matrix}], C = [\begin{matrix} 1 & 0 & \dots & 0 . \end{matrix}] (17)

Choosing the state feedback μ = −Kη(x) leads to h(x) being an explicit function of time t and the initial state x₀:

h (x) = C e^{(F - G K) t} η (x_{0}), (18)

where h(x) is then said to be an ECBF—rendering $C$ forward invariant—if there exists a row vector K_γ such that

\sup_{u \in U} [L_{f}^{r} h (x) + L_{g} L_{f}^{r - 1} h (x) u] \geq - K_{γ} η (x), \forall x \in I n t (C) (19)

results in

h (x (t)) \geq C e^{F - G K_{γ}} t η (x_{0}) \geq 0, (20)

where h (x₀) ≥ 0. A K_γ can be found that satisfies this using the pole placement method, with all poles being real and negative. Additionally, the eigenvalues λ_i of system F − GK_γ must adhere to a condition based on the initial conditions x₀:

λ_{i} (F - G K_{γ}) \geq - \frac{{\dot{ν}}_{i - 1} (x_{0})}{ν_{i - 1} (x_{0})}, (21)

where

\begin{align} ν_{0} (x) & = h (x), \\ ν_{1} (x) & = {\dot{ν}}_{0} (x) + s_{1} ν_{0} (x), \\ ⋮ \\ ν_{r} (x) & = {\dot{ν}}_{r - 1} (x) + s_{r} ν_{r - 1} (x), \end{align} (22)

and s₀, … , s_r are defined as the roots of the characteristic polynomial of F − GK_γ.

4 Approach

This section presents our proposed method for safe active search under uncertainty applied to ultrasound image acquisition.

4.1 Ultrasound image quality optimization

We will now present our proposed solution to Problem 1. In regular Bayesian optimization, sampling from anywhere in the search space is assumed to be associated with the same cost. As the robot must move to acquire images, this assumption is not ideal for robotic ultrasound scanning, as switching between sampling points far apart from each other will incur a large time cost while the robot moves between them. To account for this, we consider a non-standard acquisition function that incorporates the ability to sample ultrasound images throughout the path traveled by the robot. Similar methods have been applied to robotic environmental monitoring using unmanned aerial vehicles (Marchant and Ramos, 2014). We parameterize the path as straight lines to ensure the optimization time of the acquisition function is kept low. The path $P$ is defined as

P (u, p_{n + 1} | p_{n}) = (1 - u) p_{n + 1} + u p_{n}, (23)

with u ∈ [0, 1]. The modified acquisition function is denoted α_IC(⋅) and is said to be informative continuous as information throughout the path is considered. It is defined by integrating the base acquisition function over the path and scaling with the inverse of the path length:

\begin{align} α_{I C} (P (u, p_{n + 1} | p_{n})), α) & = \frac{1}{‖ p_{n + 1} - p_{n} ‖_{2}} \\ \int_{0}^{1} α (P (u, p_{n + 1} | p_{n})) d u . \end{align} (24)

The scale is included to account for the time spent moving between p_n and p_n+1. Assuming that the robot moves at a constant velocity, maximizing the modified acquisition function leads to a path that maximizes the integral of the base acquisition function per unit of time spent. The formulation leads to the entire path being considered—and thus less time spent in areas with low uncertainty and/or low expected scores. We employ the EI acquisition function as the base acquisition function and denote the informative continuous version as ICEI. A score y is associated with each position in the search space p using a classifier trained on labeled ultrasound images. The range of the output is [0; 1], with 0 indicating poor quality and 1 good. Defining an appropriate stopping criterion is important to ensure a good area has been found without spending excessive time scanning a patient. A statistical stopping criterion is proposed based on the maximum lower bound over the samples, y. The lower bound for a sample is

y_{1 - δ} = y - Z_{δ} σ_{y}, (25)

where Z is the standard normal distribution, Z_δ is the critical value for a confidence level of 1 − δ, and σ_y is the prediction of standard deviation for the corresponding sample y based on the surrogate model. As the range of possible scores is known beforehand, defining an appropriate threshold is possible.

4.2 Safe direct force control

Our proposed solution to Problem 2 combines a hybrid force/position controller—also referred to as the nominal controller—with a CBF defined such that bounded contact force is ensured. The hybrid force/position controller allows direct control over the force to ensure consistent ultrasound image quality while making positioning possible during contact. The robot considered in this work is controlled at the joint position level. The CBF is defined in Cartesian space and at the acceleration level to allow for constraints on the force. End-effector accelerations are related to joint accelerations using the following kinematic relationship:

\ddot{q} = J^{†} (q) (\ddot{x} - \dot{J} (q) \dot{q}), (26)

where J(q) is the manipulator Jacobian, ^† denotes the Moore–Penrose pseudo inverse, q is the joint configuration, and $\ddot{x}$ is the Cartesian space acceleration. The resulting joint accelerations are then double integrated for the robot’s position controller. Figure 2 illustrates the proposed control architecture.

Figure 2

Figure 2. Block diagram of hybrid force/position controller with CBF to ensure bounded contact force. T_d specifies the desired pose, and h_d specifies the desired wrench.

4.2.1 Hybrid force/position controller

The error between the desired and measured end-effector wrench is denoted Δh. Similarly, the positional error used is denoted Δx; note that the orientational error is obtained using quaternions. The Cartesian space is divided such that force control is applied in some subset of the space, and position control is applied in the remainder of the space (Siciliano et al., 2008). New variables are defined to achieve this:

\underset{m \times 1}{Δ h^{*}} = \underset{m \times 6}{S_{f}^{†}} \underset{6 \times 1}{Δ h}, (27)

\underset{n \times 1}{Δ x^{*}} = \underset{n \times 6}{S_{p}^{†}} \underset{6 \times 1}{Δ x}, (28)

where $S_{p}^{†}$ and $S_{f}^{†}$ select the position-controlled and force-controlled subsets, respectively. Force control is applied along the length of the probe, defined as the z-axis, while positional control is used for the remaining degrees of freedom (DOFs). The force controller acting on a subset of the Cartesian space is then

{\bar{a}}_{f} = K_{p}^{f} S_{f}^{†} Δ h - K_{d}^{f} S_{f}^{†} {\dot{x}}_{e}, (29)

where ${\dot{x}}_{e}$ is the end-effector velocity, ${\bar{a}}_{f}$ is the resulting acceleration reference, and $K_{p}^{f}, K_{d}^{f}$ are diagonal m × m positive definite gain matrices. Note that ${\dot{x}}_{e}$ is used over the more natural choice of using the end-effector wrench derivative ${\dot{h}}_{e}$ . This is a method of avoiding the problem of taking the time derivative of the often highly noisy force measurements (Siciliano et al., 2008). The position controller is defined similarly as

{\bar{a}}_{p} = K_{p}^{p} S_{p}^{†} Δ x - K_{d}^{p} S_{p}^{†} {\dot{x}}_{e}, (30)

with ${\bar{a}}_{p}$ as the acceleration reference, and $K_{p}^{p}, K_{d}^{p}$ are n × n positive definite gain matrices. The full acceleration of the nominal controller is then

{\ddot{\bar{x}}}_{n o} = S_{p} {\bar{a}}_{f} + S_{p} {\bar{a}}_{p} . (31)

4.2.2 CBF for bounded contact force

The acceleration reference from the hybrid force/position control is modified by a control barrier function to ensure bounded contact force. This requires the definition of a control affine model relating commanded end-effector accelerations to contact force. This is defined as being one-dimensional, only considering the force from movement along the z-axis, thus neglecting friction in the axes of motion. This is a valid assumption, as the US gel placed on the patient’s hand while scanning will render friction between the probe and hand negligible. The contact force is modeled using the Kelvin–Voigt model as

f_{c} = k (z - z_{r}) + b \dot{z}, (32)

where f_c is the contact force, k and b are positive scalars, and z_r is the position of the contact surface.

A second-order system is used to model the inner position control loop and account for the actuator dynamics. The relationship between position reference $\bar{z}$ and actual position z is modeled by a second-order system:

\ddot{z} = ω_{n}^{2} (\bar{z} - z) + 2 ζ ω_{n} \dot{z}, (33)

where ζ is the damping factor and ω_n is the undamped natural frequency. The control input to the CBF from the nominal force controller is defined at the acceleration level $u = \ddot{\bar{z}}$ . To relate this to $\bar{z}$ , it is double integrated:

\bar{Z} (s) = \frac{1}{s^{2}} \ddot{\bar{Z}} (s) . (34)

The full control affine model is then

\begin{gathered} x = [\begin{matrix} f_{c} \\ z \\ \dot{z} \\ \bar{z} \\ \dot{\bar{z}} \end{matrix}] = [\begin{matrix} x_{1} \\ x_{2} \\ x_{3} \\ x_{4} \\ x_{5} \end{matrix}], u = \ddot{\bar{z}} \\ [\begin{matrix} {\dot{x}}_{1} \\ {\dot{x}}_{2} \\ {\dot{x}}_{3} \\ {\dot{x}}_{4} \\ {\dot{x}}_{5} \end{matrix}] = [\begin{matrix} k x_{3} + b (ω_{n}^{2} (x_{4} - x_{2}) + 2 ζ ω_{n} x_{3}) \\ x_{3} \\ ω_{n}^{2} (x_{4} - x_{2}) + 2 ζ ω_{n} x_{3} \\ x_{5} \\ 0 \end{matrix}] + [\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \\ 1 \end{matrix}] u . \end{gathered} (35)

Note that this utilizes the derivative of the force, such that the method does not directly depend on z_r. However, it will lead to an overly conservative barrier function outside of contact, as the non-contact situation is not modeled. The barrier function is defined as

h (x) = f_{\max} - f_{c}, (36)

where f_max is the maximum allowable force. This leads to a relative degree of three for the system, and an ECBF is therefore employed. The ECBF is defined as a constraint to a quadratic program (QP). The objective function is defined as the squared error between the nominal control input u_no and the resulting input u

| u - u_{no} |^{2} = u^{2} - 2 u_{no} u + u_{no}^{2} . (37)

This ensures the nominal control is followed whenever possible, and the ECBF only limits the input whenever the system is near the boundary of the safe set. The resulting QP is then

\begin{aligned} u^{*} (x, u_{no}) = \underset{u \in R}{\arg} \min & u^{2} - 2 u_{no} u \\ s . t . & L_{f}^{r} h (x) + L_{g} L_{f}^{r - 1} h (x) u + K_{γ} η (x) \geq 0, \end{aligned} (38)

where u^* is the modified control input and u_no is the nominal control input from the hybrid force/position controller.

5 Results

In this section, results for the image quality optimization method and safe force controller are presented. The standard EI acquisition function is compared with the proposed ICEI variant. The addition of a CBF for keeping bounded contact force is evaluated under uncertainty in the estimate of surface location z_r.

5.1 Comparing image quality optimization methods

The ultrasound image quality optimization approach assumes a starting position is given, and predetermined bounds of the search space are defined in relation to this. The search was performed over two DOFs such that the input to the Bayesian optimizer was $p = {[x, R_{x}]}^{T}$ , where R_x is rotation about the x-axis. The inherent noise of the measurements was set to σ_ϵ = 0.1. Bayesian optimization was implemented using BoTorch (Balandat et al., 2020). The informative continuous extension was implemented in PyTorch, estimating the integral using a discrete sum. Our proposed method for optimizing ultrasound image quality has been evaluated in a simulated setting. The simulation is based on data gathered from scanning 32 joints across four test subjects. The search space was sampled in a grid using nearest-neighbor interpolation to construct an approximated ground truth. The stopping criterion was set to a maximum lower bound of 0.9 with a confidence level of 95%. Three methods were tested: random sampling, EI, and ICEI. The search space was normalized such that the range was equal for the two DOFs. A distance budget of 10 times the length of the search space was given. In order to simulate the ability to sample continuously, a sample was taken for every 4% of the search space traveled. Based on this, random sampling, EI, and ICEI resulted in 60.7%, 67.8%, and 75.0% success rates, respectively, with mean distance traveled being 8.65, 8.13 and 7.64 times the length of the normalized search space respectively. Figure 3 shows the search patterns resulting from the different methods. Random sampling is, as expected, inefficient. The tendency of ICEI to cause less overlap of paths compared to EI is clear.

Figure 3

Figure 3. Ground truth and results for random sampling, EI, and ICEI are shown. The predictions of the Gaussian process regression are displayed with the samples and the location of the maximum lower bound.

EI and ICEI were also compared in a live test. In these tests the search was extended such that $p = {[x, R_{x}, R_{y}, R_{z}]}^{T}$ with a range of [−7.5mm, 7.5 mm] for x and [−4°, 4°] for the rotational DOFs. The test compared the results of searching using EI and ICEI. For each test, the methods were started at the same position. The methods were run until the stopping criterion was reached. When this occurred, the probe was positioned at the maximum lower bound, and a measurement was taken to ensure the success criterion actually had been reached. This cannot be guaranteed due to uncertainties in positioning. If not, the search was continued. If a solution had not been found within 60 s, it was considered a failed scan. Fourteen joints were scanned. EI lead to a success rate of 79% and ICEI to a rate of 92%. The mean recorded time for optimizing the acquisition function during this test was 0.10 ± 0.06 s for EI and 0.14 ± 0.10 s for ICEI. The training time for the Gaussian process using maximum log-likelihood was 0.16 ± 0.08 s. Even with the ICEI acquisition method, querying a new sample point only took around 0.3 s

5.2 Evaluation of safe force controller

A KUKA AG IIWA 7 R800 was used to evaluate the safe controller with the fast research interface (FRI) for communication with the robot through the IIWA ROS package (Chatzilygeroudis et al., 2019). The communication rate with the inner joint position controller was set to 200 Hz. The KYOTO KAGAKU Rheumatism Hand Phantom was used with parameters estimated to $\hat{k} = 1285$ and $\hat{b} = 15$ . For the inner joint position controller, the following parameters were found: ${\hat{ω}}_{n} = 31.25$ and $\hat{ζ} = 1$ ; it is not possible to change these parameters, as it is an inbuilt controller. The parameters of the joint controller were estimated by minimizing the sum of squared errors between the model and measurements taken during an initial experiment where the robot was moved into contact with the hand phantom.

The parameter K_γ used in the safety filter must be defined such that the poles of System (20) are negative and real and adhere to the constraints on initial conditions (21). Pole placement within these constraints affects how conservative the barrier function is and will always ensure safety. The following poles p were utilized in the experiments: $p = [\begin{matrix} - 3 & - 6 & - 9 \end{matrix}]$ . A maximum force of f_max = 5N was specified.

The nominal force control parameters where set to $K_{p}^{f} = 0.001$ and $K_{d}^{f} = 1$ . The controller was implemented in Python using CVXOPT (Andersen et al., 2020) to solve the ECBF QP problem. The end-effector wrench was estimated from the robot’s joint torque sensors using the following kinematic relationship:

h_{e} = {(J {(q)}^{T})}^{†} τ_{external}, (39)

where τ_external are the measured external joint torques available through the FRI. Figure 4 illustrates the invariance of the proposed method to the contact surface point in the simulation and when running on the robot and shows that the ECBF is able to bound the contact forces below the specified threshold.

Figure 4

Figure 4. Experiments in simulation (left column) and on the robot (right column) of the robot establishing contact with the environment under initial condition z₀ relative to the surface position z_r. The nominal control alone (denoted NOM) does not bound the contact force during contact establishment and leads to varying violations depending on the initial condition. The addition of the proposed CBF ensures contact force is bounded below 5N for varying initial conditions ranging from starting 1 mm (z₀ | = z_r − | 1 mm) inside the contact to starting 4 mm above (z₀ | = z_r + | 4 mm) the contact.

6 Discussion

Our methodology is applicable to other robotic applications involving safety-constrained active exploration where contact between the robot and an unknown environment is concerned, such as tactile exploration and manipulation. Our proposed formulation for search using Bayesian optimization with ICEI can help model the kind of sampling scheme applicable to robot tactile exploration, that is, where the full travel path of the robot end-effector must be considered. Our approach to the safe bounding of the contact forces using ECBFs is applicable in the case of robots in contact with the environment where force feedback is available. By directly measuring the contact force and considering its first derivative in the contact model, we increase the system’s robustness to uncertainty in the contact boundary, that is, uncertainty in the switching of the environment dynamics.

However, our method has a number of limitations in real-world applications. Although our CBF formulation is robust to uncertainty in the contact boundary, it is not robust to a time-varying boundary. In an ultrasound scanning application, this might occur if the patient, for example, lifted their hand during the scan. In this case, knowledge of the derivative of the contact location would be needed to guarantee safety. Similarly, the Bayesian optimization approach assumes a static function, that is, that the patient’s hand does not move. Nevertheless, small motions can be accounted for by increasing the uncertainty term σ_ϵ.

7 Conclusion

Methods for safe active exploration of an unknown environment have been presented in this article. Active search under uncertainty is performed using Bayesian optimization, and safety is achieved through the use of control barrier functions. The approach has been applied to autonomous ultrasound scanning of rheumatoid arthritis, which involves finding areas of high ultrasound image quality by positioning the ultrasound probe on the patient. The method has been demonstrated in both a simulated and real setting. An extension to the standard expected improvement, referred to as informative continuous expected improvement, has been proposed. The method has been shown to increase the success rate from 79% to 92%, given a limited time budget of 60 s. Safety in the context of autonomous ultrasound scanning is defined in terms of keeping contact force bounded. We have shown both in simulation and on a robot that, by using a higher-order control barrier function, we are able to bound contact forces during contact establishment despite incorrectly specifying the contact boundary by −1 or +4 mm.

Data availability statement

The raw data supporting the conclusion of this article will be made available by the authors, without undue reservation.

Author contributions

FV-H: writing–original draft and writing–review and editing. II: writing–original draft and writing–review and editing. CS: writing–review and editing and writing–original draft. TS: writing–review and editing and writing–original draft.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. The work undertaken in this paper was supported by the Odense Robotics Control and Learning of Contact Transitions (CoLeCT) project, funded by the Danish Ministry of Higher Education and Science, and by the SDU I4.0Lab at the University of Southern Denmark.

Acknowledgments

The authors would also like to thank ROPCA ApS for allowing them to use their robot platform, ARTHUR, for data acquisition.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Abraham, I., and Murphey, T. D. (2019). Active learning of dynamics for data-driven control using koopman operators. IEEE Trans. Robotics 35, 1071–1083. doi:10.1109/tro.2019.2923880

CrossRef Full Text | Google Scholar

Ames, A. D., Coogan, S., Egerstedt, M., Notomista, G., Sreenath, K., and Tabuada, P. (2019). “Control barrier functions: theory and applications,” in 2019 18th European Control Conference (ECC), Naples, Italy, 25-28 June 2019 (IEEE), 3420–3431. doi:10.23919/ECC.2019.8796030

CrossRef Full Text | Google Scholar

Ames, A. D., Grizzle, J. W., and Tabuada, P. (2014). “Control barrier function based quadratic programs with application to adaptive cruise control,” in 53rd IEEE Conference on Decision and Control, Los Angeles, CA, USA, 15-17 December 2014 (IEEE), 6271–6278.

CrossRef Full Text | Google Scholar

Andersen, M., Dahl, J., and Vandenberghe, L. (2020). CVXOPT: convex optimization.

Google Scholar

Balandat, M., Karrer, B., Jiang, D. R., Daulton, S., Letham, B., Wilson, A. G., et al. (2020). BoTorch: a framework for efficient monte-carlo bayesian optimization. Adv. Neural Inf. Process. Syst. 33.

Google Scholar

Berkenkamp, F., Krause, A., and Schoellig, A. P. (2023). Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics. Mach. Learn. 112, 3713–3747. doi:10.1007/s10994-021-06019-1

PubMed Abstract | CrossRef Full Text | Google Scholar

Castañeda, F., Choi, J. J., Zhang, B., Tomlin, C. J., and Sreenath, K. (2021). “Pointwise feasibility of Gaussian process-based safety-critical control under model uncertainty,” in 2021 60th IEEE Conference on Decision and Control (CDC) (IEEE), 6762–6769.

Google Scholar

Chatelain, P., Krupa, A., and Navab, N. (2017). Confidence-Driven control of an ultrasound probe. IEEE Trans. Robotics 33, 1410–1424. doi:10.1109/TRO.2017.2723618

CrossRef Full Text | Google Scholar

Chatzilygeroudis, K., Mayr, M., Fichera, B., and Billard, A. (2019). Iiwa_ros: a ROS stack for KUKA’s IIWA robots using the fast research interface

Google Scholar

Cohen, M. H., and Belta, C. (2022). “High order robust adaptive control barrier functions and exponentially stabilizing adaptive control lyapunov functions,” in 2022 American Control Conference (ACC), Atlanta, GA, USA, 08-10 June 2022 (IEEE), 2233–2238. doi:10.23919/ACC53348.2022.9867633

CrossRef Full Text | Google Scholar

De Farias, C., Marturi, N., Stolkin, R., and Bekiroglu, Y. (2021). Simultaneous tactile exploration and grasp refinement for unknown objects. IEEE Robotics Automation Lett. 6, 3349–3356. doi:10.1109/lra.2021.3063074

CrossRef Full Text | Google Scholar

Driess, D., Hennes, D., and Toussaint, M. (2019). “Active multi-contact continuous tactile exploration with Gaussian process differential entropy,” in2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20-24 May 2019 (IEEE), 7844–7850.

CrossRef Full Text | Google Scholar

Goel, R., Abhimanyu, F., Patel, K., Galeotti, J., and Choset, H. (2022). “Autonomous ultrasound scanning using bayesian optimization and hybrid force control,” in 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, PA, USA, 23-27 May 2022 (IEEE), 8396–8402. doi:10.1109/ICRA46639.2022.9812410

CrossRef Full Text | Google Scholar

Jamali, N., Ciliberto, C., Rosasco, L., and Natale, L. (2016). “Active perception: building objects’ models using tactile exploration,” in 2016 IEEE-RAS 16th International Conference on Humanoid Robots (Humanoids), Cancun, Mexico, 15-17 November 2016 (IEEE), 179–185.

Google Scholar

Jiang, Z., Grimm, M., Zhou, M., Esteban, J., Simson, W., Zahnd, G., et al. (2020). Automatic normal positioning of robotic ultrasound probe based only on confidence map optimization and force measurement. IEEE Robotics Automation Lett. 5, 1342–1349. doi:10.1109/LRA.2020.2967682

CrossRef Full Text | Google Scholar

Jiang, Z., Grimm, M., Zhou, M., Hu, Y., Esteban, J., and Navab, N. (2021a). Automatic force-based probe positioning for precise robotic ultrasound acquisition. IEEE Trans. Industrial Electron. 68, 11200–11211. doi:10.1109/TIE.2020.3036215

CrossRef Full Text | Google Scholar

Jiang, Z., Li, Z., Grimm, M., Zhou, M., Esposito, M., Wein, W., et al. (2021b). Autonomous robotic screening of tubular structures based only on real-time ultrasound imaging feedback. arXiv:2011.00099.

Google Scholar

Karamalis, A., Wein, W., Klein, T., and Navab, N. (2012). Ultrasound confidence maps using random walks. Med. Image Anal. 16, 1101–1112. doi:10.1016/j.media.2012.07.005

PubMed Abstract | CrossRef Full Text | Google Scholar

Koller, T., Berkenkamp, F., Turchetta, M., and Krause, A. (2018). “Learning-based model predictive control for safe exploration,” in 2018 IEEE conference on decision and control (CDC), Miami, FL, USA, 17-19 December 2018 (IEEE), 6059–6066.

CrossRef Full Text | Google Scholar

Kroemer, O. B., Detry, R., Piater, J., and Peters, J. (2010). Combining active learning and reactive control for robot grasping. Robotics Aut. Syst. 58, 1105–1116. doi:10.1016/j.robot.2010.06.001

CrossRef Full Text | Google Scholar

Lew, T., Sharma, A., Harrison, J., Bylard, A., and Pavone, M. (2022). Safe active dynamics learning and control: a sequential exploration–exploitation framework. IEEE Trans. Robotics 38, 2888–2907. doi:10.1109/tro.2022.3154715

CrossRef Full Text | Google Scholar

Lluvia, I., Lazkano, E., and Ansuategi, A. (2021). Active mapping and robot exploration: a survey. Sensors 21, 2445. doi:10.3390/s21072445

PubMed Abstract | CrossRef Full Text | Google Scholar

Marchant, R., and Ramos, F. (2014). “Bayesian Optimisation for informative continuous path planning,” in 2014 IEEE International Conference on Robotics and Automation (ICRA), Hong Kong, China, 31 May 2014 - 07 June 2014 (IEEE), 6136–6143. doi:10.1109/ICRA.2014.6907763

CrossRef Full Text | Google Scholar

Mockus, J., Tiesis, V., and Zilinskas, A. (1978). The application of Bayesian methods for seeking the extremum. Towards Glob. Optim. 2, 2.

Google Scholar

Möller, I., Janta, I., Backhaus, M., Ohrndorf, S., Bong, D. A., Martinoli, C., et al. (2017). The 2017 EULAR standardised procedures for ultrasound imaging in rheumatology. Ann. Rheumatic Dis. 76, 1974–1979. doi:10.1136/annrheumdis-2017-211585

CrossRef Full Text | Google Scholar

Nguyen, Q., and Sreenath, K. (2016). “Exponential Control Barrier Functions for enforcing high relative-degree safety-critical constraints,” in 2016 American Control Conference (ACC), Boston, MA, USA, 06-08 July 2016 (IEEE), 322–328. doi:10.1109/ACC.2016.7524935

CrossRef Full Text | Google Scholar

Nogueira, J., Martinez-Cantin, R., Bernardino, A., and Jamone, L. (2016). “Unscented bayesian optimization for safe robot grasping,” in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, Korea, 09-14 October 2016 (IEEE), 1967–1972.

CrossRef Full Text | Google Scholar

Pandya, R., and Liu, C. (2022). “Safe and efficient exploration of human models during human-robot interaction,” in 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Kyoto, Japan, 23-27 October 2022 (IEEE), 6708–6715.

CrossRef Full Text | Google Scholar

Rasmussen, C. E., and Williams, C. K. I. (2005). Gaussian processes for machine learning. United States: The MIT Press. doi:10.7551/mitpress/3206.001.0001

CrossRef Full Text | Google Scholar

Ren, P., Xiao, Y., Chang, X., Huang, P.-Y., Li, Z., Gupta, B. B., et al. (2021). A survey of deep active learning. ACM Comput. Surv. (CSUR) 54, 1–40. doi:10.1145/3472291

CrossRef Full Text | Google Scholar

Romdlony, M. Z., and Jayawardhana, B. (2014). “Uniting control lyapunov and control barrier functions,” in 53rd IEEE Conference on Decision and Control, Los Angeles, CA, USA, 15-17 December 2014 (IEEE), 2293–2298. doi:10.1109/CDC.2014.7039737

CrossRef Full Text | Google Scholar

Roveda, L., Maroni, M., Mazzuchelli, L., Praolini, L., Shahid, A., Bucca, G., et al. (2022). Robot end-effector mounted camera pose optimization in object detection-based tasks. J. Intelligent Robotic Syst. 104, 16. doi:10.1007/s10846-021-01558-0

CrossRef Full Text | Google Scholar

Shahriari, B., Swersky, K., Wang, Z., Adams, R. P., and de Freitas, N. (2016). Taking the human out of the loop: a review of bayesian optimization. Proc. IEEE 104, 148–175. doi:10.1109/JPROC.2015.2494218

CrossRef Full Text | Google Scholar

Siciliano, B., Sciavicco, L., Villani, L., and Oriolo, G. (2008). Robotics: modelling, planning and control. 1st edn. Incorporated: Springer Publishing Company.

Google Scholar

Turchetta, M., Berkenkamp, F., and Krause, A. (2019). Safe exploration for interactive machine learning. Adv. Neural Inf. Process. Syst. 32.

Google Scholar

Virga, S., Zettinig, O., Esposito, M., Pfister, K., Frisch, B., Neff, T., et al. (2016). “Automatic force-compliant robotic ultrasound screening of abdominal aortic aneurysms,” in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, Korea, 09-14 October 2016 (IEEE), 508–513. doi:10.1109/IROS.2016.7759101

CrossRef Full Text | Google Scholar

Welleweerd, M., de Groot, A., de Looijer, S., Siepel, F., and Stramigioli, S. (2020). “Automated robotic breast ultrasound acquisition using ultrasound feedback,” in 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May 2020 - 31 August 2020 (IEEE), 9946–9952. doi:10.1109/ICRA40945.2020.9196736

CrossRef Full Text | Google Scholar

Yi, Z., Calandra, R., Veiga, F., van Hoof, H., Hermans, T., Zhang, Y., et al. (2016). “Active tactile object exploration with Gaussian processes,” in 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Daejeon, Korea, 09-14 October 2016 (IEEE), 4925–4930.

CrossRef Full Text | Google Scholar

Zhang, F., Bazarevsky, V., Vakunov, A., Tkachenka, A., Sung, G., Chang, C.-L., et al. (2020). MediaPipe hands: on-device real-time hand tracking. arXiv. doi:10.48550/arXiv.2006.10214

CrossRef Full Text | Google Scholar

Keywords: control barrier function, Bayesian optimization, active search, autonomous ultrasound scanning, robot force control

Citation: Vinter-Hviid F, Sloth C, Savarimuthu TR and Iturrate I (2024) Safe contact-based robot active search using Bayesian optimization and control barrier functions. Front. Robot. AI 11:1344367. doi: 10.3389/frobt.2024.1344367

Received: 25 November 2023; Accepted: 07 March 2024;
Published: 29 April 2024.

Edited by:

Mohammadreza Davoodi, University of Memphis, United States

Reviewed by:

Loris Roveda, Dalle Molle Institute for Artificial Intelligence Research, Switzerland
Yazdan Batmani, University of Kurdistan, Iran
Hamid Hafezi, Herff College of Engineering, University of Memphis, Memphis, United States in collaboration with reviewer YB

Copyright © 2024 Vinter-Hviid, Sloth, Savarimuthu and Iturrate. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Frederik Vinter-Hviid, ZnZAcm9wY2EuY29t

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.