- 1Department of Brain Robot Interface, Computational Neuroscience Laboratories, Advanced Telecommunications Research Institute International (ATR), Kyoto, Japan
- 2Graduate School of Informatics, Kyoto University, Kyoto, Japan
Identifying an accurate dynamics model remains challenging for humanoid robots. The difficulty is mainly due to the following two points. First, a good initial model is required to evaluate the feasibility of motions for data acquisition. Second, a highly nonlinear optimization problem needs to be solved to design movements to acquire the identification data. To cope with the first point, in this paper, we propose a curriculum of identification to gradually learn an accurate dynamics model from an unreliable initial model. For the second point, we propose using a large-scale human motion database to efficiently design the humanoid movements for the parameter identification. The contribution of our study is developing a humanoid identification method that does not require the good initial model and does not need to solve the highly nonlinear optimization problem. We showed that our curriculum-based approach was able to more efficiently identify humanoid model parameters than a method that just randomly picked reference motions for identification. We evaluated our proposed method in a simulation experiment and demonstrated that our curriculum was led to obtain a wide variety of motion data for efficient parameter estimation. Consequently, our approach successfully identified an accurate model of an 18-DoF, simulated upper-body humanoid robot.
1 Introduction
Due to the nonlinear nature of robotic systems, leveraging accurate models of their dynamics is crucial for fast and precise motion control.
System identification is a methodology that estimates parameters of a dynamical model. For robot systems, the inertial parameters of each link (mass, center of mass position, and inertia matrix) can be estimated from the measurement data of motion trajectories (e.g., joint angles, angular velocities, and torques). Identification methods for robot manipulators have been studied for decades (Mayeda et al., 1984; Atkeson et al., 1986) and applied to complicated multi-degree-of-freedom (DoF) robots, such as humanoid robots (Ayusawa et al., 2008; Mistry et al., 2009). However, identifying reliable inertial parameters remains challenging for humanoid robots.
To estimate reliable inertial parameters, a wide variety of measurement data is required to excite the robot’s dynamics. For robot manipulators, such measurement data can be sampled by designing appropriate reference trajectories by optimization and generating them on the robots (Gautier and Khalil, 1992; Presse and Gautier, 1993). For humanoid robots, however, designing reference trajectories is difficult. Although some strategies have been developed to design robot reference trajectories (Bonnet et al., 2016; Ayusawa et al., 2017), a good model is necessary to consider the constraints of balance of a humanoid robot to generate reference trajectories without falling over. Optimization can also be intractable since the balance constraints are imposed by nonlinear inequality constraints.
To avoid the balance problem, Bonnet et al. (2018) proposed to use a rigid harness to restrain a humanoid robot’s body. Since the harness obviated the risk of falling over, the balance constraints were not necessary. However, the above approach was not able to identify some inertial parameters since the link was fixed by the harness and could not freely move. Therefore, a good initial model remains necessary, and the highly nonlinear optimization problem must be solved to estimate reliable inertial parameters for all links.
Alternatively reference trajectories which ensure the stability of the robots could be chosen from a list of prepared motion candidates (Venture et al., 2009; Jovic et al., 2015). Recently, Aller et al. (2021) proposed to use a large set of static poses for precise estimation of the mass and the center of mass position of each link. Since such static motions are less sensitive to acceleration, it is easy to control balance and the method does not require a highly accurate model. However, the above approach was not able to estimate the moment of inertia due to the lack of dynamic motions. Therefore, it is necessary to derive an accurate initial model so that the robots can perform not only static but also dynamic motions.
To overcome this problem, we propose a curriculum-based identification approach. Our curriculum first provides conservative reference motions that can be generated with an unreliable model. As identification progresses, the curriculum offers more aggressive reference motions so that the robot’s dynamics are fully excited. The initial unreliable model is iteratively updated with the measurement data obtained by generating reference motions. Therefore, since an accurate dynamics model can be learned gradually from an unreliable initial model, a good initial model is unnecessary. Here, a curriculum is represented as a sequence of reference motions for efficient humanoid model identification.
In our proposed method, the curriculum needs to provide a wide variety of reference trajectories, including not only static but also dynamic motions. We utilized captured human motions to obtain such data. Due to the recent development of large human motion databases (Mandery et al., 2015; 2016; Krebs et al., 2021), they now contain a wide variety of motion trajectories. By selecting appropriate human motions from databases and transforming the motion data into the robot’s data, we can easily obtain various reference trajectories. Therefore, our method does not need to solve the highly nonlinear optimization problem to design references. We especially use a recently available database that includes a wide variety of upper-body behaviors of humans (Krebs et al., 2021) to obtain the reference trajectories for an upper-limb humanoid robot.
A schematic diagram of our identification curriculum is shown in Figure 1. We first convert the trajectories in the human database into those of the robot to construct a dataset of robot reference motions (Figure 1A). We next search for a generable and effective reference motion for parameter identification through the dataset (Figure 1B). The selected reference is provided to the robot and generated to obtain measurement data (Figure 1C). Finally, the new inertial parameters are estimated with the measured motion trajectories (Figure 1D). The robot can generate more aggressive motions from the next iteration since the model has become more reliable. Thus, in the next iteration such an aggressive motion is selected and provided from the dataset (Figure 1E). This iterative process is repeated until an accurate dynamics model is identified.
FIGURE 1. Schematic diagram of proposed identification method: Our method constructs identification curriculum using large-scale human motion database (Krebs et al., 2021) to estimate a dynamics model of humanoid robot from unreliable initial model without solving complex optimization problems: (A) Transform human motion database into robot data. (B) Search for generable and effective reference motions. (C) Provide selected reference and generation to obtain measurement data. (D) Estimate new inertial parameters with measurement data. (E) Proceed to next iteration and search for new reference motion.
The following are the contributions of this study:
• We proposed a curriculum-based humanoid robot identification method that can be adopted even without access to a good initial robot model. Although quasi-static motions are selected with the inaccurate initial model in the first identification step, more aggressive motions become generable from the next step of the iterative identification since the model’s accuracy is gradually improved by the previous iterations.
• In our proposed method, by using large-scale human motion data, the optimization process was greatly simplified for deriving appropriate movement data for system identification than conventional humanoid identification methods. We derived reference data by simply selecting the most suitable data for efficient parameter identification from a large-scale dataset. Note that the most suitable data means one of the candidate motions which can estimate the most reliable parameters. The optimization process to select the suitable data is, in fact, much less computationally intensive than directly solving complex optimization problems considered in the conventional approaches.
• We empirically showed that our identification method can efficiently find the parameters of a humanoid robot model. To identify an accurate model, the data sampling order is significantly important since it affects the sensitivity of the identification results to torque output errors. Through our simulated experiments, we confirmed that our method properly derived the suitable identification sequence and showed that our curriculum-based method achieved efficient parameter estimation.
The paper is organized as follows. Section 2 introduces related studies. Section 3 gives preliminaries on system identification. In Section 4, we describe our curriculum-based identification method for a humanoid robot, and Section 5 shows the results of curriculum-based learning. In Section 6, we summarize and discuss future works.
2 Related works
Identification methods for humanoid robots have been developed for over a decade, demonstrating that the inertial parameters can be identified with limited sensor measurements (Ayusawa et al., 2008; Mistry et al., 2009). The physical consistency of inertial parameters can be maintained by an optimization approach (Jovic et al., 2016).
However, the identification of reliable inertial parameters remains challenging for humanoid robots. The strategies that design robot reference trajectories require good initial models to incorporate balance constraints (Baelemans et al., 2013; Bonnet et al., 2016). Although initial models can be obtained from computer-aided-design (CAD) software, the models are often insufficient since a CAD model cannot capture every dynamical effect of the robot (Swevers et al., 2007). Our proposed approach, on the other hand, can estimate reliable inertial parameters without a good initial model using the identification’s curriculum.
A human motion capture database was used for system identification to estimate a human dynamics model (Venture et al., 2009). Human motion data can be utilized to identify a humanoid robot by converting human motion trajectories into those of a robot (Yamane, 2011). However, if the database is small-scale and contains a limited range of motion data, selecting reference trajectories that fully excite the system is difficult (Bonnet et al., 2016). Therefore, we leveraged a large-scale human motion database and show that the dynamics of a humanoid robot are successfully excited since various reference trajectories can be provided from such databases.
To the best of our knowledge, ours is the first work that creates a curriculum of system identification for a humanoid robot. Curriculum-based learning has been used to learn control policies for humanoid robots (e.g. (Karpathy and Van De Panne, 2012; Rodriguez and Behnke, 2021)). We demonstrate that curriculum-based learning is also useful for the identification of a humanoid robot.
Recently, neural-network-based identification methods have been explored for legged robots (Hwangbo et al., 2019; Yu et al., 2019). The robot’s dynamics were modeled with neural networks, and networks parameters were estimated. These studies focused on identifying task-specific dynamics models that did not generalize across multiple motions. Our proposed approach, on the other hand, can identify a non-task-specific model by estimating the physical parameters from a wide variety of measurement data. In the last part of the experiment (Section 5), we demonstrate that our method can obtain more reliable model than the task-specific models.
3 Preliminaries on system identification
3.1 Dynamics model
In this paper, we address the following dynamics model of a robot system:
where θ,
3.2 Identification process
An identification method estimates the so-called base parameters, which are the minimal identifiable set of the robot’s inertial parameters. They comprise the mass, the CoM position, and the independent inertia tensor coefficients of each link. These parameters are estimated using the fact that the equation of motion 1) can be written in a linear form with respect to base parameter vector w ∈ RD (Kawasaki et al., 1991):
where ϕ is called the regressor matrix for the base parameter. If we obtain T time-step measurement data of angle trajectories, their velocities and accelerations
and the regressor matrix becomes
and (2) can be rewritten:
Thus, the base parameters can be identified as the least-squares solution w⋆:
3.3 Optimization of robot reference trajectory
In practical applications, estimation w⋆ is perturbed by errors due to noise-contaminated measurements. For example, the accuracy of the parameters is affected by the torque output errors δf, including the external disturbances.
The sensitivity of estimated parameters w⋆ to the measurement errors can be evaluated by the condition number of the regressor matrix (Gautier and Khalil, 1992), which can be defined:
where σmax(Φ) and σmin(Φ) denote the maximum and minimum singular values of the regressor matrix. If the condition number is small, the solution is insensitive to the errors. Therefore, it is important to obtain a wide variety of measurement data to reduce the condition number to reliably estimate the base parameters.
A conventional strategy to collect such measurement data is to first design a reference trajectory of robot
The derived movement
To derive the optimal excitation trajectory for a humanoid robot, the constraints of balance must be considered so that the reference trajectory is generable by a humanoid robot without falling over. Even for an upper-limb humanoid robot, the balance constraints is important since the robot can fall over if the supporting base is not fixed to the ground.
The balance constraints are imposed on (6) by nonlinear inequality constraints:
The calculation of the balance constraints requires base parameters w. For example, the constraints of the Zero Moment Point (ZMP) are imposed to guarantee the robot’s balance (Bonnet et al., 2016). The ZMP can be represented:
where (x, y, z) denotes the center of the mass position of the humanoid robot, and lz and g are the height of the floor and the gravitational acceleration. The support polygon is a convex hull formed by the floor contact points. If the ZMP lies within the support polygon, the floor contact points do not change. Then, the robot can maintain the balance by using the ground reaction force from the contact points:
where we denote ZMP’s upper and lower boundaries as
where
However, the ZMP cannot be computed precisely if the initial model is unreliable. To account for inaccuracies, the conservative lower and upper bounds of the support polygon are considered instead of (9) in practice:
where α is the reduction coefficient of the support polygon (0 < α ≤ 1). As the reduction coefficient increases, the size of the support polygon is reduced. This restricts the variations of the reference trajectories, and the dynamics may not be fully excited. Therefore, a good initial model is necessary to estimate an accurate dynamics model.
The optimization (6) becomes more complex by enforcing the nonlinear inequality constraints (7). Deriving the reference trajectories can be intractable due to the large number of optimization variables and the nonlinear balance constraints.
4 Proposed method
Our identification method, on the other hand, requires neither preparing a good initial model nor solving the complex optimization problem. We construct a curriculum for system identification so that an accurate dynamics model is iteratively learned from an unreliable initial model. We select the reference trajectories from a robot motion dataset which is constructed using a large-scale human motion database. Our curriculum-based identification method is summarized in Algorithm 1. The details of our proposed method are described in this section.
Algorithm 1.Curriculum-Based Identification Algorithm.
1: Given
2: Human motion dataset
3: Initial base parameters w1
4: Initial inequality constraints h1
5: Run
6: for k = 1 to K do
7: Transfer
8: end for
9: return robot motion dataset
10: Run
11: for i = 1 to I do
12: Select reference Q⋆ from
13: Sample measurement data Qi and fi by using Q⋆
14: Compute new base parameters wi+1 by (4)
15: Construct new inequality constraints hi+1
16: end for
17: return estimated parameters wI+1
4.1 Iterative identification
We define a curriculum as a series of robot reference trajectories. By sequentially generating the reference trajectories by the robot, a wider variety of measurement data can be sampled. Consequently, an accurate model can be learned gradually without a good initial model. To perform this process, we developed an iterative identification method (lines 10 to 17 of Algorithm 1). Our method repeats an identification process until the iteration reaches the maximum number of iterations: I.
At each iteration, we obtain reference trajectory Q⋆ (line 12 of Algorithm 1). In i-th iteration, the reference trajectory is selected from a dataset of K robot motions
such that
Since this simple optimization avoids solving the complex optimization problem (6, 7), a wide variety of reference trajectories can be easily obtained.
Note that in our proposed method, the robot motion dataset for iterative identification was constructed by converting the captured human data into robot motion, i.e., motion retargeting. Due to the similarity of the structure to humans, the motion data for a humanoid robot can be acquired by capturing a human movement (Yamane, 2011; Ayusawa and Yoshida, 2017). Thus, we are able to obtain a sufficient amount of motion data for robots from a large-scale human motion dataset. The retargeting process is described in Section 4.2.
In the optimization (11), we consider a concatenated regressor matrix with all the measurement data that were sampled in the previous iterations from 1 to i − 1 and the newly selected reference trajectory in current iteration i:
This enables us to find an effective reference trajectory to estimate the unreliable base parameters that were not excited by the previously obtained measurement data.
By reproducing the reference trajectory on the robot, the measurement data of
where Kp and Kd denote the position and velocity gains. These gains can be small in the computed torque controller since we use an inverse dynamics controller. Figure 2 shows a block diagram of the data sampling. We investigate the noise perturbation of the joint torques.
FIGURE 2. Data sampling process: Each data point of reference trajectories
After sampling the new measurements, we stack the regressor matrix and the torque sequence with the previously obtained measurements:
and new parameters wi+1 are estimated using (4) (line 14 of Algorithm 1).
Finally, we construct new inequality constraints hi+1 for next iteration i + 1 (line 15 of Algorithm 1). At the first iteration, initial parameters w1 are unreliable. Thus, initial inequality constraints h1 restrict the reference trajectories to be generable with the unreliable dynamics model. In many instances, quasi-static motions are selected in the first iteration. But, as the iteration proceeds, the robot might generate more aggressive motions than the previous iterations since the parameters are iteratively updated. Therefore, we relax the inequality constraints at each iteration so that more dynamic reference trajectories can be selected in the optimization (11).
4.2 Motion retargeting
Before initiating the iterative identification process, we transformed the human motion data into those of the robot (lines 5 to 9 of Algorithm 1) in a process called motion retargeting (Gleicher, 1998).
We denote a sequence of three-dimensional positions of markers measured by a motion capture system as pm. We define the human motion data with T time steps as
We achieve this by first defining the kinematics model for representing the human motion data. Next we compute geometric parameters ξh and angle trajectories
We respectively define the N control points equally spaced between t = 1 to T (assuming N < T) and their corresponding basis functions of the B-spline function:
where
Then joint angles
By using (14), we formulate motion retargeting as the following optimization problem:
where η is a penalty function and wη is its weight. In this paper, η was used to constrain the joint angles so that they do not exceed their limitations. Moreover, wg is a weight corresponding to the series of relative position errors of the markers between the human model and the robot.
The computational cost of Algorithm 1 is dominated by the optimization process for motion retargeting. We adopted the quasi-Newton algorithm implemented as a Matlab function for solving the retargeting problem in (15), where the computational complexity of the quasi-Newton method is known as O(n2). Since we used the B-spline functions to represent the movements, knot points were variables to be optimized. The number of optimized variables for each iterative computation was n = nξ + 2Nnθ, where nξ is the number of geometric parameters ξh, and nθ denotes the number of joint angles θh or θr.
The derivatives of the angle trajectories are calculated using the finite difference method:
where Δt denotes the time step of motion retargeting.
5 Experiments
We conducted experiments on the simulated environment to evaluate our curriculum-based identification approach. The details of the experimental setups and results are provided in this section.
5.1 Evaluation setups
We first verified that our method could identify a robot’s accurate dynamical model without a good initial model. We did 200 identification experiments. For each identification, different sequences of zero-mean Gaussian noise δf were added to the computed torque (Figure 2) to simulate torque output errors in the identification results. The standard deviation of the noise was set to 5% of the torque limits (Presse and Gautier, 1993). The maximum number of iterations of our iterative identification method was set to I = 5 so that the identification process might not be finished before the accurate model was obtained.
We prepared an unreliable initial model and examined whether the estimation error of the base parameters was gradually reduced as the learning iterations proceeded. The estimation error is measured by
where wtrue is the true base parameter vector and D is the dimension of the base parameter vector, i.e., the number of parameters. (The relevant results are shown in Figure 6.)
The reliability of the parameters was evaluated using the relative standard deviation of each estimated parameter:
where
The reliability was also evaluated by validation. We prepared reference motion trajectories for validation and obtained measurement data
(The relevant results is shown in Figure 8.)
We next evaluated our simple optimization (11). Our proposed method created a curriculum by selecting appropriate reference trajectories from the robot motion dataset. We compared our proposed method with a non-curriculum-based approach that did not solve the optimization problem (11). This approach randomly picked a reference trajectory under the constraints (12). We examined whether the base parameters can be learned efficiently if the sampling process was guided by the curriculum (The relevant results are shown in Figure 7).
In addition, we also evaluated our curriculum by comparing the obtained model with the other models which were identified by using single motions. Here, we called the former “the non-task-specific model” and the latter “the task-specific models.” At first, we identified the task-specific models by using each of the two groups of motions (Open and Scoop), which could be selected in the first iteration of the identification curriculum in Figure 4. After identifying each model, we evaluated the reliability of the model by using (19). (The relevant results are shown in Figure 8).
Moreover, we evaluated the efficiency of using B-spline functions for motion retargeting (Section 4.2). To compare with using B-spline functions, we prepared another strategy which directly optimized the joint angles
5.2 Experimental setups
5.2.1 Robot model
We conducted simulated experiments using an upper-limb humanoid robot model of Torobo (Tokyo Robotics Inc.) to evaluate our proposed method. Assuming a risk of falling over, we investigated the balance constraints using ZMP. The robot has a total 18 DoFs; each arm has 7 DoFs, its torso has 2 DoFs, and its head has 2 DoFs.
A dynamic simulation was performed with a physics engine MuJoCo (Todorov et al., 2012). The sensor information of each joint angle (and its derivatives) and torque were measured at 10-ms intervals. Gains Kp and Kd for the proportional derivative (PD) controller in (13) were manually designed. Note that the robot could not follow the reference trajectories by the PD controller since we only set the gains to be small.
The robot has a total of D = 132 identifiable base parameters. In the experiment, we regarded the CAD parameters as the true inertial parameters w′true to derive true base parameters wtrue. Inertial parameters w′1 for deriving initial base parameters w1 were randomly generated from a uniform distribution. If the j-th element of w′true exceeds 0, the uniform distribution interval was
5.2.2 Relaxation of inequality constraints
At each iteration of our identification method, we relaxed the inequality constraints (12) by computing a reduction coefficient of the support polygon at each iteration by
where reduction coefficient α in (10) is decreased by the following criteria:
where αmax, a, and b respectively denote the maximum reduction coefficient, scaling, and bias factors. In the first iteration, α was initialized with αmax. We set αmax = 0.74 so that only 5% of the reference trajectories could be selected from the robot motion dataset at the first iteration. We used a = 2.5 × 10−4 and b = 1.0 so that α did not become too small value in the initial iteration.
Since condition number cond(Φ) might decrease as the iteration proceeds, reduction coefficient α was also decreased. Therefore, the support polygon gradually expanded, and our proposed method selected more aggressive motions as learning progressed.
5.2.3 Human motion database and setups for retargeting
We utilized for motion retargeting the KIT Bimanual Manipulation Dataset(Krebs et al., 2021), which contains a large number of capture data of a human’s bimanual daily household activities, e.g., opening and closing lids, wiping dishes, etc. Each captured motion was stored in the Coordinate 3D (C3D) format, which recorded the three-dimensional coordinates of each part of the human body at 100 Hz sampling frequency (Δt = 10 ms period). We took 110 motions from the dataset, 98 of which were used as a reference motion dataset for identification, and 12 of which were used for validation.
The motion retargeting was performed using the minFunc function (Schmidt, 2005) (which had been made for solving optimization problems) in a Matlab environment by an AMD EPYC 7742 CPU, 3.4-GHz computer. The weights of the penalty function were set to wg = 100 and wη = 50. The angle trajectories were generated by interpolating the via points, which were optimized at 200-ms intervals. After retargeting, each bit of data was labeled by the motion names shown in Table 1, where the names were taken from the used dataset.
TABLE 1. Motion dataset (Krebs et al., 2021) used for system identification and validation.
5.3 Experimental result
Our proposed method selected a sequence of the reference trajectories from the robot motion dataset to create a curriculum. Figure 3 shows the robot motion dataset. Since we constructed it using a large-scale human motion database, it contains various motion trajectories (e.g., Figure 3).
FIGURE 3. Reference motion dataset for upper-limb humanoid robot: Each block represents reference motion trajectory. Trajectories were sorted according to labels of motion names. By retargeting large-scale human motion database, we contained a wide variety of motion trajectories. Snapshots of some motions are shown as examples.
By selecting appropriate trajectories from the dataset, 200 learning curricula were created in the experiments because we performed identification experiments 200 times in different noise settings. A representative curriculum is depicted in Figure 4. At each iteration, the reference trajectory with the minimum condition number was selected from the entire dataset. The varieties of reference trajectories increased as iterations progressed since the upper and lower boundaries of the ZMP were expanded with (20) (Figure 5). For example, the sweep motions were unavailable in the first and second iterations, although they became generable from the third iteration and had the smallest condition numbers.
FIGURE 4. Identification curriculum for upper-limb humanoid robot: Each rectangle enclosed by thick black line represents generable reference motion dataset for each iteration. Color of each block represents relative magnitude of condition number for each iteration. Here, darker color represents smaller condition number than lighter color. Black arrows represent curriculum which connects motion with smallest condition number (red dots) at each iteration.
FIGURE 5. Relaxation of balance constraints for each iteration of representative curriculum: Red dot (pZMP) represents ZMP, computed using reference trajectory. Yellow rectangle
Figure 6 shows the RMSE between the true and estimated parameters (17) for each iteration of the identification. The error of the initial model is indicated with a solid line. The estimation errors gradually reduced as iterations proceeded (Figure 6). Finally, the error median was 0.243 at the fifth iteration. We evaluated the reliability of the parameters by relative standard deviation (18). Our proposed method successfully estimated 75.8% of the parameters (100 of 132) at the fifth iteration. In the initial model, 57.6% of the parameters (76 of 132) were reliable. Therefore, our method learned an accurate dynamics model where a good initial model was unavailable.
FIGURE 6. Estimation errors of base parameters: Blue line represents root mean square error (RMSE) between true and initial parameters of robot model. Each red box represents RMSEs between true and estimated parameters of each iteration. The red crosses represent the outliers in each iteration of the evaluation using the proposed method.
Figure 7 compares our curriculum-based and the non-curriculum-based approaches. As shown in Figure 7, the condition number was significantly large at each iteration if the reference trajectories were selected randomly from the robot motion database. Our curriculum-based approach, on the other hand, efficiently learned the accurate base parameters. Therefore, the results showed the importance of guiding the sampling process by creating the curriculum.
FIGURE 7. Comparison of curriculum-based and non-curriculum-based approaches: Each red box represents condition numbers of our curriculum-based approach. Each green box represents condition numbers of non-curriculum-based approach which randomly picked reference motions for each iteration. The red crosses represent the outliers in each iteration of the evaluation using the proposed method. The green crosses represent the outliers using non-curriculum-based approach in each iteration.
Figure 8 compare the non-task-specific model and the task-specific models. As shown in Figure 8, in case of the non-task-specific model, the errors became smaller than that of the task-specific models for each iteration. Therefore, our curriculum estimated more reliable model than the task-specific models.
FIGURE 8. Comparison of task-specific and non-task-specific models: Blue band represents RMSEs between estimated joint torques using initial base parameters and their measurements. Each box represents RMSEs between estimated joint torques using identified parameters of each iteration and their measurements. Each yellow box represents RMSEs between measurements and estimations using identified base parameters by leveraging only “Open” motions. Each violet box represents RMSEs between measurements and estimations using identified base parameters by leveraging only “Scoop” motions. Each red box represents RMSEs between measurements and estimations using base parameters of the non-task-specific model. The crosses represent the outliers in each iteration of the evaluation.
Figures 9A, B compare the performances of the motion retargeting: The blue boxes show the performance of retargeting without B-spline and the red boxes show the performance of retargeting with using B-spline. As shown in Figures 9A, B, when optimization was performed using (14), we could obtain results with lower optimization costs and less computation time than when B-spline functions were not used. Therefore, motion retargeting using the B-spline functions was shown to be more efficient.
FIGURE 9. Comparison of the strategies of the motion retargeting. (A) The calculation times which were taken for motion retargeting. (B) the optimization costs. In both subfigures, blue boxes represent the results when motion retargeting is performed without using the B-Spline function. Red boxes represent the results when motion retargeting is performed using the B-Spline function.
6 Conclusion
We proposed a curriculum-based approach for the system identification of a humanoid robot. Since the curriculum helps gradually learn accurate dynamics from an unreliable initial model, our method does not require a good initial model. Unlike the conventional approach, our method does not need to solve the complex optimization problem. By using a large-scale human motion database, various reference motions can be provided by converting the human motion data into those of the robot. By selecting an appropriate reference trajectory, our curriculum successfully guided the robot’s sampling process. Consequently, an accurate dynamics model was gradually learned from an unreliable initial model by our iterative identification method.
Future work will apply our curriculum-based identification method to a physical robot. In addition, we will evaluate our approach on a whole-body humanoid robot. In this case, we would not only change the robot model, but also consider whether there are other motion databases which would be suitable for identifying the model. Once the accurate initial model is obtained, we would try to identify more accurate model for the fast and precise control. Then, it is necessary to consider whether it is more preferable to carry out our identification curriculum using a database containing more aggressive behaviors or to use a conventional method without using a database.
Moreover, we will verify the end conditions of iterative identifications in the future work. In this paper, the maximum number of iterations was set in advance to demonstrate the effectiveness of the identification curriculum. However, it should be determined to ensure that a highly accurate model is obtained. To do so, some criteria could be used as the end conditions of identification [e.g., the condition number of the angle trajectories (5) and the reliability of the model (19)]. Then, the tuning of the hyperparameters for the relaxation of the inequality constraint might also be discussed to avoid generating a redundant curriculum.
Data availability statement
The datasets presented in this article are not readily available because the data is not open to the public. Requests to access the datasets should be directed to kang@atr.jp.
Author contributions
SK: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Software, Writing–original draft, Writing–review and editing. KI: Conceptualization, Investigation, Methodology, Writing–original draft, Writing–review and editing. NS: Data curation, Resources, Writing–review and editing. JM: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing–review and editing.
Funding
The authors declare financial support was received for the research, authorship, and/or publication of this article. This paper is based on results obtained from a project, JPNP20006, commissioned by the New Energy and Industrial Technology Development Organization (NEDO). Part of this work was supported by MIC-SCOPE (182107105), JST-Mirai Program Grant JPMJMI18B8, JSPS KAKENHI JP22H03669 and JP22H04998. Tateishi Science and Technology Foundation.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Aller, F., Harant, M., and Mombaur, K. (2021). “Motion capture based model identification of the humanoid robot REEM-C using static poses,” in ECCOMAS thematic conference on multibody dynamics (Budapest: Budapest University of Technology and Economics), 310–321.
Atkeson, C. G., An, C. H., and Hollerbach, J. M. (1986). Estimation of inertial parameters of manipulator loads and links. Int. J. Robotics Res. 5, 101–119. doi:10.1177/027836498600500306
Ayusawa, K., Nakamura, Y., and Venture, G. (2009). “Optimal estimation of human body segments dynamics using realtime visual feedback,” in 2009 IEEE/RSJ International Conference on Intelligent Robots and Systems (IEEE), St. Louis, MO, USA, 10-15 October 2009, 1627–1632.
Ayusawa, K., Rioux, A., Yoshida, E., Venture, G., and Gautier, M. (2017). “Generating persistently exciting trajectory based on condition number optimization,” in 2017 IEEE International Conference on Robotics and Automation (ICRA) (IEEE), Singapore, 29 May 2017 - 03 June 2017, 6518–6524.
Ayusawa, K., Venture, G., and Nakamura, Y. (2008). “Identification of humanoid robots dynamics using floating-base motion dynamics,” in 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems (IEEE), Nice, France, 22-26 September 2008, 2854–2859.
Ayusawa, K., and Yoshida, E. (2017). Motion retargeting for humanoid robots based on simultaneous morphing parameter identification and motion optimization. IEEE Trans. Robotics 33, 1343–1357. doi:10.1109/tro.2017.2752711
Baelemans, J., Van Zutven, P., and Nijmeijer, H. (2013). “Model parameter estimation of humanoid robots using static contact force measurements,” in 2013 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR) (IEEE), Sweden, 21-26 October 2013, 1–6.
Bonnet, V., Crosnier, A., Venture, G., Gautier, M., and Fraisse, P. (2018). “Inertial parameters identification of a humanoid robot hanged to a fix force sensor,” in 2018 IEEE International Conference on Robotics and Automation (ICRA) (IEEE), Brisbane, QLD, Australia, 21-25 May 2018, 4927–4932.
Bonnet, V., Fraisse, P., Crosnier, A., Gautier, M., González, A., and Venture, G. (2016). Optimal exciting dance for identifying inertial parameters of an anthropomorphic structure. IEEE Trans. Robotics 32, 823–836. doi:10.1109/tro.2016.2583062
Gautier, M., and Khalil, W. (1992). Exciting trajectories for the identification of base inertial parameters of robots. Int. J. robotics Res. 11, 362–375. doi:10.1177/027836499201100408
Gleicher, M. (1998). “Retargetting motion to new characters,” in Proceedings of the 25th annual conference on Computer graphics and interactive techniques (New York, NY, United States: Association for Computing Machinery), 33–42.
Hwangbo, J., Lee, J., Dosovitskiy, A., Bellicoso, D., Tsounis, V., Koltun, V., et al. (2019). Learning agile and dynamic motor skills for legged robots. Sci. Robotics 4, eaau5872. doi:10.1126/scirobotics.aau5872
Jovic, J., Escande, A., Ayusawa, K., Yoshida, E., Kheddar, A., and Venture, G. (2016). Humanoid and human inertia parameter identification using hierarchical optimization. IEEE Trans. Robotics 32, 726–735. doi:10.1109/tro.2016.2558190
Jovic, J., Philipp, F., Escande, A., Ayusawa, K., Yoshida, E., Kheddar, A., et al. (2015). “Identification of dynamics of humanoids: systematic exciting motion generation,” in 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE), Hamburg, Germany, 28 September 2015 - 02 October 2015, 2173–2179.
Karpathy, A., and Van De Panne, M. (2012). “Curriculum learning for motor skills,” in Canadian conference on artificial intelligence (Berlin, Heidelberg: Springer), 325–330.
Kawasaki, H., Beniya, Y., and Kanzaki, K. (1991). “Minimum dynamics parameters of tree structure robot models,” in Proceedings IECON’91: 1991 International Conference on Industrial Electronics, Control and Instrumentation (IEEE), Kobe, Japan, 28 October 1991 - 01 November 1991, 1100–1105.
Khosla, P. K., and Kanade, T. (1989). Real-time implementation and evaluation of the computed-torque scheme. IEEE Trans. Robotics Automation 5, 245–253. doi:10.1109/70.88047
Krebs, F., Meixner, A., Patzer, I., and Asfour, T. (2021). “The kit bimanual manipulation dataset,” in 2020 IEEE-RAS 20th International Conference on Humanoid Robots (Humanoids) (IEEE), Munich, Germany, 19-21 July 2021, 499–506.
Mandery, C., Terlemez, O., Do, M., Vahrenkamp, N., and Asfour, T. (2015). “The kit whole-body human motion database,” in 2015 International Conference on Advanced Robotics (ICAR) (IEEE), Istanbul, Turkey, 27-31 July 2015, 329–336.
Mandery, C., Terlemez, O., Do, M., Vahrenkamp, N., and Asfour, T. (2016). Unifying representations and large-scale whole-body motion databases for studying human motion. IEEE Trans. Robotics 32, 796–809. doi:10.1109/tro.2016.2572685
Mayeda, H., Osuka, K., and Kangawa, A. (1984). A new identification method for serial manipulator arms. IFAC Proc. Vol. 17, 2429–2434. doi:10.1016/s1474-6670(17)61346-6
Mistry, M., Schaal, S., and Yamane, K. (2009). “Inertial parameter estimation of floating base humanoid systems using partial force sensing,” in 2009 9th IEEE-RAS International Conference on Humanoid Robots (IEEE), Paris, France, 07-10 December 2009, 492–497.
Ogawa, Y., Venture, G., and Ott, C. (2014). “Dynamic parameters identification of a humanoid robot using joint torque sensors and/or contact forces,” in 2014 IEEE-RAS International Conference on Humanoid Robots (IEEE), Madrid, Spain, 18-20 November 2014, 457–462.
Presse, C., and Gautier, M. (1993). “New criteria of exciting trajectories for robot identification,” in 1993 Proceedings IEEE International Conference on Robotics and Automation (IEEE), Atlanta, GA, USA, 02-06 May 1993, 907–912.
Rackl, W., Lampariello, R., and Hirzinger, G. (2012). “Robot excitation trajectories for dynamic parameter estimation using optimized b-splines,” in 2012 IEEE International Conference on Robotics and Automation (IEEE), Saint Paul, MN, USA, 14-18 May 2012, 2042–2047.
Rodriguez, D., and Behnke, S. (2021). “Deepwalk: omnidirectional bipedal gait by deep reinforcement learning,” in 2021 IEEE International Conference on Robotics and Automation (ICRA) (IEEE), Xi'an, China, 30 May 2021 - 05 June 2021, 3033–3039.
Schmidt, M. (2005). minfunc: unconstrained differentiable multivariate optimization in matlab. Available at: http://www.cs.ubc.ca/∼schmidtm/Software/minFunc.html.
Swevers, J., Ganseman, C., Tukel, D. B., De Schutter, J., and Van Brussel, H. (1997). Optimal robot excitation and identification. IEEE Trans. robotics automation 13, 730–740. doi:10.1109/70.631234
Swevers, J., Verdonck, W., and De Schutter, J. (2007). Dynamic model identification for industrial robots. IEEE control Syst. Mag. 27, 58–71. doi:10.1109/MCS.2007.904659
Todorov, E., Erez, T., and Tassa, Y. (2012). “Mujoco: a physics engine for model-based control,” in 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems (IEEE), Vilamoura-Algarve, Portugal, 07-12 October 2012, 5026–5033.
Venture, G., Ayusawa, K., and Nakamura, Y. (2009). “A numerical method for choosing motions with optimal excitation properties for identification of biped dynamics-an application to human,” in 2009 IEEE International Conference on Robotics and Automation (IEEE), Kobe, Japan, 12-17 May 2009, 1226–1231.
Yamane, K. (2011). “Practical kinematic and dynamic calibration methods for force-controlled humanoid robots,” in 2011 11th IEEE-RAS International Conference on Humanoid Robots (IEEE), Bled, Slovenia, 26-28 October 2011, 269–275.
Keywords: human motion database, humanoid robots, motion retargeting, system identification, dynamics model
Citation: Kang S, Ishihara K, Sugimoto N and Morimoto J (2023) Curriculum-based humanoid robot identification using large-scale human motion database. Front. Robot. AI 10:1282299. doi: 10.3389/frobt.2023.1282299
Received: 23 August 2023; Accepted: 13 November 2023;
Published: 30 November 2023.
Edited by:
Melih Kandemir, University of Southern Denmark, DenmarkReviewed by:
Andreas Look, Robert Bosch, GermanyErhan Oztop, Osaka University, Japan
Bengisu Guresti, Washington University in St. Louis, United States, in collaboration with reviewer EO
Copyright © 2023 Kang, Ishihara, Sugimoto and Morimoto. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sunhwi Kang, kang@atr.jp; Jun Morimoto, xmorimo@atr.jp