- 1Department of Mechanical and Electrical Engineering, Hunan University of Science and Technology, Xiangtan, China
- 2Intelligent Manufacturing College, Hunan Vocational Institute of Technology, Xiangtan, China
- 3Department of Information and Electrical Engineering, Hunan University of Science and Technology, Xiangtan, China
In order to deal with the multi-target search problems for swarm robots in unknown complex environments, a multi-target coordinated search algorithm for swarm robots considering practical constraints is proposed in this paper. Firstly, according to the target detection situation of swarm robots, an ideal search algorithm framework combining the strategy of roaming search and coordinated search is established. Secondly, based on the framework of the multi-target search algorithm, a simplified virtual force model is combined, which effectively overcomes the real-time obstacle avoidance problem in the target search of swarm robots. Finally, in order to solve the distributed communication problem in the multi-target search of swarm robots, a distributed neighborhood communication mechanism based on a time-varying characteristic swarm with a restricted random line of sight is proposed, and which is combined with the multi-target search framework. For the swarm robot kinematics, obstacle avoidance, and communication constraints of swarm robots, the proposed multi-target search strategy is more stable, efficient, and practical than the previous methods. The effectiveness of this proposed method is verified by numerical simulations.
Introduction
Inspired by the group behavior of social insects such as ants and bees, the concept of swarm intelligence is put forward by scholars (Bonabeau, 1999), which is defined as the collective intelligence emerging from a group of simple agents. The swarm robot system (Doty and Van Aken, 2002) is a typical artificial swarm intelligence system, which consists of a large number of homogeneous autonomous robots with a simple structure. By the coordination and cooperation of robots with limited individual capabilities under a specific mechanism, the system can present intelligent behavior and complete relatively complex tasks.
The common research contents of swarm robot systems include target search (Alfeo et al., 2019; Booth et al., 2020), task assignment (Liang et al., 2018), cluster avoidance (Khan et al., 2019), path planning (Ryan, 2008; Luo et al., 2017), and cluster formation (Anonymous, 1993; Alsamman, 2011). In this paper, the target search problem of swarm robots in unknown complex environments is mainly studied, such as forest fire detection (Yao et al., 2018; Marzaeva, 2019), toxic gas leak detection (Zhang et al., 2010; Moshayedi and Gharpure, 2013), search and rescue of missing personnel (Goodrich et al., 2009; Kamegawa et al., 2020), military target detection (Ha and Cho, 2018; Jiong et al., 2019) and so on. In order to solve this type of search problem, there are mainly composed of two main categories of design strategies, namely, behavior-based search and learning-based search (Cizek and Faigl, 2019; Berscheid et al., 2020; Suzuki et al., 2020), and this article mainly discusses the former.
According to the number of search targets, searches can be divided into single-target searches and multi-target searches. When the swarm robot system is applied to single-target search, it is necessary to pay attention to the cooperation mechanism between individual robots. Gudise (2004) proposed an extended particle swarm optimization (EPSO) algorithm, which was successfully applied to single-target searches. Ducatelle et al. (2011) used the local wireless network communication strategy to strengthen the communication ability between robots and enhance the robustness of the swarm robot system. Majid and Arshad (2017) mainly focused on the performance indicators in the EPSO coordinated search algorithm such as trajectory smoothness, search success rate, and search time, and studied the impact of the inertial weight on the search performance of swarm robots. Tang et al. (2020) proposed an improved adaptive bat algorithm (IABA) search algorithm by focussing on the problem of obstacle avoidance and improving the performance of the algorithm in the single-target search process of swarm robots. Aiming at the distributed communication problem in the single-target search process of swarm robots, Yang et al. (2019) proposed a time-varying characteristics swarm of visual limited (V-TVCS) model.
However, when the swarm robot system is applied to the actual neighborhood search, the number of search targets is more than one. Therefore, how to set up a multi-target search algorithm considering the actual search environment is the focus of scholars at home and abroad. Manic (2009) proposed a multi-target task allocation model with response threshold (TRT) to realize self-organizing task allocation, and then robots with the same objective task used the EPSO algorithm for coordinated search. Zhang and Xue (2014) proposed a dynamic task division strategy with closed-loop adjustment for the problem of uneven subgroup size of the TRT model. Xinjie (2020) established a simplified virtual force model (SVFM) for the unknown and complex environment, and successfully solved the obstacle avoidance problem in the multi-target search process. Zhang and Xue (2015) proposed the strategies of competition and cooperation and cooperation for the problem of subgroup interaction in parallel search. Jie (2019) proposed a probabilistic finite state machine search framework for the multi-target search problem of swarm robots. Xinjie (2020) extended the two-dimensional SVFM (2D-SVFM) to 3D-space, and successfully implemented this type of search method to achieve multi-target search in the Unmanned Aerial Vehicle (UAV) cluster system.
Based on the above literature analysis, the above methods can be applied to specific target search scenarios, but there are the following problems. First of all, there is no standard multi-target search algorithm framework in these methods. Most of the algorithms' settings are only suitable for searching for a specific number of targets, not for searching for any number of targets. Secondly, most of the algorithms only start to study a specific performance index of swarm robots, and do not consider the algorithm performance, obstacle avoidance, and swarm communication problem of swarm robots in actual search scenarios at the same time.
Aiming at the static multi-target search problem of swarm robots in unknown complex environments, a multi-target coordinated search algorithm for swarm robots considering practical constraints (MSRCPC) is proposed in this paper. The main work of this paper is as follows. First, based on the mechanism of finite state machines, an ideal multi-target search framework for swarm robots is proposed. Then, on the basis of the entire framework, combined with the simplified virtual force model, the obstacle avoidance problem of the swarm robot in the multi-target search process is solved. Finally, considering the communication interaction problem in the coordination and cooperation of swarm robots and the random line-of-sight problem of individual robots in the actual communication process, the distributed neighborhood interaction model based on a time-varying characteristic swarm with a restricted random line of sight (RS-TVCS) is constructed. By embedding the sub-algorithms in the whole algorithm framework, the MSRCPC algorithm proposed in this paper can greatly improve the search performance of the swarm robot system, making the entire system more scalable and practical.
The remaining parts of this paper are summarized as follows. In section 1, the research background of this algorithm and the research progress at home and abroad was introduced. In section 2, the ideal multi-target search framework for swarm robots is introduced. In section 3, the obstacle avoidance mechanism and distributed communication mechanism of the swarm robot system are described, and the multi-objective search framework of swarm robots considering practical constraints is proposed. The simulation test analysis on the proposed algorithm is conducted in section 4. Finally, the main work is summarized.
The Framework of Ideal Search Algorithm
In a closed two-dimensional space R2, the task environment for multi-target search of swarm robots can be described by the set {R, T, S, D}.where, R = {R1,…, Ri,…, Rm, m > 1} is the search subject (swarm robots); T = {T1,…, Tj,…, Tn, n > m} is the searched target; S = {S1,…, So,…, Sp, p > 1} is the static obstacle and D = {D1,…, Dl,…, Dq, q > l} is the dynamic obstacle. In addition, we let w exist in the task set {R, T, S, D}.
The set targets can send out a continuous specific signal, and are randomly distributed in the search map. The sensors carried by swarm robots can detect the strength of the target signal, which cannot determine the direction of the signal. The initial positions of swarm robots are randomly in a certain corner of the search map. In the case without considering obstacles and ideal communication interaction, the multi-target search algorithm framework of swarm robots can be described in the form of a finite state machine. The specific description is shown in Figure 1.
As shown in Figure 1, the basic multi-target search algorithm framework can be described as: when the robot detects the target signal, it enters the coordinated search state, and uses the swarm intelligence optimization algorithm to coordinate the search; when the robot does not detect the target signal, it will follow a certain roaming mechanism to detect the target signal.
The Multi-Assignment Model Based on Response Threshold
Sensor Detection Model
Sensors with different detection distances have different response strengths for target signal, and the function to describe the target signal strength can be set as follows (Manic, 2009):
where Q is the constant power signal sent from the center of the target, dij denotes the distance between the robot and the target, d0 is the maximum detection distance of the sensor, s is the signal attenuation factor, rand is the random disturbance of the signal, and I(i, j) is the signal strength between robot and target.
Multi-Target Allocation and Design
In the robot roaming search process, the robot may detect multiple target signals. How to make self-organizing decisions on the target signals and find subgroup alliances is the key to the coordinated search of swarm robots. First, the target response function is used to calculate the detection of each target signal for each robot at time t. Then, the probability that the robot selects the target is calculated via the target response signal strength, Finally, the decision about the target based on the roulette probability decision algorithm is made. As shown in Table 1, the induction about the target signal strength of the robot at the moment t is as follows:
The probability response process of the i-th robot to the j-th target is:
where Ij is the signal strength of the target Tj detected by the robot Ri. If the robot can detect the number of targets, i.e., m, the probability that Ri responds to the excitation from target Tj is p(i, j). The Ri decision-making process of the robot Ri is as follows:
where rand () is subject to a uniform score between 0 and 1, and k is the smallest target sequence number satisfying its condition. According to the processed decision-making method, it can be determined from Table 1 that the subgroup alliances composed of the task target set are T1 = {R1, R5}, T2 = {R2, R3}, and T3 = {R4}, and the members of R6 are in the roaming search state and do not participate in the coordinated search.
The Roaming Search Algorithm Based on Nearest Neighbor Exclusion Diffusion
At the initial moment, the robot cannot detect the target signal. Therefore, it is very important to design an effective individual roaming search model to detect the target signal at the fastest speed. Typical roaming search models include Levy Flight (Viswanathan et al., 1999) and Intermittent Search (Bénichou et al., 2006). However, the roaming search strategies of these models suffer from the following disadvantages: (1) the search efficiency is not high, and (2) the factor of obstacle avoidance is not considered in the search process. Therefore, a new roaming search algorithm, namely, the Nearest Neighbor Exclusion Diffusion (NNED) Algorithm is introduced in this section.
Suppose the position information of the i-th robot in the search space at time t is expressed as Xri(t) = [Xi(t), Yi(t)]T, and the maximum speed of the roaming robot is Vm. The NNED algorithm is described below.
Without considering obstacles, the distance matrix Dim between the i-th robot and other robots at time t can be expressed as follows:
where dik is the Euclidean distance between the i-th robot and the k-th robot. Sort equation (4) by row from small to large to obtain the distance sorting matrix Dis.
The position sequence information index of the neighboring robot can be expressed as follows:
The repulsion angle θ between the i-th robot and the index-th robot is expressed as follows:
where diindex is the Euclidean distance between the i-th robot and the index-th robot. Set the expected position of the robot at time t + 1 as , and the step size is updated as follows:
Taking into account the boundary constraints, the actual position of the roaming robot is updated as follows:
where L is the search boundary. In the same way, the y-axis velocity component considering the boundary limit can be updated. Set the actual updated position of the robot at time t + 1 as Xri(t + 1) = [xi(t + 1), yi(t + 1)]T, and the position update of the roaming robot considering boundary constraints is as follows:
where Vic(t + 1) = [V′(x), V′(y)]T is update step of the robot roaming speed.
Coordinated Search Algorithm of Particle Swarm Based on Kinematics Constraints
By analyzing and comparing several benchmark concepts in the cooperative search state of the particle swarm algorithm and swarm robots, it can be found that there is a certain mapping relationship between them. Based on the inertial weight particle swarm algorithm, kinematic constraints can be used to describe this mapping relationship, and the specific expression is as follows (Gudise, 2004):
where Vie(t + 1) is the expected speed of the robot at the next moment, VRi(t + 1) is the speed of the robot at time t, (t) is the historical optimal position of the individual robots, g * Ri (t) is the optimal position of the robot at time t, VRi(t + 1) is the actual expected speed considering the kinematics of the robot, XRi(t) is the position coordinate of the robot at time t, XRi(t + 1) is the expected position of the robot at the next time, w is the inertial weight, c1 and c2 are the individual and social cognitive coefficients of the robot, r1 and r2 are random numbers uniformly distributed between 0 and 1, α is the inertia coefficient, δ is the step size control factor of the robot, and Vm is the limited maximum speed.
Setting the target position as [Xot,Yot]T, the fitness function of the coordinated search of the robot is as follows:
Because the particle swarm optimization algorithm easily falls into the local best optimum, its inertia weight is improved by combining the actual search situation of the robot in this paper. The basic idea is as follows: when the distance between the particle and the target exceeds a certain threshold, w remains large and the global search is performed; when the distance between the particle and the target is less than a given threshold, w uses its fitness value to performs adaptive non-linear decrement value, fine-grained search and continuously approach the target point. The sigmoid function in the neural network has a strong non-linear approximation ability, whose extreme value ranges between 0 and 1. Since the value of the inertia weight w in the particle swarm is almost the same, the mapping relationship is as follows:
Then, a function is introduced into a distance-dependent robots system to adapt the value of inertia weight, and the specific expression is as follows:
where dl is the set distance threshold and d = f(i) is the fitness value of the robot.
The Framework of Swarm Robot Search Algorithm Considering Practical Constraints
On the basis of the ideal multi-target search algorithm framework in the previous section, in this section, the problems of real-time obstacle avoidance and distributed communication in the search process of swarm robots are considered, and a multi-target search algorithm framework for swarm robots considering practical constraints is designed.
Simplified Virtual Force Model
Aiming at the obstacle avoidance problem in the multi-target search process of swarm robots, introducing a simplified virtual force model can not only perfectly overcome the collision avoidance problem between robots, but also can be well integrated with the entire search algorithm framework, and the performance of the algorithm is also guaranteed.
The Construction of Obstacle Avoidance Model
The idea of this model is described in Figure 2. Supposing that the position of the i-th robot at time t is Xri(t), the position of the robot at time t + 1 under the framework of the ideal multi-target search algorithm is Xri(t + 1). It is obvious from Figure 2 that the local path planned by the robot from t to t + 1 will coincide with the position of the obstacle.
First, find out the position information of two neighboring obstacles or robots based on the information of obstacles or neighboring robots detected by the sensor of i-th robot sensor, which are Xo1 and Xo2 respectively.
Then it is assumed that the robot will be affected by the virtual introduction fac at the next moment and two neighboring obstacles or robot repulsion which are fio1 and fio2 respectively.
Now define the rotation matrix TR of the new coordinate system XOY generated by rotating the xoy coordinate system counterclockwise by angle a as follows:
Set gravity fac as follows:
The rotation matrix parameter a can be expressed as follows:
The force function of a given neighbor obstacle or robot is as follows:
where da is the obstacle avoidance distance of the object (static obstacles, robots, and dynamic obstacles) in the search process, and dik is the distance between the robot and the obstacle in the search process, and k1 is the obstacle avoidance parameter of the robot.
Therefore, the coordinate components of obstacles (robots) Xo1 and Xo2 to robot i in the XOY coordinate system can be respectively obtained by the simultaneous equations (15)-(18), which are as follows:
or
where, fio1x and fio1y represent the components of the repulsive force fio1 in the coordinate system xoy, and fio2x and fio2y represent the components of the repulsive force fio2 in the coordinate system xoy. These components can be all obtained by equation (18).
Finally, only considering the deflection force in the X-axis direction and ignoring the resistance in the Y-axis direction, the final motion direction fri of the robot is expressed as follows:
where, vri(t + 1) is the actual speed required by the robot at the next moment, which can be regarded as the vector sum of the robot state update speed vie(t + 1) at time t + 1 under the framework of the ideal search algorithm and the deflection speed vif (t) of the neighboring obstacles in the X-axis direction.
The Strategy of Robot State Step Update Considering Obstacle Avoidance Constraints
Based on the idea of the SVFM combined with the ideal search algorithm framework with SVFM, the step size update strategy of the robot in different search states is given in this section. When the robot is in the roaming search state, rstate = 0; when the robot is in the coordinated search state, rstate = 1. The speed step update strategy of swarm robots in different states is as follows:
The position update strategy of swarm robots considering obstacle avoidance constraints is as follows:
The Distributed Neighborhood Communication Mechanism Based on Time-Varying Characteristic 179 Swarm With Restricted Random Line of Sight (RS-TVCS)
The Communication Model Based on RS-TVCS
In biological research, perception and communication between animal groups are often limited by perception distance. For example, when the Ouqiong bird population flies in formation, its individuals can only exchange information with neighboring individuals within its communication radius to form a local communication network. There is a common neighboring individual between two individuals, and they cannot directly communicate and interact. Through sharing the information of common neighboring individuals, it can spread to the individuals outside their neighbors to form a global communication network. Based on this idea, a representation based on distributed neighborhood communication is defined. The communication-based neighborhood of robot i is a set of teammates within a fixed radius dc to the position of robot i, which can be written as (Xue et al., 2009):
where Ω is the communication-based neighborhood, m is the number of members in the swarm, and ri denotes the robot i. xri and xrj are the spatial positions of robots i and j, robots respectively. dc is the maximum communication radius.
During swarm moving, the neighborhoods may change over time, causing the whole swarm to be divided into several dynamically changing sub-swarms. Xue et al. defined those sub-swarms with the concept of Time-Varying Characteristic Swarm (TVCS). The TVCS of robot i at time t can be represented as follows (Junior and Nedjah, 2016):
where Ω(ri)(t) represents the TVCS of robot i. The number of members in a TVCS is dynamically changing, i.e., ri can only able to communicate with other agents in Ω(ri)(t) at the time t. Taking into account the limited field of view in the robot signal interaction process, Yang et al. (2019) defined a notation of visual limited TVCS (V-TVCS), which can be written as:
where Ωv(ri)(t) is the V-TVCS. ω is the single of view of i-th robot, and its sight range is generally set to ϕi,j ϵ (0, 2π]. ϕi,j is the sight judgment vector of robots i and j, which is expressed as follows:
where, rij(t) is the location vector of robots i and j, vri(t) is the speed vector of the ith robot, and 〈 rij(t), vri(t)〉 is the angle between vectors rij(t) and vri(t).
Since the line of sight of the robot is not always in the direction of its speed in the process of motion, it is assumed that the line of sight of individual robots changes randomly along the direction of movement in this paper and that the change law obeys the normal distribution, namely, η ~ N(0, σ2), where σ is the standard deviation of the deflection angle of the line of sight, and the mean value is 0, indicating that the probability of the individual going straight ahead is greater than that of information interaction to the diagonal side. Considering the limitation of the random line of sight of the robot, the relationship structure diagram of the neighborhood distributed neighborhood communication based on RS-TVCS designed in this paper is shown in Figure 3.
The distributed neighborhood communication mechanism based on RS-TVCS is defined as followed:
where, the expression of ϕi,j is as follows:
where LOSri(t) is the vector of line of sight. When LOSri(t) = vri(t), it indicates that the line of sight of the robot is consistent with its moving direction. Therefore, the V-TVCS distributed communication mechanism based on V-TVCS is only a special case of RS-TVCS. RS-TVCS has better scalability and practicability than V-TVCS.
RS-TVCS Distributed Network Connected Subset Judgment Based on BFS Algorithm
The global communication network based on the RS-TVCS will change with the dynamic migration of swarms. Under the ideal search algorithm framework, it will iteratively change with the position of the robots, which will make it impossible for some robots to interact with each other, thus forming connected subgroups. Therefore, based on graph theory, assuming that the position of each robot at a certain moment represents a dynamic node, the connected subgroup of each robot is determined based on the idea of the breadth first search (BFS) algorithm. Through this algorithm, the interactive information of each robot under the entire global communication network based on RS-TVCS can be obtained, so as to realize the coordinated search of swarm robots.
The specific ideas are as follows:
1) Taking the position of the robot at time t as the node, the weight matrix dij is constructed by using the distance between the two points as follows:
2) Through the neighborhood judgment conditions of equations (28) and (29), the neighborhood weight matrix is constructed. When the neighborhood judgment conditions are not satisfied between the robots i and j, the weight between the two robots (nodes) is 0; otherwise, the weight between the two robots (nodes) is Euclidean distance value.
3) Based on the idea of the BFS (Awerbuch and Gallager, 1987; Jia et al., 2008; Wang et al., 2020) algorithm, all the connected nodes of the neighborhood weight matrix are found to obtain the neighborhood communication information of each robot in the global network.
The Flow of Multi-Target Search Algorithm Swarm Robots Considering Practical Constraints
Under the ideal multi-target search algorithm framework, the distributed communication problem in the search environment is combined with the real-time obstacle avoidance problem. The flow chart of the multi-target search algorithm for swarm robots considering practical constraints (i.e., MSRCPC) is shown in Figure 4.
The main sub-algorithms involved in the proposed algorithm include NNED roaming search algorithm, IAEPSO coordinated search algorithm, TRT multi-target task assignment, obstacle avoidance algorithm based on SVFM, and distributed communication algorithm based on RS-TVCS. The entire algorithm framework basically considers all the problems in the search process of swarm robots, which greatly enhances the scalability and usability of the algorithm.
Simulation
In this section, the proposed MSRCPC algorithm has been verified by several experiments in Matlab2019a. First, the MSRCPC algorithm is described in detail by simulating the search behavior of swarm robots in single-target environments and multi-target environments. Then, four multi-target search comparison modes are set up, and the simulation tests are carried out 30 times by using different modes simulation tests 30 times under different group sizes. The effectiveness of the MSRCPC algorithm is verified by comparing and analyzing the simulation results.
The MSRCPC Algorithm Test
In this part, the basic parameter settings of the MSRCPC algorithm are shown in Table 2.
With constant basic parameters, the algorithm is applied to single-target and multi-target simulation environments. In view of the randomness of the algorithm, an algorithm search process is randomly recorded to describe the search mechanism and show the performance of the algorithm in detail.
The Single-Target Search Test in Unknown Complex Environments
The initial environment settings of the swarm robotics single-target search simulation for swarm robots are shown in Figure 5A. As shown in Figure 5A, at T = 0, swarm robots are distributed in the corners of the search space, represented by red dots. The position of the target to be searched is set in the middle of the search space, represented by a black regular hexagon. The various black shapes in the figure represent obstacles in the search space. For the robot, the maximum speed is 10, the direction of its initial speed is random, the communication range is limited to 150 degrees, and the direction of moving speed is inconsistent with the direction of the line of sight, and meets the communication conditions of robot in RS-TVCS. Since the robot does not detect the target signal at the initial moment, the NNED algorithm is used to perform random search and diffusion. When T = 40, the robot still does not detect the target signal point, and the NNED algorithm continues to be used to randomize, as shown in Figure 5B. As shown in Figure 5C, at T = 80, the No. 1 robot detects the target signal, and then based on the RS-TVCS algorithm proposed in this paper, the number of the robots is learned that can communicate, and the group communication is conducted to form sub swarms. The robot that can detect the target signal through group information sharing switches from the roaming search state to the coordinated search state, and uses the IAEPSO algorithm to coordinate the search for the target point. Finally, as shown in Figure 5D, at T = 128, the robots numbered 5, 6, 7, 8, 9, 11, and 12 basically converge to the target point, and the target search is successful. The simulation search process with the MSRCPC algorithm can basically be divided into two stages: roaming search and coordinated search.
Figure 5. The figure of MSRCPC single-target search. (A) T = 0, (B) T = 40, (C) T = 80, (D) T = 124.
The search path of the robot recorded in this single-target simulation is shown in Figure 6, and it can be seen that the MSRCPC algorithm can not only search for targets quickly and accurately, but also can intelligently avoid obstacles, and has good cluster avoidance performance.
The Multi-Target Search Test in Unknown Complex Environments
Given that the initial number of robots is 30 and the number of targets is 5, other algorithm parameters are consistent with those of the single-target search algorithm in the previous section. The specific simulation search process is shown in Figures 7A–D.
Figure 7. The figure of MSRCPC multi-target search. (A) T = 0, (B) T = 87, (C) T = 123, (D) T = 186.
In Figure 7A, at T = 0, the drone swarm is randomly distributed at 200 × 200 unit positions in the search space, and the target points are randomly distributed in the 1,000 × 1,000 search space. The black irregular shape represents the obstacles in the search environment, and the proposed RS-TVCS method is adopted by the robot group to communicate. Based on the RS-TVCS communication rules, using the BFS algorithm, it can be known that at T = 0, the 30 robots are neighbors and can maintain information sharing. As shown in Figure 1, the 30 fan-shaped shared areas of the robots are group global communication area of the robot group. The robot does not detect the target signal in the global communication area, and the robot is in a roaming search state, that is, it uses the NNED algorithm to perform a roaming search at its maximum speed.
In Figure 7B, when T = 87, some robots detect the No. 2 target signal and the No. 3 target signal. At this time, the robots in the RS-TVCS global communication neighborhood share local information, and then perform target assignment based on the TRT model to form a subgroup alliance and enter the coordinated search state. However, the robots that fail to communicate with their subgroups continue to maintain the roaming search state and perform roaming searches. In addition, the No. 4 robot detects the No. 2 target signal and the No. 13 robot detects the No. 3 target signal. Based on the RS-TVCS neighborhood communication algorithm, it can be seen that the robots 6, 8, 9, 11, 17, 21, 24, and 29 that share information with the No. 4 robot form a subgroup alliance. Their state changes to the coordinated search state, and then a collaborative search will be conducted on the No. 3 target. In the same way, the No. 29 robot that shares information with the No. 13 robot forms a subgroup alliance, and then performs an accurate collaborative search on the No. 3 target. Since the remaining robots cannot communicate with the two subgroup alliances, or detect the target signal, they continue to maintain the roaming search state for random diffusion.
As shown in Figure 7C, when the MSRCPC algorithm iterate to T = 123, the subgroup alliances that perform a coordinated search on the targets No. 2 and No. 3 converge to the vicinity of targets No. 2 and No. 3, respectively, and the search for targets No. 2 and No. 3 succeeds. At the same time, the search target information disappears, and the subgroup alliance is disbanded. The formation of robot is the No. 9 robot and the No. 26 robot detect the signal of the No. 1 target and the No. 5 target, respectively. Similarly, according to the solution of the RS-TVCS distributed communication model, it can be seen that 8 robots (2, 18, etc.), which can share information with the No. 9 robot form a subgroup alliance to conduct a collaborative search for the No. 1 target, while the No. 26 robot that fails to interact with other robots cannot obtain communication and maintains a coordinated search alone. In addition, the remaining individual robots that fail to communicate with the target groups No. 1 and No. 5 continue to roam and search using the NNED algorithm.
Finally, as shown in Figure 7D, at T = 186, the robots successfully detect the No. 4 target, and the search of swarm robots ends.
The Simulation Analysis of the MSRCPC Algorithm
In the test of the single-target and multi-target search process, the MSRCPC algorithm proposed in this paper has the following advantages. (1) The search process of the algorithm mainly includes roaming search processes and coordinated search processes. In the roaming search process, the robot cannot obtain the prior information of the target, and spreads the search space at the fastest speed; in the coordinated search process, by obtaining the target information, the robots are determined by the RS-TVCS communication interaction model in the global communication range, and then the sub-group alliances approach the optimal position of the target point based on the group optimal information and individual optimal information in the IAEPSO algorithm. (2) Self-organization and adaptability are embodied in the process of the target search of swarm robots. In the process of target searching, swarm robots adaptively transform their own state by acquiring information of the external environment or sharing local information and participating in task collaboration. (3) The intelligence of swarm robots in the target search process is also reflected. In the process of roaming search and coordinated search, individual robots can realize intelligent obstacle avoidance by sensing the information of the external environment and successfully avoiding obstacles. In order to verify the performance of the MSRCPC algorithm, a series of comparative experiments are carried out in the next part.
Comparison and Discussion of MSRCPC Algorithm Simulation
In this part, the four sets of comparison modes are set up to further verify the superiority of the MSRCPC algorithm based on the multi-target search framework of the finite state machine. The settings of the four comparative search models are shown in Table 3.
Based on its framework, the search algorithm is divided into the following five parts, namely, multi-target task allocation model, roaming search algorithm, coordinated search algorithm, cluster obstacle avoidance, and distributed communication model.
For Mode 1, the NNED algorithm is adopted for roaming search, the traditional TRT model is used to assign tasks to the target, the EPSO algorithm proposed in Gudise (2004) is applied to coordinated search, the 2D-SVFM model (Xinjie, 2020) is applied to group obstacle avoidance, and the V-TVCS model proposed in Yang et al. (2019) is used for robot communication. For Mode 2, the IABA algorithm proposed in Tang et al. (2020) is applied to the robot coordinated search, and the other sub-algorithms remain constant. For Mode 3, the proposed IAEPSO algorithm is applied to the robot coordinated search, and the remaining sub-algorithms remain unchanged. For Mode 4, the proposed MSRCPC algorithm is used to set up the search experiment.
When the number of targets in the search environment of swarm robots is 10, by changing the number of swarm robots, these four modes were used to conduct 30 simulation search experiments. The change of the search path S and the mean value of the search time T of the swarm robots with the population number is shown in Figure 8; Table 4.
It can be seen from Figure 8; Table 4 that when the number of constant search targets is 10, as the scale of the swarm robots increases, the search time of the swarm robot system will decrease, but the system energy consumption of the swarm robots will increase. Therefore, we are surprised to find that how to balance the search time and energy consumption of the entire system by balancing the scale of swarm robots is a basic problem in the practical application of swarm robot systems.
By comparing Mode 3 with Mode 1 and Mode 2, it can be seen that in the entire multi-target search framework, the proposed IAEPSO coordinated search algorithm has better performance than EPSO and IABA algorithms in different population sizes. The main reason is that the adaptive inertia weight set by the IAEPSO algorithm can satisfy the coordinated search behavior of the robot. However, when the target signal exceeds a certain threshold, the robot has a larger inertia weight and can conduct a large-scale coordinated search. When the target signal is less than a certain threshold, in order to avoid the robot oscillating around the target, the algorithm can adaptively adjust the motion behavior of the robot to avoid the oscillation of the path, thereby reducing system energy consumption.
From the performance comparison curves of Mode 4 and Mode 3, it can be seen that in the case of the other sub-algorithms being the same, the search performance of swarm robots using the RS-TVCS distributed communication algorithm is better than that of Mode 3. The main reason is that the RS-TVCS distributed neighborhood communication model can meet the communication interaction performance of actual swarm robots. Using the RS-TVCS model in the process of forming subgroup alliances will make the configuration of the robot members within each member more reasonable, which can greatly improve the utilization of robot members and provide a more efficient search for the entire algorithm framework.
All in all, compared with the first three modes, the search performance of the swarm robotics can be improved by at least 25 by using the proposed MSRCPC algorithm (Mode 4).
Conclusion
The multi-target search problem of swarm robots in unknown complex environments is studied in this paper. The main innovations are as follows. (1) Aiming at the target search problem of swarm robots in actual environments, a target search framework based on a finite state machine is proposed. The proposed framework can not only solve the single-target search problem, but also solve the multi-target search problem, which improves the applicability of this algorithm in actual search scenarios. (2) In this algorithm, the problem of cluster obstacle avoidance is considered as a problem in the actual search environment, and the intelligence of cluster search for the robot is reflected. (3) In order to solve the distributed communication interaction problem in the unknown environments, by considering the random communication between individual robots and the limited visual area, a RS-TVCS model is proposed, which overcomes the shortcomings of the V-TVCS communication model.
Simulation analysis and comparison experiments show that this proposed algorithm has good search performance and strong scalability and stability, and can adapt to any search environment. In addition, we find, surprisingly, that the balance of search performance of the swarm robot system is related to the number of swarm robots. Therefore, how to balance the search path and search time of swarm robot systems by setting a certain number of swarm robots is the focus of further research.
Data Availability Statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.
Author Contributions
YZ: conceptualization, methodology, and writing—original draft preparation. AC: methodology, software, investigation, and writing—reviewing and editing. XH and XB: software and writing—reviewing and editing. All authors contributed to the article and approved the submitted version.
Funding
This work was supported in part by the National Defense Basic Research Program of China under Grant JCKY2019403D006, the Outstanding Youth Project of the Education Department of Hunan Province of China under Grant 19B200, the Doctoral Scientific Research Initial Funds of the Human University of Science and Technology under Grant E56126, and the Special Project of Engineering Research Center (Item No: Lgy18gz006).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Alfeo, A. L., Cimino, M. G. C. A., and Vaglini, G. (2019). Enhancing biologically inspired swarm behavior: Metaheuristics to foster the optimization of uavs coordination in target search. Comput. Operat. Res. 110, 34–47. doi: 10.1016/j.cor.2019.05.021
Alsamman, A. (2011). Arizona, all seven of the graduating seniors on the robotics team at an inner-city high school in phoenix will be attending college this year on full scholarships. Optics Lett. 36, 645–647. doi: 10.1364/OL.36.000645
Anonymous (1993). Innova holdings and mesa robotics team up to integrate their products for military, homeland security and law enforcement arenas. Urology. 41, 384–386.
Awerbuch, B., and Gallager, R. G. (1987). A new distributed algorithm to find breadth first search trees. IEEE Trans. Inform. Theory. 33, 315–322. doi: 10.1109/TIT.1987.1057314
Bénichou, O., Loverdo, C., Moreau, M., and Voituriez, R. (2006). Two-dimensional intermittent search processes: an alternative to lévy flight strategies. Physic. Rev. 74:020102. doi: 10.1103/PhysRevE.74.020102
Berscheid, L., Meibner, P., and Kroeger, T. (2020). Self-supervised learning for precise pick-and-place without object model. IEEE Robot. Automat. Lett. 99, 1–1. doi: 10.1109/LRA.2020.3003865
Bonabeau, E. (1999). Swarm intelligence: from natural to artificial systems. Santa Fe Inst Stud ences Complex. 9, 32–77. doi: 10.1093/oso/9780195131581.001.0001
Booth, K. E. C., Piacentini, C., Bernardini, S., and Beck, J. C. (2020). Target search on road networks with range-constrained uavs and ground-based mobile recharging vehicles. IEEE Robot. Automat. Lett. 99, 1–1. doi: 10.1109/LRA.2020.3015464
Cizek, P., and Faigl, J. (2019). Self-supervised learning of the biologically-inspired obstacle avoidance of hexapod walking robot. Bioinspirat. Biomimetics. 14:046002. doi: 10.1088/1748-3190/ab1a9c
Doty, K. L., and Van Aken, R. E. (2002). “Swarm robot materials handling paradigm for a manufacturing workcell,” in IEEE International Conference on Robotics and Automation. New York, NY: IEEE.
Ducatelle, F., Di, G. A., Pinciroli, C., Mondada, F., and Gambardella, L. M. (2011). Communication assisted navigation in robotic swarms: Self-organization and cooperation. IEEE/RSJ Int. Conf. Intell. Robots Syst. 484:9. doi: 10.1109/IROS.2011.6094454
Goodrich, M. A., Morse, B. S., Engh, C., Cooper, J. L., and Adams, J. A. (2009). Towards using unmanned aerial vehicles (uavs) in wilderness search and rescue: Lessons from field trials. Interact. Stud. 10, 453–478. doi: 10.1075/is.10.3.08goo
Gudise, S. D. V. (2004). Optimal pso for collective robotics search applications. IEEE Trans. Evol. Comput. 2:15. doi: 10.1109/CEC.2004.1331059
Ha, I. K., and Cho, Y. Z. (2018). A probabilistic target search algorithm based on hierarchical collaboration for improving rapidity of drones. Sensors. 18:8. doi: 10.3390/s18082535
Jia, Y., Andrieu, C., Piechocki, R., and Sandell, M. (2008). Depth-first and breadth-first search based multilevel sga algorithms for near optimal symbol detection in mimo systems. IEEE Trans. Wireless Commun. 7, 1052–1061. doi: 10.1109/TWC.2008.060813
Jie, T. Y. L. (2019). A probabilistic finite state machine based strategy for multi-target search using swarm robotics. Appl. Soft Comput. 77, 467–483. doi: 10.1016/j.asoc.2019.01.023
Jiong, L. I., Jianshi, L. I., Feng, M., and Zhu, Y. (2019). Research on multi-target tracking method based on improved evolutionary hungarian algorithm. J. Military Transport. Univ. 8:298. doi: 10.3389/fbioe.2020.00298
Junior, L. S., and Nedjah, N. (2016). Efficient strategy for collective navigation control in swarm robotics. Procedia Comput. Sci. 80, 814–823. doi: 10.1016/j.procs.2016.05.371
Kamegawa, T., Akiyama, T., Sakai, S., Fujii, K., Une, K., Ou, E., et al. (2020). Development of a separable search-and-rescue robot composed of a mobile robot and a snake robot. Int. J. Adv. Robotics. 34, 132–139. doi: 10.1080/01691864.2019.1691941
Khan, Z. A., Awais, M., Alghamdi, T. A., Khalid, A., Fatima, A., Akbar, M., et al. (2019). Region aware proactive routing approaches exploiting energy efficient paths for void hole avoidance in underwater wsns. IEEE Access. 19:39155. doi: 10.1109/ACCESS.2019.2939155
Liang, G. Q., Kang, Y. H., Xing, Z. C., and Yin, G. Y. (2018). Uav cooperative multi-task assignment based on discrete particle swarm optimization algorithm. Comput. Simulat. 2, 81–86. doi: 10.3969/j.issn.1006-9348.2018.02.005
Luo, T., Liang, S., Zeyin, H. E., and Zhang, X. (2017). Path planning of robot based on glowworm swarm optimization algorithm of scene understanding. J. Comput. Applicat. 17:12. doi: 10.11772/j.issn.1001-9081.2017.12.3608
Majid, M., and Arshad, A. M. (2017). An analysis of pso inertia weight effect on swarm robot source searching efficiency. IEEE 447, 173–178. doi: 10.1109/I2CACIS.2017.8239053
Manic, K. D. (2009). Multi-robot, multi-target particle swarm optimization search in noisy wireless environments. Conf. Hum. Syst. Interact. 8, 76–87. doi: 10.1109/HSI.2009.5090958
Marzaeva, V. I. (2019). Mathematical modeling of canopy forest fire spread in the presence of fire breaks and barriers. Technic. Phys. 64, 1073–1081. doi: 10.1134/S1063784219080139
Moshayedi, A. J., and Gharpure, D. (2013). Implementing breath to improve response of gas sensors for leak detection in plume tracker robots. Adv. Intell. Syst. Comput. 259:13. doi: 10.1007/978-81-322-1768-8_31
Ryan, M. R. K. (2008). Exploiting subgraph structure in multi-robot path planning. J.Artific. Intell. Res. 31, 497–542. doi: 10.1613/jair.2408
Suzuki, K., Yokota, Y., Kanazawa, Y., and Takebayashi, T. (2020). “Online self-supervised learning for object picking:Detecting optimum grasping position using a metric learning approach,” in 2020 IEEE/SICE International Symposium on System Integration (SII).
Tang, H., Sun, W., Yu, H., Lin, A., and Xue, M. (2020). A multirobot target searching method based on bat algorithm in unknown environments. Exp. Syst. Applic. 141:112945. doi: 10.1016/j.eswa.2019.112945
Viswanathan, G. M., Buldyrev, S. V., Havlin, S., Luz, M. D.a, Raposo, E., et al. (1999). Optimizing the success of random searches. Nature. 401, 911–914. doi: 10.1038/44831
Wang, Y., Shi, K., Zheng, X., You, S., and Wang, N. (2020). Thermo-hydraulic coupled analysis of meshed district heating networks based on improved breadth first search method. Energy. 205:117950. doi: 10.1016/j.energy.2020.117950
Xinjie, W. (2020). Multiobjective coordinated search algorithm for swarm of uavs based on 3d-simplified virtual forced model. Int. J. Syst. Sci. 51, 367–386. doi: 10.1080/00207721.2020.1799110
Xue, S., Zhang, J., and Zeng, J. (2009). Parallel asynchronous control strategy for target search with swarm robots. Int. J. Bio-inspired Comput. 1, 151–163. doi: 10.1504/IJBIC.2009.023811
Yang, J., Wang, X., and Bauer, P. H. (2019). Extended pso based collaborative searching for robotic swarms with practical constraints. IEEE Access. 7, 328–76 341. doi: 10.1109/ACCESS.2019.2921621
Yao, J. Y., Raffuse, S. M., Brauer, G. J., Bowman, D. M., and Johnston, J. S. (2018). Predicting the minimum height of forest fire smoke within the atmosphere using machine learning and data from the calipso satellite. Rem. Sens. Environ. 206, 98–106. doi: 10.1016/j.rse.2017.12.027
Zhang, X. J., Zhang, M. L., Zhang, J. H., and Sun, L. Y. (2010). A multi-sensory robot used for searching toxic gas leak source. Adv. Mater. Res. 139, 2150–2153. doi: 10.4028/www.scientific.net/AMR.139-141.2150
Zhang, Z. J. C., and Xue, Y. Z. (2014). Dynamic task allocation with closed-loop adjusting in swarm robot search for multiple targets. Robot. 36, 57–68. doi: 10.1017/S0263574717000157
Keywords: multi-target search, swarm robots, roaming search, coordinated search, simplified virtual force model, distributed neighborhood communication
Citation: Zhou Y, Chen A, He X and Bian X (2021) Multi-Target Coordinated Search Algorithm for Swarm Robotics Considering Practical Constraints. Front. Neurorobot. 15:753052. doi: 10.3389/fnbot.2021.753052
Received: 04 August 2021; Accepted: 01 November 2021;
Published: 06 December 2021.
Edited by:
Chen Qiao, Xi'an Jiaotong University, ChinaReviewed by:
Weichao Yue, Zhengzhou University of Light Industry, ChinaLaihao Jiang, Harbin Engineering University, China
Ping Zhou, Northeastern University, China
Copyright © 2021 Zhou, Chen, He and Bian. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: You Zhou, emhvdXlvdTYxMyYjeDAwMDQwOzE2My5jb20=