- 1Thales Research and Technology, Singapore, Singapore
- 2Engineering Product Design, Singapore University of Technology and Design, Singapore, Singapore
- 3Mechanical Engineering, University of Ottawa, Ottawa, ON, Canada
The field of multi-robot systems (MRS) has recently been gaining increasing popularity among various research groups, practitioners, and a wide range of industries. Compared to single-robot systems, multi-robot systems are able to perform tasks more efficiently or accomplish objectives that are simply not feasible with a single unit. This makes such multi-robot systems ideal candidates for carrying out distributed tasks in large environments—e.g., performing object retrieval, mapping, or surveillance. However, the traditional approach to multi-robot systems using global planning and centralized operation is, in general, ill-suited for fulfilling tasks in unstructured and dynamic environments. Swarming multi-robot systems have been proposed to deal with such steep challenges, primarily owing to its adaptivity. These qualities are expressed by the system’s ability to learn or change its behavior in response to new and/or evolving operating conditions. Given its importance, in this perspective, we focus on the critical importance of adaptivity for effective multi-robot system swarming and use it as the basis for defining, and potentially quantifying, swarm intelligence. In addition, we highlight the importance of establishing a suite of benchmark tests to measure a swarm’s level of adaptivity. We believe that a focus on achieving increased levels of swarm intelligence through the focus on adaptivity will further be able to elevate the field of swarm robotics.
1 Introduction
The field of multi-robot systems (MRS) has recently been gaining popularity among various research groups, practitioners, and industrial actors, seeking productivity gains through advanced automation. MRS are able to perform tasks more efficiently and effectively and accomplish missions that, in some cases, are simply out of reach for a single unit. This makes MRS the ideal candidate for carrying out distributed tasks in large and rapidly changing environments—e.g., performing object retrieval, search-and-rescue, and surveillance (Darmanin and Bugeja, 2017). However, the classical centralized approach to MRS can rapidly become unfeasible when dealing with tasks in unstructured and/or dynamic environments. Swarming MRS have been utilized to address some of these shortcomings and challenges, given that swarming robots exhibit the highly desirable features of flexibility, robustness, and scalability, thereby giving a system’s agents to carry out complex tasks as a collective unit (Dorigo et al., 2021). Moreover, given the ease of mass production of robotic units, many researchers and industries alike are turning toward the use of low-cost and easy-to-manufacture robotic units in swarms (Bouffanais, 2016). The versatility of robot swarms is attested by a wealth of applications spanning from environmental monitoring (Thenius et al., 2016; Vallegra et al., 2018; Zoss et al., 2018) to area mapping (Kit et al., 2019; Mitchell and Michael, 2019) and area defense (Strickland et al., 2018; Shishika and Paley, 2019).
The popularity of swarming MRS can partly be credited to the three main advantages that such systems have over their centralized MRS counterparts: scalability, robustness, and flexibility. Of the three key properties, the word “flexibility” has often been used interchangeably with “adaptivity.” However, as defined by Dorigo et al. (2021), flexibility is the system’s capacity to perform tasks that depart from those chosen at design time, while adaptivity is the system’s capacity to learn or change its behavior to respond to new operating conditions. In its current state, swarming MRS have been demonstrated to be flexible; swarms using the same strategies and behavioral parameters have demonstrated their ability to carry out a given task over a wide range of scenarios (Esterle, 2018a; Kit et al., 2019), albeit with varying levels of performance.
Despite their flexibility, current swarms have only shown limited levels of adaptivity; the development of swarm strategies and behaviors has mostly been limited to optimizing a system to fit a narrow range of pre-specified environmental conditions. This has been achieved through the modification of various behavioral and strategy parameters, according to locally or globally measured metrics, and endows the swarming MRS with what could be qualified as “narrow” adaptivity (Pang et al., 2019; Birattari et al., 2020; Nauta et al., 2020). Although doing so allows a swarm to adapt its behavior and maximizes its performance within a narrow range of conditions, the MRS will be unable to adequately operate should the environment vary outside these conditions or in an unexpected manner. A truly adaptive swarm, i.e., one that displays a “general” swarm intelligence, would be able to cope and adapt its behaviors to any conditions presented to it, and be able to achieve different types of tasks depending on an agent’s physical capabilities (e.g., one cannot expect an area mapping swarm to carry out an object retrieval task).
In this perspective, we give a preliminary definition of the concept of general swarm intelligence and address what we believe is required to achieve true adaptivity, i.e., a system’s ability to learn or change its behavior in response to new operating conditions. We also discuss the inherent challenges in evaluating a concept, such as adaptivity, and propose the use of a benchmark test suite for its quantification and comparison. Finally, we discuss what is needed for a general swarm intelligence algorithm to be obtained. In light of this, we firmly believe that future research in the field of swarm robotics should focus on making robotic swarms more adaptive, thereby increasing their viability.
2 General intelligence
2.1 Strong and weak artificial (swarm) intelligence
The key feature of flexibility that defines the effectiveness of MRS can be seen as reminiscent of “weak” or “narrow” AI. Consider the flexibility exhibited by a school of fish when it carries out a rapid evasive maneuver, following a predator’s attack: swarm intelligence at its best. Searle (1980) stated that with weak AI, “the principal value of the computer in the study of the mind is that it gives us a very powerful tool.” For our school of fish, the group’s collective escape strategy from its predators is indeed a powerful survival tool. However, this particular strategy would have to be adapted if it were to remain as effective and relevant for a different species; a flock of birds would have to use a different strategy, given that they live in different mediums and deal with different predators. In contrast, with artificial general intelligence (AGI)—also referred to as “strong” AI—“the computer is not merely a tool in the study of the mind: rather, the appropriately programmed computer really is a mind, in the sense that computers, given the right programs, can be literally said to understand and have other cognitive states” (Searle, 1980). As such, systems with AGI should be able to generalize their knowledge and use them in various different contexts (Goertzel, 2014). General swarm intelligence would then be akin to AGI, one in which a system demonstrates high levels of adaptivity, thereby allowing it to effectively function in any highly dynamic environment under any given set of circumstances.
It is worth reminding that AI is usually defined and constructed using some characteristics associated with human intelligence. Interestingly, natural swarm intelligence (SI) exhibited by animal groups constitutes yet another form of intelligence—albeit a collective and decentralized one—that can be seen as distinct from individual human intelligence. To better appreciate the concept of SI, it can be useful to consider each bird in a flock of birds as a neuron within a brain, with the intelligence emerging from the repeated interactions between the constituting units. As a matter of fact, the current framework for AI does not always lend itself to SI with its decentralized information gathering, social information transfer, and distributed processing. Nonetheless, there is broad consensus within the scientific community about SI being a particular subset of AI, as currently defined, without necessarily knowing where SI exactly fits (Bonabeau et al., 1999; Sadiku et al., 2021). It is important to note that the central difference between AI and SI is more than a simple question of application. The key distinction is that collective intelligence emerges from the actions of the swarm’s agents that ultimately lead to complex collective behaviors that are greater than the sum of its parts. The macro-level behavior cannot be directly programmed. Instead, the algorithm must be designed to empower individual agents to come together as an emergent swarm to accomplish a desired task. This extra layer of separation between the programming and the desired output behavior generally makes achieving swarm intelligence much more complicated than the equivalent AI behavior (Birattari et al., 2020; Nguyen et al., 2020; Bianconi et al., 2023).
Following this line of thought, the development of current non-adaptive swarming algorithms can be compared to that of current non-adaptive narrow AI algorithms. In both cases, some form of reprogramming or intervention is required by a human operator to match a system to its environment and/or task (Goertzel, 2014). This parallel leads us to believe that breakthroughs in AGI development will concomitantly lead to more adaptive outcomes for swarms. For instance, Butz (2021) argues that current (narrow) machine learning “optimizes the best possible strategy within the status quo” and that current algorithms are “reflective rather than prospective.” On the other hand, strong AI needs to be predictive to infer the hidden causes behind its observations and explain the reasoning behind its decision-making process. This offers us prospective routes to improve the current SI status quo.
2.2 Defining swarm intelligence
Currently, there is no concise and universally accepted definition of SI. Given the search for a definition of intelligence by both the SI and AI communities, we have taken inspiration from the various definitions proposed by the AI community to develop one for SI. To this end, we have made a selection of definitions from Legg and Hutter (2007) shown in Table 1. In this table, it can be seen that the key common element shared across definitions is a system’s ability to make necessary behavioral adjustments to cope with the changing environment and to achieve its goals that may vary over time. Together with the importance of collective behavior and adaptivity discussed previously, we believe that swarm intelligence can be defined as follows: “Swarm Intelligence is the emergent ability of a decentralized system of agents to make the appropriate adjustments to its collective behavior, thereby allowing the system to achieve changing goals in dynamic environments.”
TABLE 1. List of AI definitions selected from Legg and Hutter (2007).
3 Adaptivity
3.1 Current state of the art
Our proposed definition of SI is strongly linked with a system’s level of adaptivity—i.e., the system’s ability to learn or change its behavior in response to new operating conditions (Dorigo et al., 2021). Despite the importance of adaptivity for swarm intelligence, the vast majority of research carried out on swarm robotic systems aims to optimize a strategy or a certain set of behavioral parameters to maximize its performance for a task-specific and/or environmental conditions. In these works, adaptivity is usually demonstrated as an afterthought, with systems exhibiting that they are able to function in various operating conditions, albeit with a certain degree of performance degradation (Coquet et al., 2019; Kwa et al., 2022a).
Adaptive swarms can be achieved through either offline or online swarm design methods, as suggested by Birattari et al. (2020). These two radically different approaches toward adaptivity have also been described by Hasbach and Bennewitz (2022) as two different controllers—(1) a robust controller that allows a system to control its behavior when the environmental conditions vary within the expected range (i.e., offline methods) and (2) an adaptive controller that allows the system to implement rules and new system goals to adjust the robust controller when the fit between the robust controller and the situation set is not optimal (i.e., online design methods). A visual representation of the differences between these two controllers is shown in Figure 1.
FIGURE 1. Flowchart of controllers used in swarming MRS to promote adaptivity. (A) Robust controllers. (B) Adaptive controllers.
Currently, adaptivity in swarms is mostly achieved using robust controllers. In doing so, swarms develop a base set of strategies that can be further optimized to maximize the system’s performance over a specific range of task conditions, thereby achieving narrow adaptivity. Such a system was used by Rausch et al. (2019), who identified an optimal degree of connectivity between agents and varied the communication ranges of the individual agents to maintain a consistent number of neighbors, thereby maintaining the coherence of the system. Similarly, by optimizing an individual agent’s movements to maintain the inter-agent spacing and a constant Voronoi cell area, Vallegra et al. (2018) and Kouzeghar et al. (2023) were able to maintain a high level of coverage of two different types of dynamic environments using swarms composed of autonomous buoys and UAVs, respectively. Using this approach, systems will be able to perform well within a specific range of conditions. However, should the operating conditions vary outside this range—i.e., when a high level of adaptivity is demanded from the system—the performance of the swarm will inevitably suffer due to the fact that the system has been optimized prior to its implementation (Hunt, 2020).
3.2 Toward general adaptivity
We believe that attaining general swarm intelligence would require a system to have general adaptivity. This would, therefore, involve using adaptive controllers. In doing so, systems will be able to optimize, learn, or develop new behavioral and strategy parameters as the mission progresses, thereby allowing systems to display the sought-after general adaptivity. This naturally allows for a higher degree of adaptivity due to the increased abilities of an MRS to modulate and vary its behavior, better allowing the system to maintain its performance should the operating conditions change. It is worth noting that a few initial attempts have been made in using these methods, for instance, in systems making use of Lévy walks, individual agents autonomously varying their individual Lévy parameters to modulate the level of exploration and exploitation being carried out (Pang et al., 2019; Nauta et al., 2020; Garcia-Saura et al., 2021; Kwa et al., 2022a). Along the same vein, Esterle (2018b) developed a swarm robotic system with agents autonomously switching between two states, thereby allowing the system to track targets that continuously appear and disappear in the environment. This was achieved using a variable threshold value that was calculated based on the number of targets currently tracked and the number of targets to be tracked.
In making swarms more adaptive, regardless of the approach, system designers need to identify the correct set of cues to determine if adaptation should be triggered and how these cues will be sensed (Hunt, 2020). In addition, designers should also be wary of agents responding to false alarms when detecting these cues, i.e., systems adapting to changes in the operating conditions that have not actually taken place. This usually entails a process, where a system’s responsiveness to environmental changes needs to be balanced against its resilience to false alerts, also known as the stability–flexibility dilemma in neural systems (Liljenström, 2003; Wahby et al., 2019; Leonard and Levin, 2022).
4 Swarm intelligence benchmark framework
4.1 The need for a swarm intelligence benchmark
Given the need to push toward increasing levels of swarm intelligence, i.e., system adaptivity, we believe that a common benchmark is required to facilitate the comparison between different strategies, a belief also shared by Dorigo et al. (2021). The concept of benchmarking is rather common and already exists in other more mature research fields such as optimization, where algorithms are tested against various benchmark functions (Huband et al., 2006; Li et al., 2013). Similarly, various test environments have been created to evaluate control strategies developed by means of reinforcement learning algorithms for single agents (Duan et al., 2016; Cobbe et al., 2020) and multiple agents (Lowe et al., 2017). The introduction of such a benchmark for a system’s adaptivity has the potential to unify the research community by providing a basis for measurement and comparison, and allow practitioners to understand how high levels of adaptivity are derived.
Although swarm intelligent solutions have been established for multiple problems (Kit et al., 2019; Kwa et al., 2022b; Esterle and King, 2022; Kouzeghar et al., 2023), they are often evaluated based on their own operational performance for a specified scenario and fixed conditions. This is to be expected as such systems are developed and applied to solve a particular problem or carry out a specific task. Therefore, the evaluation of a system’s operational performances allows it to be compared to previous solutions and strategies found in the literature. However, measuring the operational performance in one scenario cannot possibly be considered equivalent to measuring swarm intelligence. Indeed, a system displaying high levels of operational performance may not necessarily have swarm intelligence. Take, for example, a multi-robot system tasked with retrieving and delivering packages to and from designated locations within a warehouse. When operating within a predictable, organized, and mostly static environment, a centralized pre-planned strategy that has been optimized for a specific warehouse is often the ideal solution (Ma et al., 2017; Bredeche and Fontbonne, 2022). Recently, attempts have also been made at developing decentralized strategies that feature reduced optimization times when the number of robots in the system is increased while maintaining similar levels of performance (Claes et al., 2017). However, both strategies would be unable to deliver the same level of performance should they be deployed in different warehouses without another round of optimization and planning. On the other hand, while swarm intelligent strategies may not perform as well as the situation-optimized solutions, they are able to adapt to changes in the environment autonomously and deliver similar levels of performances across different operating conditions. The use of an adaptivity benchmark test suite would highlight this higher level of adaptivity present within swarm intelligent MRS.
4.2 Testing narrow and general swarm intelligence
As previously stated, current MRS are capable of displaying narrow adaptivity, where a system is able to carry out one specific task in a dynamic environment. As such, to test for adaptivity, the system must be able to demonstrate that it is able to perform over a possibly wide range of different environmental conditions for a given class of problems. For example, in a target tracking scenario, an MRS using an adaptive strategy should be able to maintain a certain level of performance should there be changes in the number of targets to be tracked, target speed, target movement profile, etc. (Esterle, 2018a; Kwa et al., 2020; Kwa et al., 2021; Kwa et al., 2022b). In many engineered systems, there often exist performance indicators and metrics, such as those for first- and second-order linear time-invariant (LTI) controllers that allow a system’s properties to be fully understood by its users and also facilitate its comparison to other similar systems. Similar benchmark frameworks and characteristic graphs could also be developed for MRS that could reveal how a system’s performance is expected to evolve over varying environmental conditions, thereby giving an indication of a system’s level of adaptivity.
To progress from narrow adaptivity toward general swarm intelligence, the community should focus its efforts on developing systems with general adaptivity. This will endow systems with not just the ability to adapt to different environmental conditions but also with the ability to adjust their behaviors to accomplish different tasks, e.g., changing behaviors from a set suited to collective mapping to another set suited to target tracking. This taps into the central concept of transferability, where one can evaluate the system’s ability to operate in different scenarios while maintaining its level of performance. Measuring general adaptivity requires more than one class of benchmark environment. In view of this, a suite of benchmark tests should be developed and implemented, similar to the Procgen Benchmark proposed and developed by OpenAI for the testing of reinforcement learning agents (Cobbe et al., 2020). Specifically, with the benchmark, developers are able to measure how quickly their agents learn generalizable skills in 16 procedurally generated environments, in effect measuring their agent’s level of general adaptivity. It is worth adding that these 16 environments have been selected to assess different characteristics. In our proposed benchmark framework, as shown in Figure 2, each benchmark test environment would be associated with its own set of environmental parameters and performance metrics, thereby allowing the user to determine which system is more adaptive in any of the given contexts.
FIGURE 2. Flowchart describing our proposed benchmark framework. The systems to be compared are tested in multiple environments (environments 1, 2, …, n), each with different environmental parameters (a, b, …, x). Their performances are determined by various metrics specific to each environment, thereby allowing the ranking of the systems’ adaptivity in different contexts.
Although more standardized benchmarks can improve the development of adaptivity in swarms, it is not a panacea. The existence of such benchmarks may give clear targets and methods to evaluate a given set of strategies, which may inadvertently lead to other factors not included by the benchmark to be ignored. As stated by Raji et al. (2021), there is no test dataset or environment that can capture the full details and complexity of all the possible scenarios in which an MRS can be deployed. Hence, any implemented benchmark framework must, like general swarm intelligence algorithms, evolve to prevent what is essentially the overfitting of strategies to particular benchmarks. Such evolving environments have already been implemented by Cobbe et al. (2020); the benchmark uses procedurally generated environments, essentially offering a “near-infinite supply of randomized content,” preventing an agent from memorizing an optimal policy. The ultimate goal of any proposed benchmark framework should not be to declare if an algorithm has attained general adaptivity but to enable practitioners to understand how systems are able to adapt to their immediate surroundings and compare different systems while they are executing similar tasks.
5 Discussion
Many parallels can be established between the development of artificial general intelligence and general swarm intelligence. This includes the search for a general (swarm) intelligence algorithm and a fixation of the community on developing tailor-made strategies for very specific circumstances. However, we believe that these two are contradictory objectives. Developing environment-specific strategies will essentially maximize the performance of a swarming MRS for a particular set of conditions. However, these high levels of performance would most likely falter should the conditions deviate from those considered at the design stage. A system designed for adaptivity would avoid such declines in performance when changing conditions, and thereby, moves us closer to developing general swarm intelligence algorithms.
We believe the key to unlocking these general swarm intelligence algorithms is through increasing the system’s level of adaptivity. This would allow a swarm to categorize and understand its environment, thereby giving it the ability to change its collective behavior to match its changing goals in dynamic environments. As previously mentioned, such a system needs to be multipurpose and be able to optimize its own behavioral parameters, or even develop new ones, to achieve its new objectives in the most efficient manner (Birattari et al., 2020; Hasbach and Bennewitz, 2022). For example, in the absence of any targets, a swarm with general SI originally performing a target tracking task may find it more appropriate to transition to carry out area mapping and, therefore, optimize its actions and behavior for this new task. Should a single target appear in the environment, the system may even deem it appropriate to carry out both tracking and mapping tasks concurrently.
As a first step toward general SI, adaptivity must be embedded into the system design process, essentially allowing the system to learn to be adaptive. From the AI standpoint, this means that the robotic system is capable of dealing with an open world, such that techniques to enable adaptivity during operation—and not at the design stage—will be the key. To this end, we expect designers to train their agents in gradually more open environments using multi-agent reinforcement learning (MARL), allowing for the effects of dynamic environmental factors to be included in the training process. Indeed, several research groups have already started using MARL techniques to develop policies for their swarming agents in dynamic environments (Kouzehgar et al., 2020; Wang et al., 2022a; Wang et al., 2022b; Kouzeghar et al., 2023). Learning from demonstration, experience replay, and transfer learning offer promising opportunities to exploit prior knowledge, e.g., from another domain or task (Karimpanal and Bouffanais, 2018; Karimpanal and Bouffanais, 2019). However, these powerful techniques will have to be extended to multi-agent systems.
Although current MARL-trained swarms only focus on achieving a single objective, enabling a swarm to switch between tasks or carry out tasks simultaneously can possibly be achieved by setting the swarm’s priorities. Doing so would allow a system to identify environmental cues to trigger the switch between scheduled tasks, thereby allowing it to ascertain when and how to accomplish different tasks as it learns and adapts to its environment. However, to attain such a high level of adaptivity, a system needs to account for a wide range of factors and missions while simultaneously being able to avoid reacting to false alarms. It is not feasible for MARL practitioners to handcraft a reward function and perform hyperparameter tuning for a system with so many parameters and goals over all possible environments. As such, further advances to swarm adaptivity may come from the newly established field of automated reinforcement learning (AutoRL), essentially enabling a swarming MRS to train itself, i.e., self-learn (Faust et al., 2019; Parker-Holder et al., 2022).
Given the need to design for adaptivity, it would also be beneficial to quantify this level of adaptivity and see how different swarming systems compare with each other. Similar to the testing of “standard” artificial intelligence and computational optimization algorithms, there is a critical need for the swarm intelligence community to implement a suite of evolving benchmark problems, along with metrics to evaluate the performance of swarming systems, with a focus on adaptivity. Although there are works that compare the performance of different swarming algorithms, such comparisons are usually incomplete, with the two strategies usually being tested within a narrow range of environmental conditions and tasks. Since it has previously been shown that MRS performances are highly sensitive to the demands of the task and those of the operating environment, the current limited form of testing may give a false impression that one swarm strategy is able to outperform another over all conditions and settings. Therefore, an implementation of a standardized benchmark framework, consisting of a suite of benchmark problems, would allow for a more complete and accurate comparison of different swarm algorithms and also contribute to faster algorithm development and evaluation.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material; further inquiries can be directed to the corresponding author.
Author contributions
All authors listed have made a substantial, direct, and intellectual contribution to the work and approved it for publication.
Funding
HLK was funded by Thales Solutions Asia Pte Ltd., under the Singapore Economic Development Board Industrial Postgraduate Program (IPP). JP, NH, MS, RB were supported by the Natural Sciences and Engineering Research Council of Canada (NSERC), under the grant # RGPIN-2022-04064.
Conflict of interest
HLK was employed by Thales Solutions Asia Pte Ltd. The company was not involved in the writing of the manuscript or the decision to submit it for publication.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Bianconi, G., Arenas, A., Biamonte, J., Carr, L. D., Kahng, B., Kertesz, J., et al. (2023). Complex systems in the spotlight: Next steps after the 2021 nobel prize in physics. J. Phys. Complex 4, 010201. doi:10.1088/2632-072X/ac7f75
Birattari, M., Ligot, A., and Hasselmann, K. (2020). Disentangling automatic and semi-automatic approaches to the optimization-based design of control software for robot swarms. Nat. Mach. Intell. 2, 494–499. doi:10.1038/s42256-020-0215-0
Bonabeau, E., Dorigo, M., and Theraulaz, G. (1999). Swarm intelligence: From natural to artificial systems. Oxford University Press. doi:10.1093/oso/9780195131581.001.0001
Bouffanais, R. (2016). Design and control of swarm dynamics. Springer Singapore. doi:10.1007/978-981-287-751-2
Bredeche, N., and Fontbonne, N. (2022). Social learning in swarm robotics. Philosophical Trans. R. Soc. B Biol. Sci. 377, 20200309. doi:10.1098/rstb.2020.0309
Butz, M. V. (2021). Towards strong AI. KI - Künstliche Intell. 35, 91–101. doi:10.1007/s13218-021-00705-x
Claes, D., Oliehoek, F., Baier, H., and Tuyls, K. (2017). “Decentralised online planning for multi-robot warehouse commissioning,” in AAMAS’17: Proceedings of the 16th international conference on autonomous agents and multiagent systems (Sao Paulo, Brazil, 492–500.
Cobbe, K., Hesse, C., Hilton, J., and Schulman, J. (2020). International conference on machine learning (PMLR), 2048–2056.
Coquet, C., Aubry, C., Arnold, A., and Bouvet, P. J. (2019). “A local charged particle swarm optimization to track an underwater mobile source,” in Oceans 2019 - marseille (Marseille, France: IEEE). doi:10.1109/OCEANSE.2019.8867527
Darmanin, R. N., and Bugeja, M. K. (2017). “A review on multi-robot systems categorised by application domain,” in 2017 25th mediterranean conference on control and automation (MED). doi:10.1109/med.2017.7984200
Dorigo, M., Theraulaz, G., and Trianni, V. (2021). Swarm robotics: Past, present, and future [point of view]. Proc. IEEE 109, 1152–1165. doi:10.1109/jproc.2021.3072740
Duan, Y., Chen, X., Houthooft, R., Schulman, J., and Abbeel, P. (2016). “Benchmarking deep reinforcement learning for continuous control,” in Proceedings of the 33rd international conference on machine learning. Editors M. F. Balcan, and K. Q. Weinberger (New York, New York, USA: PMLR), 48, 1329–1338. Proceedings of Machine Learning Research.
Esterle, L. (2018a). “Chainmail: Distributed coordination for multi-task κ-Assignment using autonomous mobile IoT devices,” in Proceedings - 14th annual international conference on distributed computing in sensor systems, DCOSS 2018 (New York City, New York, U.S.A: Institute of Electrical and Electronics Engineers Inc.), 85–92. doi:10.1109/DCOSS.2018.00019
Esterle, L. (2018b). “Goal-aware team affiliation in collectives of autonomous robots,” in 2018 IEEE 12th international conference on self-adaptive and self-organizing systems (SASO) (Trento, Italy, 90–99. doi:10.1109/SASO.2018.00020
Esterle, L., and King, D. W. (2022). Loosening control — A hybrid approach to controlling heterogeneous swarms. ACM Trans. Aut. Adapt. Syst. 16, 1–26. doi:10.1145/3502725
Faust, A., Francis, A., and Mehta, D. (2019). “Evolving rewards to automate reinforcement learning,” in 6th ICML workshop on automated machine learning (CA, USA: Long Beach). doi:10.48550/arXiv.1905.07628
Fogel, D. B. (1995). Review of computational intelligence: Imitating life [book reviews]. Proc. IEEE 83, 1588. doi:10.1109/JPROC.1995.481636
Garcia-Saura, C., Serrano, E., Rodriguez, F. B., and Varona, P. (2021). Intrinsic and environmental factors modulating autonomous robotic search under high uncertainty. Sci. Rep. 11, 24509. doi:10.1038/s41598-021-03826-3
Goertzel, B. (2014). Artificial general intelligence: Concept, state of the art, and future prospects. J. Artif. General Intell. 5, 1–48. doi:10.2478/jagi-2014-0001
Hasbach, J. D., and Bennewitz, M. (2022). The design of self-organizing human–swarm intelligence. Adapt. Behav. 30, 361–386. doi:10.1177/10597123211017550
Huband, S., Hingston, P., Barone, L., and While, L. (2006). A review of multiobjective test problems and a scalable test problem toolkit. IEEE Trans. Evol. Comput. 10, 477–506. doi:10.1109/TEVC.2005.861417
Hunt, E. R. (2020). Phenotypic plasticity provides a bioinspiration framework for minimal field swarm robotics. Front. Robotics AI 7, 23. doi:10.3389/frobt.2020.00023
Karimpanal, T., and Bouffanais, R. (2019). Self-organizing maps for storage and transfer of knowledge in reinforcement learning. Adapt. Behav. 27, 111–126. doi:10.1177/1059712318818568
Karimpanal, T. G., and Bouffanais, R. (2018). Experience replay using transition sequences. Front. Neurorobotics 12, 32. doi:10.3389/fnbot.2018.00032
Kit, J. L., Dharmawan, A. G., Mateo, D., Foong, S., Soh, G. S., Bouffanais, R., et al. (2019). “Decentralized multi-floor exploration by a swarm of miniature robots teaming with wall-climbing units,” in 2019 international symposium on multi-robot and multi-agent systems (MRS) (New Brunswick, NJ, USA: IEEE). doi:10.1109/MRS.2019.8901058
Kouzeghar, M., Song, Y., Meghjani, M., and Bouffanais, R. (2023). “Multi-target pursuit by a decentralized heterogeneous uav swarm using deep multi-agent reinforcement learning,” in 2023 international conference on robotics and automation (ICRA) (London, UK.
Kouzehgar, M., Meghjani, M., and Bouffanais, R. (2020). “Multi-agent reinforcement learning for dynamic ocean monitoring by a swarm of buoys,” in Global OCEANS 2020: Singapore-U.S gulf coast (Singapore). doi:10.1109/IEEECONF38699.2020.9389128
Kwa, H. L., Babineau, V., Philippot, J., and Bouffanais, R. (2022b). Adapting the exploration-exploitation balance in heterogeneous swarms: Tracking evasive targets. Artif. Life 29, 21–36. doi:10.1162/artl_a_00390
Kwa, H. L., Kit, J. L., and Bouffanais, R. (2022a). Balancing collective exploration and exploitation in multi-agent and multi-robot systems: A review. Front. Robotics AI 8, 771520. doi:10.3389/frobt.2021.771520
Kwa, H. L., Kit, J. L., and Bouffanais, R. (2020). “Optimal swarm strategy for dynamic target search and tracking,” in Proc. Of the 19th international conference on autonomous agents and multiagent systems (Auckland, New Zealand: AAMAS), 672–680. 2020. doi:10.5555/3398761.3398842
Kwa, H. L., Kit, J. L., and Bouffanais, R. (2021). “Tracking multiple fast targets with swarms: Interplay between social interaction and agent memory,” in Alife 2021: The 2021 conference on artificial life (Prague, Czech Republic. doi:10.1162/isal_a_00376
Legg, S., and Hutter, M. (2007). A collection of definitions of intelligence. Front. Artif. Intell. Appl. 157, 17.
Legg, S., and Hutter, M. (2006). A formal measure of machine intelligence. arXiv preprint cs/0605024.
Leonard, N. E., and Levin, S. A. (2022). Collective intelligence as a public good. Collect. Intell. 1, 263391372210832–18. doi:10.1177/26339137221083293
Li, X., Tang, K., Omidvar, M. N., Yang, Z., Qin, K., and China, H. (2013). Benchmark functions for the CEC 2013 special session and competition on large-scale global optimization. gene 7, 8.
Liljenström, H. (2003). Neural stability and flexibility: A computational approach. Neuropsychopharmacology 28, S64–S73. doi:10.1038/sj.npp.1300137
Lowe, R., Wu, Y., Tamar, A., Harb, J., Abbeel, P., and Mordatch, I. (2017). “Multi-agent actor-critic for mixed cooperative-competitive environments,” in Neural information processing systems (NIPS).
Mitchell, D., and Michael, N. (2019). “Persistent multi-robot mapping in an uncertain environment,” in Proceedings - IEEE international conference on robotics and automation (Montreal, Quebec, Canada), 4552–4558. doi:10.1109/ICRA.2019.8794469
Nauta, J., Havermaet, S. V., Simoens, P., and Khaluf, Y. (2020). “Enhanced foraging in robot swarms using collective levy walks,” in 24th European conference on artificial intelligence - ecai 2020 (Santiago de Compostela, Spain. doi:10.3233/FAIA200090
Newel, A., and Simon, H. A. (1976). Computer science as empirical inquiry: Symbols and search. Commun. ACM 19, 113–126. doi:10.1145/360018.360022
Nguyen, T. T., Nguyen, N. D., and Nahavandi, S. (2020). Deep reinforcement learning for multiagent systems: A review of challenges, solutions, and applications. IEEE Trans. Cybern. 50, 3826–3839. doi:10.1109/TCYB.2020.2977374
Pang, B., Song, Y., Zhang, C., Wang, H., and Yang, R. (2019). A swarm robotic exploration strategy based on an improved random walk method. J. Robotics 2019, 1–9. doi:10.1155/2019/6914212
Parker-Holder, J., Rajan, R., Song, X., Biedenkapp, A., Miao, Y., Eimer, T., et al. (2022). Automated reinforcement learning (AutoRL): A survey and open problems. J. Artif. Intell. Res. 74, 517–568. doi:10.1613/jair.1.13596
Poole, D., Mackworth, A., and Goebel, R. (1998). Computational intelligence A logical approach. New York: Oxford University Press.
Raji, I. D., Bender, E. M., Paullada, A., Denton, E., and Hanna, A. (2021). “AI and the everything in the whole wide world benchmark,” in 35th conference on neural information processing systems (NeurIPS 2021) track on datasets and benchmarks (virtual).
Rausch, I., Reina, A., Simoens, P., and Khaluf, Y. (2019). Coherent collective behaviour emerging from decentralised balancing of social feedback and noise. Swarm Intell. 13, 321–345. doi:10.1007/s11721-019-00173-y
Sadiku, M. N., Musa, S. M., and Ajayi-Majebi, A. (2021). A primer on multiple intelligences. Springer. doi:10.1007/978-3-030-77584-1
Searle, J. R. (1980). Minds, brains, and programs. Behav. brain Sci. 3, 417–424. doi:10.1017/S0140525X00005756
Shishika, D., and Paley, D. A. (2019). Mosquito-inspired distributed swarming and pursuit for cooperative defense against fast intruders. Aut. Robots 43, 1781–1799. doi:10.1007/s10514-018-09827-y
Strickland, L., Baudier, K., Bowers, K., Pavlic, T. P., and Pippin, C. (2018). “Bio-inspired role allocation of heterogeneous teams in a site defense task,” in Distributed autonomous robotic systems 2018 (Boulder, CO, USA: Springer International Publishing). doi:10.1007/978-3-030-05816-6_10
Thenius, R., Moser, D., Varughese, J. C., Kernbach, S., Kuksin, I., Kernbach, O., et al. (2016). subCULTron - cultural development as a tool in underwater robotics. Artif. Life Intelligent Agents 732, 27–41. doi:10.1007/978-3-319-90418-4_3
Vallegra, F., Mateo, D., Tokić, G., Bouffanais, R., and Yue, D. K. P. (2018). “Gradual collective upgrade of a swarm of autonomous buoys for dynamic ocean monitoring,” in IEEE-MTS oceans 2018 (SC, USA: Charleston). doi:10.1109/OCEANS.2018.8604642
Wahby, M., Petzold, J., Eschke, C., Schmickl, T., and Hamann, H. (2019). “Collective change detection: Adaptivity to dynamic swarm densities and light conditions in robot swarms,” in The 2019 conference on artificial life. Editors H. Fellermann, J. Bacardit, Á. Goñi-Moreno, and R. M. Füchslin (Newcastle, UK: MIT Press), 642–649. doi:10.1162/isal_a_00233.xml
Wang, G., Wei, F., Jiang, Y., Zhao, M., Wang, K., and Qi, H. (2022a). A multi-AUV maritime target search method for moving and invisible objects based on multi-agent deep reinforcement learning. Sensors 22, 8562. doi:10.3390/s22218562
Wang, P. (1995). “On the working definition of intelligence,” in Center for research on concepts and cognition (Indiana University). Tech. rep., Technical Report 94.
Wang, Y., Damani, M., Wang, P., Cao, Y., and Sartoretti, G. (2022b). Distributed reinforcement learning for robot teams: A review. Curr. Robot. Rep. 3, 239–257. doi:10.1007/s43154-022-00091-8
Keywords: adaptivity, collective robotics, multi-agent systems, multi-robot systems, swarm robotics, swarm intelligence
Citation: Kwa HL, Kit JL, Horsevad N, Philippot J, Savari M and Bouffanais R (2023) Adaptivity: a path towards general swarm intelligence?. Front. Robot. AI 10:1163185. doi: 10.3389/frobt.2023.1163185
Received: 10 February 2023; Accepted: 17 April 2023;
Published: 09 May 2023.
Edited by:
Konstantinos Karydis, University of California, Riverside, United StatesReviewed by:
Lidong Zhang, East China Normal University, ChinaCopyright © 2023 Kwa, Kit, Horsevad, Philippot, Savari and Bouffanais. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Roland Bouffanais, cm9sYW5kLmJvdWZmYW5haXNAdW90dGF3YS5jYQ==