Automatic design of stigmergy-based behaviours for robot swarms

Salman, Muhammad; Garzón Ramos, David; Birattari, Mauro

doi:10.1038/s44172-024-00175-7

Download PDF

Article
Open access
Published: 14 February 2024

Automatic design of stigmergy-based behaviours for robot swarms

Communications Engineering volume 3, Article number: 30 (2024) Cite this article

1057 Accesses
31 Altmetric
Metrics details

Subjects

Abstract

Stigmergy is a form of indirect communication and coordination in which individuals influence their peers by modifying the environment in various ways, including rearranging objects in space and releasing chemicals. For example, some ant species lay pheromone trails to efficiently navigate between food sources and nests. Besides being used by social animals, stigmergy has also inspired the development of algorithms for combinatorial optimisation and multi-robot systems. In swarm robotics, collective behaviours based on stigmergy have always been designed manually, which is time consuming, costly, hardly repeatable, and depends on the expertise of the designer. Here, we show that stigmergy-based behaviours can be produced via automatic design: an optimisation process based on simulations generates collective behaviours for a group of robots that can lay and sense artificial pheromones. The results of our experiments indicate that the collective behaviours designed automatically are as good as—and in some cases better than—those produced manually. By taking advantage of pheromone-based stigmergy, the automatic design process generated collective behaviours that exhibit spatial organisation, memory, and communication.

Disentangling automatic and semi-automatic approaches to the optimization-based design of control software for robot swarms

Article 10 August 2020

Empirical assessment and comparison of neuro-evolutionary methods for the automatic off-line design of robot swarms

Article Open access 16 July 2021

Designing minimal and scalable insect-inspired multi-locomotion millirobots

Article 10 July 2019

Introduction

Stigmergy^1,2,3 is a coordination mechanism in which agents self-organise through indirect local communication mediated by the environment. When using stigmergy, agents leave indications of their presence or actions in the environment and this stimulates/inhibits the behaviours of their peers⁴. Some animals physically transform the environment thus producing visual cues that influence their peers. For instance, humans leave footprints on the ground and flatten vegetation while walking in the wild, thereby creating visually detectable paths that others tend to follow⁵. Other animals secrete chemicals that their peers can detect and to which they react—for instance, Argentine ants lay pheromone trails that are then followed by nestmates⁶.

For many social insects, pheromone-based stigmergy plays an important role in self-organisation⁷. These insects can sense environmental features, locally interact with other members of the colony and with the environment, and process information to make decisions⁸. However, they have short perception and communication ranges, are not aware of the global state of the colony, are unable to remember their actions, and are unable to plan their contributions to the collective activities of the colony⁸. The pheromones laid in the environment function as a collective and distributed memory: they effectively encode the state of the colony. The pheromones enable coordination, as the individuals can work together and self-organise without the need to communicate directly or receive instruction on the tasks they must perform^9,10.

In a robot swarm, which operates similarly to a colony of social insects¹¹, a collective behaviour emerges due to local interactions between individual robots and between the robots and the environment¹². A robot swarm, like an insect colony, can use pheromone-based indirect communication mediated by the environment¹³. Designers of robot swarms can develop pheromone-based interaction strategies for specific missions. However, giving real robots the capability to mark the environment with indications of their activities is still an open technological challenge¹⁴. In some studies, researchers have developed smart environments to enable pheromone-based stigmergy, for instance, by using: (i) a system of stationary devices (e.g., RFID tags) spread throughout the environment to store virtual pheromones^{15,16,17,18,19}, (ii) devices to display or project virtual pheromones on the ground^20,21,22,23, or (iii) augmented reality to immerse the robots in a virtual environment in which they can lay and sense pheromones^24,25,26. These systems are flexible, powerful, and enable the implementation of complex coordination mechanisms. However, as these systems rely on external infrastructures (for tracking robots, displaying the pheromones, and storing information), they can be expensive and are only suitable under restricted conditions. Alternatively, several approaches to physically deposit artificial pheromones have been proposed, using specialised onboard actuators to lay trails of alcohol or wax, without the assistance of any external infrastructure^27,28,29. However, these solutions would be impractical in most real-world applications due to the hazards of using flammable material (alcohol) or heating devices (for melting wax). To address the issue, we have recently proposed a hardware module for robots that project UV light downwards, laying an artificial pheromone trail on ground that has previously been coated with photochromic material³⁰. The part of the ground that is exposed to UV light changes in colour from white to magenta. Once the UV light is removed, the magenta colour fades back to white, in about 50 s, mimicking the evaporation of a pheromone. This approach does not present safety risks and does not rely on complex or expensive infrastructure, however, it still requires the environment to be prepared before deploying the robots.

The technological problem of endowing the robots with the ability to lay and sense artificial pheromones is not the only problem to be addressed. The concept of stigmergy is not easily understood intuitively³¹ and therefore, designing collective behaviours based on stigmergy is itself a challenge. Even without using stigmergy, designing any collective behaviour for a robot swarm is already complex: individuals are autonomous and loosely coupled, and the interactions between individuals and between them and the environment become fully defined only at run time^32,33. The design problem becomes even more complex if the interaction strategies that enable coordination are regulated by modifications to the environment. No formal design method exists to tell under what conditions and in what amount individuals should release the pheromone, nor how they should react to pheromone trails so that a desired collective behaviour emerges. In the swarm robotics literature, pheromone-based stigmergy has been predominantly designed manually, via trial and error, to address specific missions under specific conditions^26,34,35. Manual design is a time-consuming approach in which a human designer conceives, tests, and iteratively improves the control software of the robots, until a desired collective behaviour is obtained^36,37. The quality of the results obtained via manual design is not consistent and greatly depends on the experience of the designer. Typically, a manual design process is neither easily repeatable, nor directly generalisable to other—albeit similar—robotic platforms or missions³⁸. The only exception to manual design is one study in which deep reinforcement learning was used to develop a collision avoidance behaviour based on a virtual pheromone³⁹. Although restricted to simulation-only experiments, this study showed that control software produced through deep reinforcement learning can outperform the one generated via manual design. The proposed approach was conceived for scenarios where a centralised infrastructure stores global pheromone information and makes it accessible to the robots. On the one hand, this approach provides a solution to the problem of designing pheromone-based behaviours in virtual environments. On the other hand, the approach is not directly applicable in scenarios where the robots are expected to autonomously lay and sense the artificial pheromones in their physical environment.

In this paper, we focus on the automatic design of stigmergy-based collective behaviours for robot swarms. We present Habanero, an automatic off-line design method that belongs to the AutoMoDe family⁴⁰. In AutoMoDe, as is customary in automatic off-line design^38,41, the design problem is reformulated as an optimisation problem that is solved in simulation, prior to the deployment of the robots in their target environment^41,42. The solution space of the optimisation problem comprises instances of control software that can be obtained by selecting and combining pre-existing software modules (i.e., low-level behaviours and the conditions to transition between them) into a modular architecture (e.g., finite-state machines, behaviour trees) and by tuning the free parameters⁴³. Once the optimisation process is completed, the selected control software is uploaded to the robots without undergoing any manual transformations, and the robots are eventually deployed in the target environment. It has been observed that the control software produced by AutoMoDe crosses the reality gap^44,45,46,47 better than traditional approaches based on neuroevolution^43,48, in which robots are controlled by a neural network that is optimised using an evolutionary algorithm^49,50,51. This improvement can be attributed to AutoMoDe’s constraint that control software must be generated by assembling the given modules within a specific architecture (e.g., a probabilistic finite-state machine). By applying this constraint, AutoMoDe limits the size of the design space to the set of possible combinations of modules, and therefore reduces the variance of the design process⁴³. This reduces the risk of over-fitting the control software produced to the idiosyncrasies of the simulation environment, which is the main reason why control software might fail to cross the reality gap satisfactorily⁴⁷.

AutoMoDe is a general framework. To define a specific design method that conforms to it and produces control software to address a specific class of missions, the following steps must be taken: (1) select a target robot platform that is appropriate for the given class of missions, (2) define software modules for the selected robot platform, (3) specify the architecture into which the software modules will be assembled, (4) select a simulator to be used in the automatic design process, and (5) define an appropriate optimisation algorithm to search the space of the possible ways in which the software modules can be assembled and tuned. Our proposed AutoMoDe method, Habanero, designs collective behaviours to address missions in which the robot swarm relies on stigmergy to coordinate. The target robot platform is the e-puck⁵² augmented with the Overo Gumstix Linux board, the aforementioned hardware module that lays artificial pheromone trails by focusing UV light onto ground coated with photochromic material³⁰, and an omnidirectional camera to detect artificial pheromone trails. The software modules of Habanero are based on those previously defined for TuttiFrutti⁵³, another AutoMoDe method that generates control software for robots that can display colours via RGB LEDs and react to them. The main difference between TuttiFrutti and Habanero is that the latter features some original hardware and software devices to lay and detect pheromone trails. The architecture into which these modules are assembled are probabilistic finite-state machines. The simulator used in the design process is ARGoS⁵⁴ with an original library for the simulation of pheromone trails. The optimisation algorithm utilised is Iterated F-race ⁵⁵, as originally used in TuttiFrutti⁵³ and in Chocolate, the state-of-the-art AutoMoDe method⁵⁶. See Fig. 1 for a graphical illustration of Habanero, Fig. 2 for a description of the platform for which Habanero was developed, and the Methods section for further details. The collective behaviours designed by Habanero enable the robots to operate in a fully autonomous and distributed way without requiring any form of centralised control and coordination.

**Fig. 2: The e-puck robot, its reference model, and the experimental setup.**

In this study, we demonstrate Habanero by generating control software for a swarm of eight e-puck robots. We consider four missions in which the robots should rely on stigmergy-based coordination: AGGREGATION, DECISION MAKING, RENDEZVOUS POINT, and STOP. See Fig. 3 and the Methods section for details. To assess the quality of the control software produced by Habanero, we compare its performance to that of several alternatives, shown in Fig. 4: (1) control software produced via neuroevolution (EvoPheromone), (2) control software manually produced by human designers (Human-Designers), and (3) a random-walk behaviour (Random-Walk).

**Fig. 3: Construction of the arenas for the four missions.**

**Fig. 4: Pictorial representation of the design methods under analysis.**

The results of the experiments indicate that: (i) Habanero is a viable approach to designing pheromone-based stigmergy; (ii) it can produce control software that is comparable to, or even outperforms, control software produced by a human designer; and (iii) although its modules are conceived in a mission-agnostic way, the interaction strategies it devises are mission-specific.

Results

Habanero designed stigmergy-based collective behaviours that proved to be effective: the robots used the artificial pheromone to complete each mission in a way that is meaningful and appropriate to the mission considered. Statistical analysis shows that the control software generated by Habanero performed significantly better than the alternatives included in the empirical study. In the following sections, we first present the results on a per-mission basis, and then we aggregate them across all missions. Simulation-only experiments with different swarm sizes are provided as Supplementary Note 1. We also provide an analysis of the robustness to the reality gap as Supplementary Note 2.

AGGREGATION

In this mission, the robots must aggregate anywhere in the arena. To aggregate, the robots cannot rely on any form of direct communication nor on the ability to directly sense the presence of their peers in their vicinity. The only way in which they can coordinate is the laying and detecting of artificial pheromone trails. They can leverage this ability to attract their peers and aggregate using stigmergy. However, as all robots could release some pheromone at the same time in different areas, they could saturate the environment and/or be trapped in the local accumulation of their own pheromone emissions.

Habanero, EvoPheromone, and Human-Designers produced control software that performed equivalently well in simulation—see Fig. 5a. However, when transferred to the real robots, the control software produced by Habanero performed significantly better than the one produced by all other design methods.

**Fig. 5: Results of the empirical analysis.**

Habanero produced collective behaviours in which the robots laid pheromone trails only for short periods of time and kept searching the environment for pheromone traces left by their peers. By laying pheromone trails only intermittently, the robots avoided saturating the environment and marked only isolated spots, which then served as aggregation points. Around these points, they eventually gathered in clusters—see Fig. 6 and Supplementary Video 1.

**Fig. 6: Behaviours produced by Habanero and Human-Designers.**

EvoPheromone produced a different strategy: the robots laid pheromone trails while moving along a circular trajectory and followed the pheromone trails to gather at places where pheromone concentration was high. This strategy produced good results in simulation but not on the real robots. The robots did not properly avoid the walls and failed to reproduce the behaviour observed in the simulation.

The control software produced by Human-Designers continuously laid pheromone trails with the expectation that all robots would gather at one place. Results were good in simulation but failed to transfer to reality. In the real-robot experiments, the robots remained trapped in local pheromone accumulations. Eventually, they gathered in separate clusters.

DECISION MAKING

In this mission, the robots must make the decision to congregate in one of two regions of the arena, designated by RGB blocks that display blue or green colour, respectively—see Fig. 3b. Each robot scores one point for each time step spent in the green region and two points for each time step spent in the blue one. Halfway through each run of the experiment, the blue and green RGB blocks are switched off, leaving the robots without any visual cue to identify the two regions. In order to maximise the score, the robots must quickly congregate in the region that provides the highest score per time step—i.e., the blue one—and remain there even once the environmental cues are removed.

When evaluated in simulation, the control software produced by Habanero and Human-Designers performed equally well, and significantly better than the one produced by EvoPheromone—see Fig. 5b. However, in the real-robot experiments, the control software produced by Habanero performed significantly better than that of Human-Designers. The control software produced by both Habanero and Human-Designers performed significantly better than that of EvoPheromone, which obtained results comparable with those of Random-Walk.

In all experimental runs, the robot swarm designed by Habanero correctly selected the blue region to congregate. The robots relied on stigmergy not only to attract other robots to the blue region, but also to stay there after the cues were removed. The behaviour displayed in the real-robot experiments was qualitatively similar to the one displayed in simulation—see Fig. 6 and Supplementary Video 2. However, in the real-robot experiments, some robots that gathered in the blue region spilled out of the boundaries of the region, although remaining in its vicinity. Because of this, the performance in the real-robot experiments was lower than that in simulation. The robot swarm generated by EvoPheromone was unable to congregate in a single region: the robots stayed in the first region in which they entered. Consequently, the score was significantly worse than the one obtained by other design methods. The robot swarm produced by Human-Designers was able to correctly congregate in the blue region but was unable to remain there once the cues were removed.

RENDEZVOUS POINT

In this mission, a wall with a narrow gate laterally divides the arena into two sections: the left side, where the robots are deployed at the beginning of the experiment; and the right side, which contains two regions designated by RGB blocks that display blue or green colour, respectively—see Fig. 3c. Similar to DECISION MAKING, halfway through each run of RENDEZVOUS POINT, the blue and green RGB blocks are switched off, leaving the robots without any visual cue to identify the two regions. The robots must cross the narrow gate to gather in the green region. The score is given by the number of robots that, at the end of the experimental run, are positioned in the green region.

When evaluated in simulation, the control software produced by all design methods performed equally well—see Fig. 5c. However, in the real-robot experiments, the control software produced by Habanero performed significantly better than the one produced by all other methods. Moreover, the one produced by EvoPheromone performed significantly worse than that produced by all other methods.

The robot swarms designed by Habanero relied on random walk to cross the gate and find the green region. Once the robots reached the green region, they took advantage of stigmergy to attract their peers and to keep themselves inside the region even when the green light was removed. The robots laid pheromone trails to mark the green region and kept laying the pheromone trails at that place to avoid fading—see Fig. 6 and Supplementary Video 3.

In the control software produced by EvoPheromone, the robots do not randomly search for the narrow passage. Instead, they move along the walls of the arena to eventually cross the gate and reach the green region—see Supplementary Video 3. Although this behaviour worked effectively in simulation, it failed in the real-robot experiments: the robots were unable to move along the walls and remained stuck. Consequently, they were unable to cross the gate. In the real-robot experiments, the performance of the robot swarm designed by EvoPheromone was even significantly worse than that of Random-Walk.

In the control software produced by Human-Designers, the robots were mostly able to reach the green region. However, the swarm produced by Human-Designers was not always effective in using stigmergy to remain in the green region, especially after the green light was removed.

STOP

In this mission, the robots must halt and stand still as soon as a stop signal is perceived. The stop signal is a (random) RGB block that switches on at a random moment in time and emits blue light—see Fig. 3d. Before the signal, each robot scores one point for each time step during which it moves. After the signal, each robot scores one point for each time step during which it stays in place. As the robots considered in this study are incapable of direct communication, the individuals that detect the signal can only rely on stigmergy to inform any peers that are in a position from which the signal cannot be seen.

The control software produced by Habanero and Human-Designers performed similarly well when evaluated both in simulation and reality, and performed significantly better than the one produced by EvoPheromone—see Fig. 5d.

In the robot swarms designed by Habanero, the robots kept moving to search for a block emitting the stop signal. As soon as a robot detected the signal, it stopped or started waggling in place, while laying a pheromone trail to alert its peers. Other robots also stopped and started laying pheromone trails either after detecting the signal or the pheromone trails laid by their peers—see Supplementary Video 4.

Human-Designers produced collective behaviours similar to those generated by Habanero, and so no significant difference in the performance could be observed—see Fig. 5d.

The collective behaviours produced by EvoPheromone achieved good scores in some cases, but were unable to accomplish the mission in its true sense. The robots took advantage of stigmergy to gradually repel each other, approach the walls, and eventually stop against them. The evolutionary process tuned the timing of the behaviour to match the typical amount of time that elapsed between the beginning of the experiment and the moment when the blue signal appeared. This allowed the robots to score points by moving towards the walls before the appearance of the signal and remaining still against the walls after the appearance of the signal. Although this behaviour was reasonably well synchronised with the typical case, its failure to properly react to the appearance of the signal prevented it from achieving good scores consistently. Consequently, the performance achieved by EvoPheromone is significantly worse than the one achieved by both Habanero and Human-Designers.

Aggregate results

To aggregate the performance of each design method across the four missions, we used a Friedman rank sum test on the performance observed in the real-robot experiments. The test indicates that, in the experiments presented, Habanero outranked all other design methods, with a confidence of at least 95%—see Fig. 5e. Human-Designers performed significantly better than both EvoPheromone and Random-Walk.

Figure 6 shows the aggregated execution time of the behaviour modules in the finite-state machines produced by Habanero and Human-Designers—measured in simulation. Results indicate that the finite-state machines produced by Habanero and Human-Designers are different: the execution time of the behaviour modules is different in Habanero and Human-Designers across all missions. Although Habanero and Human-Designers used the same set of modules, they combined them in a different way. The aggregated execution-time plot highlights four major differences between Habanero and Human-Designers. First, Habanero used the Exploration module considerably less than Human-Designers. Second, Habanero relied more on modules that react to pheromone information compared to Human-Designers. Third, Human-Designers employed for a longer time the modules that respond to the walls’ colour compared to Habanero. Finally, Habanero made greater use of the waggle module than Human-Designers.

While our experiments highlight performance differences between the two methods, we cannot definitively determine how the design choices made by Habanero and Human-Designers influence the overall performance. More precisely, our experimental setup cannot adequately explain the rationale behind the selection, tuning, and combination of the modules for either Habanero or Human-Designers, and its relationship with the performance obtained.

Discussion

Automating the production of control software for pheromone-based robot swarms is a step further towards their real-world application. Automatic design can ease the realisation of robot swarms across different missions, while minimising human intervention^36,41,42,57. The experiments presented in this paper show that this holds true also in the case of robot swarms that rely on pheromone-based stigmergy. Indeed, Habanero automatically designed stigmergy-based collective behaviours that were effective across all missions considered. For each mission, it found appropriate ways to use the pheromone effectively. Although the software modules on which Habanero operates were conceived in a mission-agnostic way, the interaction strategies that Habanero eventually generated for each mission were tailored to each of them and are different from one another. In these interaction strategies, the limited perception and computation capabilities of the individual robots are compensated at the swarm level by exploiting pheromone-based stigmergy. The e-puck used in the experiments, as a single robot, has limited spatial coordination, memory, and communication abilities. However, spatial organisation, external memory, and communication in the swarm emerged at the collective level thanks to pheromone-based stigmergy. Spatial organisation: In AGGREGATION, DECISION MAKING, and RENDEZVOUS POINT, the e-pucks self-organised and distributed in space guided by their pheromone trails and other environmental cues. Memory: In DECISION MAKING and RENDEZVOUS POINT, the swarm of e-pucks retained relevant information about the past state of the environment by laying pheromone trails. Communication: The semantics of pheromone trails is mission-specific. For example, the pheromone trails that the e-pucks laid in STOP had a meaning (stop where you are) that is radically different from the meaning in AGGREGATION (come here). It is interesting to note that spatial organisation, memory, and communication (including the semantics of pheromone trails) were not hand-coded in the modules on which Habanero operates: they were the product of the way in which Habanero automatically combined these modules on a per-mission basis.

The study leaves two main questions open. (i) Can automatic design leverage the intensity of pheromone trails and their decay time? In the experiments presented, a robot either did or did not sense the pheromone, in a binary fashion. A more thorough investigation is required to determine whether an automatic method can simultaneously tune the concentration of the pheromone deployed and the concentration to which a robot should react. (ii) Can automatic design methods realise robot swarms that alternatively, or simultaneously, operate with direct and indirect communication? We have shown in the past that direct communication can emerge from an automatic design process^53,58. In this paper, we have shown that indirect communication can emerge as well. Further research is required to determine whether an automatic method can select direct or indirect communication as more suitable for a specific mission. In this sense, we deem particularly interesting the idea of automatically designing collective behaviours in which the robots operate with combinations of the two.

In this study, we adopted an existing technology to enable pheromone-based stigmergy with real robots—the photochromic artificial pheromone system³⁰. Although viable, it is a technology that—like all the existing solutions—has some critical limitations: namely, it is only suitable for indoor applications in which the environment can be prepared beforehand with the photochromic material. As of today, no technology exists to provide robots with a universally applicable capability to mark their environment with the indication of their activities. However, by analysing the strengths of the available solutions, we can outline desirable properties for such technology. First, pheromones should be produced by robots, minimising the need for environment preparation and/or external infrastructure. Additionally, robots should have the ability to modulate the intensity of the pheromones they lay and respond to, enabling precise control over their behaviour. We also envision that pheromone-based stigmergy should facilitate the design of more complex behaviours, possibly by functioning over diverse types of pheromones that communicate different information. The devices that lay and sense pheromones should be easy to build and integrate in modern robot platforms at different scales—from small educational robots to larger platforms. Finally, the pheromone laid by the robots must be safe and nondestructive, and any marks left by the robots should disappear once the swarm completes its operation. Engineering solutions that meet these properties would facilitate their broad adoption, development, and validation, as well as the establishment of benchmarks for robotics stigmergy.

With Habanero we demonstrated that it is possible to generate pheromone-based collective behaviours through an automatic process that is repeatable and generally applicable. We contend that this result can motivate further research to overcome the limitations of the currently available hardware solutions to implement pheromone-based stigmergy.

Methods

Arena

All experiments were performed in a rectangular arena whose walls were realised with modular RGB blocks that display colours according to the mission requirements^53,59—see Fig. 2b. The technical diagrams of the arenas used in the study are shown in Fig. 3. The floor of the arena was white and coated with a photochromic material that acts as a medium to encode the pheromone trails³⁰. The coating was realised using an acrylic binder with a 20% (w/w) concentration of photochromic pigments. Technical information to reproduce the arena is provided as Supplementary Note 5. The photochromic material adopted turns magenta when exposed to UV light. Once the UV light is removed, the magenta colour gradually fades and the floor returns white in about 50 s—see Supplementary Video 5.

The e-puck robot

The experiments were performed with e-puck robots—small-sized differential-drive robots that are widely adopted in swarm robotics research^52,60. We used an extended version of the e-puck that is equipped with the Overo Gumstix computer-on-module to run Linux on the robot; the ground sensor module to detect the gray-level colour of the floor; a UV-light module and an omnidirectional camera to deposit and detect artificial pheromone trails, respectively. The UV-light module is a ring-shaped add-on module for e-puck that is equipped with nine down-facing UV LEDs positioned at the rear of the robot³⁰. A picture of the hardware configuration of the e-puck robot adopted in the research is given in Fig. 2a. The capabilities of the e-puck for laying and detecting the artificial pheromone are illustrated in Supplementary Video 5.

Reference model: the extended version of e-puck adopted is described by reference model RM 4.1, which formally defines the input and output variables associated with sensors and actuators, respectively—see Fig. 2c. The control software of the robot reads/writes the input/output variables at every control step, which has a duration of 100 ms⁶¹.

Simulator: all simulations were performed using ARGoS3 Version 48, along with the argos3-epuck-phormica library—see section Code Availability. ARGoS was specifically developed to simulate robot swarms⁵⁴; the argos3-epuck-phormica library enables the cross-compilation of control software for the e-puck so that it can be ported to the robots without any manually applied modification.

Habanero

Habanero is an instance of AutoMoDe⁴⁰ specialised in the design of swarm of robots that can lay and detect pheromone trails. Habanero produces control software by assembling predefined software modules into probabilistic finite-state machines in which states are low-level behaviours performed by the robots and transitions are enabled by conditions on the contingencies experienced by the robot.

Habanero operates on seven low-level behaviours and six conditions. Both low-level behaviours and conditions have free parameters that affect their functioning. The space of solutions that Habanero can produce comprises all the possible probabilistic finite-state machines—with at most 4 states and at most 4 outgoing transitions per state—that can be obtained by assembling the available modules and by fine-tuning their free parameters. There are a total of 105 parameters to be tuned—with categorical parameters for the selection of software modules; and categorical, integer and real parameters that affect their functioning. The optimisation problem is mixed-variable in nature⁶². Habanero searches this space using Iterated F-race⁵⁵ with the goal of maximising a given mission-specific objective function. Iterated F-race samples, fine-tunes and selects candidate solutions performing simulations in ARGoS3. There is a limited number of simulations available to Habanero to produce an instance of control software—a simulations budget. Once the budget is exhausted, Habanero returns the best control software found up to that moment. A pictorial representation of Habanero is given in Fig. 1a.

The seven low-level behaviours are: exploration, stop, go-to-colour, avoid-colour, go-to-pheromone, avoid-pheromone, and waggle. The six conditions are: white-floor, gray-floor, black-floor, colour-detected, pheromone-detected, fixed-probability—see Fig. 1b,c and Table 1. All the low-level behaviours and the conditions interact with the e-puck hardware (sensors and actuators) via the input/output variables defined in reference model RM 4.1—see Fig. 2b.

Table 1 Habanero’s low-level behaviours and transition conditions

Full size table

We chose Iterated F-race to conduct Habanero’s optimisation process as, for historical reasons, it is the de facto standard optimisation algorithm in the AutoMoDe family. Notably, Iterated F-race outperformed human experts in the modular design of control software for robot swarms⁵⁶. Moreover, Iterated F-race was successful when applied to the problem of producing collective behaviours with a diverse set of AutoMoDe methods⁴⁰. Iterated F-race has properties that make it suitable to tackle problems in the automatic modular design of control software. Particularly, it was conceived for the statistical selection of candidate solutions when (i) the problem instances are stochastic and (ii) the solutions comprise discrete and continuous parameter spaces^55,63,64. Recent studies have shown that other optimisation algorithms are suitable for the AutoMoDe family (e.g., simulated annealing⁶⁵ and sequential model-based algorithm configuration^66,67). However, there is no evidence that indicates that they offer a definite advantage over Iterated F-race—see Kuckling⁶⁸ for a recent in-depth discussion.

Comparisons

EvoPheromone is an adaptation of EvoStick, which is a standard neuroevolutionary method to design robot swarms⁴³. EvoPheromone produces control software for an extended version of the e-puck robot formally described by reference model RM 4.1—same as Habanero. The architecture of the control software is a fully connected feed-forward artificial neural network. The neural network has 61 input nodes, 7 output nodes, and no hidden layer. The input and output nodes are directly connected by synaptic connections with weights. There are a total of 427 parameters to be tuned—all real values, which encode the synaptic weights. The optimisation problem is continuous in nature⁶². EvoPheromone tunes the synaptic weights of the neural network via elitism and mutation⁴³. The evolutionary process is based on simulations executed in ARGoS3 with the argos3-epuck-phormica library—same setting as Habanero. The design process ends when a predefined simulation budget is exhausted. We developed EvoPheromone on the basis of EvoStick, as the latter is a readily available method for the e-puck that has served as a yardstick to apprise the performance of AutoMoDe methods in the past^43,56. EvoStick is the only neuroevolutionary method that has been tested in the automatic design of robot swarms for several missions, without undergoing any mission-specific modification⁴⁸. Moreover, EvoStick served as a starting point to develop other neuroevolutionary methods for robots endowed with enhanced capabilities—see, for example, adaptations of EvoStick to study direct communication^53,58 and spatial organisation⁶⁹. EvoStick, and therefore EvoPheromone, are simple and straightforward implementations of the neuroevolutionary approach. We do not consider more advanced neuroevolutionary methods (e.g., CMA-ES⁷⁰, xNES⁷¹, and NEAT⁷²) as previous research has shown that they do not provide any performance advantage over EvoStick when applied off the shelf⁴⁸.

Human-Designers is a manual design method in which 10 human designers were requested to produce control software using the software modules of Habanero. In a sense, a human designer acts as an optimisation agent that assembles a finite-state machine and fine-tunes its parameters. Human-Designers produces control software for an extended version of the e-puck robot formally described by reference model RM 4.1—same as Habanero. The human designers who participated in this study had various levels of expertise in swarm robotics—ranging from bachelor students to post-doctoral researchers in swarm robotics. Seven of them had previous experience with real robots, seven had previous experience with ARGoS3, and six had experience with the e-puck—either in simulation or reality. We provided the designers with a visualisation tool to produce and manipulate finite-state machines, to visualise simulations, and to compute the value of the objective function⁷³. All simulations were executed in ARGoS3 with the argos3-epuck-phormica library—same setting as Habanero. The designers were allotted 4 hours per mission—see Supplementary Note 4. The guidelines and experimental description given to the designers are provided as Supplementary Note 3.

Random-Walk, although not an automatic design method, is included in the study as a lower bound on the performance of robot swarms. In Random-Walk, the robots move straight in the arena, when they encounter an obstacle, they rotate for a random number of control steps and then resume their straight motion. Random-Walk was conceived for an extended version of the e-puck robot formally described by reference model RM 4.1—same as Habanero.

Missions

The empirical study is based on four missions. Each mission must be performed within T = 180 s by a swarm of N = 8 robots. The size of the swarm was determined in accordance with the number of robots available for the experiments.

AGGREGATION: initially, the robots are randomly placed in the arena—see Fig. 3a. The robots must approach one another to form a cluster and remain close until the end of the mission. Formally, the mission is specified by the following objective function, which must be minimised:

$${F}_{a}=\mathop{\sum }\limits_{t=1}^{T/100\,{{{{{{{\rm{ms}}}}}}}}}{d}_{{{{{{{{\rm{avg}}}}}}}}}(t).$$

(1)

At each control step t, the average distance d_avg between the robots is added to F_a.

DECISION MAKING: initially, the robots are randomly placed in the arena—see Fig. 3b. The robots must select between a green and a blue region: at every control step t, the score is increase by +1 for every robot that is in the green region, and by +2 for every robot that is in the blue one. Both green and blue light signals disappear after a random amount of time, which is uniformly sampled between 70 and 90 s. Formally, the mission is specified by the following objective function, which must be maximised:

$${F}_{d}=\mathop{\sum }\limits_{t=1}^{T/100\,{{{{{{{\rm{ms}}}}}}}}}\mathop{\sum }\limits_{i=1}^{N}{I}_{i}(t);\qquad {I}_{i}(t)=\left\{\begin{array}{ll}1\quad &\,{{\mbox{if robot }}}i{{\mbox{ is in green region}}},\\ 2\quad &{{\mbox{if robot }}}i{{\mbox{ is in blue region}}},\\ 0\quad &\,{{\mbox{otherwise.}}}\hfill\,\end{array}\right.$$

(2)

RENDEZVOUS POINT: initially, the robots are placed in the left side of the arena. The robots must reach the green region and stay there until the end of the mission. A blue region is added as a decoy to possibly confuse the robots—see Fig. 3c. Both green and blue light signals disappear after a random amount of time, which is uniformly sampled between 70 and 90 s. Formally, the mission is specified by the following objective function, which must be maximised:

$${F}_{r}={K}_{{{{{{{{\rm{in}}}}}}}}}-{K}_{{{{{{{{\rm{out}}}}}}}}};$$

(3)

where K_in is the number of robots inside the green region at the end of the mission, and K_out is the number of robots outside.

STOP: initially, the robots are randomly placed in the arena. A blue light signal appears after a random amount of time $\bar{t}$, which is uniformly sampled between 70 and 90 s—see Fig. 3d. All the robots must stop as soon as the signal appears, but not before. Formally, the mission is specified by the following objective function, which must be minimised:

$${F}_{s} =\, \mathop{\sum }\limits_{t=1}^{\bar{t}}\mathop{\sum }\limits_{i=1}^{N}{\bar{I}}_{i}(t)+\mathop{\sum }\limits_{t=\bar{t}+1}^{T}\mathop{\sum }\limits_{i=1}^{N}{I}_{i}(t);\quad \quad {I}_{i}(t)\\ =\, \left\{ \begin{array}{ll} 1 & {{\mbox{if robot }}}i\,{{\mbox{is moving}}} , \\ 0 & {\mbox{otherwise}} ; \hfill\end{array} \right. \quad \hskip 24pt {\bar{I}}_{i}(t)=1-{I}_{i}(t).$$

(4)

In the absence of well-established benchmark missions, we chose a set of missions that allowed us to estimate the expected performance of Habanero in typical swarm robotics tasks. AGGREGATION, DECISION MAKING, RENDEZVOUS POINT and STOP are missions that belong into the same class—they allow the pheromone-based coordination of robots. Yet, they are sufficiently different to benefit from a tailored design—they vary in the nature of their goals and in the presence of reference points of interest. By selecting a varied set of missions, we also aimed at testing Habanero’s ability to handle diverse challenges without undergoing any mission-specific adjustment.

It is worth noting that these missions—likewise Habanero—are not suitable for drawing conclusions on whether automatic methods can handle more complex missions or design relatively more complex stigmergy-based interactions. For instance, missions that require precise behavioural control via careful modulation of the pheromone deposition and response, or missions that involve more complex communication strategies through various types of pheromones.

Protocol

All experiments were executed without any human intervention or any mission-specific modification in the design process. In the case of Habanero and EvoPheromone, for each mission, we independently executed the design process 10 times to obtain 10 instances of control software. Both methods operated with a budget of 100,000 simulation runs for each execution of the design process. We executed all automatic design processes on a high-performance computational cluster with about 1500 computing cores. In case of Human-Designers, 10 human designers were involved and each of them produced one instance of control software for each mission. After obtaining all the instances of control software, we assessed their performance once in simulation and once in reality. We varied the initial position of the robots when assessing instances of control software of a single method, and we used the same set of initial positions across the four methods. To perform the experiments in reality, the instances of control software, regardless of the design method that produced them, were automatically cross-compiled and deployed on the e-puck robots without undergoing any manually-applied modification.

Tracking system

We used a tracking system to automatically compute the performance of a robot swarm during each run of a real-robot experiment⁷⁴. The tracking system uses an overhead camera to record the positions of the robots by recognising squared markers mounted on the robots. We also used the overhead camera to record videos of the experiments—see Supplementary Video 6. The overhead camera was used only to measure the performance of the swarm and was not used to provide any information to the robots.

Statistics

We present the performance of the different methods with notched box-and-whiskers plots on a per-mission basis. In these plots, boxes represent the interquartile range, covering the central 50% of the values observed. Whiskers extend from the lower quartile to the lowest recorded performance, and from the upper quartile to the highest one. The horizontal line in the middle of each box plot represents the median performance, and the notches on the box represent a 95% confidence interval on the median. If the notches of two boxes do not overlap, then the difference between their respective medians is significant, with a confidence of at least 95%⁷⁵. For each method, we present the performance obtained in simulation and in real-robot experiments using thin and thick boxes, respectively. We executed a mission-specific comparison of the performance of methods with Wilcoxon paired rank sum tests at 95% confidence⁷⁶.

We also performed a Friedman rank sum test⁷⁶ that aggregates the performance of each method across all four missions. More precisely, we applied a Friedman two-way analysis of variance to the performances recorded in the experiments with physical robots, across all missions, and for all methods. The Friedman test is nonparametric and implements a block design. In our protocol, the treatment factor is the method under analysis and the blocking factor is the mission. By operating on the ranks, the Friedman test is invariant to the magnitude of the objective functions of the missions considered. Also, due to its nonparametric nature, it can be applied with no assumption on the distribution of the performance. These properties are instrumental for aggregating the performance observed across the four missions. We present the results of the test with the average rank of each method (computed across all missions), and its 95% confidence interval. A method is significantly better than other if it has a lower average rank and the confidence interval of the two methods do not overlap.

Data availability

The data that support the findings of this study are available in figshare with the identifier https://doi.org/10.6084/m9.figshare.24707154.

Code availability

The software used to produce the results of our study is available in the following public repositories under the MIT License: ARGoS3 (https://doi.org/10.5281/zenodo.4889111) for the ARGoS3 simulator; irace (https://doi.org/10.5281/zenodo.4888996) for the implementation of Iterated F-race; ARGoS3-AutoMoDe (https://doi.org/10.5281/zenodo.7090227) for the implementation of Habanero and Random-Walk; demiurge-epuck-dao (https://doi.org/10.5281/zenodo.7150581) for the reference model of the robots used by all design methods; experiments-loop-functions (https://doi.org/10.5281/zenodo.7150584) for the objective functions used to compute the score in a mission in simulation; argos3-epuck-private (https://doi.org/10.5281/zenodo.7241397) ARGoS3 plugin for the e-puck robot endowed with the UV module; argos3-phormica (https://doi.org/10.5281/zenodo.7241409) ARGoS3 plugin to enable the simulation of Phormica—a pheromone release and detection system; ARGoS3-NEAT (https://doi.org/10.5281/zenodo.7150530) for the implementation of EvoPheromone; experiments-loop-functions-ros (https://doi.org/10.5281/zenodo.7241441) plugin to compute the performance score in real robot experiments; and AutoMoDe-visualisation-tool (https://doi.org/10.5281/zenodo.7241468) a web editor tool that allows Human-Designers to manually edit AutoMoDe finite-state machines and visualise its performance in simulation. Installation and execution instructions are provided as Source Data.

References

Grassé, P. P. La reconstruction du nid et les coordinations interindividuelles chez Bellicositermes natalensis et Cubitermes sp. la théorie de la stigmergie: essai d’interprétation du comportement des termites constructeurs. Insectes Sociaux 6, 41–80 (1959).
Article Google Scholar
Wilson, E. O. Sociobiology: The New Synthesis. (Harvard University Press, Cambridge, MA, USA, 1975).
Heylighen, F. Stigmergy as a universal coordination mechanism II: varieties and evolution. Cognit. Syst. Res. 38, 50–59 (2016). Special Issue of Cognitive Systems Research - Human-Human Stigmergy.
Article Google Scholar
Wyatt, T. D. Pheromones and Animal Behavior: Chemical Signals and Signatures. 2nd edn. (Cambridge University Press, Cambridge, MA, USA, 2014).
Helbing, D., Schweitzer, F., Keltsch, J. & Molnár, P. Active walker model for the formation of human and animal trail systems. Phys. Rev. E 56, 2527–2539 (1997).
Article ADS CAS Google Scholar
Goss, S., Aron, S., Deneubourg, J.-L. & Pasteels, J. M. Self-organized shortcuts in the Argentine ant. Naturwissenschaften 76, 579–581 (1989).
Article ADS Google Scholar
Theraulaz, G. & Bonabeau, E. A brief history of stigmergy. Artif. Life 5, 97–116 (1999).
Article CAS PubMed Google Scholar
Bonabeau, E., Dorigo, M. & Theraulaz, G. Swarm intelligence: from natural to artificial systems. (Oxford University Press, Oxford, United Kingdom, 1999).
Feinerman, O. & Korman, A. Individual versus collective cognition in social insects. J. Exp. Biol. 220, 73–82 (2017).
Article PubMed Google Scholar
Theraulaz, G., Gautrais, J., Camazine, S. & Deneubourg, J.-L. The formation of spatial patterns in social insects: from simple behaviours to complex structures. Philosophical Trans. Royal Soc. London. Series A: Mathe. Phys. Engineer. Sci. 361, 1263–1282 (2003).
Article ADS MathSciNet Google Scholar
Şahin, E. Swarm robotics: from sources of inspiration to domains of application. In Şahin, E. & Spears, W. M. (eds.) Swarm Robotics: SAB 2004 International Workshop, vol. 3342 of Lecture Notes in Computer Science, 10–20 (Springer, Berlin, Germany, 2005).
Dorigo, M., Birattari, M. & Brambilla, M. Swarm robotics. Scholarpedia 9, 1463 (2014).
Article ADS Google Scholar
Garnier, S., Gautrais, J., Asadpour, M., Jost, C. & Theraulaz, G. Self-organized aggregation triggers collective decision making in a group of cockroach-like robots. Adapt. Behavior 17, 109–133 (2009).
Article Google Scholar
Corne, D. W., Reynolds, A. & Bonabeau, E. Swarm intelligence. In Rozenberg, G., Bäck, T. & Kok, J. N. (eds.) Handbook of natural computing, 1599–1622 (Springer, Berlin, Germany, 2012).
Goss, S., Deneubourg, J.-L., Bourgine, P. & Varela, E. Harvesting by a group of robots. In Toward a Practice of Autonomous Systems, Proceedings of the First European Conference on Artificial Life, Complex adaptive systems, 195–2054 (MIT Press, Cambridge, MA, USA, 1992).
Payton, D., Daily, M., Estkowski, R., Howard, M. & Lee, C. Pheromone robotics. Autonomous Robots 11, 319–324 (2001).
Article Google Scholar
Campo, A. et al. Artificial pheromone for path selection by a foraging swarm of robots. Biol. Cybernet. 103, 339–352 (2010).
Article Google Scholar
Khaliq, A. A., Di Rocco, M. & Saffiotti, A. Stigmergic algorithms for multiple minimalistic robots on an RFID floor. Swarm Intell. 8, 199–225 (2014).
Article Google Scholar
Alfeo, A. L. et al. Urban Swarms: a new approach for autonomous waste management. In 2019 IEEE International Conference on Robotics and Automation (ICRA), 4233–4240 (IEEE, Piscataway, NJ, USA, 2019).
Na, S., Raoufi, M., Turgut, A. E., Krajník, T. & Arvin, F. Extended artificial pheromone system for swarm robotic applications. In ALIFE 2019: The 2019 Conference on Artificial Life, 608–615 (MIT Press, Cambridge, MA, USA, 2019).
Na, S. et al. Bio-inspired artificial pheromone system for swarm robotics applications. Adapt. Behavior. (2020).
Hunt, E. R., Jones, S. & Hauert, S. Testing the limits of pheromone stigmergy in high-density robot swarms. Royal Society Open Sci. 6, 190225 (2019).
Article ADS Google Scholar
Garnier, S., Combe, M., Jost, C. & Theraulaz, G. Do ants need to estimate the geometrical properties of trail bifurcations to find an efficient route? A swarm robotics test bed. PLOS Comput. Biol. 9, 1–12 (2013).
Article MathSciNet Google Scholar
Reina, A., Cope, A. J., Nikolaidis, E., Marshall, J. A. & Sabo, C. ARK: augmented reality for Kilobots. IEEE Robot. Automat. Lett. 2, 1755–1761 (2017).
Article Google Scholar
Antoun, A. et al. Kilogrid: a modular virtualization environment for the Kilobot robot. In 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 3809–3814 (IEEE, Piscataway, NJ, USA, 2016).
Talamali, M. S. et al. Sophisticated collective foraging with minimalist agents: a swarm robotics test. Swarm Intell. 14, 25–56 (2020).
Article Google Scholar
Russell, R. A. Ant trails – an example for robots to follow? In 1999 IEEE International Conference on Robotics and Automation, ICRA 1999, vol. 4, 2698–2703 (IEEE, Piscataway, NJ, USA, 1999).
Russell, R. A. Heat trails as short-lived navigational markers for mobile robots. In 1997 IEEE International Conference on Robotics and Automation, ICRA 1997, vol. 4, 3534–3539 (IEEE, Piscataway, NJ, USA, 1997).
Fujisawa, R., Dobata, S., Sugawara, K. & Matsuno, F. Designing pheromone communication in swarm robotics: group foraging behavior mediated by chemical substance. Swarm Intell. 8, 227–246 (2014).
Article Google Scholar
Salman, M., Garzón Ramos, D., Hasselmann, K. & Birattari, M. Phormica: photochromic pheromone release and detection system for stigmergic coordination in robot swarms. Front. Robot. AI 7, 195 (2020).
Article ADS Google Scholar
Heylighen, F. Stigmergy as a universal coordination mechanism I: definition and components. Cognit. Syst. Res. 38, 4–13 (2016). Special Issue of Cognitive Systems Research - Human-Human Stigmergy.
Article Google Scholar
Winfield, A., Harper, C. J. & Nembrini, J. Towards dependable swarms and a new discipline of swarm engineering. In Şahin, E. & Spears, W. M. (eds.) Swarm Robotics: SAB 2004 International Workshop, vol. 3342 of Lecture Notes in Computer Science, 126–142 (Springer, Berlin, Germany, 2005).
Brambilla, M., Ferrante, E., Birattari, M. & Dorigo, M. Swarm robotics: a review from the swarm engineering perspective. Swarm Intell. 7, 1–41 (2013).
Article Google Scholar
Hamann, H. & Wörn, H. An analytical and spatial model of foraging in a swarm of robots. In Swarm Robotics: Second International Workshop, SAB 2006, vol. 4433 of Lecture Notes in Computer Science, 43–55 (Springer, Berlin, Germany, 2007).
Khaliq, A. A. & Saffiotti, A. Stigmergy at work: planning and navigation for a service robot on an RFID floor. In 2015 IEEE International Conference on Robotics and Automation (ICRA), 1085–1092 (IEEE, Piscataway, NJ, USA, 2015).
Dorigo, M., Theraulaz, G. & Trianni, V. Reflections on the future of swarm robotics. Sci. Robot. 5, eabe4385 (2020).
Article PubMed Google Scholar
Hamann, H. Swarm robotics: a formal approach (Springer, Cham, Switzerland, 2018).
Francesca, G. & Birattari, M. Automatic design of robot swarms: achievements and challenges. Front. Robot. AI 3, 1–9 (2016).
Article Google Scholar
Na, S., Niu, H., Lennox, B. & Arvin, F. Bio-inspired collision avoidance in swarm systems via deep reinforcement learning. IEEE Trans. Vehicular Technol. 71, 2511–2526 (2022).
Article Google Scholar
Birattari, M., Ligot, A. & Francesca, G. Automode: a modular approach to the automatic off-line design and fine-tuning of control software for robot swarms. In Pillay, N. & Qu, R. (eds.) Automated Design of Machine Learning and Search Algorithms, Natural Computing Series, 73–90 (Springer, Cham, Switzerland, 2021).
Birattari, M. et al. Automatic off-line design of robot swarms: a manifesto. Front. Robot. AI 6, 59 (2019).
Article PubMed PubMed Central Google Scholar
Birattari, M., Ligot, A. & Hasselmann, K. Disentangling automatic and semi-automatic approaches to the optimization-based design of control software for robot swarms. Nat. Mach. Intell. 2, 494–499 (2020).
Article Google Scholar
Francesca, G., Brambilla, M., Brutschy, A., Trianni, V. & Birattari, M. AutoMoDe: a novel approach to the automatic design of control software for robot swarms. Swarm Intell. 8, 89–112 (2014).
Article Google Scholar
Brooks, R. A. Artificial life and real robots. In Varela, F. J. & Bourgine, P. (eds.) Towards a Practice of Autonomous Systems: Proceedings of the First European Conference on Artificial Life, 3–10 (MIT Press, Cambridge, MA, USA, 1992).
Jakobi, N., Husbands, P. & Harvey, I. Noise and the reality gap: the use of simulation in evolutionary robotics. In Morán, F., Moreno, A., Merelo, J. J. & Chacón, P. (eds.) Advances in Artificial Life: Third European Conference on Artificial Life, vol. 929 of Lecture Notes in Artificial Intelligence, 704–720 (Springer, Berlin, Germany, 1995).
Floreano, D., Husbands, P. & Nolfi, S. Evolutionary robotics. In Siciliano, B. & Khatib, O. (eds.) Springer Handbook of Robotics, Springer Handbooks, 1423–1451 (Springer, Berlin, Germany, 2008).
Ligot, A. & Birattari, M. Simulation-only experiments to mimic the effects of the reality gap in the automatic design of robot swarms. Swarm Intell. 1–24 (2019).
Hasselmann, K., Ligot, A., Ruddick, J. & Birattari, M. Empirical assessment and comparison of neuro-evolutionary methods for the automatic off-line design of robot swarms. Nat. Commun. 12, 4345 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Trianni, V. Evolutionary Swarm Robotics (Springer, Berlin, Germany, 2008).
Silva, F., Duarte, M., Correia, L., Oliveira, S. M. & Christensen, A. L. Open issues in evolutionary robotics. Evol. Comput. 24, 205–236 (2016).
Article PubMed Google Scholar
Nolfi, S. Behavioral and Cognitive Robotics: An Adaptive Perspective (Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy, 2021).
Mondada, F. et al. The e-puck, a robot designed for education in engineering. In Gonçalves, P., Torres, P. & Alves, C. (eds.) ROBOTICA 2009: Proceedings of the 9th Conference on Autonomous Robot Systems and Competitions, 59–65 (Instituto Politécnico de Castelo Branco, Castelo Branco, Portugal, 2009).
Garzón Ramos, D. & Birattari, M. Automatic design of collective behaviors for robots that can display and perceive colors. Appl. Sci. 10, 4654 (2020).
Article Google Scholar
Pinciroli, C. et al. ARGoS: a modular, parallel, multi-engine simulator for multi-robot systems. Swarm Intell. 6, 271–295 (2012).
Article Google Scholar
López-Ibáñez, M., Dubois-Lacoste, J., Pérez Cáceres, L., Birattari, M. & Stützle, T. The irace package: iterated racing for automatic algorithm configuration. Operat. Res. Perspect. 3, 43–58 (2016).
Article MathSciNet Google Scholar
Francesca, G. et al. AutoMoDe-Chocolate: automatic design of control software for robot swarms. Swarm Intell. 9, 125–152 (2015).
Article Google Scholar
Dorigo, M., Theraulaz, G. & Trianni, V. Swarm robotics: past, present, and future. Proc. IEEE 109, 1152–1165 (2021).
Article Google Scholar
Hasselmann, K. & Birattari, M. Modular automatic design of collective behaviors for robots endowed with local communication capabilities. PeerJ Comp. Sci. 6, e291 (2020).
Article Google Scholar
Garzón Ramos, D., Salman, M., Ubeda Arriaza, K., Hasselmann, K. & Birattari, M. MoCA: a modular RGB color arena for swarm robotics experiments. Tech. Rep. TR/IRIDIA/2022-014 (IRIDIA, Université libre de Bruxelles, Brussels, Belgium, 2022).
Allen, J. M., Joyce, R., Millard, A. G. & Gray, I. The Pi-puck ecosystem: hardware and software support for the e-puck and e-puck2. In Dorigo, M. et al. (eds.) Swarm Intelligence: 12th International Conference, ANTS 2020, vol. 12421 of Lecture Notes in Computer Science, 243–255 (Springer, Cham, Switzerland, 2020).
Hasselmann, K. et al. Reference models for AutoMoDe. Tech. Rep. TR/IRIDIA/2018-002 (IRIDIA, Université libre de Bruxelles, Brussels, Belgium, 2018).
Liao, T., Socha, K., Montes de Oca, M., Stützle, T. & Dorigo, M. Ant colony optimization for mixed-variable optimization problems. IEEE Transactions on Evolutionary Computation 18, 503–518 (2014).
Article Google Scholar
Balaprakash, P., Birattari, M. & Stützle, T. Improvement strategies for the F-Race algorithm: sampling design and iterative refinement. In Bartz-Beielstein, T. et al. (eds.) Hybrid Metaheuristics: 4th International Workshop, HM 2007, vol. 4771 of Lecture Notes in Computer Science, 108–122 (Springer, Berlin, Germany, 2007).
Birattari, M., Yuan, Z., Balaprakash, P. & Stützle, T. F-Race and Iterated F-Race: an overview. In Bartz-Beielstein, T., Chiarandini, M., Paquete, L. & Preuss, M. (eds.) Experimental Methods for the Analysis of Optimization Algorithms, 311–336 (Springer, Berlin, Germany, 2010).
Kirkpatrick, S., Gelatt Jr, C. D. & Vecchi, M. P. Optimization by simulated annealing. Science 220, 671–680 (1983).
Article ADS MathSciNet CAS PubMed Google Scholar
Hutter, F., Hoos, H. & Leyton Brown, K. Sequential model-based optimization for general algorithm configuration. In Coello Coello, C. A. (ed.) Learning and Intelligent Optimization: 5th International Conference, LION 5, vol. 6683 of Lecture Notes in Computer Science, 507–523 (Springer, Berlin, Germany, 2011).
Lindauer, M. et al. SMAC3: a versatile bayesian optimization package for hyperparameter optimization. J. Mach. Learn. Res. 23, 1–9 (2022).
MathSciNet Google Scholar
Kuckling, J. Optimization in the automatic modular design ofcontrol software for robot swarms. (Ph.D. thesis, Université libre de Bruxelles, Brussels, Belgium 2023).
Mendiburu, F. J., Garzón Ramos, D., Morais, M. R. A., Lima, A. M. N. & Birattari, M. AutoMoDe-Mate: automatic off-line design of spatially-organizing behaviors for robot swarms. Swarm Evol. Comput. 74, 101118 (2022).
Article Google Scholar
Hansen, N. & Ostermeier, A. Completely derandomized self-adaptation in evolution strategies. Evol. Comput. 9, 159–195 (2001).
Article CAS PubMed Google Scholar
Glasmachers, T., Schaul, T., Yi, S., Wierstra, D. & Schmidhuber, J. Exponential natural evolution strategies. In GECCO’10: Proceedings of the 12th annual conference on Genetic and evolutionary computation, 393–400 (ACM, New York, NY, USA, 2010).
Stanley, K. O. & Miikkulainen, R. Evolving neural networks through augmenting topologies. Evol. Comput. 10, 99–127 (2002).
Article PubMed Google Scholar
Kuckling, J., Hasselmann, K., van Pelt, V., Kiere, C. & Birattari, M. AutoMoDe Editor: a visualization tool for AutoMoDe. Tech. Rep. TR/IRIDIA/2021-009 (IRIDIA, Université libre de Bruxelles, Brussels, Belgium, 2021).
Legarda Herranz, G., Garzón Ramos, D., Kuckling, J., Kegeleirs, M. & Birattari, M. Tycho: a robust, ROS-based tracking system for robot swarms. Tech. Rep. TR/IRIDIA/2022-009 (IRIDIA, Université libre de Bruxelles, Brussels, Belgium, 2022).
Chambers, J. M., Cleveland, W. S., Kleiner, B. & Tukey, P. A. Graphical Methods For Data Analysis. (CRC Press, Belmont, CA, USA, 1983).
Conover, W. J. Practical Nonparametric Statistics. Wiley Series in Probability and Statistics. 3rd edn. (John Wiley & Sons, New York, NY, USA, 1999).

Download references

Acknowledgements

We thank the students and the researchers that participated in the study (Human-Designers). We also thank Dr. Mary Katherine Heinrich and Dr. Carlo Pinciroli for reading and commenting on a preliminary version of the paper. The project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (DEMIURGE Project, grant agreement No 681872), from Belgium’s Wallonia-Brussels Federation through the ARC Advanced project GbO–Guaranteed by Optimization, and from the Belgian Fonds de la Recherche Scientifique–FNRS through the crédit d’équipement SwarmSim. DGR acknowledges support from the Colombian Ministry of Science, Technology and Innovation–Minciencias. MB acknowledges support from the Belgian Fonds de la Recherche Scientifique–FNRS.

Author information

These authors contributed equally: Muhammad Salman, David Garzón Ramos.

Authors and Affiliations

IRIDIA, Université libre de Bruxelles (ULB), Brussels, Belgium
Muhammad Salman, David Garzón Ramos & Mauro Birattari
Institute of Astronomy, KU Leuven, Leuven, Belgium
Muhammad Salman

Authors

Muhammad Salman
View author publications
You can also search for this author in PubMed Google Scholar
David Garzón Ramos
View author publications
You can also search for this author in PubMed Google Scholar
Mauro Birattari
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

The three authors developed the original ideas, defined the methodology, and contributed to the provision of the resources. M.S. and D.G.R developed the software, conducted the experiments, gathered and visualised empirical data, and drafted the initial version of the manuscript. The three authors validated the research outputs, and formally analysed the results. Together, they wrote revised and edited the final version of the manuscript. MB acquired the funding, supervised the research, and managed the project. M.S. and D.G.R. contributed equally to this work.

Corresponding author

Correspondence to Mauro Birattari.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Engineering thanks Roland Bouffanais and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Alessandro Rizzo and Mengying Su, Rosamund Daw. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

supplementary information

Description of Additional Supplementary Files

Supplementary Video 1

Supplementary Video 2

Supplementary Video 3

Supplementary Video 4

Supplementary Video 5

Supplementary Video 6

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Salman, M., Garzón Ramos, D. & Birattari, M. Automatic design of stigmergy-based behaviours for robot swarms. Commun Eng 3, 30 (2024). https://doi.org/10.1038/s44172-024-00175-7

Download citation

Received: 24 February 2023
Accepted: 30 January 2024
Published: 14 February 2024
DOI: https://doi.org/10.1038/s44172-024-00175-7