Deep learning reduces sensor requirements for gust rejection on a small uncrewed aerial vehicle morphing wing

Haughn, Kevin P. T.; Harvey, Christina; Inman, Daniel J.

doi:10.1038/s44172-024-00201-8

Download PDF

Article
Open access
Published: 21 March 2024

Deep learning reduces sensor requirements for gust rejection on a small uncrewed aerial vehicle morphing wing

Communications Engineering volume 3, Article number: 53 (2024) Cite this article

811 Accesses
5 Altmetric
Metrics details

Subjects

Abstract

Uncrewed aerial vehicles are integral to a smart city framework, but the dynamic environments above and within urban settings are dangerous for autonomous flight. Wind gusts caused by the uneven landscape jeopardize safe and effective aircraft operation. Birds rapidly reject gusts by changing their wing shape, but current gust alleviation methods for aircraft still use discrete control surfaces. Additionally, modern gust alleviation controllers challenge small uncrewed aerial vehicle power constraints by relying on extensive sensing networks and computationally expensive modeling. Here we show end-to-end deep reinforcement learning forgoing state inference to efficiently alleviate gusts on a smart material camber-morphing wing. In a series of wind tunnel gust experiments at the University of Michigan, trained controllers reduced gust impact by 84% from on-board pressure signals. Notably, gust alleviation using signals from only three pressure taps was statistically indistinguishable from using six pressure tap signals. By efficiently rejecting environmental perturbations, reduced-sensor fly-by-feel controllers open the door to small uncrewed aerial vehicle missions in cities.

Machine learning for flow-informed aerodynamic control in turbulent wind conditions

Article Open access 16 December 2022

WindSeer: real-time volumetric wind prediction over complex terrain aboard a small uncrewed aerial vehicle

Article Open access 25 April 2024

Grasping extreme aerodynamics on a low-dimensional manifold

Article Open access 14 October 2023

Introduction

Although both the public sector and defense agencies are interested in urban uncrewed aerial vehicle (UAV) mission performance, fixed winged aircraft are still incapable of adapting to the complex aerodynamics within a city environment^1,2,3. Currently, the most dynamic environments are dominated by multirotor flight vehicles; however, the highly maneuverable and responsive quadrotor design suffers from substantial weight and power constraints, limiting the operational range and on-board computational capabilities needed for autonomy^4,5,6,7. Current fixed wing UAVs have greater range but are not as maneuverable⁸. Counter to both rotorcraft and traditional fixed wing UAV design, birds can adapt their wing shape as the environment changes to achieve both efficient and maneuverable flight^9,10. This ability supports birds of prey in navigating through complex environments¹¹, or rejecting perturbations in a gusty environment^12,13. UAVs can achieve a similar adaptive gust rejection by changing the shape of their wings with camber morphing (Fig. 1a).

**Fig. 1: Natural flyers use wing shape morphing to reject gusts.**

Wing morphing brings several challenges regarding mechanical complexity and compliance with the weight and volume constraints of small UAV design. Recent advances in smart materials offer a clever way to address these challenges^14,15. Macro-fiber composites (MFC) have been used for bio-inspired soft robotics and can act as both the skin and actuator of a camber-morphing wing^16,17. By rapidly changing the wing’s curvature, MFCs can actively reduce the aerodynamic forces experienced during gusts without the mechanical complexity associated with large scale shape changes. In addition, the smooth shape change offered by MFC camber-morphing improves aerodynamic efficiency, speed, weight reduction, and overall control authority when compared to traditional rigid flap actuation methods^18,19,20. However, MFCs suffer from hysteresis, creep, and inconsistent performance under out-of-plane loading. These challenges informed our autonomous gust alleviation (GA) controller design for a camber morphing wing with three active MFC sections (Fig. 1b).

Autonomous gust rejection is a key part of the puzzle that must be achieved to enable small, fixed wing UAVs to complete missions in complex aerodynamic environments, thus expanding the operational range compared to their quadrotor counterparts. Perturbations, such as gusts, impact flight performance and complicate tracking of predefined trajectories²¹. This is especially true for small UAVs due to their lightweight nature. Historically, gust response requires a pilot or autopilot to respond to a perturbation with an antagonistic action^22,23. However, these corrections occur after the external force has already perturbed the aircraft, and pilot reaction times typically fall between 0.4 s and 1.3 s after a perturbation signal before providing an input reaction²⁴. This may compromise mission success when strict altitude caps are in place, such as during nap-of-the-earth flight²⁵. Autopilot systems following classical control theory have used traditional control surfaces with strain gauges for feedback to achieve 50% gust load and flight ride quality improvement²³. Recently this response has been improved to 80% when assuming a Doppler light detection and ranging (LIDAR) system was available to provide a preview of incoming gusts²⁶. Instead of responding to a perturbation after it occurs, or spending computation and weight resources on LIDAR systems to look ahead for future perturbations, our fly-by-feel (FBF) active GA senses environmental changes on the wing in real time, beginning the initial morphing reaction in as little as one discrete timestep (0.05 s) to mitigate unintended changes in aerodynamic forces during a gust.

Successful adaptation, such as that provided by GA, relies on an accurate representation of the changing environment^27,28,29. FBF is a biologically inspired paradigm that uses distributed sensors to inform UAVs of environmental changes^{29,30,31,32,33,34}. Recently, FBF achieved up to 76% mean gust rejection on a servo-driven camber morphing wing by using incremental nonlinear dynamic inversion with quadratic programming and virtual shape functions (INDI-QP-V), incorporating sixteen on-board piezoelectric pressure sensors to detect changes in the airflow for state inference as well as fourteen fiberoptic cables, twelve strain gauges, and a wing root mounted camera to detect camber deflection with proprioceptive modeling³⁵. However, the expansive sensing networks used to inform decision making through proprioception and state inference add weight and challenge the computational power capabilities offered by small UAVs^5,6,7,28,36. Instead of relying on vast amounts of sensory data for decision making, we used intelligent controller design to determine if fewer sensors could be used to achieve GA while reducing computational cost. The model-based controllers often used for GA require highly accurate predictions to achieve sufficient control because any errors produced prior to action selection propagate through the controller. This dramatically increases computational costs^27,37,38,39. Alternatively, model-free deep reinforcement learning (DRL) can train neural networks to make action decisions directly from raw sensor inputs without using dynamics or state inference models^40,41. Proximal policy optimization (PPO) is a DRL algorithm that has shown to account for MFC hysteresis and produce effective camber control in a morphing airfoil^42,43. For this reason, we used PPO to develop the GA policies (i.e., controllers) directly from three different sensor combinations (Supplementary Fig. 1). Controllers were trained to make decisions in a gusting wind tunnel environment based on pressure signals provided by up to six pressure taps installed on the top surface of the morphing wing (Fig. 1c).

Most successful DRL applications are trained in simulation due to the repetitive nature of DRL’s trial-and-error training format^44,45. However, accurately simulating complex, gusty environments requires large computational time and cost^46,47. We avoided the computational costs as well as the uncertainty associated with simplified approximation by training directly on the physical hardware environment. Although training in the physical hardware space offers unique challenges, we found success using methods emphasizing efficiency and autonomy in state-action exploration through a pseudo-episodic training method^48,49. This training format requires an automatic transition between episodes. Therefore, we adapted methods previously established in the literature to automate a gusting environment^29,35,36. By deflecting a rigid wing, mounted in a wind tunnel upstream of our morphing wing, we exposed the morphing wing to a broad range of repeatable gusts during training to facilitate thorough exploration of the dynamic environment’s state and action spaces (Fig. 1d). Exploration is crucial for developing a robust controller capable of effectively rejecting the various degrees of perturbation experienced in a city. Therefore, during training the gust generator induced a variety of wakes representative of the updrafts and downdrafts experienced when flying over the complex street systems between buildings (Fig. 1e). Autonomously rejecting these types of gusts with reduced-sensor FBF will open the door to urban flight for fixed wing UAVs.

Results

Gust impact and reduction

The gust generator used in this wind tunnel environment perturbed the local angle of attack for the incoming airflow in a manner analogous to common flight situations in natural and urban environments (Supplementary Fig. 2). The controller experienced the gusts as instantaneous changes in wind speed and direction, similar to a sharp-edged gust model (see materials and methods). This model is often used to imitate an aircraft encountering an updraft, as found between two buildings, resulting in a change in lift^21,23,50,51. The magnitude of gust-generated lift that was rejected by the active morphing wing was termed the gust rejection percentage (GRP) defined as:

$${{{{{\rm{GRP}}}}}}\left(t\right)=\left(1-\frac{{{{{{{\rm{|}}}}}}\varDelta L}_{C}\left(t\right){{{{{\rm{|}}}}}}}{{{{{{\rm{|}}}}}}\frac{1}{T}\mathop{\sum }\nolimits_{t=0}^{T}{\varDelta L}_{B}\left(t\right){{{{{\rm{|}}}}}}}\right)\times 100 \% .$$

(1)

GRP was measured as a percentage difference between the change in lift during active morphing control, ΔL_C, and the baseline average change in lift, ΔL_B, produced by the wing when unactuated over the duration of the gust, T (Fig. 2a, b). To replicate common scenarios experienced during city flight, tests were conducted at three different flight conditions (low-lift, medium-lift, and high-lift) for three gust magnitudes (mild, moderate, and strong) in two directions (upward and downward) (Supplementary Table 1). Although the high-lift condition experienced smaller gust impact (5% change in lift), the medium-lift and low-lift conditions experienced much larger ranges and magnitudes of gust impacts (28% and 29% change in lift, respectively). To define the stability and robustness of the trained neural network policies, we trained a total of twenty (20) policies and repeated gust alleviation performance tests ten (10) times for each gust condition (6), resulting in 1200 gust rejection wind tunnel tests. We quantified a controller’s consistency between individual test iterations, gust conditions, and trained policies using the average standard deviation (STD) of the settled GRP between tests while holding all other factors constant. The settled GRP was consistent between test iterations for a single policy at each gust condition (high-lift: STD = 4.9%; medium-lift: STD = 2.3%; low-lift: STD = 2.5%) (Fig. 2c), but the average settled GRP performance of individual trained policies was less consistent between gust conditions (high-lift: STD = 10.5%; medium-lift: STD = 21.4%; low-lift: STD = 19.0%) (Fig. 2d). However, the average settled GRP was consistent between trained policies for each gust condition (high-lift: STD = 8.2%; medium-lift: STD = 7.5%; low-lift: STD = 5.7%) (Fig. 2e).

**Fig. 2: Gust Rejection Percentage (GRP) provides a metric for controller performance and consistency.**

We repeated the training and testing process described above to measure GRP for three sensor configurations: one, three, and six chordwise distributed pressure taps (Fig. 3a). This resulted in 3600 gust rejection wind tunnel tests in total. We found the number of pressure taps used for state observation significantly affected the trained GA controller performance.

**Fig. 3: The number of pressure taps significantly affected gust rejection performance.**

Diminishing effect of rearward sensors

We used the settled GRP from each test to calculate the mean gust rejection percentage for each pressure tap configuration and gust condition (Fig. 3b–d). Controllers using all six pressure taps consistently achieved large mean gust rejections for each flight condition (high-lift: 84%; medium-lift: 84%; low-lift: 86%) relative to the respective gust-generated change in lift. When we reduced the number of signals informing the DRL algorithm to only use one pressure tap, we found a significant reduction in the gust rejection performance (high-lift: P = 0.006; medium-lift: P < 0.001; low-lift: P < 0.001). However, when using only three pressure taps, we found an insignificant effect on the gust rejection compared to the six-tap case for all tested flight conditions (high-lift: P = 0.40; medium-lift: P = 0.32; low-lift: P = 0.67). This result indicates that the increased complexity of the six-tap input did not yield additional improvements in gust rejection performance beyond the three-tap construction. In fact, for the medium-lift flight condition, the three-tap configuration achieved greater, although not significantly greater, mean gust rejection.

The mean GRP is only part of the puzzle. Performance consistency is important if this approach is to provide safe and reliable flight control for future UAVs. Therefore, we directly considered the uncertainty of our gust rejection metric using the standard deviation of the settled GRP distributions (Supplementary Fig. 3a–c). We found that the one-tap configuration was significantly less consistent than the controllers using more pressure taps (high-lift: P = 0.001; medium-lift: P < 0.001; low-lift: P < 0.001). Like the mean results, we found no significant difference between the consistency of the six-tap and three-tap configurations (high-lift: P = 0.20; medium-lift: P = 0.91; low-lift: P = 0.46). Note that the standard deviations were small relative to the gust-generated change in lift (one tap: 15%, three taps: 14%, six taps: 12%), suggesting that the active morphing gust rejection was overall quite consistent for our implementation.

Timing is also a crucial component of perturbation response since a slower reaction would negate much of the benefit offered by the correction. The instantaneous change in lift produced by the sharp-edged gusting environment neglected the buildup in gust intensity typically found in nature, creating a challenging environment for controller response. Still, using rise time, we quantified the controllers’ speed to comment on how reducing sensor count impacted the active responsiveness of the system (Fig. 3e). We found that the controller speed was not significantly affected by the pressure tap configurations (P > 0.05) for all flight conditions and was consistent with rise times established in previous work where DRL controllers showed to be faster than traditional feedback control methods for an MFC morphing wing⁴² (Fig. 3f–h). However, the higher intensity gusts resulted in greater rise times, which suggests the limited discrete action space likely restricted controller speeds. Rise time uncertainty was considered using standard deviation, as done previously with gust rejection (Supplementary Fig. 3d–f).

Next, we explored the functional differences between the number of taps used and found that sensitivity of the pressure taps decreased towards the trailing edge of the wing (Fig. 4a), explaining the insignificant difference in performance between using three sensors and six sensors. The leading-edge pressure taps showed the greatest sensitivity for both positive and negative gust deflections, which is consistent with expectations as this region is usually responsible for the largest suction peak on lift producing airfoils. Comparing upward and downward gusts in the high-lift flight condition, the second pressure tap showed less sensitivity (27% reduction) during the downward gust than during the upward gusts. The third tap, however, showed a steep reduction in sensitivity (83%) when experiencing a downward gust as opposed to an upward gust. Similar effects occurred in the other flight conditions as well (Supplementary Fig. 4).

**Fig. 4: The third pressure tap lost sensitivity during downward gusts for the high-lift flight condition.**

Downward gusts challenge sensing

Despite the overall success, we found situations in which the controllers underperformed relative to the other tested gust conditions, including the mild downward gust during high-lift flight (Fig. 3b). For this condition, the wing morphing controller overcompensated by actuating the trailing edge to a magnitude appropriate for a larger change in lift (Fig. 4b). However, this effect did not occur for the mild upwards gust in the same flight condition. These results suggested that the controllers were less effective at differentiating between the magnitudes of downward gusts in this flight condition.

To investigate further, we used particle image velocimetry (PIV) to quantify the change in local flow velocity across the top surface of the morphing wing at each tested gust condition compared to the baseline neutral gust condition during high-lift flight (Fig. 4c). The mild upward gust condition (7.5° gust generator deflection) increased the flow velocity over the first three pressure taps. The mild downward gust (−7.5° gust generator deflection) reduced velocity at the leading edge of the wing. However, the change in velocity shifted from negative to positive near the third pressure tap, producing a minimal pressure change. For the strong downward gust (−12.5° gust generator deflection) there was a larger reduction of velocity at the leading edge of the wing, but the velocity change near the third pressure tap was still weak. Despite this, the trained controllers still achieved mean GRP values of above 73% for the three-tap and six-tap configurations in this challenging gust condition.

The strong downward gust during low-lift flight also produced disproportionately low performance relative to the other gusts within the same flight condition. In this case, the controller undershot the target, again suggesting it was difficult to distinguish between downward gust magnitudes. Interestingly, this gust was generated by a similar deflection angle (−8°) to that of the other challenging gust condition. This may provide insight into a challenging characteristic specific to our gust generating mechanism as opposed to a deficiency in the gust rejection controller design. The wake behind a deflecting wing produced changes in lift similar to those experienced during a vertical gust but generated additional streamwise aerodynamic effects (Fig. 1e) that are absent in traditional gust models.

Discussion

Here we showed a FBF controller that does not require many sensors to effectively reject gusts. The learned controllers consistently achieved greater than 80% gust rejection without the computational and mechanical complexities associated with expansive distributed sensing networks. This suggests that the success of FBF aircraft need not depend on our ability to implement highly complex large scale distributed networks if we can effectively identify a reduced set of sensors that provides comparable performance. These results run counter to the big data mentality that is pervasive in deep learning and has recently driven sensor network design in machine learning based distributed sensing applications, including FBF^30,31,52. Like intelligent feature selection in deep learning, intelligent controller and sensor design can achieve reduced-sensor FBF, providing an efficient alternative to large-scale distributed sensing networks⁵³. This reduces mechanical complexity and cost during fabrication as well as weight and computational requirements during operation. In addition, where human pilots naturally have a delayed initial reaction (0.4 s to 1.3 s) to gusts, FBF can begin changing shape in as little as a single timestep (0.05 s), and we showed that the controller speed was not impacted by reduced sensor input²⁴. Further, we expect that optimizing the controller action space would provide a more rapid response. Our findings suggest that these cost-effective solutions can expand the mission scope of small, fixed-wing UAVs to increasingly dynamic environments. This creates the opportunity for numerous critical applications^8,54.

Incorporating reduced-sensor FBF UAVs for surveillance and disaster response will drastically improve safety for those living in large cities⁴. The range offered by fixed wing designs will provide greater coverage than that achieved by quadrotor designs, allowing them to provide broader surveillance or survey fire and earthquake scenes across the city for extended periods of time. This technology may prove particularly useful to first responders impeded by street traffic, communicating crucial information to improve efficiency and safety. Similarly, we can apply these methods to long range urban reconnaissance for soldiers encountering potentially dangerous situations.

Finally, the success of this model-free method promotes future intelligent aircraft designs for other complex maneuvers and environments where accurate models are not readily available. For example, similar hardware-based learning may produce controllers for morphing UAVs with alternative shape changes to achieve avian-like aerobatics. Banking, diving, and perching in obstacle-dense environments, such as forests, opens the door to mission performance in natural disaster scenarios such as flooding, hurricanes, and wildfires^54,55. The extended range offered by adaptive FBF morphing UAVs will greatly improve survey coverage and search and rescue response by increasing the distance covered and time in flight between charges.

Methods

Morphing wing construction

We designed the morphing wing with three 42 mm wide active sections separated by two 51 mm wide passive sections to form a 228 mm wide wing with a 320 mm chord. To construct the active sections, we followed the methods established in previous work, which combine a NACA0012 leading edge with an antagonistic double macro-fiber composite (MFC) unimorph trailing edge¹⁷. We used multi-material 3D printing to include a flexure box design at the interface between the rigid and morphing portion of our active wing section to maximize deflection potential. Unlike in the previous work, we used narrower M8528-P1 MFCs to allow for three active sections to fit within our wind tunnel. Using epoxy, we bonded each MFC to a 0.025 mm stainless steel shim to produce a bending shape change when actuated. We also used epoxy to attach the active trailing edge section to the flexure box interface at the rear of the rigid leading edge.

We constructed the passive sections following methods established by Pankonien et al. for a spanwise morphing wing¹⁷. The passive sections contain a rigid NACA0012 leading section, but don’t have a rigidly structured trailing end. Instead, structure was provided by the spanwise skin extending across the full wing. Bonding a soft 3D-printed mixed cruciform honeycomb to the elastic silicon skin provided additional strength to the trailing edge of the passive sections^56,57. This allowed the passive sections to smoothly morph with the active sections while maintaining structural integrity under out of plane aerodynamic loading.

Within each passive section of the wing, we installed six 0.5 mm pressure taps for state observation. The pressure taps were located at positions of 0%, 1.5%, 5%, 10%, 40%, and 50% of the chord length measured from the leading edge. We offset the front four pressure taps at an angle of 30° from the leading tap to mitigate the effect of upstream pressure taps on the flow⁵⁸. Due to the large separation between the front four and rear two pressure taps, we installed the two rearmost pressure taps at a separate 30° angle, not including the front four taps to allow all taps to fit within the passive wing section. Each 1.5 mm pressure tap hole was included in the 3D-printed NACA0012 leading section of the airfoil. We used epoxy to fasten ethyl vinyl acetate tubing into the pressure tap locations. After installation, we used a razorblade to cut the end of each pressure tap to be flush with the surface of the morphing wing to avoid disrupting the flow over the wing.

Experiment setup

The final morphing wing design was installed 30 cm behind a gust generator (measured at quarter chord positions) in the 30 cm × 30 cm wind tunnel at the University of Michigan (Fig. 5). We created a gusting environment for three flight configurations (high-lift, medium-lift, low-lift) by using various combinations of morphing wing angles of attack (α = 10 ± 1°, 4 ± 1°, 4 ± 1°) and flow speeds (U = 10 m s⁻¹, 15 m s⁻¹, 10 m s⁻¹) as measured ahead of the gust generator (Supplementary Table 1). We included elliptical endplates on the wing to prevent wing tip vortices from forming, limiting this analysis to 2D airfoil effects. We measured the morphing wing’s lift using a six-axis ATI Delta load cell mounted at the quarter-chord. Six compact differential low-pressure transducers measured the pressures experienced by the six pressure taps in comparison to the static pressure located at the front of the test section of the wind tunnel, as measured using a pitot-tube. The gust generator consisted of a 15 cm chord NACA0012 rigid wing with a 25 cm span. We used a stepper motor operated turntable to vary the gust generator’s angle of attack and create the desired gust deflection.

**Fig. 5: Data flow structure of our gusting wind tunnel experiment for controller training and testing.**

The gust generator’s deflection angle produced different gust intensities depending on the wind tunnel flight condition (high-lift, medium-lift, low-lift). We found the effect of the gust generator setup was sensitive to the angle of attack of our morphing wing. At the highest tested angle of attack (10 ± 1°), the gust generator produced the smallest effect, even when using larger deflections. We limited our gust generator deflections to a range between positive and negative 12.5˚ during tests to prevent stall and avoid highly variable wake effects. Training included maximum deflections up to 13.5˚ to allow for the randomized training exploration to include states around the maximum testing conditions. The generated gusts had greater effect with flight configurations at the lower angle of attack (4 ± 1°) and gained an even stronger effect at the higher flow speed (15 m s⁻¹). Therefore, we used gust generator deflection ranges that produced changes in lift that were recoverable within the structural morphing capabilities of the wing (Supplementary Table 1).

To create learned controllers capable of reacting to the changing environment, we adapted an open-source implementation of proximal policy optimization (PPO) in Pytorch to develop policies for the camber morphing wing⁵⁹. The deep reinforcement learning (DRL) environment included a discrete action space. The first testing configuration (high-lift) used a symmetric action space of 7 voltage signal changes. For the subsequent flight conditions (medium-lift and low-lift), we reduced the action space to 3 voltage signal changes, sacrificing potential controller speed for a smaller action space. This compromise required less exploration and potentially improved variability between trained controllers (Supplementary Table 1). Each flight configuration used the same continuous state space, including normalized change in pressure signals and normalized MFC voltage signals.

The actor and critic network structures included a one-dimensional convolutional neural network input layer with the ten most recent state measurements for state observation, resulting in input dimensions of 2 × 10, 4 × 10, and 7 × 10 for the one-tap, three-tap, and six-tap configurations, respectively (Fig. 6). This layer included convolutions with kernel lengths of three and a stride length of one. The two subsequent hidden layers were structured linearly with 512 nodes each and rectified linear unit (ReLU) activation functions^60,61. Due to challenges and time constraints associated with DRL training in hardware environments, many hyperparameters were selected based on previous work performed in a similar MFC morphing environment⁴² (Supplementary Table 2). However, we tuned the learning rate manually, determining a value of 3 × 10⁻⁵ to be suitable for Adam optimization⁶². We used change in lift as our optimization parameter, using real-time load cell measurements to provide a reward to the learning algorithm. The goal of the learning algorithm was to develop a controller that minimized the change in lift experienced during a gust using the reward function,

$$R\left(t\right)=-10\times {\varDelta L}_{C}^{2}\left(t\right).$$

(2)

**Fig. 6: The neural network structure for the actor and critic models in the proximal policy optimization (PPO) algorithm.**

Although lift measurements were used for the reward structure during training, the controllers did not use lift information for action selection. The learned policies only used pressure and MFC voltage signals for action selection. During testing, the load cell provided information to judge controller performance.

A Python script in Jupyter Notebooks orchestrated controller training and testing (Fig. 5). For this work, we defined a gust as a change in effective wind velocity, including speed and direction. Due to electromagnetic interference, the load-cell and pressure sensors were unable to provide accurate signals during step-motor operation. During training and testing, our script paused timestep progression, policy updates, and data collection during gust generator rotation, then resumed training and testing after the gust generator achieved the desired deflection. Due to this full computational pause during rotation, gusts appeared as immediate changes in lift (Fig. 2a). This resulted in perturbations, as viewed by the controller, that are analogous to the sharp-edged updrafts and downdrafts that are used to model changes in lift experienced by small UAVs in gusty city environments^{21,23,50,51,63,64}.

Training was formatted in a pseudo-episodic manner, alternating between baseline episodes and gusting episodes to facilitate autonomy during training⁴⁹. Each episode began after rotating the gust generator to a specified location depending on the episode’s function. Baseline episodes began at zero degrees and gusting episodes began with the gust generator rotated to a random deflection within the specified training gust range (Supplementary Table 1). The MFC actuators began baseline episodes without camber morphing in either direction. From this neutral position, the pressure taps provided a base signal for comparative pressure observations throughout the episode. After initialization was completed, the episode began, including policy action selection and learning updates. The initialized pressure and goal lift values were recorded and carried into the following gusting episode to maintain the same base signals for calculating comparative pressure and reward values. In addition, the MFC sections began gusting episodes actuated to the same position in which they ended the prior baseline episode. Gusting episode action selection and training began after the gust generator was deflected to a randomized position where it was held for the length of the episode, 200 timesteps, representing an extended 10 s gust. Terminating the gust operation represented the completion of a training episode pair (baseline and gusting), returning the gust generator to zero degrees and the morphing wing MFCs to a neutral deflection position to begin a new initialization and subsequent baseline episode.

Training included 1000 total episodes consisting of 200 timesteps of 0.05 s. Learning updates occurred after every 20 timesteps from four minibatches of five state-action samples in series, resulting in a maximum sample size of 2 × 10⁵ for policy training. Progress was observed using a running average reward earned over 100 consecutive episodes, from which the highest performing policy was evaluated for testing (Supplementary Fig. 1). We used this procedure to train controllers (high-lift: n = 10; medium-lift: n = 5; low-lift: n = 5) for each of three different pressure tap configurations, including: using all six pressure taps, the front three pressure taps, and a single pressure tap on the leading edge of the morphing wing. We selected these pressure tap configurations based on the pressure distribution expected for the top surface of a symmetric airfoil and the sensitivity of the respective tap locations⁵⁸. In all, this approach resulted in 60 trained controllers.

Testing

We tested each of the 60 controllers at their trained flight condition (high-lift, medium-lift, low-lift) for three gust magnitudes (mild, moderate, strong) in two directions (upwards and downwards). Upward gusts were denoted as positive and downward as negative (Supplementary Table 1). This resulted in 360 independent testing conditions. Like the baseline training episodes, each testing episode began with an initialization period to reset the base pressure tap signals during neutral airflow. After initialization, the test episode timestep count and controller action selection began. The first quarter of the testing episode consisted of neutral airflow, followed by the gust generator deflecting to a specified gust condition for the following two-quarters of the testing episode. Finally, the gust generator returned to a deflection of zero, concluding the discrete gust and remaining at neutral for the final quarter of the test (Fig. 2a). For each test, we measured controller performance as a gust rejection percentage (GRP), comparing the change in lift experienced by the active camber morphing wing, ΔL_C, to the baseline change in lift measured when the same wing remained unactuated during the gust, ΔL_B (Eqn. 1) (Fig. 2a).

Due to the black-box nature of neural networks, and the policies developed using such methods, we accounted for stability and robustness of control through repetition. For the initial flight condition (high-lift), we repeated gust alleviation performance tests ten (10) times for each combination of trained controller (10), gust condition (6), and pressure tap configuration (3). This amounted to 1800 gust rejection tests. We measured consistency in performance between test iterations (Supplementary Fig. 5), gust conditions (Supplementary Fig. 6), and training iterations (Supplementary Fig. 7) while all other factors were held constant. Following the completion of testing at the high-lift flight condition, we repeated the process for five (5) trained controllers at both additional flight configurations (low-lift and medium-lift) to test the robustness of our methods and results for different angles of attack and airflow speeds (Supplementary Table 1). This doubled our previous count of test data, resulting in 3600 gust rejection tests in total (Supplementary Figs. 8–13).

We calculated settled GRP for each gust response test by averaging the GRP achieved during the last half of the gust alleviation test,

$${{{{{\rm{settled\; GRP}}}}}}=\frac{2}{T}\mathop{\sum }\limits_{t=T/2}^{T}{{{{{\rm{GRP}}}}}}\left(t\right).$$

(3)

Therefore, a higher settled GRP represented greater gust rejection performance. We calculated the settled GRP values for each individual test, providing distributions of n = 100 GRP values for each gust and pressure tap configuration at the high-lift flight condition, and n = 50 for each gust and pressure tap configuration at the medium-lift and low-lift flight conditions. Due to the maximum bounded nature of this metric, many distributions were skewed to varying degrees (Supplementary Figs. 14–16). Although the median is traditionally used to represent central tendency for highly skewed distributions, since the distributions were predominantly skewed away from superior performance and there was a large variation in skew between testing conditions, we used the mean as a conservative estimate of central tendency for our primary performance metrics. Further, we use statistical methods to comment on the significance when comparing performances between controllers using different pressure tap configurations. Initially we used a linear mixed effects model to determine the relationship between GRP and the number of pressure taps while considering the random effects of the tested gust conditions and the individual trained controllers. However, we found that the residuals were not normally distributed and therefore broke linear assumptions. Therefore, we trained generalized linear mixed effects models using Markov chain Monte Carlo to provide statistical analyses that were more robust to the variably skewed distributions offered by our tests.

We also considered performance consistency by measuring the absolute difference between the settled GRP of an individual test to the average settled GRP for the associated test condition (flight configuration, gust condition, and number of used pressure taps). This provided a metric for each individual test from which we used another generalized linear mixed effects model to determine significance when comparing gust rejection consistency between controllers using one, three, and six pressure taps.

Finally, we measured the speed of our controllers using rise time, measured as the time required for the learned controllers to increase GRP from 10% to 90% of the settled GRP. Therefore, a lower rise time represented a faster response. Rise times were measured for each test. Although many of these test distributions were highly skewed, because the distributions were predominantly skewed toward slower rise times and there was a large variance in skew between distributions, we again used the mean as a conservative estimate of central tendency (Supplementary Figs. 17–19). Again, we used a generalized linear mixed effects model to analyze the significance between the speed of controllers using one, three, and six pressure taps.

When investigating the sensor signal degradation that occurred during the downward gusts, we used a LaVision particle image velocimetry (PIV) system with DaVis 10 intelligent imaging software to characterize the various aerodynamic effects developed by the gust generator (Fig. 1e). Oil-based smoke particles were accelerated through the open-loop wind tunnel. An EverGreen double-pulse quantel laser mounted outside the wind tunnel illuminated a two-dimensional sheet of particles in the longitudinal dimensions. Above the wind tunnel, two Imager sCMOS cameras in a stereo configuration captured 50 sets of paired images with 15-µs intervals. From this, we captured the mean velocity profiles in the x and z directions of the wind frame of reference up stream of and around the morphing wing, including the locations where pressure taps were installed (Fig. 4c).

Data availability

All data gathered from experimentation and used for analysis are available to be viewed on the corresponding author’s GitHub repository at: https://github.com/kevpatha/few_sensor_gust_alleviation/.

Code availability

All code used for experimentation and analysis are available to be viewed on the corresponding author’s GitHub repository at: https://github.com/kevpatha/few_sensor_gust_alleviation/.

References

Geng, L., Zhang, Y. F., Wang, J. J., Fuh, J. Y. H. & Teo, S. H. in 2013 10th IEEE International Conference on Control and Automation (ICCA) 828–833 (2013).
Dutt, A. J. Wind flow in an urban environment. Environ. Monit. Assess. 19, 495–506 (1991).
Article Google Scholar
Hertwig, D. et al. Wake characteristics of tall buildings in a realistic urban canopy. Bound. Layer. Meteorol. 172, 239–270 (2019).
Article Google Scholar
Giyenko, A. & Cho, Y. I. in 2016 Joint 8th International Conference on Soft Computing and Intelligent Systems (SCIS) and 17th International Symposium on Advanced Intelligent Systems (ISIS). 729–733 (2016).
Kang, K., Belkhale, S., Kahn, G., Abbeel, P. & Levine, S. in 2019 International Conference on Robotics and Automation (ICRA). 6008–6014 (2019).
Mandel, N., Milford, M. & Gonzalez, F. A method for evaluating and selecting suitable hardware for deployment of embedded system on UAVs. Sensors 20, 4420 (2020).
Article Google Scholar
Zhao, Y., Zheng, Z. & Liu, Y. Survey on computational-intelligence-based UAV path planning. Knowl.-Based Syst. 158, 54–64 (2018).
Article Google Scholar
Russell, L., Goubran, R. & Kwamena, F. in 2019 15th International Conference on Distributed Computing in Sensor Systems (DCOSS) 546–553 (2019).
Harvey, C., de Croon, G., Taylor, G. K. & Bomphrey, R. J. Lessons from natural flight for aviation: then, now and tomorrow. J. Exp. Biol. 226, jeb245409 (2023).
Article Google Scholar
Harvey, C. et al. A review of avian-inspired morphing for UAV flight control. Prog. Aerosp. Sci. 132, 100825 (2022).
Article Google Scholar
Pagel, J. E. et al. in Urban Raptors: Ecology and Conservation of Birds of Prey in Cities (eds Boal, C. W. & Dykstra, C. R.) 180–195 (Island Press/Center for Resource Economics, 2018).
Cheney, J. A. et al. Bird wings act as a suspension system that rejects gusts. Proc. R. Soc. B: Biol. Sci. 287, 20201748 (2020).
Article Google Scholar
Reynolds, K. V., Thomas, A. L. R. & Taylor, G. K. Wing tucks are a response to atmospheric turbulence in the soaring flight of the steppe eagle Aquila nipalensis. J. R. Soc. Interface 11, 20140645 (2014).
Article Google Scholar
Bilgen, O., Kochersberger, K. B., Inman, D. J. & Ohanian, O. J. III. Novel, bidirectional, variable-camber airfoil via macro-fiber composite actuators. J. Aircr. 47, 303–314 (2010).
Article Google Scholar
Sun, J., Guan, Q., Liu, Y. & Leng, J. Morphing aircraft based on smart materials and structures: A state-of-the-art review. J. Intell. Mater. Syst. Struct. 27, 2289–2312 (2016).
Article Google Scholar
Gamble, L. L. & Inman, D. J. A tale of two tails: developing an avian inspired morphing actuator for yaw control and stability. Bioinspiration Biomim. 13, 026008 (2018).
Article Google Scholar
Pankonien, A. & Inman, D. J. in Active and Passive Smart Structures and Integrated Systems 2013. Vol. 8688, 352–364 (SPIE, 2013).
Pankonien, A. M. Smart Material Wing Morphing for Unmanned Aerial Vehicles. University of Michigan, Ann Arbor, MI, PhD diss.,(2015).
Gamble, L. L., Pankonien, A. M. & Inman, D. J. Stall recovery of a morphing wing via extended nonlinear lifting-line theory. AIAA J. 55, 2956–2963 (2017).
Article Google Scholar
Nathan, D. et al. Si-based self-programming neuromorphic integrated circuits for intelligent morphing wings. J. Compos. Mater. 56, 4561–4575 (2022).
Article Google Scholar
Wu, Z., Cao, Y. & Ismail, M. Gust loads on aircraft. Aeronaut. J. 123, 1216–1274 (2019).
Article Google Scholar
Hunsaker, J. C. & Wilson, E. B. Report on behavior of aeroplanes in gusts. No. NACA-TR−1 (1917).
Regan, C. D. & Jutte, C. V. Survey of Applications of Active Control Technology for Gust Alleviation and New Challenges for Lighter-weight Aircraft. Report No. DFRC-E-DAA-TN4736 (2012).
Binias, B., Myszor, D., Palus, H. & Cyran, K. A. Prediction of pilot’s reaction time based on EEG signals. Front. Neuroinform. 14, 6 (2020).
Article Google Scholar
Cheng, V. H. L. & Sridhar, B. Considerations for automated nap-of-the-earth rotorcraft flight. in 1988 American Control Conference 967–976 (1988).
Hamada, Y., Saitoh, K. & Kobiki, N. Gust alleviation control using prior gust information: wind tunnel test results. IFAC-PapersOnLine 52, 128–133 (2019).
Article MathSciNet Google Scholar
Giesseler, H.-G., Kopf, M., Varutti, P., Faulwasser, T. & Findeisen, R. Model predictive control for gust load alleviation. IFAC Proc. Vol. 45, 27–32 (2012).
Article Google Scholar
Haughn, K. P., Gamble, L. L. & Inman, D. J. MFC Morphing Aileron Control With Intelligent Sensing. Vol. 86274, V001T03A013 (American Society of Mechanical Engineers, 2022).
Pankonien, A. M., Magar, K. S. T., Beblo, R.V. & Reich, G. W. Gust prediction via artificial hair sensor array and neural network. in A Tribute Conference Honoring Daniel Inman Vol. 10172, 55–64 (SPIE, 2017).
Hollenbeck, A. C., Grandhi, R., Hansen, J. H. & Pankonien, A. M. Bioinspired artificial hair sensors for flight-by-feel of unmanned aerial vehicles: a review. AIAA J. 1–26 (2023).
Topac, O. T. et al. Hybrid models for situational awareness of an aerial vehicle from multimodal sensing. AIAA J. 61, 305–314 (2023).
Article Google Scholar
Armanious, G. & Lind, R. Fly-by-feel control of an aeroelastic aircraft using distributed multirate Kalman filtering. J. Guid. Control Dyn. 40, 2323–2329 (2017).
Article Google Scholar
Araujo-Estrada, S. A. & Windsor, S. P. Aerodynamic state and loads estimation using bioinspired distributed sensing. J. Aircr. 58, 704–716 (2021).
Article Google Scholar
Huang, Y. et al. Flexible smart sensing skin for “Fly-by-Feel” morphing aircraft. Sci. China Technol. Sci. 65, 1–29 (2022).
Article Google Scholar
Wang, X., Mkhoyan, T., Mkhoyan, I. & De Breuker, R. Seamless active morphing wing simultaneous gust and maneuver load alleviation. J. Guid. Control Dyn. 44, 1649–1662 (2021).
Article Google Scholar
Maraj, J. J., Haughn, K. P., Inman, D. J. & Sarles, S. A. Sensory adaptation in biomolecular memristors improves reservoir computing performance. Adv. Intell. Syst. 5, 2300049 (2023).
Zeng, J., Moulin, B., de Callafon, R. & Brenner, M. J. Adaptive feedforward control for gust load alleviation. J. Guid. Control Dyn. 33, 862–872 (2010).
Article Google Scholar
Wu, Z., Chen, L., Yang, C. & Tang, C. Gust response modeling and alleviation scheme design for an elastic aircraft. Sci. China Technol. Sci. 53, 3110–3118 (2010).
Article Google Scholar
Thapa Magar, K. S., Pankonien, A. M., Reich, G. W. & Beblo, R. Optimal control framework for gust load alleviation using real time aerodynamic force prediction from artificial hair sensor array. in 2018 AIAA Guidance, Navigation, and Control Conference (American Institute of Aeronautics and Astronautics, 2018).
Mnih, V. et al. Playing Atari with deep reinforcement learning. Preprint at https://arxiv.org/abs/1312.5602 (2013).
Sutton, R. S. & Barto, A. G. Reinforcement Learning: an Introduction (MIT Press, 2018).
Haughn, K. P., Gamble, L. L. & Inman, D. J. Deep reinforcement learning achieves multifunctional morphing airfoil control. J. Compos. Mater. 57, 721–736 (2023).
Article Google Scholar
Schulman, J., Wolski, F., Dhariwal, P., Radford, A. & Klimov, O. Proximal policy optimization algorithms. Preprint at https://arxiv.org/abs/1707.06347 (2017).
Guerra-Langan, A., Estrada, S. A. & Windsor, S. Reinforcement learning to control lift coefficient using distributed sensors on a wind tunnel model. in AIAA SCITECH 2022 Forum (American Institute of Aeronautics and Astronautics, 2022).
Wada, D., Araujo-Estrada, S. & Windsor, S. Sim-to-real transfer for fixed-wing uncrewed aerial vehicle: pitch control by high-fidelity modelling and domain randomization. IEEE Robot. Autom. Lett. 7, 11735–11742 (2022).
Article Google Scholar
Beck, A. & Kurz, M. A perspective on machine learning methods in turbulence modeling. GAMM-Mitteilungen 44, e202100002 (2021).
Article MathSciNet Google Scholar
Duraisamy, K., Iaccarino, G. & Xiao, H. Turbulence modeling in the age of data. Annu. Rev. Fluid Mech. 51, 357–377 (2019).
Article MathSciNet Google Scholar
Dulac-Arnold, G. et al. Challenges of real-world reinforcement learning: definitions, benchmarks and analysis. Mach. Learn. 110, 2419–2468 (2021).
Article MathSciNet Google Scholar
Haughn, K. P. T. & Inman, D. J. Autonomous learning in a pseudo-episodic physical environment. J. Intell. Robot. Syst. 104, 32 (2022).
Article Google Scholar
Rhode, R. V. & Lundquist, E. E. Preliminary Study of Applied Load Factors in Bumpy Air (National Advisory Committee for Aeronautics, 1931).
Badrya, C., Jones, A. R. & Baeder, J. D. Unsteady aerodynamic response of a flat plate encountering large-amplitude sharp-edged gust. AIAA J. 60, 1549–1564 (2022).
Article Google Scholar
Zhou, L., Pan, S., Wang, J. & Vasilakos, A. V. Machine learning on big data: opportunities and challenges. Neurocomputing 237, 350–361 (2017).
Article Google Scholar
Bolón-Canedo, V., Sánchez-Maroño, N. & Alonso-Betanzos, A. Recent advances and emerging challenges of feature selection in the context of big data. Knowl. Based Syst. 86, 33–45 (2015).
Article Google Scholar
Mohammed, F., Idries, A., Mohamed, N., Al-Jaroodi, J. & Jawhar, I. UAVs for smart cities: opportunities and challenges. in 2014 International Conference on Unmanned Aircraft Systems (ICUAS) 267–273 (2014).
Karaca, Y. et al. The potential use of unmanned aircraft systems (drones) in mountain search and rescue operations. Am. J. Emerg. Med. 36, 583–588 (2018).
Article Google Scholar
Zou, T. & Zhou, L. Mechanical property analysis and experimental demonstration of zero Poisson’s ratio mixed cruciform honeycomb. Mater. Res. Express 4, 045702 (2017).
Article Google Scholar
Haughn, K. P. T., Gamble, L. L. & Inman, D. J. Horizontal planform morphing tail for an avian inspired UAV using shape memory alloys. in ASME 2018 Conference on Smart Materials, Adaptive Structures and Intelligent Systems (American Society of Mechanical Engineers Digital Collection, 2018).
Kuester, M. S., Borgoltz, A. & Devenport, W. J. Pressure tap effects on the lift measurement of an airfoil section. in 32nd AIAA Aerodynamic Measurement Technology and Ground Testing Conference (American Institute of Aeronautics and Astronautics, 2016).
Tabor, P. ppo in pytorch. https://github.com/philtabor/Youtube-Code-Repository/tree/master/ReinforcementLearning/PolicyGradient/PPO/torch (2020).
Kiranyaz, S. et al. 1D convolutional neural networks and applications: a survey. Mech. Syst. Signal Process. 151, 107398 (2021).
Article Google Scholar
Xu, B., Wang, N., Chen, T. & Li, M. Empirical evaluation of rectified activations in convolutional network. Preprint at https://arxiv.org/abs/1505.00853 (2015).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. Preprint at https://arxiv.org/abs/1412.6980 (2017).
Golubev, V. V. & Visbal, M. R. Modeling MAV response in gusty urban environment. Int. J. Micro Air Veh. 4, 79–92 (2012).
Article Google Scholar
Zhou, Y., Wu, Z. & Yang, C. Gust alleviation and wind tunnel test by using combined feedforward control and feedback control. Aerospace 9, 225 (2022).
Article Google Scholar

Download references

Acknowledgements

This work was supported in part by the National Science Foundation under grant 1935216, as well as the US Air Force Office of Scientific Research under grants numbered FA9550-16-1-0087 and FA9550-21-1-0325.

Author information

Authors and Affiliations

U.S. Army Research Laboratory; Aberdeen Proving Ground, Aberdeen Proving Ground, MD, USA
Kevin P. T. Haughn
Department of Mechanical and Aerospace Engineering, University of California Davis, Davis, CA, USA
Christina Harvey
Department of Aerospace Engineering, University of Michigan, Ann Arbor, MI, USA
Daniel J. Inman

Authors

Kevin P. T. Haughn
View author publications
You can also search for this author in PubMed Google Scholar
Christina Harvey
View author publications
You can also search for this author in PubMed Google Scholar
Daniel J. Inman
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.P.T.H., C.H. and D.J.I. conceived original research idea. K.P.T.H. and C.H. designed the research methodology. K.P.T.H. performed design, fabrication, experimental testing, and data analysis. K.P.T.H. and C.H. organized paper structure and data visualization. D.J.I. performed funding acquisition, project administration, and supervision. K.P.T.H. wrote the original manuscript draft. K.P.T.H., C.H. and D.J.I. revised and edited manuscript.

Corresponding author

Correspondence to Kevin P. T. Haughn.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Communications Engineering thanks the anonymous reviewers for their contribution to the peer review of this work. Primary Handling Editors: Mengying Su. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Peer Review File

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Haughn, K.P.T., Harvey, C. & Inman, D.J. Deep learning reduces sensor requirements for gust rejection on a small uncrewed aerial vehicle morphing wing. Commun Eng 3, 53 (2024). https://doi.org/10.1038/s44172-024-00201-8

Download citation

Received: 09 June 2023
Accepted: 12 March 2024
Published: 21 March 2024
DOI: https://doi.org/10.1038/s44172-024-00201-8