Memory load differentially influences younger and older users’ learning curve of touchscreen gestures

Li, Bingxin; Yang, Tong; Liu, Yanfang; Du, Feng

doi:10.1038/s41598-022-15092-y

Download PDF

Article
Open access
Published: 25 June 2022

Memory load differentially influences younger and older users’ learning curve of touchscreen gestures

Bingxin Li^1,2,
Tong Yang³,
Yanfang Liu³ &
…
Feng Du^1,2

Scientific Reports volume 12, Article number: 10814 (2022) Cite this article

1036 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

In this study, we employed a recall test to investigate how memory load affects the learning curve of gesture-letter pairs for younger and older users. The gesture-letter pairs were carefully designed to mimic real-world gesture-function/command associations on a touchscreen mobile phone. Both younger and older user groups showed lower recall accuracy as the memory load of gesture-letter pairs increased, and recall performance improved with repeated memory training. More specifically, younger users improved rapidly over repeated memory sessions under all memory loads, whereas older users benefited much less from repeated memory sessions except the lowest memory load of 6 gesture-letter pairs. These results reveal that the memory load differentially modulated younger and older users’ learning curves of gesture-letter pairs. Thus, our work suggests an upper limit when adding new gesture-function associations on mobile phones and special attention should be devoted to old users.

Analysis of postures for handwriting on touch screens without using tools

Article Open access 07 January 2022

Capturing complex hand movements and object interactions using machine learning-powered stretchable smart textile gloves

Article 12 January 2024

Coming in handy: CeTI-Age — A comprehensive database of kinematic hand movements across the lifespan

Article Open access 25 November 2023

Introduction

Touchscreen mobile phones have become a pivotal device for people’s daily life because of its powerful and flexible connection with various function tools, with engaging users from a young age until the old. It has been shown that in 2020 the average smartphone user had 40 applications installed on the phone¹. Users usually spend hours a day on their phones, and often find it difficult to execute core system commands (e.g., phone language setting, mobile data/WiFi/Bluetooth on/off) and access various applications (e.g., starting a timer, browsing a website, and opening a favorite contact). They need to navigate through a deep interface hierarchy, which is time-consuming. Thus designers hope to provide as many navigational gestures as possible to help users perform tasks. There are already some navigational gestures available on the existing operating systems. For example, users can return to home page by swiping up quickly from the bottom of the screen on Android or iOS systems. Similarly, by holding a finger on the screen after swiping up from the bottom edge in both systems, users can open the recent application view displaying swipeable cards of all running applications.

Android system has introduced many more complex navigational gestures than iOS. For example, knocking firmly with knuckle while sliding horizontally across the screen will enable split-screen mode. Moreover, a double knock or drawing a letter “S” with a knuckle anywhere on the screen can take a screenshot. However, new users and elderly users may find it difficult to memorize so many complex gestures. Users may not want to have a heavy burden of gesture learning. However, it is unknown that how many pairs of gestures and the related action can be memorized and how efficiently users of different ages can learn those pairs with repeated practice.

There are a few empirical studies that evaluated the memorability of touchscreen gestures and commands using recall tests^2,3,4. For example, Nacenta et al.² found that user-defined gestures are easier to be memorized by users when compared with pre-designed gesture set and random set. Researchers argued that user-defined gestures may be more memorable than system-defined gestures, because users have established a strong association between gestures and actions when defining the gestures. On average the recall rate of self-defined gesture shortcuts was above 90%³. In addition, user-defined gestures were more memorable compared to system-defined letter gestures (i.e., draw a letter to evoke function)³. Indeed, associating a specific command with semantic gestures such as a letter or multiple letters can be a challenging task, especially for users who use symbolic language. Thus, instead of studying letter gestures, in the present work we employed unistroke swipes (a single swiping movement) which mimicked the existing set of fullscreen navigational gestures. Moreover, every swipe gesture was randomly assigned with a unique letter representing a command/action to be executed (rather than applying letter gestures) in order to understand the memorability of pair between gestures and the associated actions as the pair number increases. According to previous results of the list length effect on memory performance^5,6,7, the mean proportion of items recalled decreases as the length of the memory list increases. Supporting evidence also comes from change-detection tasks, in which detection accuracy also declined when memory load increased^8,9. Thus, we predicted lower memory accuracy for gesture-letter pairs as the memory load became higher.

It is well-known that repeated learning and testing can significantly improve recall performance^{10,11,12,13,14,15}. For example, a study showed that, compared to one presentation, five-fold repetition of word pairs increased both the familiarity of every individual word in the pair and the associative strength of the word pairs, leading to higher memory accuracy¹³. In addition, repeated learning and testing of the same words contributed to a higher proportion of recall of words and its category¹⁴. Thus, in two experiments by applying five sessions of repeated learning and testing method, we predicted that users would on average show increased accuracy in recalling pairs of gestures and the associated letter, but the learning rate of gesture-letter pairs would be slower as the memory load increased.

Furthermore, in the second experiment we planned to examine how fast younger and older users would learn the gesture-letter pairs under different memory loads. Aging and memory have been elaborated in the past research^{16,17,18,19,20,21,22}. Studies consistently show a significant age decline in episodic memory. Performance on memory tests requiring association is particularly vulnerable to aging^18,20,21,22. In a classic study on aging and memory for relationship among items, researchers found that older adults performed poorly in recognition of various types of associative information (e.g., word-nonword pairs, word-font pairs), suggesting greater difficulty in encoding and retrieving specific associations between units of information in the aged group²¹. With a rather different context using a gesture learning test, we predicted better recall performance in the younger than in older users, and their differences would be larger under high memory load.

In sum, the present study contains reports of two experiments. Experiment 1 aimed to examine the learning curves for two different memory loads of gesture-letter pairs. Some of the gesture-letter pairs in the first experiment were designed to imitate the existing navigational gestures and the command to be executed. Experiment 2 was conducted, by employing a wider range of levels of memory loads, to investigate the memory performance in users of different ages under different memory loads and after repeated practice. The present study may provide important information for UI designers on how many gestures are easy for users of different ages to memorize.

Experiment 1

In the present experiment, we asked users to recall the letter associated with each gesture (unistroke swipe gestures) after every memorizing session for five times. We planned to examine the learning curve (memory performance across memorizing sessions). Subjective data including perceived learning effectiveness, general evaluations, fatigue and emotional experience for two different memory loads of gesture-letter pairs were also assessed.

Experiment 1 methods

Users

The study was approved by the Ethics Committee of Human Experimentation at the Institute of Psychology, Chinese Academy of Sciences. All procedures were performed in accordance with relevant guidelines and regulations.

Sample size was estimated based on prior studies on memory load/list length (η²_p > 0.75)^5,6 and repetition/testing effect on recall performance (η²_p > 0.30)¹³. A statistical power analysis using G*power²³ indicated an optimal sample size of N = 9 if the effect size η²_p = 0.30 with α = 0.05, and power = 0.90.

Eighteen users who had experience of touchscreen mobile phones (10 females; mean age = 26.89 years, SD = 10.47, age ranged between 21 and 38 years) took the experiment in exchange for monetary compensation (see Appendix A for demographic details). All users reported having normal or corrected-normal vision. They signed an informed consent form before the experiment.

Apparatus, materials and procedure

As Fig. 1 showed, pictures of swiping gestures (pointing vertically, horizontally or in diagonal directions from screen edges or middle-lower part) were presented on a 14-inch ThinkPad laptop. Every gesture appeared in pair with a unique letter presented at the center of the screen (see examples in Fig. 1 and all gesture-letter pairs in Appendix B). The gesture-letter pairs were prespecified by E-prime 2.0.10 (Psychology Software Tools, Pittsburgh, PA, USA), showing in a pseudorandom order.

There are two memory loads of gesture-letter pairs: 15 and 22 pairs (see Appendix B). For the memory load 15 condition, users needed to learn gestures swiping from the bottom edge, middle lower and left/right sides of the screen and the associated letters. In this condition, some of the gestures and the associated letter were designed to imitate the existing fullscreen navigational gestures and the related action, e.g., back to home screen by swiping up from the very bottom of the screen. For the memory load 22 condition, we required users to swipe from a narrower starting region in order to have more complex gesture-letter pairs. In this condition, a limited number of directions were included if swiping from an area that was close to the four corners of the screen; and in this condition some of the gesture-letter pairs imitated the existing complex navigational gestures, e.g., the corner swipe gesture to invoke Google assistant on Android. In both memory load conditions, symmetric gesture directions on the left and right sides of the screen were always shown simultaneously, assigned with the same letter.

In order to make sure that users understood the task, they were given a few pairs of swiping gestures and digits for practice before the real experiment. Note that gesture-digit pairs were only used in the practice and did not appear in the experiment to avoid confusion. In the critical experiment, users were given five memorizing sessions repeatedly, each of which was followed by a recall test of gesture-letters pairs—the standard condition in free recall learning²⁴. In every memorizing session, gesture-letter pairs were presented in random order. Each pair appeared on screen for 5 s, then a 500 ms blank interval before the onset of the next pair. In the recall test users had to retrieve the corresponding letter for a specific gesture just learned, without feedback. Users had as much time as they needed to provide a response. After five sessions of gesture memorizing and testing of every memory load condition, a survey of easiness to learn and memorize, general evaluation of gestures, perceived fatigue and emotional experience was applied (Appendix C). Perceived fatigue and emotional experience were assessed because it has been suggested that fatigue and emotional response are important aspects for evaluating user experience^25,26. For all users memory load 15 was performed before 22 so that they would not be too frustrated at the beginning of the study. Breaks were allowed after every recall test. The whole experiment lasted about 60 min (including break time).

Design

The memory load (15 or 22 gesture-letter pairs) and memorizing session (S1, S2, S3, S4, S5) were two within-subject variables. The recall accuracy, fatigue and emotions rating, perceived easiness of learning and memorizing as well as general evaluations of gestures were key dependent variables.

Experiment 1 results

Recall accuracy

Recall accuracy was analysed using a repeated measures ANOVA with memory load and memorizing session as within-subject factors, though the data were nonnormally distributed for three reasons. First, most studies on recall memory used ANOVA to assess memory loads’ effect (also the effects of repeated learning^13,14 or age groups^15,16,17,18) on recall accuracy. Secondly, Norman²⁷ had shown that ANOVA can also be used for data with non-normal distributions and data of Likert scale. Lastly, only a few non-parametric tests (e.g., Scheirer–Ray–Hare test, aligned ranks transformation ANOVA) can be used for two-way factorial design. However, they are not suited for detecting an interaction.

Accuracy for each condition is illustrated in Fig. 2 and shown in Appendix D. There was a significant main effect of memory load, F(1, 17) = 5.75, p = 0.028, η²_p = 0.25, with higher recall accuracy in the memory load 15 (mean accuracy of 0.52) compared to the memory load 22 (0.41). The effect of memorizing session was significant, F(4, 68) = 49.84, p < 0.001, η²_p = 0.75, with higher accuracy with more memory training. In addition, memory load interacted significantly with memorizing session, F(4, 68) = 3.23, p = 0.017, η²_p = 0.16. Post hoc analysis revealed that the improvement in the memory load 15 (accuracy in S5—accuracy in S1 = + 0.54, p < 0.001) appeared to be slightly larger than in the memory load 22 (accuracy in S5—accuracy in S1 = + 0.43, p < 0.001). In particular, recall accuracy of first memorizing session in the memory load 15 was not different from that of memory load 22, t(26.31) = − 0.18, p = 0.861, Cohen’s d = 0.06.

Gesture evaluation scores

Seven subjective rating scores for two sets of gesture-letter pairs (e.g., ease of learning, likelihood to use et al.) were compared using paired sample t-tests. Mean rating scores and the corresponding SEs for different memory loads are illustrated in Fig. 3 and shown in Appendix E.

Perceived easiness of learning and memorizing in the memory load 15 was not different from that of memory load 22, t(17) = 1.60, p = 0.128, Cohen’s d = 0.31; neither was overall satisfaction, t(17) = 1.05, p = 0.310, Cohen’s d = 0.23. However, users indicated higher likelihood to use the memory load of 15 gesture-letter pairs than the memory load 22, t(17) = 2.82, p = 0.012, Cohen’s d = 0.53; they were also more likely to recommend the 15 pairs of gesture set than 22 pairs, t(17) = 1.61, p = 0.003, Cohen’s d = 0.86.

Fatigue and emotional experience

Users also reported significantly higher positive emotion in the memory load 15 compared to 22, t(17) = 2.93, p = 0.009, Cohen’s d = 0.48; lower rating of negative emotion, t(17) = − 2.15, p = 0.046, Cohen’s d = 0.47; and lower levels of fatigue, t(17) = − 3.65, p = 0.002, Cohen’s d = 0.48.

Summary of experiment 1

Experiment 1 showed lower recall accuracy in memory load of 22 pairs (mean accuracy of 41%) than in memory load of 15 pairs (52%). Moreover, recall performance significantly improved by repeated learning and testing of the same set of gesture-letter pairs. Especially there were larger and faster improvements for the memory load of 15 pairs than the memory load of 22 pairs, revealing users can learn associative binding of gestures and letters more rapidly under low memory load.

Users were also more likely to use and to recommend the memory load 15 compared to 22, with better emotional experience and lower level of fatigue for the memory load 15. However, users reported no significant differences in the easiness of learning and memorizing and ratings of overall satisfaction between the two memory loads.

Experiment 2

Experiment 2 aimed to examine whether memory load differentially influences the younger and older users’ gesture learning curve over memorizing sessions. Different from Experiment 1 where memory loads of 15 and 22 gesture-letter pairs were tested, in this experiment we applied a wider range of memory load levels varying between 6 and 22 gesture-letter pairs to investigate the upper limit of gesture learning in the users of different ages.