Emergence of perceptuomotor relationships during paleolithic stone toolmaking learning: intersections of observation and practice

Bayani, Kristel Yu Tiamco; Natraj, Nikhilesh; Khresdish, Nada; Pargeter, Justin; Stout, Dietrich; Wheaton, Lewis A.

doi:10.1038/s42003-021-02768-w

Download PDF

Article
Open access
Published: 11 November 2021

Emergence of perceptuomotor relationships during paleolithic stone toolmaking learning: intersections of observation and practice

Communications Biology volume 4, Article number: 1278 (2021) Cite this article

1617 Accesses
8 Citations
23 Altmetric
Metrics details

Subjects

Abstract

Stone toolmaking is a human motor skill which provides the earliest archeological evidence motor skill and social learning. Intentionally shaping a stone into a functional tool relies on the interaction of action observation and practice to support motor skill acquisition. The emergence of adaptive and efficient visuomotor processes during motor learning of such a novel motor skill requiring complex semantic understanding, like stone toolmaking, is not understood. Through the examination of eye movements and motor skill, the current study sought to evaluate the changes and relationship in perceptuomotor processes during motor learning and performance over 90 h of training. Participants’ gaze and motor performance were assessed before, during and following training. Gaze patterns reveal a transition from initially high gaze variability during initial observation to lower gaze variability after training. Perceptual changes were strongly associated with motor performance improvements suggesting a coupling of perceptual and motor processes during motor learning.

Perceptography unveils the causal contribution of inferior temporal cortex to visual perception

Article Open access 18 April 2024

Motor neurons generate pose-targeted movements via proprioceptive sculpting

Article 20 March 2024

Bumblebees socially learn behaviour too complex to innovate alone

Article Open access 06 March 2024

Introduction

Linnaeus chose to name our species “wise,” but it may be the ability to learn from others that makes Homo sapiens unique¹. Humans live in a cultural niche of artificial landscapes, structures, artifacts, skills, practices, and beliefs constructed over generations and far exceeding the understanding or creative potential of any individual. Paleolithic stone tools provide our earliest evidence of this emerging niche and the social learning of increasingly complex skills and knowledge that supports it^2,3,4. Acquisition of such demanding technical skills requires the generative interaction of observation and practice⁵. Still, relatively little is known about the interaction of observation and practice generally⁶ or for stone tool making specifically^7,8. To address this, we used eye-tracking to investigate gaze patterns during stone tool making action observation. Our sample comprises 11 research participants with no prior stone tool making experience learning to make Paleolithic stone “handaxes”³ comparable to those made by Homo heidelbergensis ca. 500,000 years ago^9,10. Pargeter et al. (2019, 2020) have published details of training and learning outcomes elsewhere. Here we address variability in gaze both at the group level over time and in relation to individual technological success and motor performance metrics.

Motor variability is an important part of exploratory behavior^6,11,12 that supports individual skill acquisition^13,14. Assessment of motor variability has figured prominently in attempts to understand the interaction of individual and social learning in the intergenerational reproduction of stone tool making techniques^5,15,16. Researchers know much less about perceptual variability, exploration, and learning of stone tool making¹⁷, thus the contribution of demonstration and observation to social transmission remains unclear^2,16,18.

Stone tool making through controlled fracture (“knapping”) requires the sequential detachment of flakes from a stone core using precise strikes with a handheld hammer (typically stone, bone, or antler). Each removal leaves traces that archeologists can use to reconstruct discrete actions and sequential strategies. These strategies range in complexity from simple iteration to multi-level goal hierarchies needed to produce later Paleolithic forms¹⁷, such as the stone “handaxes” studied here. Executing such strategies requires reliable control over individual flake removals, including the sensitive adaptation of kinematics to variable core morphology, composition, and positioning^7,15. Therefore, learning to knap requires mastery of subtle interactions between bodily kinematics, object properties, action outcomes, and technological goals^7,17,19. These factors might be facilitated by observation of expert performance⁵ and reflected in the spatiotemporal variation of gaze patterns.

Attentional processes drive eye movements that facilitate foveation on salient and meaningful visual features^20,21, thus providing a window on cognitive processes²² that we exploit in order to better understand processes of observational learning. Work on the role of gaze in motor control and learning has largely focused on action planning and execution^12,22,23 or goal inference (e.g.²⁴), At least one recent study²⁵ has reported gaze variation’s effects on observational learning of a novel task (prosthesis use). Bayani et al found that eye movements tracking action path and bodily effector rather than the action start and endpoints during demonstration was associated with more efficient kinematics during execution. This provides support for a role in gaze in maximizing observational learning through motor resonance. Arbib (2011) has advanced similar proposals for the role of observation in knapping skill acquisition. It remains unknown to what extent useful information about the rapid and finely tuned kinematics and stone fracture mechanics^7,15 underlying knapping skill can be extracted by observing others⁵.

To investigate this issue, we evaluated changes in (1) gaze variability and (2) the relationship between gaze patterns and knapping performance over training. Participants (n = 11) received up to 90 h of practice and direct active teaching²⁶ from an expert knapper. At three time points (Pre = before training, Post 1 = 50 h training, Post 2 = after completion), we invited participants to observe a 17-min video of their instructor producing a handaxe while we recorded participant gaze behavior. A small number of prior studies have examined gaze variability during motor learning, although not of entirely novel tasks like knapping. These studies find that the standard deviation of raw eye velocity demonstrated a general reduction in gaze variability with training²⁷. Gaze entropy, a value that utilizes the standard deviation and fixation distribution, provided an additional measure of exploratory behavior²⁸. We thus hypothesized that if participants were extracting useful technical information from knapping observation, we should observe: 1) a reduction in gaze variability with training and 2) greater spatial allocation towards more meaningful features of the task. We further predicted stronger associations between perceptual processes and motor performance with practice.

Results

Determination of meaningful action sequences

Participants watched a 17-min video of an expert knapper performing the stone toolmaking task before, during, and after training. The same expert flintknapper that trained all participants (NK) categorized each second in the video. This allowed for the temporal categorization of action phases based on one of the six action phases: core reposition, core move, light percussion, percussion, grinding, and tool change (see Methods). During video observation, we instructed participants to press a button whenever they judged the knapper to have “completed a meaningful unit of action in pursuit of [the] goal” of making a handaxe (Fig. 1a). We matched these button presses to specific time points and motor actions produced by the actor (Fig. 1c, Fig. 2). This allowed us to identify action phases important to the participants used for our analysis without bias from the investigators. Meaningful button presses across all participants and training conditions revealed a high degree of responses corresponding to core move time points.

**Fig. 1: Experimental design and task.**

**Fig. 2: Data preprocessing pipeline.**

The study used participants’ reactions to meaningful time points in the video to generate a group-level temporal probability plot by isolating a 10-s window with a 5-s overlap. Within each window, we calculated the probability by dividing the number of participants that indicated a particular time segment contained a meaningful event and dividing it by the total number of participants (Fig. 3a, black trace). We determined time points for more in-depth analysis by assigning the 90th percentile as the threshold for meaningful time points across all participants (Fig. 3b). The meaningful time points were also matched to specific action phases with greater meaningful button presses associated with core moves (χ2(5) = 131.24, p < 0.01, Fig. 3c) compared to core reposition (t(5) = 9.31, p < 0.001, g = 1.22), light percussion (t(5) = 9.33, p < 0.001, g = 1.17), grinding (t(5) = 10.03, p < 0.001, g = 1.50), tool change (t(5) = 11.46, p < 0.001, g = 1.65), and percussion (t(5) = 10.23, p < 0.001, g = 1.44). Together, core move time points that met the 90th percentile probability threshold or higher were isolated and extracted for continuous gaze analysis (Fig. 2, Fig. 3b, c).

**Fig. 3: Data-driven approach to identify salient action phases and time points.**

Gaze variability

We sought to understand any patterns of dispersion of gaze data across a given core move time point to address gaze variability. Temporal dispersion of gaze enables us to identify how the continuous gaze data vary over training. Spatial dispersion will determine where gaze falls during observation.

Temporal dispersion of gaze

We evaluated group-level changes in gaze patterns with the highest probability of meaningful button presses by implementing a hierarchical clustering algorithm of gaze positions over core move time points and calculating cluster variance for each meaningful core move time point. Implementation of hierarchical clustering (see Methods) across time points revealed changes in variability of gaze within a cluster (within-class variance) and variability of gaze between different clusters (between-class variance) across training. While within-class gaze variance decreased from Pre to Post 1 and increased from Post 1 to Post 2, a one-way ANOVA did not reach statistical threshold across any of the training phases. Dissimilarity demonstrated no main effect of training but displayed a stepwise reduction as participants progressed through training.

These non-significant changes with training may also reflect the substantial participant-level differences in learning documented by Pargeter et al (2019,2020), which likely contributed to the large variability seen in group-level data. We thus measured gaze coefficient of variation to capture the variability of gaze data for each participants’ continuous gaze data for core move time points across training. One-way ANOVA revealed a main effect of Training (F_2,30 = 3.74, p < 0.036, ƞ2 = 0.20). Pairwise comparison using TukeyHSD indicated gaze variability decreased from Pre compared to Post 1 (p < 0.01, g = 0.87) and Post 2 (p < 0.01, g = 0.949, Fig. 4a). This reduction indicates that participants focused on a smaller area of the visual scene during core moves due to training.

**Fig. 4: Perceptual-motor changes after 0 (Pre), 50 (Post 1), and 100 (Post 2) hours of training.**

Spatial dispersion of gaze

Eye movements involve top-down processes that direct gaze to information-rich and meaningful areas in the visual scene^20,21. It is vital to assess whether the gaze changes are associated with task-specific features in the visual scene by understanding the spatial aspects of this shifting variability over training. We sought to understand the spatial dispersion of gaze by determining if participants directed foveation towards the most relevant part of the visual scene, the working edge (i.e., a portion of the tool from which toolmakers remove flakes). We extracted the working edge’s continuous location using a MATLAB-based computer vision machine learning algorithm, random sampling consensus (RANSAC). This algorithm allowed us to detect the core’s x-axis edge in each image frame (“Computer Vision ToolboxTM User’s Guide R2020a” 2004^29,30. The output from the hierarchical cluster of gaze data identified cluster centroids, a data-driven representation of the most statistically distinct gaze position and their associated time points within a core move for all participants (see Methods). We calculated gaze distance relative from the working edge by subtracting the working edge coordinates from each participants’ gaze centroid coordinate at each frame. Results show a decrease in gaze distance from the working edge. However, this decrease across training did not reach statistical significance (F_2,27 = 0.73, p = 0.49). The reduced coefficient of variation along with the proclivity for the gaze to fall closer to the working edge may reveal that training encourages gaze to fall in information-rich and meaningful areas of the visual scene^20,31. We attributed the lack of statistical significance to the high degree of variability, which prompted more in-depth spatial analysis.

We evaluated the variability of gaze along the working edge by calculating the gaze distance from the working edge across all clusters’ time centroids. The standard deviation of those distances was extracted and used as a measure of gaze variability relative to the working edge. Quantification of gaze variability using this metric not only reduced the amount of noise inherent in gaze data but provides a statistical backing for utilizing specific time points within a single core move time point. Our results revealed a main effect of Training (χ²(2) = 8.74, p = 0.013, ε2 = 0.21, Fig. 4a) with significantly higher gaze variability relative to the working edge in Post 2 compared to Pre (V = 49, p = 0.03, g = −1.00) and Post 1 (V = 53, p < 0.01, g = −0.93). Training thus led to an increase in gaze variability relative to the core, especially comparing Post 1 and Post 2 gaze variability.

Performance measures

Motor skill performance over training was quantified using two primary measures: handaxe score and percussion error. The handaxe score is a multivariate ensemble measure (higher is better) of tool quality for handaxes produced during periodic skill evaluations throughout training³. Percussion error measures the difference between knappers’ intended and actual impact points (i.e., lower is better) during a controlled flake production task included in the skill assessments⁷.

For handaxe score, there was a main effect of Training (χ²(2) = 622.36, p < 2.2e-16, ∈2 = 0.89). Pairwise comparisons using Wilcoxon signed-rank test revealed significantly lower handaxe scores in Post 1 vs Pre (V = 0, p < 2.2e-16, g = −19.60) and Post 2 vs Pre (V = 820, p < 2.2e-16, g = −14.97) (Fig. 4a). For percussion error, a Kruskal-Wallis rank sum test revealed a main effect of Training (χ²(2) = 25.81, p < 0.001, ∈2 = 0.91). Post hoc comparisons using Wilcoxon signed rank test demonstrated a statistically significant decrease from Pre to Post 1 (p < 0.01, g = 3.87), from Post 1 to Post 2 (p < 0.01, g = 3.11). Post 2 was also statistically lower than Pre (p < 0.01, g = 5.87, Fig. 4a). In agreement with previous analyses^3,7, both measures reveal an improvement in general skill level with training, following a classic “power curve”³² of rapid initial progress followed by asymptotic leveling off as learners reach local performance optima. We quantified participants’ learning rates as the slope of a linear regression of all a participant’s handaxe scores against practice hours’ square root. Because training caused participants to converge on similar performance levels after 30 h, variation in learning rate is largely driven by initial aptitude with better initial performance resulting in a flatter slope.

Relationship between gaze patterns and knapping performance

While the above results suggest changes in gaze patterns and motor performance driven by training, further analysis seeks to understand the extent to which perceptual (gaze) and motor performance changes may be related to each other through training.

Perceptual-motor shifts with learning

We observed that gaze variability and handaxe quality showed a complex relationship, revealing change in motor performance followed by gaze variability changes through training (Fig. 4b–d). These visualizations prompted additional analyses to quantify the changes through shift functions, a technique implemented in neuroscience to quantify how two distributions differ³³. We described the data using deciles wherein each condition can be separated into nine quantiles leading to the segmentation of data into ten different segments (see Methods). We divided the data into two shift matrices. Shift1 is the perceptual and performance changes from Pre to Post 1 (Fig. 4b, c), while Shift2 defines the perceptual and performance changes from Post 1 to Post 2 (Fig. 4b, d). This method allows us to quantify which specific variables undergo the most changes from one training stage to another and if the effect is applicable across all of the data. The top panel in Fig. 4c, d represents the deciles (dark black lines) in each training condition, with colored lines connecting the two conditions’ deciles representing the direction and magnitude by which that specific decile has to shift. We plotted the quantile differences to create a shift function with 0 y-axis values representing no shift (or no difference between training within a specific decile), positive (orange) representing a shift to higher values, and negative (purple) representing a shift to lower values. The error bars represent a 95% confidence interval percentile bootstrap. Handaxe score demonstrated the strongest, positive shift during the Shift1 (quantile differences: 1.16, 1.45, 1.52, 1.44, 1.32, 1.20, 1.11, 0.96, 0.68), while gaze variability relative to the working edge did not exhibit a strong shift (Fig. 5b). However, gaze variability relative to the working edge showed greater shifts in Shift2 (quantile differences: −0.013, −0.0099, −0.0090, −0.0092, −0.0097, −0.011, −0.012, −0.014, −0.015), while handaxe score was relatively consistent. This staggered relationship between changes in execution (handaxe score) and observation (gaze variability) lends support to a cyclical model of skill acquisition^34,35. In this model, observation of expert models guides the consolidation of motor skills through individual practice, which produces enhanced perception and understanding of demonstrated actions, leading to further learning.

**Fig. 5: Shift function of perceptuo-motor variables.**

Perceptual-motor canonical correlation analysis

To gain a deeper understanding of the perceptual and motor relationships with training, we utilized canonical correlation analyses (CCA) to quantify the strength of the relationship between gaze and performance outcomes within a single training condition, how the relationship between variables evolved with training using Pearson’s R, and the relative contribution of each variable using canonical weights (Fig. 6). The gaze variate consisted of gaze coefficient of variation, mean distance from the core, and gaze variability from the core. The performance variate consisted of learning slope, handaxe score, and percussion error. In the Pre condition, the relationship between motor performance and gaze variables exhibited two modes that demonstrate a strong (r² = 0.81) and moderate relationship (r² = 0.44). The variables contributing the greatest weights in the first mode were learning slope (ω = −1.3762) and gaze variability from the core (ω = 0.7972). The relative weights of the remaining performance and gaze variables were relatively low compared to the variables mentioned above. However, the second mode demonstrated greater weights between the handaxe score (ω = −0.8678) and the gaze coefficient of variation (ω = 0.1535).

**Fig. 6: Canonical correlation analyses of perceptual and motor variables.**

Similarly, the Post1 condition demonstrated a strong canonical correlation between the performance and gaze variables (r² = 0.87). However, there was a single dominant mode that explained the relationship between these variables. While the highest contribution to this relationship was learning slope (ω = −0.9380) and mean gaze distance from core (ω = 0.8499), the relative contribution of handaxe score (ω = 0.7048), percussion error (ω = −0.6665), and gaze variability relative to the core (ω = 0.7665) was comparable. Gaze coefficient of variation had the lowest weight contributing to the correlation (ω = 0.1457). In the Post1 condition, we did not find a single dominant variable in each set. Instead, most of the variables in both the gaze and performance variates contributed equally to the correlation. In the Post2 condition, the canonical correlation between the gaze and performance variate decreased (r² = 0.6618). The highest contributing variables were handaxe score (ω = 0.6567) and percussion error (ω = 0.6353) as well as mean gaze distance from core (ω = 0.8571).

Discussion

Researchers have proposed that observational learning of body movement¹⁸ is a uniquely human capacity enabling our species’ accumulation of increasingly complex cultural and technological practices over time³⁶. This relationship is likely true of ritual and communicative gestures that depend on accurate replication of arbitrary body movements³⁷. There is debate on whether the learning of technical skills is more reliant on the mastery of physical affordances through individual practice¹⁸ and how efficiently observed movements translate effectively to different bodies^38,39,40 and contexts^41,42. This debate raises questions about the role of action observation in modern skill learning⁴³ and the importance of putative neurocognitive specializations for action observation in human evolution⁵. To investigate the interaction of observation and practice in real-world technological learning, we evaluated the development of gaze patterns and motor performance during the acquisition of the completely novel, evolutionarily relevant stone tool-making skill.

Consistent with expectations, we found that training resulted in rapid initial improvements in tool making action execution accompanied by a reduction in gaze coefficient of variation (CV) during action observation. This CV reduction marks putative emergence of “quiet eye” behavior (fewer, longer fixations with fewer saccades) often associated with the development of perceptual-motor expertise^21,31,44,45. More specifically, this eye movement pattern indicates a possible transition from exploration (sampling the visual scene) to exploitation (foveation of specific regions of the visual scene) for more in-depth processing^22,28,46 and complex mental representation⁴⁴. Learning and expertise are associated with the transition of attention towards exploitation due to the predictive value learned through exposure⁴⁷. The fact that experience with tool making action execution rapidly leads participants to focus on particular aspects of observed tool making indicates a transition to the exploitation of (perceived to be) relevant technical information. It supports a potential role for action observation in learning about stone tool making object affordances and/or behavioral techniques⁵.

To further explore this potential, we examined variability in the spatial allocation of gaze over training. In contrast to decreasing eye movement variability related to the emergence of quiet eye behavior, the spatial allocation of gaze relative to the core edge became more variable over training. Furthermore, this increase in spatial variability occurred after significant performance increases observed from Pre to Post1 timepoints instead of coinciding with the Post1-Post2 period of more modest performance changes. This pattern is contrary to our predictions of decreasing variability and tighter correlation with performance derived from static visual scene studies²⁸ and simple motor tasks²⁷. Instead, it reflects a more complicated gaze-performance relationship characteristic of real-world skills.

Complex skills like stone tool making often involve simultaneously tracking multiple moving areas of interest (e.g., the hammerstone, anticipated point of percussion on the core edge, and core surface topography behind the edge). They also require multifocal strategies that decouple attentional mechanisms from gaze behavior^6,48,49. Humans achieve such “visuomotor parsimony”⁴⁸ through increased reliance on peripheral vision that allows for reduced foveation of the targets. In turn, this strategy relies on the development of detailed sensorimotor mappings sufficient to enable the extraction of information from non-foveated targets⁵⁰. For this reason, visuomotor parsimony has been associated with late-stage skill acquisition and refinement¹², as also seen in our study. More broadly, the complex relationships between training, gaze behavior, and performance evident in our study lend support to a cyclical model of real-world skill acquisition^34,35. In Whiten’s (2015) model, observation of expert models guides consolidation of motor skills through individual practice⁵¹ leading to enhanced perception and understanding of demonstrated actions⁵². Enhanced action-perception then supports further observational learning, continuing the cycle⁵.

Finally, the canonical correlation analysis allowed us to test putative relationships between action observation and execution in individuals. Consistent with our group-level results, this demonstrated strong gaze–performance correlations during early exploration and skill acquisition (Pre and Post1) that deteriorated during later stage refinement and consolidation (Post2). Before training, a relationship between tool making aptitude coming into the study (high initial performance reflected in a relatively flat learning slope) and gaze spatial variability dominated the primary canonical mode. Thus, higher aptitude individuals anticipate to some degree the visuomotor parsimony that developed in the group as a whole over training. This anticipation suggests pre-existing sensorimotor competence in some participants. The presence of sensorimotor knowledge of a specialized skill has roots in the mirror neuron system (MNS), a frontoparietal neural network that is implicated in learning through execution and observation⁵³. For example, classical ballet dancers or athletes show MNS activation during observation of their respective crafts while naïve participants failed to show MNS activation^53,54,55. Furthermore, participants with prior knowledge of the task showed greater goal-directed and anticipatory gaze towards the goal of the task while naïve participants failed to show these perceptual patterns²⁴. These studies provide neural and perceptual evidence for the role of observation in skill learning and expertise. Indeed, we subsequently found that initial performance correlated with self-reported years of prior experience in gross-motor crafts like carpentry and sculpture (n = 17, r² = 0.39, p = 0.007) among all starting participants reported by Pargeter et al. (2019, 2020)^3,7. A second pre-training canonical mode was dominated by a negative relationship between handaxe score and gaze coefficient of variation, again anticipating later group-level decreases in gaze CV associated with rising performance. Two canonical modes in the pretraining condition suggest complex influences of pre-existing sensorimotor competencies and perceptual strategies on initial stone tool making aptitude.

After 50 h of training, variation in participant performance converged at a higher level, and a single canonical mode emerged, showing a strong correlation between motor performance and gaze. This mode differs from the primary pre-training mode, most notably in the increased weight assigned to mean gaze distance from the core edge. The effect suggests that particularly successful participants adopted stable fixation points offset some distance from the moving core edge, much as expert jugglers employ a stable “gaze through” strategy fixated at a central location rather than tracking individual balls⁴⁸. The post-training condition showed more evenly weighted performance and gaze variables, but the canonical correlation weakened from strong to moderate. This loosening of gaze-performance coupling suggests that other factors, such as differences in practice habits³ or the timing transitions between learning plateaus⁵⁶, may be increasingly important in driving observed performance variation. More generally, this pattern again speaks to the complex interplay of action observation and individual practice in acquiring real-world skills⁵.

Results show that persons learning the highly skilled stone tool knapping process rapidly learn to perceive and focus on technologically informative aspects of observed tool making. Furthermore, gaze behavior during observation is strongly associated with actual tool making performance. Our study’s complex, asynchronous development of eye movement strategies, gaze spatial allocation, and toolmaking performance is consistent with a cyclical model of real-world skill acquisition. This finding has implications for debates over the role of individual vs. social learning in early stone toolmaking, and thus for interpreting the archeological evidence used to support evolutionary accounts of the origins of human culture^1,4,8,18. It highlights the need for more real-world skill learning studies to understand perceptual-motor interactions during human skill learning.

Methods

Subjects

Seventeen experimental participants (10 female/7 male; 21–48 years of age, median 27.25) were recruited from Emory University (students and staff) and the surrounding community with no prior flint knapping experience. Full participation in the study amounted to ~90 h; of which, ~80 h involved training in handaxe production. Six participants left the study before the sfinal assessment with the remaining 11 achieving between 74 and 89 h training (median = 84 h). All participants were right-handed, had no prior knapping experience, and provided written informed consent. The study was approved by Emory University’s Internal Review Board (IRB study no: 00067237).

Experimental set up

Workspace

Subjects were seated in front of a raised 24” computer monitor with a computer keyboard situated directly in front of the subject (Fig. 1A). Since the eye tracker is sensitive to direct sunlight and may alter pupil detection reliability, subjects faced away from the windows to avoid direct sunlight on their face and pupils. The Gazepoint control system (GP3), a 60-Hz high-performance eye tracking platform, was placed below the computer monitor approximately 65 cm or an arms distance away from the subject. The eye tracker was stabilized with a mini tripod that allowed researchers to adjust the position of the eye tracker based on each individual subjects’ height. A laptop was used to power and obtain data from the eye tracker through two USB cables.

Eye tracking

To ensure subjects’ distance from the computer monitor and eye tracker was consistent across all sessions and optimal for gaze data acquisition, the eye tracker is equipped with distance monitor and a 5-point calibration test. The distance monitor tracked subjects’ pupil distance from the eye tracker. A moving dot on the Gazepoint control system displayed the participants’ distance using a red or green dot indicator. A red dot indicator informs researchers if the participant was too far or too close. A green dot indicates when the participant is an optimal distance (~65 cm away) from the eye tracker. Researchers adjusted subjects’ seated position to ensure that the distance monitor’s green dot was always situated green prior to data collection. The 5-point calibration test was administered after optimal distance was determined. Subjects were instructed to follow a white dot that traveled across the computer monitor with their gaze while keeping their head completely still. Following the calibration period, five calibration markers appeared on the screen. Subjects were instructed to make saccades to 5 calibration points. Successful calibration was confirmed if gaze position fell within the bullseye of each of the 5 points. The combination of the distance monitor and 5-point calibration ensured an accuracy between 0.5 to 1 degree of the visual angle.

Action observation

The paradigm was executed using a Windows laptop containing the Gazepoint control and analysis system software as well as MATLAB to coordinate video replay, gaze data collection, and behavioral responses from the subjects. During all sessions, subjects viewed the same 17 min and 4 s video of an expert stone knapper creating a tear drop shaped hand axe in an egocentric viewing perspective. The video was presented using a 1680 ×1050 viewing dimension. During video observation subjects were instructed to “press the left button on the keyboard for events that seem natural and meaningful to you.” MATLAB was initiated prior to the start of the video and continued processing during video observation in order to record the timing of subjects’ meaningful and error event button presses.

Experimental design

The behavioral and eye-tracking testing sessions were divided into three different sessions: Pre, Post 1, and Post 2 (Fig. 1b). The Pre condition is the baseline condition wherein subjects had no prior to training on the task. Their first exposure to the task was the aforementioned 17 min and 4 s video of an expert stone knapper performing the task in an egocentric perspective. Following the first session, subjects completed stone knapping training by an expert stone knapper and returned for the second session after 50 h of training for the Post 1 assessment. During this Post 1 condition, subjects observed the same video and indicated their perception of meaningful and error events with a keyboard button press. The last 50 h (100 h total) of motor training was completed before subjects came back for their third and last assessment, Post 2. During this time, subjects observed the same video and indicated their perception of meaningful and error events during video observation.

Task

In between each training condition, subjects were trained and assessed on the flint knapping task. The goal of the task is to create a tear-dropped shaped hand-axe from a larger stone, which we will refer to as the “core” for the rest of the manuscript. All participants were instructed in basic knapping techniques including how to select appropriate percussors, initiate flaking on a nodule, maintain the correct flaking gestures and angles, prepare flake platforms, visualize outcomes, deal with raw material imperfections, and correct mistakes. Handaxe-specific instruction included establishment and maintenance of a bifacial plane, cross-sectional thinning, and overall shaping. The importance of producing thin, symmetrical pieces with centered edges was emphasized throughout the training as these are key components in successful handaxe making.

Data preprocessing

Determination of meaningful action phases

While watching the videos, during Pre, Post 1 and Post 2, participants indicated their perception of meaningful actions using the left and right arrows on the computer keyboard. Keyboard button presses for meaningful events were used to pinpoint the action phases and time points that were deemed most salient to subjects. Button press latencies are time points that correspond to specific time points in the video and were collected from the start until the end of the video. Latencies were stored matrix wherein columns represented each subject and each row corresponds to a specific time the subject responded to a meaningful event. All meaningful event button press latencies were preprocessed using MALAB2015b. Latencies were rounded to the nearest second and transferred into a discrete subject x time matrix with 1 representing a button press and 0 representing no response. More specifically, a one was placed in a column (i.e., time point) within a specific row (i.e., subject) where the button press occurred. Figure 1c shows the visualization of the subject x time matrix of button press across the duration of the video for each subject (y axis) across the entire video duration (x axis) in Pre, Post 1, and Post 2 training conditions. Figure 2c depicts a histogram of button press data across time within the Post 2 condition. Together, this reveals salient time points that subjects perceived as meaningful events in the Post 2 training condition.

To pinpoint the action phases that were most salient across all subjects, each second in the video was categorized by an expert flintknapper (NK) and independently confirmed (DS). This allowed for the temporal categorization of action phases based on one of the six action phases: core reposition, core move, light percussion, percussion, grinding, and tool change. Action phase time points were used to segment the subject x time matrix and categorize them based on the six action phases. The number of subjects that responded with a button press was summed for each specific action phase. The average number of button presses for each action phase was determined across all subjects within a training condition to identify the action phase that subjects’ perceived to be the most meaningful and salient. Using the shapiro.test in R, action count violated assumptions of normality (W = 0.60, p-value < 0.001). A linear mixed effect model using lmer in R was used to determine statistical significance.

Determination of meaningful time points

Due to the hesitation and delayed reaction to an observed evident, the probability of a meaningful event button press was calculated by isolating a 10-s window with a 5-s slide. Using the subject x time matrix, 10 columns (i.e., 10 s) across all subjects were isolated a time, the probability meaningful event button presses were calculated by determining the number of subjects (all the rows) that responded (indicated with a 1) and dividing it by the total number of subjects. The window was shifted by 5-s. The subsequent probability for this 10-s window was calculated using the same metric. The sliding window intervals were applied throughout the entire video resulting in a time series of probability for each training condition (overlaid black lines in Fig. 3a). Time points for analysis were determined by assigning a 90^th percentile as the threshold percentile and only used video times points with probabilities at or greater than the 90^th percentile (Fig. 3c) indicate probability number value. Together, core move time points that meet the 90^th percentile probability threshold or higher were isolated and extracted for continuous gaze analysis (Figure S1). Based on these criteria, we identified 40 core move time points for gaze analysis. Gaze behavior was analyzed for these 40 core move time points in Pre, Post 1, and Post 2 for all subjects for a total of 1,320 data points for each given gaze parameter that was measured.

Gaze & perceptual analysis

Temporal dispersion of gaze using agglomerative hierarchical cluster analysis

We sought to understand the relationship between the dispersion of gaze data across a given core move time point using gaze coefficient of variation and agglomerative hierarchical cluster analysis of gaze position along the x-axis. For the first approach, gaze coefficient of variation was calculated by using continuous gaze data for each of the 40 core move time points and dividing the standard deviation by the mean gaze position across the entire core move continuous data segment. For the second approach, agglomerative hierarchical cluster (AHC) analysis, a bottom-up approach wherein individual data points are clustered based on their similarity, was implemented on continuous x-axis gaze data that has been organized into a time x subject matrix. Data was clustered using XLSTATS 2017.03.44550 with Euclidean distance as the distance measure and Ward’s method as the agglomeration method (Manning et al., 2009;⁵⁷ Tan et al., 2019⁵⁸).

We focused on two classes of parameters: group-level variance and subject-level gaze positions. For the group-level variance, cluster output included within- and between-class variance, which is a percentage value that represents how variable gaze positions are within a cluster and between different clusters respectively. The average within-class and between-class gaze variance across all core move time points that exceeded the probability threshold was used to determine if there was an effect of training on group-level gaze variance. The primary focus of subject-level parameters from the AHC output was central object, which is an actual value for each subject that is closest to the cluster centroid. On average, AHC on continuous core move time points resulted in 3 time clusters with each time cluster containing a time cluster centroid, which is the most statistically representative data point (time point and gaze position) within every identified cluster. Utilization of this approach for all core move time points reduces and summarizes continuous gaze data based on cluster-based descriptive statistics.

Spatial dispersion of gaze

To understand where gaze fell in the visual scene during observation, we focused on gaze dispersion. Since core moves are the focal point of subjects’ attention, we set the core itself as the primary area of interest (AOI). More specifically, we deemed the working edge, which is the right most edge of the core, as the subarea of interest because most of the manipulative, tool-use actions applied to mold the core itself occurs within this vicinity. The specific coordinates of the core was determined by an MATLAB2015b-based computer vision object tracking algorithm using machine learning code that utilizes Random Sampling Consensus (RANSAC) for the detection of object features (i.e., core) in a training image and identifying it on a larger visual frame (Computer Vision Toolbox^TM User’s Guide R2020a, 2004; “Computer Vision Toolbox - MATLAB & Simulink²⁹,” 2020; Bahraini et al., 2018³⁰). Video time points that contained core move time points that meet and exceed the 90^th percentile probability threshold or higher were cropped and used for frame-by-frame object tracking analysis. A screenshot of the core in various time points within the cropped video were used as training images for the RANSAC machine learning algorithm. Each frame within the cropped video was resized to 1680 × 1050 to match the exact viewing dimensions subjects experienced during video observation. The algorithm extracts features within the training images and applies it to the larger 1680 × 1050 video frame. A box was fitted around the core for each video frame to visually validate proper identification of core coordinates. The x-axis maximum of these coordinates were deemed to the subarea of interest where most of the manipulative, tool-use actions were applied to the core. This procedure was repeated across all 40 core move time points.

These core coordinates were used to calculate the mean gaze distance from the working edge and variance of gaze from the core edge. Subject-level central objects for each identified cluster from the AHC output was used to calculate gaze distance from core coordinate by subtracting the subject-level central object from the working edge coordinates at the corresponding time point of the central object. Gaze variance relative to the core was calculated by calculating the standard deviation of the distance for each time centroid. This procedure was repeated across all 40 core move time points and for each training condition.

Statistical analysis

Statistical analysis were performed using R version 3.6.2 GUI 1.70 El Capitan build (7735) & RStudio version 1.2.5033. For all values, normality and equality of variances was determined using Shapiro-Wilks and the Levene’s test respectively. Effect sizes for ANOVA was calculated using eta-squared (Ƞ²) through the lsr R package with type II sum of squares while effect sizes for the Kruskal–Wallis test was calculating using epsilon-squared (ϵ). Effect size for pairwise comparisons was calculated using the effsize R package by implementing Cohen’s d with Hedge’s g correction, which is utilized for studies with n < 20. For statistical test for significance, a p-value <0.05 was considered significant for all variables. The data structure and statistical analysis will be discussed in each individual section.

Action phase

To determine what action phase subjects deemed meaningful, a linear mixed effect model with maximum likelihood was used to determine the contribution of the fixed effects (action phase and training). A null model, containing only the random effect of Subject, was created using the lmer R function to account for any subject-level and baseline differences in behavior. The main effect of action phase and training were examined by creating models for each independent variable, and the interaction effect was determined by the combination of action and training. The chi-square value from the anova R function was used to quantify the comparison between the models that incorporated these factors and the baseline model. The multcomp and lsmeans R packages were used for pairwise comparisons with a Bonferroni adjustment.

Temporal dispersion of gaze statistical analysis

Within-class variance is a parameter output from the agglomerative hierarchical clustering that was used to assess gaze dispersion during core moves. Within-class variance was normally distributed (W = 0.99, p = 0.26) and exhibited homogeneity of variance (F(2) = 0.72 p = 0.49). Dissimilarity, a parameter that is calculated using Euclidean distance and used to quantify gaze distance, was not normally distributed (W = 0.69, p < 0.001) but exhibited homogeneity of variance (F(2) = 0.75, p = 0.47). Significance was assessed using one-way ANOVA with a TukeyHSD pairwise comparison and a Kruskal-Wallis rank sum test with a Wilcoxon signed ranked post-hoc for within-class variance and dissimilarity respectively.

Gaze & motor performance statistical analysis

Each row in the data matrix represents a particular subject, in a specific condition, and during a specific core move time. Due to high variability in the data, a data permutation within a specific condition was conducted. In essence, the subject identifier was shuffled within a given condition. Finally, we took the mean of all dependent variables across all core move time points. Each condition should have a mean coefficient of variation for each subject.

Coefficient of variation and mean distance from core

Coefficient of variation (W = 0.94, p = 0.084; F(2) = 0.54, p = 0.59) and mean distance from the core (W = 0.98, p = 0.74; F(2) = 0.23, p = 0.80) were both normally distributed and had equal variances. A one-way ANOVA with a TukeyHSD pairwise comparison adjustment was used to determine the effect of training.

Gaze variability relative to the core

Gaze variability violated assumptions of normality (W = 0.92, p = 0.031) but had equal variances (F(2) = 2.02, p = 0.15). Model score also violated assumptions of normality (W = 15.022, p < 0.001) but exhibited homogeneity of variance (F(2) = 2.020, p = 0.15). For both parameters, a Kruskal-Wallis rank sum test with a Wilcoxon signed ranked post-hoc was conducted to determine significant difference across training conditions.

Handaxe score

Model score violated assumptions of normality (W = 0.85, p < 0.001) and homogeneity of variance (F(2) = 64.77, p < 0.001). A Kruskal-Wallis rank sum test with a Wilcoxon signed ranked post-hoc was conducted to determine significant difference across training conditions.

Percussion error

Percussion score violated assumptions of normality (W = 0.90, p = 0.008) and exhibited homogeneity of variance (F_2,27 = 3.17, p = 0.06).

Shift functions

To quantify the extent to which perceptual (i.e., gaze) and motor performance changed from one training condition to another, we utilized shift function originally proposed by Rouselett & colleagues (2017). A shift function is another technique that quantifies how distribution differ by quantifying the amount one distribution must be shifted relative to another distribution (Rousselet et al., 2016, 2017^33,59). Data are described using deciles wherein each condition is described using 9 quantiles leading to the segmentation of data into 10 different segments. Deciles are calculated using the Harrell-Davis quantile estimator (Wilcox & Erceg-Hurn, 2012). Deciles for one group (e.g., Pre) is compared to a second group (e.g., Post 1) by quantifying the decile differences between the two groups (e.g., Post 1 – Pre). A 95% confidence interval is calculated using percentile bootstrap, and the significance for each decile is quantified using Hochberg’s method (Wilcox & Erceg-Hurn, 2012). In the current study, data was separated into two shifts: Shift1 and Shift2. Shift1 compares gaze and motor performance scores in Pre & Post 1 while Shift2 compares Post 1 and Post 2 data. Each shift data frame was passed through the rogme package in R, which implements the Harrell-Davis quantile estimator, 95% confidence interval, and Hochberg’s method.

Canonical correlation analysis (CCA)

To quantify the strength of the relationships between performance and gaze through training, we conducted canonical correlation analysis (CCA), a multivariate tool that evaluates the relationship between two variables (Wang et al., 2020⁶⁰). Gaze and performance dependent variables were separated into two different variable sets or variates. The performance variate is a subject x performance data matrix with each row representing subjects and the columns representing the three performance variables: learning slope, model score, and percussion score. Similarly, the gaze variate is a subject x gaze matrix with each row representing a subject and each column representing gaze variables: gaze coefficient of variation, mean gaze distance from core, and gaze variability from core. Each training condition (i.e., Pre, Post1, and Post2) consisted of these two performance and gaze matrices yielding a total of six matrices.

For each condition, the performance variate (Xa) and the gaze variate (Xb) were passed through the cca function in MATLAB. Within the cca function, all variables within each variate was z-scored normalized, a common procedure implemented to the data prior to CCA implementation. A sample covariance matrix was created within each variate, which allowed for the implementation of Cholesky factorization using the chol function in MATLAB. Canonical correlations were computed using singular value decomposition through the svd function in MATLAB. The output of the function consisted of canonical weights (Wa, Wb), a numerical value quantifying how a single variable within a variate contributes to the overall trend. The canonical variate (Za, Zb) is a computed weighted sum of original variables obtained by a linear combination of the variables in Xa and another linear combination of variables in Xb. The computed canonical correlation (S), a quantification of the strength of the relationship between the two variates (i.e., performance measures and gaze variables), is obtained by quantifying the Pearson’s R correlation of the canonical variates, Za and Zb. A comparison of weights (Wa, Wb) as well as the changes in correlation between Za and Zb will be assessed from Pre, Post1, and Post2.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The datasets generated during and/or analyzed during the current study are available in Georgia Tech’s SMARTech Respository: (web link: https://smartech.gatech.edu/handle/1853/64500).

References

Henrich, J. The Secret of Our Success | Princeton University Press. (Princeton University Press, 2016).
Morgan, T. J. H. et al. Experimental evidence for the co-evolution of hominin tool-making teaching and language. Nat. Commun. 6, 1–8 (2015).
Article Google Scholar
Pargeter, J., Khreisheh, N. & Stout, D. Understanding stone tool-making skill acquisition: experimental methods and evolutionary implications. J. Hum. Evol. 133, 146–166 (2019).
Article PubMed Google Scholar
Stout, D., Rogers, M. J., Jaeggi, A. V. & Semaw, S. Archaeology and the origins of human cumulative culture: a case study from the earliest oldowan at gona, Ethiopia. Curr. Anthropol. 60, 309–340 (2019).
Article Google Scholar
Stout, D. & Hecht, E. E. Evolutionary neuroscience of cumulative culture. Source 114, 7861–7868 (2017).
CAS Google Scholar
Dicks, M., Button, C., Davids, K., Chow, J. Y. & van der Kamp, J. Keeping an eye on noisy movements: on different approaches to perceptual-motor skill research and training. Sport. Med. 47, 575–581 (2017).
Article Google Scholar
Pargeter, J., Kreisheh, N., Shea, J. J. & Stout, D. Knowledge vs. know-how? Dissecting the foundations of stone knapping skill. J. Hum. Evol. 145, 102807 (2020).
Article PubMed Google Scholar
Tennie, C., Premo, L. S., Braun, D. R. & McPherron, S. P. Early stone tools and cultural transmission: resetting the null hypothesis. Curr. Anthropol. 58, 652–672 (2017).
Article Google Scholar
García-Medrano, P., Ollé, A., Ashton, N. & Roberts, M. B. The mental template in handaxe manufacture: new insights into acheulean lithic technological behavior at Boxgrove, Sussex, UK. J. Archaeol. Method Theory 26, 396–422 (2019).
Article Google Scholar
Stout, D., Apel, J., Commander, J. & Roberts, M. Late Acheulean technology and cognition at Boxgrove, UK. J. Archaeol. Sci. 41 (2014).
Sternad, D. It’s not (only) the mean that matters: variability, noise and exploration in skill learning. Curr. Opin. Behav. Sci. 20, 183–195 (2018).
Article PubMed PubMed Central Google Scholar
Sailer, U., Flanagan, J. R. & Johansson, R. S. Eye-hand coordination during learning of a novel visuomotor task. J. Neurosci. 25, 8833–8842 (2005).
Article CAS PubMed PubMed Central Google Scholar
Dhawale, A. K., Smith, M. A. & Ölveczky, B. P. The Role of Variability in Motor Learning. Annu Rev Neurosci. 40, 479–498 (2017). https://doi.org/10.1146/annurev-neuro-072116-031548
Caballero, C., Moreno, F. J., Reina, R., Roldan, A., Coves, A. & Barbado, D. The role of motor variability in motor control and learning depends on the nature of the task and in individual’s capabilities. European Journal of Human Movement 38, 12–26 (2017).
Nonaka, T., Bril, B. & Rein, R. How do stone knappers predict and control the outcome of flaking? Implications for understanding early stone tool technology. J. Hum. Evol. 59, 155–167 (2010).
Article PubMed Google Scholar
Rein, R., Nonaka, T. & Bril, B. Movement pattern variability in stone knapping: implications for the development of percussive traditions. (2014). https://doi.org/10.1371/journal.pone.0113567
Stout, D. Stone toolmaking and the evolution of human culture and cognition. Philos. Trans. Biol. Sci. 366, 1050–1059 (2011).
Article Google Scholar
Heyes, C. Imitation. Curr. Biol. 31, R228–R232 (2021).
Article CAS PubMed Google Scholar
Orban, G. A. & Caruana, F. The neural basis of human tool use. Front. Psychol. 5, 1–12 (2014).
Article Google Scholar
Henderson, J. M. & Hayes, T. R. Meaning-based guidance of attention in scenes as revealed by meaning maps. Nat. Hum. Behav. 1, 743–747 (2017).
Article PubMed PubMed Central Google Scholar
Cronin, D. A., Hall, E. H., Goold, J. E., Hayes, T. R. & Henderson, J. M. Eye movements in real-world scene photographs: general characteristics and effects of viewing task. Front. Psychol. 10, 1–12 (2020).
Article Google Scholar
König, P. et al. Eye movements as a window to cognitive processes. J. Eye Mov. Res 9, 1–16 (2016).
Google Scholar
Lohse, K. R., Jones, M., Healy, A. F. & Sherwood, D. E. The role of attention in motor control. J. Exp. Psychol. Gen. 143, 930–948 (2014).
Article PubMed Google Scholar
Green, D., Li, Q., Lockman, J. J. & Gredebäck, G. Culture influences action understanding in infancy: prediction of actions performed with chopsticks and spoons in Chinese and Swedish infants. Child Dev. 87, 736–746 (2016).
Article PubMed Google Scholar
Bayani, K. Y. et al. Implicit development of gaze strategies support motor improvements during action encoding training of prosthesis use. Neuropsychologia 127, 75–83 (2019).
Article PubMed PubMed Central Google Scholar
Kline, M. A. How to learn about teaching: An evolutionary framework for the study of teaching behavior in humans and other animals. (2014). https://doi.org/10.1017/S0140525X14000090
Guo, C. C. & Raymond, J. L. Motor learning reduces eye movement variability through reweighting of sensory inputs. J. Neurosci. 30, 16241–16248 (2010).
Article CAS PubMed PubMed Central Google Scholar
Gameiro, R. R., Kaspar, K., König, S. U., Nordholt, S. & König, P. Exploration and exploitation in natural viewing behavior. Sci. Rep. 7, 1–24 (2017).
Google Scholar
Computer Vision Toolbox - MATLAB & Simulink [WWW Document] (2020). URL https://www.mathworks.com/products/computer-vision.html Computer Vision Toolbox^TM User’s Guide R2020a (2004).
Bahraini, M. S., Bozorg, M. & Rad, A. B. SLAM in dynamic environments via ML-RANSAC. Mechatronics 49, 105–118 (2018).
Article Google Scholar
Milazzo, N., Farrow, D. & Fournier, J. F. Effect of implicit perceptual-motor training on decision-making skills and underpinning gaze behavior in combat athletes. Percept. Mot. Skills 123, 300–323 (2016).
Article PubMed Google Scholar
William, L. B. & Harter, N. Studies on the telegraphic language: the acquisition of a hierarchy of habits. Psychol. Rev. 6, 345–375 (1899).
Article Google Scholar
Rousselet, G. A., Foxe, J. J. & Bolam, J. P. A few simple steps to improve the description of group results in neuroscience. Eur. J. Neurosci. 44, 2647–2651 (2016).
Article PubMed Google Scholar
Stout, D. Neuroscience of Technology. Cultural Evolution: Society, Technology, Language, and Religion (MIT Press, 2013). https://doi.org/10.7551/mitpress/9780262019750.001.0001
Whiten, A. Experimental studies illuminate the cultural transmission of percussive technologies in Homo and Pan. Philosophical Transactions of the Royal Society B: Biological Sciences 370, 1-13 (2015). https://doi.org/10.1098/rstb.2014.0359
Tennie, C., Call, J. & Tomasello, M. Ratcheting up the ratchet: on the evolution of cumulative culture. Philos. Trans. R. Soc. B Biol. Sci. 364, 2405–2415 (2009).
Article Google Scholar
Legare, C. H. & Nielsen, M. Imitation and innovation: the dual engines of cultural learning. Trends Cogn. Sci. 19, 688–699 (2015).
Article PubMed Google Scholar
Cusack, W. F. et al. Neural activation differences in amputees during imitation of intact versus amputee movements. (2012). https://doi.org/10.3389/fnhum.2012.00182
Cusack, W. F., Patterson, R., Thach, S., Kistenberg, R. S. & Wheaton, L. A. Motor performance benefits of matched limb imitation in prosthesis users. Exp. Brain Res 232, 2143–2154 (2014).
Article PubMed Google Scholar
Cusack, W. et al. Enhanced neurobehavioral outcomes of action observation prosthesis training. Neurorehabil. Neural Repair 30, 573–582 (2016).
Article PubMed Google Scholar
De Vignemont, F. & Haggard, P. Social neuroscience action observation and execution: what is shared? Soc. Neurosci. 3, 421–433 (2008).
Article PubMed Google Scholar
Mizelle, J. C. & Wheaton, L. A. How can we improve our understanding of skillful motor control and apraxia? Insights from theories of ‘affordances’. Front. Hum. Neurosci. 8, 1–2 (2014).
Article Google Scholar
Kardas, M. & O’Brien, E. Easier seen than done: merely watching others perform can foster an illusion of skill acquisition. Psychol. Sci. 29, 521–536 (2018).
Article PubMed Google Scholar
Frank, C., Land, W. M. & Schack, T. Perceptual-cognitive changes during motor learning: the influence of mental and physical practice on mental representation, gaze behavior, and performance of a complex action. Front. Psychol. 6, 1–14 (2016).
Article Google Scholar
Mann, D. T. Y., Williams, A. M., Ward, P. & Janelle, C. M. Perceptual-cognitive expertise in sport: a meta-analysis. J. Sport Exerc. Psychol. 29, 457–478 (2007).
Article PubMed Google Scholar
Wittek, P., Liu, Y. H., Darányi, S., Gedeon, T. & Lim, I. S. Risk and ambiguity in information seeking: eye gaze patterns reveal contextual behavior in dealing with uncertainty. Front. Psychol. 7, 1–10 (2016).
Article Google Scholar
Stojić, H., Orquin, J. L., Dayan, P., Dolan, R. J. & Speekenbrink, M. Uncertainty in learning, choice, and visual fixation. Proc. Natl Acad. Sci. USA 117, 3291–3300 (2020).
Article PubMed PubMed Central Google Scholar
Dessing, J. C., Rey, F. P. & Beek, P. J. Gaze fixation improves the stability of expert juggling. Exp. Brain Res. 216, 635–644 (2012).
Article PubMed Google Scholar
Cavanagh, P. & Alvarez, G. A. Tracking multiple targets with multifocal attention. Trends Cogn. Sci. 9, 349–354 (2005).
Article PubMed Google Scholar
Natraj, N., Alterman, B., Basunia, S. & Wheaton, L. A. The role of attention and saccades on parietofrontal encoding of contextual and Grasp-specific affordances of tools: an ERP study. Neuroscience 394, 243–266 (2018).
Article CAS PubMed Google Scholar
Ramsey, R., Cross, E. S. & de Hamilton, A. F. C. Predicting others’ actions via grasp and gaze: evidence for distinct brain networks. Psychol. Res. 76, 494–502 (2012).
Article PubMed Google Scholar
Kirsch, L. P. & Cross, E. S. Additive Routes to Action Learning: Layering Experience Shapes Engagement of the Action Observation Network. (2015). https://doi.org/10.1093/cercor/bhv167
Ramsey, R., Kaplan, D. & Cross, E. Watch and learn: the cognitive neuroscience of learning from others’ actions | Elsevier Enhanced Reader. Trends Neurosci. 1–14 (2021). https://doi.org/10.1016/j.tins.2021.01.007.
Calvo-Merino, B., Glaser, D. E., Grè Zes, J., Passingham, R. E. & Haggard, P. Action observation and acquired motor skills: an fMRI study with expert dancers. Cereb. CortexCerebral Cortex 15, 1243–1249 (2005).
Article CAS Google Scholar
Wohlschläger, A. M. et al. Prefrontal involvement in imitation learning of hand actions: effects of practice and expertise. Neuroimage 37, 1371–1383 (2007).
Article PubMed Google Scholar
Gray, W. D. & Lindstedt, J. K. Plateaus, dips, and leaps: where to look for inventions and discoveries during skilled performance. Cogn. Sci. 41, 1838–1870 (2017).
Article PubMed Google Scholar
Manning, C., Raghaven, P., & Schütze, H. (2009) Hierarchical clustering. In Introduction to Information Retrieval. Cambridge University Press, Cambridge, England, pp. 377–401.
Tan, P., Steinbach, M., Anuj, K., & Vipin, K. (2019) Cluster Analysis: Basic Concepts and Algorithms. In Introduction to Data Mining, 2nd edn. Pearson.
Rousselet, G. A., Pernet, C. R. & Wilcox, R. R. Beyond differences in means: robust graphical methods to compare two groups in neuroscience. Eur. J. Neurosci. 46, 1738–1748 (2017).
Article PubMed Google Scholar
Wang, H. T. et al. Finding the needle in a high-dimensional haystack: canonical correlation analysis for neuroscientists. Neuroimage 216, 116745 (2020).
Article PubMed Google Scholar

Download references

Acknowledgements

We thank Bennett Alterman for technical support, Carson Topping for his expertise in Excel macros and XLStat that made the gaze distance from core calculation efficient, and Neel Atawala for his technical support in the machine learning code. This work was financially supported by the National Science Foundation, 1328567.

Author information

Authors and Affiliations

School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
Kristel Yu Tiamco Bayani, Nikhilesh Natraj & Lewis A. Wheaton
Division of Neurology, UCSF Weill Institute for Neurosciences, San Francisco, CA, USA
Nikhilesh Natraj
Anthropology Department, Emory University, Atlanta, GA, USA
Nada Khresdish, Justin Pargeter & Dietrich Stout
Department of Anthropology, New York University, New York, NY, USA
Justin Pargeter

Authors

Kristel Yu Tiamco Bayani
View author publications
You can also search for this author in PubMed Google Scholar
Nikhilesh Natraj
View author publications
You can also search for this author in PubMed Google Scholar
Nada Khresdish
View author publications
You can also search for this author in PubMed Google Scholar
Justin Pargeter
View author publications
You can also search for this author in PubMed Google Scholar
Dietrich Stout
View author publications
You can also search for this author in PubMed Google Scholar
Lewis A. Wheaton
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: L.W., D.S., N.N., N.K. Data curation: N.N., K.B., N.K., J.P. Formal analysis: K.B., N.N., J.P. Funding acquisition: L.W., D.S. Investigation: L.W., D.S. Methodology: L.W., D.S., N.N., N.K. Project administration: L.W., D.S., N.N., K.B. Resources: L.W., D.S. Software: K.B., N.N., N.K., J.P. Supervision: L.W., D.S. Validation: K.B., N.N., J.P. Visualization: K.B., N.N., J.P. Writing – original draft: K.B., L.W. Writing – review & editing: L.W., D.S., N.N., K.B., J.P.

Corresponding author

Correspondence to Lewis A. Wheaton.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Peer review information Communications Biology thanks Grzegorz Króliczak and the other, anonymous, reviewers for their contribution to the peer review of this work. Primary Handling Editors: Stefano Palminteri and Karli Montague-Cardoso.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bayani, K.Y.T., Natraj, N., Khresdish, N. et al. Emergence of perceptuomotor relationships during paleolithic stone toolmaking learning: intersections of observation and practice. Commun Biol 4, 1278 (2021). https://doi.org/10.1038/s42003-021-02768-w

Download citation

Received: 04 May 2021
Accepted: 11 October 2021
Published: 11 November 2021
DOI: https://doi.org/10.1038/s42003-021-02768-w

This article is cited by

Neuroplasticity enables bio-cultural feedback in Paleolithic stone-tool making
- Erin Elisabeth Hecht
- Justin Pargeter
- Dietrich Stout
Scientific Reports (2023)
Testing the Effect of Learning Conditions and Individual Motor/Cognitive Differences on Knapping Skill Acquisition
- Justin Pargeter
- Cheng Liu
- Dietrich Stout
Journal of Archaeological Method and Theory (2022)
The role of vision during Lower Palaeolithic tool-making
- María Silva-Gago
- Marcos Terradillos-Bernal
- Emiliano Bruner
Journal of Paleolithic Archaeology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Perceptography unveils the causal contribution of inferior temporal cortex to visual perception

Motor neurons generate pose-targeted movements via proprioceptive sculpting

Bumblebees socially learn behaviour too complex to innovate alone

Introduction

Results

Determination of meaningful action sequences

Gaze variability

Temporal dispersion of gaze

Spatial dispersion of gaze

Performance measures

Relationship between gaze patterns and knapping performance

Perceptual-motor shifts with learning

Perceptual-motor canonical correlation analysis

Discussion

Methods

Subjects

Experimental set up

Workspace

Eye tracking

Action observation

Experimental design

Task

Data preprocessing

Determination of meaningful action phases

Determination of meaningful time points

Gaze & perceptual analysis

Temporal dispersion of gaze using agglomerative hierarchical cluster analysis

Spatial dispersion of gaze

Statistical analysis

Action phase

Temporal dispersion of gaze statistical analysis

Gaze & motor performance statistical analysis

Coefficient of variation and mean distance from core

Gaze variability relative to the core

Handaxe score

Percussion error

Shift functions

Canonical correlation analysis (CCA)

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Neuroplasticity enables bio-cultural feedback in Paleolithic stone-tool making

Testing the Effect of Learning Conditions and Individual Motor/Cognitive Differences on Knapping Skill Acquisition

The role of vision during Lower Palaeolithic tool-making

Comments

Search

Quick links