Computerized handwriting evaluation and statistical reports for children in the age of primary school

Chang, Shao-Hsia; Yu, Nan-Ying

doi:10.1038/s41598-022-19913-y

Download PDF

Article
Open access
Published: 19 September 2022

Computerized handwriting evaluation and statistical reports for children in the age of primary school

Shao-Hsia Chang¹ &
Nan-Ying Yu²

Scientific Reports volume 12, Article number: 15675 (2022) Cite this article

1704 Accesses
3 Citations
Metrics details

Subjects

Abstract

This study proposed a novel computational method for evaluating logographic handwriting. It can precisely evaluate both the handwriting product and the process. The measures included handwriting performance as well as the temporospatial, kinematics, and kinetics features. For examining the psychometrics of this comprehensive evaluation system, typical development children aged 6 to 9 years old (grade 1 to grade 3) (n = 641) were involved in the study of factor analysis. From twelve measuring variables, the exploratory factor analysis extracted five factors (handwriting performance, motor control, speed and automation, halt and exertion, and “in air” events). The test reliability was confirmed by further recruitment of typically developing children (n = 242). The internal consistency mostly demonstrated good to excellent results for every measure. This study further recruited children with handwriting difficulties (n = 33) for testing the discriminative validity of the evaluation system. A series of two-way ANOVA tests was conducted to test the significance of the main effects of the groups (typical development and handwriting deficit) and grades (1, 2, and 3) and their interaction effects on the handwriting measures. All the measures showed significant differences between the two groups, indicating the discriminative validity for identifying handwriting deficits. Seven of twelve measures showed significant interaction effects, indicating the different trends across the grades between the two groups. Typically-developing children demonstrated ongoing progress from grade 1 to grade 3, suggesting a developmental trend during their early school age. Implications for motor development and clinical evaluation are discussed herein in relation to the five dimensions.

Extending the Spectrum of Dysgraphia: A Data Driven Strategy to Estimate Handwriting Quality

Article Open access 21 February 2020

Phenotyping features in the genesis of pre-scriptural gestures in children to assess handwriting developmental levels

Article Open access 12 January 2021

The transferability of handwriting skills: from the Cyrillic to the Latin alphabet

Article Open access 23 February 2021

Introduction

Many researchers argued that handwriting had become a past art since a significant number of digital devices are used in daily life. However, it might be true only in the adult world. For children of primary school, it is impossible for an academic learning process without handwriting. Handwriting is indeed an essential fine motor skill in school-age children. In children with handwriting problems, the learning in classes may become difficult and may lead to low academic performance^1,2. Such frustration is likely to further lead to their perception of insufficient handwriting skills, consequently they are reluctant to the writing work, and the negative emotions may affect the development of their writing ability. The scrawls of schoolchildren may impact their marks evaluated by teachers. Therefore, handwriting skill is related not only to language learning but affects the development of the self-concept of students³.

Handwriting is a complicated process of fine motor movements, which allows for parallel activation of cognitive and motor processes⁴. Poor fine motor control may lead to strenuous or even illegible handwriting. Assessment of fine motor control in handwriting movement is thus important in any comprehensive evaluation of handwriting dysfunction. However, it is difficult to evaluate or analyze the subtle handwriting movements by traditional clinical observation. With the help of a digital tablet, the computerized evaluation provides a quantitative method for measuring specific fine motor control variables during the actual handwriting process^5,6,7,8,9,10. For measuring handwriting proficiency, it has been used to document the frequency or duration of pauses in the stroke movement. The poor handwriters can be identified by the increased frequency of pauses or longer pause time^11,12,13. Using the Minnesota Handwriting Assessment (MHA) test sheet on a digital tablet, Dirlikov et al. revealed standard (manual derived) MHA and comparable automated handwriting scores were highly correlated. They found elevated letter-form scores (worse performance) in children with ASD and ADHD groups compared to TD children. Children with ASD showed the greatest letter-form impairment¹⁰. Li‐Tsang et al. (2018) successfully used computerized assessment of temporospatial features and kinetics to distinguish comorbid learning difficulties from attention deficit hyperactivity disorder in Chinese adolescents. They found ADHD and ADHD-LD groups also showed larger variations in either handwriting speed or pen pressure than their controls¹⁴.

In addition to pauses of stroke movement, the time spent holding the writing instrument above the paper was thought to correspond to the time needed to initiate the handwriting movement to execute the character⁹. The time and trajectory length of ‘‘in air’’ movements have been used to characterize handwriting deficit in fine motor control. As compared with the typically developed group, the non-proficient handwriters showed significantly longer ‘‘in air’’ time and path length^13,15.

From the aspect of motor control theory, Plamondon defined rapid human movements (automation) in an open-loop paradigm showing a smooth velocity curve with ideally a single peak value¹¹. In one of our previous studies, the average number of velocity peaks of a stroke has been applied to characterize the handwriting deficit of children with developmental coordination disorder (DCD). The study showed promising results for using this parameter to characterize motor control problems in children with handwriting difficulties¹⁶.

Our previous study compared the handwriting process in children with non-proficient characteristics to those of typically-developing children in Grades 1 and 2. We found the attainment of automated handwriting was markedly slower in children with handwriting deficits. The number of velocity peaks was found a valid parameter to monitor the automation of stroke movement in handwriting. The study also revealed the poor handwriters used a faster stroke velocity than typically-developing group to write simple characters but not complex characters, parameters of paramount significance in graphomotor function were adopted to evaluate the children with non-proficient characteristics¹⁶.

In the recent decade, the advance of artificial intelligence has been implemented on handwriting evaluation for the identification of poor handwriting^17,18. Wu et al. found that the Support Vector Machine model had the best performance in detecting Chinese handwriting deficit¹⁸. A total of 34 handwriting features were extracted from the data recorded, which can be generally divided into five categories: spatial, temporal, velocity, acceleration, and pen tilt. Gargot et al. performed K-means clustering to define a new classification of dysgraphia. Twelve digital features describing handwriting through different aspects (static, kinematic, pressure, and tilt) were extracted and used to create linear models to investigate handwriting acquisition throughout education. They found that three features (two kinematic and one static) showed a significant association to predict the change of handwriting quality in control children. In the two clusters exhibiting severe dysgraphia, one presented abnormality in terms of kinematics and pressure whilst the other one showed mainly pen tilt problems¹⁷. These studies gained new insights into which handwriting features are predictive of handwriting deficits in children and propose a method that can help the clinical and educational professionals to automatically detect children at risk of handwriting deficit. To further the implication of these studies, it is necessary to relate these features to that clinical intervention or educational ecology in handwriting lessons.

Different from the alphabetical language system, a Chinese character is a kind of logogram (or logograph) which means a written character that represents a word or morpheme. In this logographic system, there are many hieroglyphic and cuneiform characters. The difference between the oriental and western language systems should be considered in the design of the evaluation method. There are more than 4000 distinct characters needed for the basic components to acquire the written language skills. The sophisticated character structure forms the major problems in Chinese handwriting. The complicated process of Chinese handwriting has become a complex motor learning and motor control paradigm.

There were few studies or evaluations on legibility, especially in Chinese handwriting that the requirement of legible handwriting is the correct placement of distinct strokes. Different from the alphabetical system, every Chinese character has usually more than ten strokes in a character. The important component of Chinese handwriting is the correct stroke placement with a certain length, the direction of the turn of the strokes in the handwriting. For handwriting accuracy and legibility evaluation, most evaluations of handwriting legibility still relied on eye-ball check. Few studies implemented computer applications for the automated assessment of legibility and accuracy^19,20,21. In the work of Li-Tsang et al., they proposed an assessment for calculating accuracy and the total number of characters with stroke errors²¹. However, it is not available how they get the accuracy or legibility data pertaining to the number of characters with added strokes, omitted strokes or concatenated strokes, mirrored strokes and crossing-over strokes. Based on offline image processing, our preliminary system proposed a template matching method for the assessment of legibility, alignment, and size control of Chinese handwriting¹⁹. For the accuracy evaluation of Chinese handwriting, the work of Hu et al. (2009) was originally aiming at handwriting tutoring. They have proposed a method being able to automatically check handwriting errors. In interactive handwriting learning, attributed relational graph matching is used to locate the handwriting errors, such as the stroke production errors, stroke sequence errors, and stroke relationship errors²⁰. All the results of the above studies showed the relevance of combining error detection and legibility assessment of individual strokes with the traditional kinetic and kinematic handwriting evaluation. A more scientific definition with clinical meaning and the computing of legibility and accuracy are needed in a comprehensive evaluation.

In a classroom setting, school teachers need a standardized practical tool for the measurement of legibility to help them identify those with difficulties and determine whether the child needs support to develop their skills²². In the clinical setting, therapists need an evaluation of the issues related to motor learning and motor control aspects. The present system has addressed legibility analysis and the temporospatial features of the handwriting process. This paper proposed a novel computational method for the evaluation of logographic handwriting. It can precisely evaluate both the handwriting product and the process. The aim of the current study was therefore to develop a reliable and valid computerized assessment of the temporospatial features and performance (speed and legibility) of handwriting. The discriminative validity was examined by the comparison between children with typical development and handwriting difficulties. For constructing a reference for clinical implementation, the differences were also examined between genders and across the school years.

Methods

Participants

In this study, 655 children aged 6–9 from elementary school were recruited to participate in the experiments for verifying the construct of evaluation and the reliability of the testing measures. After excluding children with neuromotor or related medical diagnoses (such as cerebral palsy, muscle atrophy, developmental retardation, and intellectual disability), a total of 641 children were chosen. As eight/six children were diagnosed with developmental retardation disorder, their data have been excluded from the statistical analysis. The final subject number in factor analysis was 342 girls and 299 boys.

A second sample from another 242 children (129 girls and 113 boys) was collected for the reliability test and statistical analysis to construct a reference for future clinical implementation of this evaluation system.

This study further recruited children with handwriting difficulties to check whether the tests can discriminate against them from children with typical development. The handwriting deficit has been confirmed by the administration of the Chinese Handwriting Evaluation Form (CHEF)²³. The CHEF has 29 items that measure six handwriting dimensions, including the construction of characters, major mistakes in writing the components of characters, developmental delays, use of a pencil, a gross motor function, and emotional reaction to the problem. According to the test manual, the cut-off criterion for the identification of a handwriting deficit is the presence of difficulties in two or more of the six dimensions, with a median score larger than, or equal to three²³. Of 395 children, thirty-three children with handwriting difficulties were identified and then recruited into the study for the validity analysis of the tests.

All participants and their parents gave written informed consents. The Institutional Review Board of the E-DA Hospital approved the study (EDAH IRB No. EMRP14107N). All participants and their parents gave written informed consents. All the consents were documented and witnessed by the Institutional Review Board of the E-DA Hospital. Table 1 shows the basic information and descriptive statistics of the participants recruited for reliability and discriminative validity tests.

Table 1 Basic information and descriptive statistics of participants recruited for reliability and discriminative validity tests.

Full size table

Instruments and apparatus

Participants were seated on a chair in front of a desk on which a digitizer tablet (487 × 318 × 12 mm, PH-1820-A, PendoTech, ShangHai) was positioned so that the tablet’s lower edge lined up with the edge of the table at which the participant was seated. On the digitizer tablet, an A4-sized piece of paper was positioned with the vertical and horizontal edges parallel to the horizontal and vertical edges of the digitizer. A standard-sized wireless electronic inking pen with a force-sensitive tip (2048 levels) was used to collect the movement data on the digitizer tablet. The axial pen force and X (horizontal) and Y (vertical) positions of the pen tip were sampled at a frequency of 200 Hz with a spatial resolution of 0.01 mm. This data acquisition setup was adopted from our previous study for neurodegenerative disease with the exception of digital tablets used²⁴.

Copying target

Figure 1A shows the characters used in this study as copying targets. According to the stroke number of most used Chinese characters, thirty Chinese characters were selected and printed on the gridded A4-size white paper. The characters were selected from the Chinese textbooks for schoolchildren in Grades 1 and 2 to ensure that the children have learned the characters. Figure 1B shows the configuration of the components in every Chinese character. According to the configuration, every character can be classified into one of the eleven categories in the copying target. The number shown on the corner of the grid denotes their attribute of the categories. During the test, the writing space was blank grids with the same format as shown in Fig. 1A.

Measuring parameters

The measures of computerized handwriting evaluation.

Performance measures

(1)
Writing speed (WS) According to the number of characters written in ten minutes, the speed is computed then expressed as characters per minute. If the time to complete all the tasks is less than ten minutes, the test time is modified according to the time they really took.
(2)
Correct writing speed (CWS): Correct writing of Chinese characters is defined as all the strokes are correctly placed with their corresponding positions, lengths, widths, heights, and directions. Following these rules, the evaluation system defined the criteria of correctness that all strokes being correctly placed as demonstrated on the left of Fig. 2. A sample of twenty-two test sheets including 566 characters was used for the validation of this categorization with an eye-ball check. High agreement (Kappa = 0.92, p < 0.001) was found between automatic check and eye-ball check by an experienced elementary school teacher. According to the number of characters written correctly in ten minutes, the correct writing speed is computed and then expressed as the accurate number of characters written per minute.

Geometric measures (legibility)

(1)
Intra-character configuration (ICC) This measures the ability to control stroke placement within a character. A Chinese character is composed of strokes up to tens or twenties, and even more than thirty strokes. In legible handwriting, every stroke should be appropriately placed to correctly express the configuration of the character. For every stroke, five measures were used to evaluate the appropriateness of stroke placement. The evaluation of intra-character configuration includes the measures of location, width, height, length, and slope of the stroke. Figure 2 shows an example of a comparison between the target and writing scripts. Given the second stroke of the target task and the fifth stroke of writing script as a comparable example, the differences in the five measures are computed to show the deviation from standard character configuration. The layout of a standard character is in a square grid (20 × 20 mm). Every written character was rescaled to fit in the square grid. The maximum differences between standard and handwritten scripts are 28.28 mm in a central location and stroke length, 20 mm in width and height, and 90 degrees in the slope. With a normalization to the maximal values, the proportion of differences would be deducted from the ideal score of 100 (%), then the average value of the total 30 characters was calculated.
(2)
Character size (CS) Character size is defined by the area of a virtual grid that the written character can exactly fit. The area is the product of the character width and height. The unit of this measure is mm².
(3)
Character Size Consistency (CSC) The standard deviation of the character size (area) is computed to indicate the ability to control a character with the same size. The unit of this measure is mm².

Temporospatial measures

(1)
Accuracy of Stroke order (ASO) Every Chinese character has its habitual order to write down the strokes. There are general rules of stroke order that have been defined and taught by the education system of countries with Chinese as their official language. Coming from ergonomic analysis of the context in Chinese handwriting, the general rule of stroke order includes (1) from top to bottom, (2) from left to right, and (3) from outside to inside. Using the metrics of mean absolute error (MAE)²⁵, the accuracy of stroke order was calculated by subtracting the stroke order of the sample characters from the stroke order of the written script, and the absolute value represents the error degree. For example, in writing “尖”, the proper sequence (denoted by number on Fig. 2) should be the upper component (3 strokes: middle → left → right) then the lower one (3 strokes: horizontal → center to left bottom → center to right bottom). When children write the lower component first, the fourth stroke they write is the first stroke in the correct order, so the error value of the first stroke was |4–1| = 3, the second stroke was also 3 (|5–2|) and so on. If two strokes were completed by one stroke, the second stroke of the order would be accumulated as an error value. For example, the xth and (x + 1)th strokes were written as a single stroke. The (x + 1)th stroke accumulates an error value |(x + 1) − x|. The same as in one stroke was written as two distinct strokes. An error value was accumulated |(x + 1) − x|. The completely reverse stroke order was taken as the maximum value of the wrong stroke order. The sum was calculated by subtracting the maximum value with the absolute value of error degree, the proportion of this value to the maximum value of wrong stroke order represented the proportion of correct stroke order, which was presented in percentage.
(2)
Pen pause time ratio (PPTR) This measures the dysfluency of movement from the temporal data. If two consecutive sampling points had the same registered coordinate, the period was cumulated and referred to as pause time. This parameter is derived by dividing the cumulative pause time by the total stroke number.
(3)
On-paper time ratio (OPTR) This parameter measures the temporal proportion of the pen-tip contacts on paper. It is derived by dividing the total on paper time spent on the task by the total time including the halting time in the air.
(4)
On-paper length ratio (OPLR) This parameter measures the proportion of stroke trajectory length on the paper. It is derived by dividing the total on paper trajectory length by the total pen tip trajectory length including the wandering length in the air.
(5)
The number of velocity peaks per stroke (NVP) This measure is one of the kinematic features. The number of vertical (or horizontal) velocity peaks for every vertical (or horizontal) stroke was determined as quantification of automation of handwriting movement. The NVP per stroke decreases as the movement becomes automatic; otherwise, the movement remains deliberate. When movement is fully automated, the ideal NVP per stroke is 1 per vertical or horizontal stroke; a decrease in the number indicates a switch from closed feedback control to open-loop control^7,26.
(6)
Mean peak velocity (MPV) This measures one of the kinematic features. The maximum value of the tangential velocity per stroke was first determined, after which the mean peak velocity per trial was determined by averaging the values across the strokes. The mean peak velocity per condition was determined for every participant by averaging the values of all trials per condition.
(7)
Axial pen force (APF) This measures the kinetic feature. Since force recorded at the start and end of a stroke has large variation, it cannot represent the valid exerted force. For avoiding the effect of this variation, the axial pen force only in the middle 80% of a stroke (ex. middle 4 mm of a 5-mm stroke) was recorded¹⁶. The mean value was determined by dividing the sum of these values by the total number of sampling points in a task.

Data analyses

The collected data were loaded into the computer for further analysis. SPSS for Windows EC (Version 18.0) was applied in the analysis of these data. The test data were analyzed by exploratory factor analysis. Principal component analysis (PCA) with varimax orthogonal rotation was used to explore the main dimensions these 12 measures converge. For realizing the internal struct of the measure items, Pearson correlation analyses were conducted to examine the intercorrelation among the twelve measures.

Reliability tests

We examined the consistency of the measurement variables using Cronbach’s α²⁷. Cronbach’s α was calculated as an index of the internal reliability of the measurement system. To test the measuring results' consistency, every copy test trial (copying 30 characters) was split into three sections. Every section contained the process of copying 10 characters. A reliability coefficient across the three sections was obtained for each variable from the 242 typically developing children. High inter-trial coefficients would indicate that the trials are measuring the same underlying construct.

Statistical analyses

For the discriminative validity of the evaluation system, a series of two-way analysis of variance (ANOVA) tests were conducted to test the significance of main effects of the groups (typical development and handwriting deficit) and grades (1, 2, and 3) and their interaction effect on the handwriting measures. A series of independent t-tests were also performed to compare the measures between two genders. The statistical significance level is set at 0.05.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards. The Institutional Review Board of the E-DA Hospital approved the study (EDAH IRB No. EMRP14107N).

Informed consent

All participants and their parents gave written informed consents. All the consents were documented and witnessed by the Institutional Review Board of the E-DA Hospital. All possible harms or unintended effects in each group were depicted in the consent.

Results

Factor analysis

Factor analyses were conducted on the 12 test measures. With a large sample size (N = 611) and a high subjects-to-variables ratio (53.4), it is acceptable in predicting important outcomes in exploratory factor analysis²⁷. The KMO value is 0.528 (> 0.5). It supports the sampling adequacy for factorability as suggested by Kaiser²⁸. Bartlett's test of sphericity reaches statistical significance (p < 0.001). It indicates the correlation matrix is not an identity and supports the factor analysis²⁹. After varimax orthogonal rotation, five major components with eigenvalues larger than 1 were extracted (Table 2). From the largest to the smallest, the eigenvalues were 3.303, 2.036, 1.831, 1.432, and 1.158. Factor loadings larger than 0.5 were observed. The five factors accounted for 81.33% of the total variance. The first dimension contains 3 measures (Nos. 2, 3, 6) relevant to handwriting performance (correct writing speed, legibility, and accuracy of stroke order). The second dimension contains the measures (Nos. 4 and 5) relevant to the character size control and the average of peak velocity (No. 11). It is related to motor control in handwriting. The third dimension contains two measures relevant to the speed to complete writing characters (No. 1 and 2) and the movement automation (No. 10) in stroke production. The fourth dimension contains the time measure of the pen halting on paper (No. 7) and the axial pen tip force (No. 12). The fifth dimension contains two measures (Nos. 8–9) relevant to the proportion of time spent and trajectory length really on paper.

Table 2 Factor analysis: measures and factor loadings (N = 641).

Full size table

Table 3 shows the intercorrelation among the testing measures. Some of the measures were found to correlate with others indicating the interrelated metrics. The measures of writing speed (nos. 1 and 2) were found to significantly correlate with all the others except for character size consistency. The measure in the proportion of trajectory length on paper was also found to significantly correlate with all others except the intra-character configuration. With the least other measures, the character size consistency was found to correlate with only five other measures. A similar result in the intra-character configuration was also found to correlate with six other measures.

Table 3 Cross correlation among the computerized measures (N = 641).

Full size table

Instrument reliability

Table 4 shows the results of the instrument reliability testing where each of the replications is considered an independent test. All of the computerized handwriting measures exhibited high repeatability (with Cronbach’s α coefficients ranging from 0.751 to 0.968). The relatively low coefficient for the On-paper length ratio (0.751) indicates that different parts of the copying target with different complexities may exhibit varied trajectory lengths of pen movement in the air.

Table 4 Reliability estimates across trial replications for the evaluation variables for 242 children.

Full size table

Discriminative validity

Table 5 shows the result of comparisons between children with typical development and those with handwriting difficulties among the three grades. All measures show significant differences between the two groups indicating the discriminative validity for identifying handwriting deficits. All measures except mean peak velocity showed significant main effects from the grade factor. Seven measures showed significant interaction effects indicating the different trends across the grades between TD and HWD groups. Further post hoc tests on the TD group found significant differences between adjacent grades in the measures pertaining to performance, intra- and inter-character configuration (legibility), and the temporospatial variables. The other six measures did not display significant differences between grade 2 and grade 3. It includes the measures of stroke placement, stroke order accuracy, and the temporospatial features related to movement automation and motor control. However, all these measures display the ongoing progress of these skills from grade 1 through grade 3 in children with typical development.

Table 5 The comparison of testing measures among three grades and two groups.

Full size table

Table 6 shows the result of comparisons between girls and boys. The girls show better performance in all the measures. Five measures were found to reach statistical significance, it includes correct writing speed, character control, stroke order, and the axial pen force. It includes the measures of stroke placement, stroke order accuracy, and the temporospatial features related to movement automation and motor control.

Table 6 The comparison of testing measures between genders.

Full size table

Discussion

The result of this study confirmed the reliability and validity of the comprehensive computerized evaluation of Chinese handwriting. There are five major dimensions revealed for the evaluation of children from grades 1 to 3. The first three dimensions display the measures of handwriting performance. Dimension 1 includes the measures directly related to academic learning (correctness, legibility, and stroke order). Dimension 2 includes the measures related to motor control (character size control and peak stroke velocity). Dimension 3 is composed of measures displaying the automation of handwriting (copying speed and number of velocity peaks). Dimension 4 shows the interrelation between kinetic and kinematic measures. A greater force applied results in much slower pen tip stroke movement (as reflected by a much greater pause ratio). Dimension 5 is the measure of the on-paper to in-air ratio. It measures the temporal and spatial efficiency of handwriting.

For measuring the general handwriting performance, this study presented descriptive statistics of computerized handwriting measures in school children from grade 1 to grade 3 and the discriminative validity for identifying handwriting deficit. Children with handwriting difficulties showed significant impairments in all measures. In addition to the traditionally kinematic and kinetics results, this study showed the result of structural deficits in children with handwriting difficulties. The strength of this evaluation system is not only the performance deficit can be identified but the process of handwriting can also be provided to clinicians or parents of the children. The general handwriting performance was found significantly improved across the three grades. Gender differences can be found in five measures. Girls showed significantly better performance than boys in all measures of Dimension 1 (CWS, ICC, and ASO) and CSC in Dimension 2. They also showed increased force exerted on the paper (APF) than boys. With reference to the norm of these data, the differences between genders should be considered.

Up to date, pen-and-paper tests and observation scales are the most commonly used handwriting evaluation in school or clinical settings. These evaluations are conducted by eye-ball check which is subjective and difficult to be quantified. For providing quantitative measures, the present computerized system is not only able to count the number of completed characters but also the number of characters being correctly written. This is the most challenging process for a computerized system since there are no standard criteria available even for eye-ball checks. For example, the character “夫” is different from “天” because the middle strokes start from different heights. For correct writing of the later character, what is the minimum height of the starting position is acceptable? In addition, all the geometric relations of any stroke with other ones are related to the judgment of accurate writing. In the present system, the criteria for judgment are the correctness of all strokes in a character.

Since Chinese characters are composed of distinct strokes. Correct writing is defined as all the strokes are correctly placed with their corresponding positions, lengths, widths, heights, and directions. In a previous study for the evaluation of fine motor function, there was a new attempt to score the legibility of Chinese handwriting. It has been used for detecting the early sign of fine motor deficit in persons with mild cognitive impairment²⁴. The evaluation tool proposed in this study is a novel implementation on the legibility assessment for children in early school age. From the result of this study, children in grade 2 seem to reach a certain level of the stroke placement control whereas the character size and its variation seem to decrease persistently to grade 3.

To evaluate the motor control of handwriting, the present system includes the assessment specific to movement automation and movement control. Our previous study revealed slower stroke velocities in children with DCD than in those without DCD. The mean peak stroke velocity (MPV) can reflect the fluency of pen movement¹⁶. Another previous study also found increased pause time per stroke and an increased number of velocity peaks in children with dysgraphia. These two measures (PPTR and NVP) can reflect the automation of stroke formation in clinical handwriting evaluation¹³.

With a standardized process, the pen pause time ratio of pen-tip movement (PPTR) reflects the movement smoothness. Pauses during handwriting have been proposed to be greatly affected by linguistic processes⁴. As revealed from the result of this study, the pen-tip movement in strokes of handwriting becomes more mature as the increase of age. In the sample of typically-developing children in this study, the significant difference was only found between grades 1 and 2 but not 2 and 3. It indicates that the change of this measure may reach a plateau around the age of grade 2.

In the measure of legibility, different from the alphabetical language system, the key point of Chinese character is the spatial relationship of strokes within a character. The tempo and rhythm of stroke placement with an ideal stroke sequence form fluent and legible handwriting. The present system is the first evaluation including stroke order evaluation. This is a particular evaluation that may be only found in the logographic language system. Being included in Dimension 1, this measure shows its relevance in handwriting performance evaluation. As revealed from the results (Table 3), it is found highly correlated with intra-character configuration (r = 0.815). Children with higher ratios of correct stroke sequence exhibited higher scores in intra-character configuration. It implicates an important factor determining Chinese handwriting legibility.

For the progress of handwriting, there are two parameters showing the halt of the progress in the air: the accumulation of “In air” time and the “In air” length. These two parameters show different meanings. The “In air” length indicates the seeking path but the “In air” time may just reflect the halting time. A previous study found that the “In air” time measure may supply information about the perceptual aspect of the motor act^4,30,31. By comparing with normal controls, the “in air” information in handwriting shows the applicability in the diagnosis of neurodegenerative disease (Parkinson’s and Alzheimer’s disease)³¹. It has been considered to relate to the difficulty with motor memory for letter formation or difficulty in visualizing the letters as needed to form them rapidly⁹. This study reveals these two parameters form the fifth dimension of the evaluation system. They were also found significantly correlated with the other dynamic measures. The significant differences across the three grades indicate the progressive improvement in the early school age (grades 1 to 3).

In the kinetic analysis of handwriting movement, poor writers have been observed to have higher pressure exerted on the writing surface during their writing tasks⁶. However, different results were reported by Rosenblum and Livneh-Zirinski in both the name writing and the paragraph copying tasks, the children with DCD exerted significantly lower mean pressure in comparison to the control children³². Our past work also found a dramatic decrease in the axial pen pressure in children with DCD when they were copying complex characters¹⁶. The present study showed a similar result to our past work and that of Rosenblum and Livneh-Zirinski³². Children with handwriting difficulties were found to exert less pressure than typically-developing children. In the typically-developing group, the axial pen force was found to increase with the increase of age but only between grades 1 and 2. Further study is needed to confirm at what age the increase of pen tip pressure stops.

Suggestions for the application of this system

This study proposed a novel computerized handwriting evaluation system for screening children with handwriting difficulties. Since significant differences between genders were found in correct writing speed, intra-character configuration, character size consistency, the accuracy of stroke order, and axial pen force, the score conversion of these measures are suggested to be different between girls and boys. However, as standardized norm references for future clinical applications, more data on participants such as information about cognitive abilities, socioeconomic status, and other relevant variables should be provided. In the present stage, it can be utilized as a screening for handwriting deficits or as a test to add valuable information during clinical or educational assessments.

Limitations

Some limitations should be noted. The first is the pen used cannot be their usually used pencil. For this limitation, future study is suggested to develop a system equipped with pencils used in primary school instead of the currently used ballpoint pen.

The second limitation is that the evaluation of characters or paragraph alignment control is not available in the present study. There are grid lines on the test sheet. It may refrain to detect poor alignment control when no grid lines are provided. Further study and the evaluation design may have tasks without grid lines.

Conclusion

This computerized test provides an on-site realistic evaluation and a quantitative report of the quality of the handwriting product. The focus was on the assessment of a naturalistic (and therefore ecologically valid) handwriting task, which is easy to gather from large numbers of children in the usual classroom setting. Through the computerized analysis, the accuracy and legibility can be scored by the computer instead of the eye-ball check. The development of a computerized evaluation presented in this paper shows the relevance of providing objective assessment results to the related team members for the plan or discussion of an effective intervention. For educators, clinicians and researchers, this evaluation system can provide greater insight into the motor, perceptual and cognitive components underlying poor handwriting. The current results open new opportunities for screening and detection of children with handwriting difficulty in the classroom.

Data availability

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

References

Karlsdottir, R. & Stefansson, T. Problems in developing functional handwriting. Percept. Mot. Skills 94(2), 623–662 (2002).
Article Google Scholar
Van Waelvelde, H. et al. Effectiveness of a self-regulated remedial program for handwriting difficulties. Scand. J. Occup. Ther. 24(5), 311–319 (2017).
Article Google Scholar
Graham, S. et al. Development of handwriting speed and legibility in grades 1–9. J. Educ. Res. 92(1), 42–52 (1998).
Article Google Scholar
Afonso, O. & Álvarez, C. J. Measuring Writing Durations in Handwriting Research: What Do They Tell Us About the Spelling Process? Spelling and Writing Words 151–162 (Brill, 2019).
Google Scholar
Lam, S. S. et al. Chinese handwriting performance of primary school children with dyslexia. Res. Dev. Disabil. 32(5), 1745–1756 (2011).
Article ADS Google Scholar
Di Brina, C. et al. Dynamic time warping: A new method in the study of poor handwriting. Hum. Mov. Sci. 27(2), 242–255 (2008).
Article Google Scholar
Mergl, R. et al. Digitized analysis of handwriting and drawing movements in healthy subjects: Methods, results and perspectives. J. Neurosci. Methods 90(2), 157–169 (1999).
Article CAS Google Scholar
Marquardt, C. & Mai, N. A computational procedure for movement analysis in handwriting. J. Neurosci. Methods 52(1), 39–45 (1994).
Article CAS Google Scholar
Rosenblum, S., Parush, S. & Weiss, P. L. Computerized temporal handwriting characteristics of proficient and non-proficient handwriters. Am. J. Occup. Ther. 57(2), 129–138 (2003).
Article Google Scholar
Dirlikov, B. et al. Novel automated morphometric and kinematic handwriting assessment: A validity study in children with ASD and ADHD. J. Occup. Ther. Schools Early Interv. 10(2), 185–201 (2017).
Article Google Scholar
Plamondon, R. A kinematic theory of rapid human movements: Part I Movement representation and generation. Biol. Cybern. 72(4), 295–307 (1995).
Article CAS Google Scholar
Wann, J. & Jones, J. Space-time invariance in handwriting: Contrasts between primary school children displaying advanced or retarded handwriting acquisition. Hum. Mov. Sci. 5(3), 275–296 (1986).
Article Google Scholar
Chang, S.-H. & Yu, N.-Y. Handwriting movement analyses comparing first and second graders with normal or dysgraphic characteristics. Res. Dev. Disabil. 34(9), 2433–2441 (2013).
Article Google Scholar
Li-Tsang, C. W. et al. Handwriting assessment to distinguish comorbid learning difficulties from attention deficit hyperactivity disorder in C hinese adolescents: A case–control study. Int. J. Methods Psychiatr. Res. 27(4), e1718 (2018).
Article Google Scholar
Li-Tsang, C. W. et al. Handwriting characteristics among secondary students with and without physical disabilities: A study with a computerized tool. Res. Dev. Disabil. 32(1), 207–216 (2011).
Article Google Scholar
Chang, S.-H. & Yu, N. Y. Characterization of motor control in handwriting difficulties in children with or without developmental coordination disorder. Dev. Med. Child Neurol. 52(3), 244–250 (2010).
Article MathSciNet Google Scholar
Gargot, T. et al. Acquisition of handwriting in children with and without dysgraphia: A computational approach. PLoS ONE 15(9), e0237575 (2020).
Article CAS Google Scholar
Wu, Z., Lin, T. & Li, M. Automated detection of children at risk of chinese handwriting difficulties using handwriting process information: An exploratory study. IEICE Trans. Inf. Syst. 102(1), 147–155 (2019).
Article MathSciNet Google Scholar
Chang, S.-H., Yu, N.-Y. & Shie, J.-J. The preliminary development of computer-assisted assessment of Chinese handwriting performance. Percept. Mot. Skills 108(3), 887–904 (2009).
Article Google Scholar
Hu, Z. et al. A Chinese handwriting education system with automatic error detection. J. Softw. 4(2), 101–107 (2009).
Article Google Scholar
Li-Tsang, C. W. et al. Validation of the Chinese handwriting analysis system (CHAS) for primary school students in Hong Kong. Res. Dev. Disabil. 34(9), 2872–2883 (2013).
Article Google Scholar
Hammerschmidt, S. L. & Sudsawad, P. Teachers’ survey on problems with handwriting: Referral, evaluation, and outcomes. Am. J. Occup. Ther. 58(2), 185–192 (2004).
Article Google Scholar
Chang, S.-H. & Yu, N.-Y. Evaluation and classification of types of Chinese handwriting deficits in elementary schoolchildren. Percept. Mot. Skills 101(2), 631–647 (2005).
Article Google Scholar
Yu, N.-Y. & Chang, S.-H. Characterization of the fine motor problems in patients with cognitive dysfunction: A computerized handwriting analysis. Hum. Mov. Sci. 65, 71–79 (2019).
Article Google Scholar
Willmott, C. J. & Matsuura, K. Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim. Res. 30(1), 79–82 (2005).
Article Google Scholar
Khalid, P. I., Yunus, J. & Adnan, R. Extraction of dynamic features from hand drawn data for the identification of children with handwriting difficulty. Res. Dev. Disabil. 31(1), 256–262 (2010).
Article Google Scholar
Osborne, J. W. & Costello, A. B. Sample size and subject to item ratio in principal components analysis. Pract. Assess. Res. Eval. 9(1), 11 (2004).
Google Scholar
Kaiser, H. F. An index of factorial simplicity. Psychometrika 39(1), 31–36 (1974).
Article Google Scholar
Stewart, D. W. The application and misapplication of factor analysis in marketing research. J. Mark. Res. 18(1), 51–62 (1981).
Article Google Scholar
Werner, P. et al. Handwriting process variables discriminating mild Alzheimer’s disease and mild cognitive impairment. J. Gerontol. B Psychol. Sci. Soc. Sci. 61(4), P228–P236 (2006).
Article Google Scholar
Alonso-Martinez, C., Faundez-Zanuy, M. & Mekyska, J. A comparative study of in-air trajectories at short and long distances in online handwriting. Cogn. Comput. 9(5), 712–720 (2017).
Article Google Scholar
Rosenblum, S. & Livneh-Zirinski, M. Handwriting process and product characteristics of children diagnosed with developmental coordination disorder. Hum. Mov. Sci. 27(2), 200–214 (2008).
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the Minister of Science and Technology of the Republic of China and I-Shou University for supporting this work financially under Contract Nos. MOST-106-2221-E-21-022 and ISU-108-01-15A.

Author information

Authors and Affiliations

Department of Occupational Therapy, I-Shou University, Kaohsiung City, Taiwan
Shao-Hsia Chang
Department of Physical Therapy, I-Shou University, No. 8, Yida Rd., Jiaosu Village Yanchao District, Kaohsiung City, 82445, Taiwan
Nan-Ying Yu

Authors

Shao-Hsia Chang
View author publications
You can also search for this author in PubMed Google Scholar
Nan-Ying Yu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.C. planned the research project, conducted the experiment, and wrote the main part of the manuscript text. N.Y. collected data, run the statistics, and wrote part of the results and discussion.

Corresponding author

Correspondence to Nan-Ying Yu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Chang, SH., Yu, NY. Computerized handwriting evaluation and statistical reports for children in the age of primary school. Sci Rep 12, 15675 (2022). https://doi.org/10.1038/s41598-022-19913-y

Download citation

Received: 04 December 2021
Accepted: 06 September 2022
Published: 19 September 2022
DOI: https://doi.org/10.1038/s41598-022-19913-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Extending the Spectrum of Dysgraphia: A Data Driven Strategy to Estimate Handwriting Quality

Phenotyping features in the genesis of pre-scriptural gestures in children to assess handwriting developmental levels

The transferability of handwriting skills: from the Cyrillic to the Latin alphabet

Introduction

Methods

Participants

Instruments and apparatus

Copying target

Measuring parameters

Performance measures

Geometric measures (legibility)

Temporospatial measures

Data analyses

Reliability tests

Statistical analyses

Ethical approval

Informed consent

Results

Factor analysis

Instrument reliability

Discriminative validity

Discussion

Suggestions for the application of this system

Limitations

Conclusion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links