Perpetrator pose reinstatement during a lineup test increases discrimination accuracy

Colloff, Melissa F.; Seale-Carlisle, Travis M.; Karoğlu, Nilda; Rockey, James C.; Smith, Harriet M. J.; Smith, Lisa; Maltby, John; Yaremenko, Sergii; Flowe, Heather D.

doi:10.1038/s41598-021-92509-0

Download PDF

Article
Open access
Published: 09 July 2021

Perpetrator pose reinstatement during a lineup test increases discrimination accuracy

Melissa F. Colloff¹,
Travis M. Seale-Carlisle^1,2,
Nilda Karoğlu³,
James C. Rockey⁴,
Harriet M. J. Smith⁵,
Lisa Smith⁶,
John Maltby⁷,
Sergii Yaremenko^1,8 &
…
Heather D. Flowe¹

Scientific Reports volume 11, Article number: 13830 (2021) Cite this article

2620 Accesses
5 Citations
130 Altmetric
Metrics details

Subjects

Abstract

We examined how encoding view influences the information that is stored in and retrieved from memory during an eyewitness identification task. Participants watched a mock crime and we varied the angle from which they viewed the perpetrator. In Experiment 1, participants (N = 2904) were tested with a static photo lineup; the viewing angle of the lineup members was the same or different from the perpetrator at encoding. In Experiment 2, participants (N = 1430) were tested with a novel interactive lineup in which they could rotate the lineup faces into any angle. In both experiments, discrimination accuracy was greater when the viewing angle at encoding and test matched. Participants reinstated the angle of the interactive faces to match their encoding angle. Our results highlight the importance of encoding specificity for eyewitness identification, and show that people actively seek out information in the testing environment that matches the study environment to aid memory retrieval.

A validation of the two-high threshold eyewitness identification model by reanalyzing published data

Article Open access 04 August 2022

A perceptual scaling approach to eyewitness identification

Article Open access 14 July 2020

Eyewitness identification performance is not affected by time-of-day optimality

Article Open access 10 February 2021

Worldwide, witnesses are given lineups to help the police identify criminal perpetrators. A lineup contains the police suspect—who may or may not be the perpetrator—embedded among ‘fillers’, who are individuals who look similar to the police suspect and are known by the police to be innocent. The goal of the eyewitness is to identify the perpetrator if the perpetrator is present in the lineup (known as a correct identification) or to identify no one if the perpetrator is absent from the lineup (known as a correct rejection). The ability of the eyewitness to distinguish the presence or absence of the perpetrator is known as discrimination accuracy. In many countries (e.g., the US, Germany, Canada, Australia), lineups consist of static photographs^{1, 2}. Lineup members are shown from the shoulders up, facing forward, even if a witness viewed the perpetrator from a different angle at the time of the crime (e.g., saw only their profile view). The National Academy of Sciences recently called for the development of new technology to improve lineup identification accuracy³. In this paper, we heed this call and examine whether discrimination accuracy can be improved by enabling witnesses to see the lineup faces from the same angle that the perpetrator’s face was seen during the crime. We also introduce a novel interactive lineup procedure to test whether during the lineup people spontaneously reinstate the angle at which they saw the perpetrator, and if so, whether pose-reinstatement is associated with increased discrimination accuracy.

There are good reasons to predict that discrimination accuracy will be higher if witnesses can view the lineup faces from the same angle that they studied the perpetrator. One of the most influential principles of human memory—encoding specificity—holds that the correspondence in the context in which memories are acquired and retrieved is a powerful determinant of memory accuracy⁴. One example is that divers, who learnt words either on dry land or in water, were better able to recall words if they were tested in the same environment as they had studied the words compared to the alternative environment⁵. The notion that the match between encoding and retrieval is important is also central to other key concepts in memory theory, such as the transfer-appropriate processing framework⁶, and the proceduralist approach, which assumes that encoding operations are re-enacted during remembering⁷. These principles predict that greater overlap of cues present at encoding and test, such as a high correspondence between viewing angle at encoding and test, lead to better memory performance.

In accordance with these memory principles, a wealth of face recognition research shows that similarity across study-test viewing angle improves recognition accuracy^8,9,10. At a 30-degree study-test difference, the performance cost plateaus, and at 45 degrees, recognition performance is impaired significantly¹¹. Neurophysiological studies also indicate pose-sensitivity in cortical regions¹². In the context of police lineups, overlapping cues at learning and test, such as the quantity of facial information available at encoding versus test (i.e., internal portion of faces versus full faces)¹³, clothing¹⁴, and disguises¹⁵ boost discrimination accuracy. Context reinstatement¹⁶ and alcohol state-dependent learning¹⁷ effects have also been reported in the eyewitness literature. Together, this research supports the encoding specificity principle, whereby overlapping cues at learning and test facilitate accurate memory retrieval.

What is not yet clear, however, is whether the encoding specificity principle generalises to conditions in which people encode viewpoint information and use it during a lineup identification test. Face recognition paradigms employ numerous study-test trials. Each study trial presents an individual face, with pose typically varying across faces. In these studies, pose is therefore a key distinguishing feature, so participants may attend to and encode pose information more than they otherwise would in a more naturalistic context¹⁸. Indeed, some research suggests that in more naturalistic contexts, the correspondence between study and test pose does not always enhance recognition accuracy, and faces at certain angles (i.e., the profile view) are difficult to learn and recognize^{10, 19}. Null findings regarding the overlap of pose at learning and test have also been reported in the eyewitness literature. A study that varied whether a perpetrator was studied at eye level versus from overhead found that recognition accuracy was not increased by having matching information at the lineup test. However, this study was likely underpowered²⁰. Therefore, it is important to test if people benefit from consistent viewpoint information at study and at test, and under conditions that are akin to real life, such as an eyewitness identification task.

Moreover, research has not yet determined if, during memory retrieval, humans actively seek out information at test that matches the study environment as an aide-mémoire. Most memory paradigms experimentally manipulate the degree of overlap between the cues present at encoding and at test, with some participants allocated to experience overlap and others not⁵. But would, for example, divers who studied underwater be more inclined to jump back into the pool at test to reinstate study context to aid their memory retrieval compared to those who studied on land? Here, we test if participants naturally seek out cues at test that correspond with information learned at study, using an eyewitness identification task.

To do so, we developed an interactive lineup procedure, wherein each lineup face can be rotated along the vertical axis. This enables the witness to dynamically view the lineup faces from − 90° to 90° and hold the faces in any pose desired (see https://tinyurl.com/t4nc9gp). If participants reinstate pose at test—by naturally rotating the lineup faces into the same angle from which they saw the perpetrator commit the crime, then this suggests people encode viewpoint information, and seek (consciously or unconsciously) to make use of overlapping cues gleaned through pose reinstatement. Further, if accurate participants reinstate perpetrator pose to a greater degree than inaccurate participants, this suggests pose information is a valuable retrieval cue for eyewitnesses.

To summarise, we ask (1) whether consistent viewpoint information at study and at test is associated with higher discrimination accuracy; and (2) whether people naturally reinstate at test the pose in which they had viewed a perpetrator if given the opportunity to do so with an interactive lineup.

Experiment 1

In Experiment 1, we tested whether consistent viewpoint information at study and at test improved accuracy on a lineup identification task. Based on the encoding specificity principle and the existing face recognition literature, we predicted that discrimination accuracy would be higher when participants had available at test the same pose information that they encoded compared to when they had different pose information (pose-reinstatement hypothesis). We were also interested whether discrimination accuracy would be higher when participants had available at test the same pose information that they encoded as opposed to different pose information, and the highest when they had the same pose information plus an additional unstudied pose. Specifically, a geometric representation of the face can be constructed from the frontal and profile views of the face²¹. Indeed, the first police system to catalogue faces for criminal identification was developed by Alphonse Bertillon, and it photographically described arrestees using the profile and frontal view. Seeing a face from more than one angle is thought to be useful for building a representation of the face’s three-dimensional structure, and knowing the three-dimensional structure of a face can provide additional cues that boost discrimination accuracy^{9, 22}. In Experiment 1, we tested whether seeing the lineup faces from the angle in which the perpetrator was studied improved discrimination (pose reinstatement hypothesis) compared to when such information was not available, and (b) whether having the same pose information plus additional unstudied information about the face at test boosted discrimination accuracy the most. We pre-registered our hypotheses and analysis plan before we collected data (https://osf.io/vs48c).

Methods

All methods were carried out in accordance with relevant guidelines and regulations.

Design

We used a 2 (perpetrator encoding pose: front, right-profile) × 3 (lineup member pose: front, right-profile, front and right-profile) × 2 (target: present, absent) between-subjects design.

Participants

Participants (N = 3021) were recruited using Amazon Mechanical Turk; they were remunerated 0.40 cents to take part in the experiment, which took 5 min to complete. Participants who incorrectly answered the validation question or experienced a technical issue (n = 117) were excluded, leaving a total of 2,904 participants (42% female; 18–76 years old, M = 37.46, SD = 11.95 years; 72% Caucasian, 10% Black or African American, 6% Hispanic, 5% East Asian, 2% South Asian, 2% Other, and 3% prefer not to say). Our data-collection stopping rule was to recruit 3,000 participants—250 in each of the between-subjects conditions. Using the mean difference and SDs observed in Mickes, Flowe, and Wixted, 2012²³ as a guide, a power analysis indicated that, with 250 subjects per between-subjects condition, power would exceed 80%. The research was reviewed according to the University of Birmingham Science, Technology, Engineering and Mathematics Ethical Review Committee. Informed consent was obtained from all subjects.

Materials

We filmed a mock crime of a Caucasian male perpetrator aged in his 30 s stealing a handbag from behind a female victim. We filmed three versions of the crime: the perpetrator was shown from the front, left-profile, or the right-profile. The whole crime was 14 s in length and the perpetrator’s face was in view for approximately 8 s. For Experiment 1, we used only the front and right-profile videos.

We recruited 9 members of the public to be the lineup fillers. Following recommended practice, the fillers were individuals who matched the physical appearance (i.e., age, build, gender, skin tone, hair color, eye color, facial hair, hairstyle) of the perpetrator from the video^{24, 25}. We took static photographs of each lineup member from the shoulders up showing him facing directly towards the camera in frontal view, and showing him turning away from the camera in profile view (see Fig. 1). Target-absent lineups contained all 9 fillers. Target-absent lineups are akin to the real-life scenario in which the police have apprehended a suspect who matches the description of the perpetrator, but is innocent. Target-present lineups contained the perpetrator and 8 fillers.

To check that our lineup fillers were plausible alternatives to the suspect, we conducted a mock-witness test. First, we asked 10 independent observers to describe the appearance of the perpetrator while looking at the perpetrator’s photograph to create a modal description. We presented a different group of participants acting as mock-witnesses (N = 80) with the description of the perpetrator followed by a frontal target-present lineup or target-absent lineup. The mock-witnesses did not view a crime or a to-be-remembered face; they simply had to select the lineup member who best fit the modal description that we provided for them²⁵. The perpetrator was identified 7% of the time in the target-present lineup, a rate that did not exceed chance (11%, p > 0.05) and the most frequently chosen lineup member was identified 20% of the time in the target-absent lineup, a rate that did not significantly exceed chance expectation. We also calculated E’²⁶, a measure of effective size, which assesses the number of lineup members that are effective at drawing mock-witness choices²⁷. Effective size was 6.50 (95% CI [4.92, 9.61]) in the target-present lineup and 6.76 (95% CI [5.57, 8.60]) in the target-absent lineup. These values are appreciably larger than the effective size of lineups reported in field research on UK police lineups (e.g., Valentine & Heaton, 1999, reported a mean effective size of 4.24 (SE = 0.31) for photo lineups, and 4.46 (SE = 0.32) for video lineups)²⁸. Together, the mock-witness test illustrated that our lineups were perceptually fair, based on the description of the perpetrator.

Procedure

At the start of the experiment, participants were asked a number of demographic questions (age, sex, ethnicity/race). Then participants watched either the front or right-profile mock-crime video. We told participants to pay attention because they would be asked questions about the video later. Next, participants watched a distractor cartoon for 1 min 11 s and attempted to solve anagrams for a further 2 min. We asked participants if they had experienced any technical problems when viewing the video. Following this, participants were told that they would view a lineup and their task was to try and recognize the perpetrator from the mock-crime video. In line with recommended police practice, participants were told that the perpetrator may or may not be present in the lineup²⁴.

Next, the lineup was displayed. The lineup was administered sequentially (i.e., one lineup member was shown at a time). We experimentally manipulated the angle from which the lineup members were shown (see Fig. 1): In the front condition, the lineup members were shown exclusively from the front, whereas in the right-profile condition, they were shown exclusive in right-profile. In the front and right-profile condition, the lineup members were shown from the front as well as in the right-profile, and for each lineup member, the front and right-profile images were shown simultaneously. The order in which the lineup members were presented was randomly determined for each participant. Each lineup member was accompanied by a number corresponding to their position in the lineup (1–9). We asked participants to write down the number of the lineup member they believed to be the perpetrator, if they believed the perpetrator was present in the lineup. Participants saw each face only once and could not review previously seen faces after they had advanced to the next lineup member. Participants could view each lineup member for any length of time desired and once they were finished viewing each member, they pressed a “next” button. After viewing the nine lineup members, participants made an identification decision by selecting a number (1–9), or indicating that the perpetrator was “Not Present.” All participants were asked to rate their confidence in their response on an 11-point Likert-type scale ranging from 0% (not at all sure) to 100% (completely certain). Finally, at the end of the experiment, participants were asked if they had experienced any technical problems while viewing the lineup images, and to select from a drop-down menu the type of crime shown in the video as a manipulation check.

Conference presentation

Sections of these data were presented by Heather D. Flowe at the Society for Applied Research in Memory and Cognition (June, 2019), Cape Cod, Massachusetts, The United States.

Results

Our data are available (https://osf.io/jm2k9/). Recall that participants were randomly assigned to encode the perpetrator from the front (n = 1449) or the right-profile (n = 1455). After watching the mock crime video, participants were randomly assigned to a lineup condition, with 969 viewing the front lineup (480 viewed a target-present lineup and 489 viewed a target-absent lineup), 975 the right-profile lineup (487 viewed a target-present lineup and 488 viewed a target-absent lineup), and 960 the front + right-profile lineup (493 viewed a target-present lineup and 467 viewed a target-absent lineup). For analysis, we combined over perpetrator encoding pose and lineup member pose conditions to create same-pose, different-pose, and same + additional pose conditions, as per our OSF pre-registration. This allowed us to test the pose-reinstatement hypothesis, and whether having the same pose information plus additional unstudied information about the face boosted discrimination accuracy the most. The numbers of participants in each pose condition by experimental group are given in Table 1.

Table 1 Frequencies of participants in each pose condition by encoding group and lineup type in Experiment 1.

Full size table

Response frequencies for perpetrator, filler and reject (i.e., “Not Present”) decisions at every level of confidence for the same-pose, different-pose, and same + additional pose conditions are displayed in Table 2. The overall correct ID rate of the perpetrator (displayed in the proportion row in Table 2) is equal to the total number of perpetrator IDs from target-present lineups divided by the number of target-present lineups, in each condition. Because there was not a designated innocent suspect, the number of innocent suspect IDs in target-absent lineups was estimated using the total number of filler IDs from target-absent lineups. Following standard practice²⁹, this estimate was obtained by dividing the number of target-absent filler IDs by the number of lineup members (i.e., 9). That estimated value was then divided by the number of target-absent lineups to estimate the false ID rate in each condition. The overall correct ID rates were 0.66, 0.49, and 0.65 for the same-pose, different-pose, and same + additional pose conditions, respectively. The corresponding overall false ID rates were 0.04, 0.05, and 0.04 for the same-pose, different-pose, and same + additional pose conditions, respectively. Thus, even without performing any additional analyses, it is clear that those in the same and same + additional pose conditions performed better than those in the different pose condition, as predicted by the pose-reinstatement hypothesis.

Table 2 Frequencies of perpetrator, filler and reject identification decisions by pose condition in Experiment 1.

Full size table

ROC analysis

We conducted ROC analysis to measure participants’ collective ability to discriminate between perpetrators and innocent suspects. Figure 2 shows the ROC curves for the same-pose, different-pose, and same + additional-pose conditions (see Mickes et al., 2012, for a tutorial)²³. It is apparent that those in the same-pose and same + additional pose conditions discriminated perpetrators from innocent suspects better than those in the different-pose condition. Partial area under the curve (pAUC) values were computed using a target-absent filler ID cut-off (i.e., specificity) of 0.618 with the statistical package pROC³⁰. We used a Bonferroni-corrected alpha level of 0.017. As predicted by the pose-reinstatement hypothesis, the pAUC for the same-pose condition (0.186) was significantly greater than the pAUC for the different-pose condition (0.117), D = 5.43, p < 0.001. The pAUC for the same + additional pose condition (0.194) was also greater than the pAUC for the different-pose condition (0.117), D = 6.29, p < 0.001. Although the same + additional pose condition yielded a slightly higher pAUC (0.194) than the same-pose condition (0.186), that difference was not statistically significant, D = 0.639, p = 0.523.

We conducted additional ROC analyses to study pose reinstatement effects in each perpetrator encoding pose condition (see Supplementary Appendix A). The results indicated that discrimination accuracy was higher when the lineup members could be seen in the same pose compared to when they were seen in a different pose in all encoding conditions, albeit the effect was more reliable in the frontal pose than in the right-profile condition. The increase in discrimination accuracy appears larger in the frontal compared to the profile encoding condition (see Supplementary Fig. A1).

We also fit a signal-detection model to the data which accounts for all identification decisions (see Supplementary Appendix B) and the results agreed with the pAUC results reported here. Compared to when participants had different pose information at test (d′ = 1.67), discrimination accuracy was better when they had the same pose information that they encoded (d′ = 2.17; χ² (1) = 35.37, p < 0.001) and when they had the same pose information plus additional unstudied information (d′ = 2.02; χ² (1) = 39.51, p < 0.001). Discrimination accuracy was not boosted further when participants had available at test additional unstudied information about the face (d′ = 2.20; χ² (1) = 0.10, p = 0.752). The same was true when the model was fit to each perpetrator encoding pose condition separately, and the effect appears larger in the frontal compared to the profile encoding condition (see Supplementary Appendix B). Taken together, these supplementary results support the pose-reinstatement hypothesis.

Analyses of the confidence accuracy relationship by lineup member pose condition are presented in Supplementary Appendix C.

In sum, the results of Experiment 1 support the pose-reinstatement hypothesis: consistent viewpoint information at study and at test improved lineup discrimination accuracy. These findings conceptually replicate face recognition research using an episodic memory paradigm and an eyewitness identification task.

Experiment 2

In Experiment 2, we extended Experiment 1 by examining whether participants would actively seek to reinstate during a lineup test the pose in which they had encoded a perpetrator commit a mock crime (i.e., left-profile or right-profile). Relatedly, an interactive virtual reality eyewitness memory study with avatars reported that having multiple face viewpoints available at test increased accuracy, but only when the perpetrator was present in the lineup³². However, this study was likely underpowered, and did not vary encoding pose or measure discrimination accuracy. We developed an interactive lineup procedure that enabled participants to rotate the lineup faces into any pose desired. The procedure recorded the length of time participants spend viewing the faces at different angles. We predicted that participants would naturally reinstate the lineup faces, particularly the perpetrator’s face, into the same pose as they had encoded the perpetrator, and that greater pose reinstatement would be associated with better discrimination accuracy. We pre-registered our hypotheses and analysis plan before we collected data (https://osf.io/ezsxg).