Showup identification decisions for multiple perpetrator crimes: Testing for sequential dependencies

Nina Tupper; Melanie Sauerland; James D. Sauer; Nick J. Broers; Steve D. Charman; Lorraine Hope

doi:10.1371/journal.pone.0208403

Abstract

Research in perception and recognition demonstrates that a current decision (i) can be influenced by previous ones (i–j), meaning that subsequent responses are not always independent. Experiments 1 and 2 tested whether initial showup identification decisions impact choosing behavior for subsequent showup identification responses. Participants watched a mock crime film involving three perpetrators and later made three showup identification decisions, one showup for each perpetrator. Across both experiments, evidence for sequential dependencies for choosing behavior was not consistently predictable. In Experiment 1, responses on the third, target-present showup assimilated towards previous choosing. In Experiment 2, responses on the second showup contrasted previous choosing regardless of target-presence. Experiment 3 examined whether differences in number of test trials in the eyewitness (vs. basic recognition) paradigm could account for the absence of hypothesized ability to predict patterns of sequential dependencies in Experiments 1 and 2. Sequential dependencies were detected in recognition decisions over many trials, including recognition for faces: the probability of a yes response on the current trial increased if the previous response was also yes (vs. no). However, choosing behavior on previous trials did not predict individual recognition decisions on the current trial. Thus, while sequential dependencies did arise to some extent, results suggest that the integrity of identification and recognition decisions are not likely to be impacted by making multiple decisions in a row.

Citation: Tupper N, Sauerland M, Sauer JD, Broers NJ, Charman SD, Hope L (2018) Showup identification decisions for multiple perpetrator crimes: Testing for sequential dependencies. PLoS ONE 13(12): e0208403. https://doi.org/10.1371/journal.pone.0208403

Editor: Guido Hesselmann, Psychologische Hochschule Berlin, GERMANY

Received: January 20, 2018; Accepted: November 16, 2018; Published: December 6, 2018

Copyright: © 2018 Tupper et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All data files are available in the Open Science Framework Database: https://osf.io/bt3fa/?view_only=c7495be4710847b9ab40ff710ee2c554.

Funding: This paper has been supported by a fellowship from the House of Legal Psychology/ Erasmus Mundus Joint Doctorate Programme in Legal Psychology ((FPA) 2013-0036/ (SGA) 2014-0678 /001-001-EMII-EMJD) (http://legalpsychology.eu/) to Nina Tupper and by a Grants-in-Aid for students from the American Psychology-Law Society. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

In October 2015, news outlets [1] featured security footage of an unresolved case: the attempted abduction of a truck driver on the French-Belgian border. As the truck driver walked around the rear of his truck, two men appeared and attacked him. While the two perpetrators struggled to force the driver into the back of a waiting car, an elderly passerby intervened, pulling at the perpetrators’ jackets and trying to place himself between them and the truck driver. Following the failed abduction and a hurried, but equally unfruitful search for the truck driver’s keys, the two men fled the scene by car.

This case is just one example of the many violent crimes that are committed by multiple perpetrators. Gang violence [2], hate crimes [3], rapes [4] and assaults [5] are often committed by perpetrators working together. In fact, the rising rate of such crimes appears to be a global phenomenon. For example, in Finland, Sweden, and the Netherlands, 13–17% of homicides between 2003 and 2006 involved two or more perpetrators [6] while the proportion of homicides with multiple perpetrators in the U.S. reached 20% in 2008 (nearly double that reported in 1980 [7]). These crimes often involve victims or bystanders as eyewitnesses—like the driver and the passerby above—who may be asked to identify multiple suspects related to the multiple perpetrators. Yet, the decades of research focused on uncovering and understanding factors that affect accuracy in eyewitness identification procedures typically considers only identifications of a single perpetrator, providing little empirical evidence to support or oppose recommendations in protocols specific to the context of multiple perpetrator crime. Should police departments, for instance, follow the example of the U.K. and multiply “best practice” by creating a new lineup for each suspect of a different perpetrator [8]? If so, does the order of presentation of identification tests affect the reliability of the evidence obtained? Or does the act of making multiple identification decisions affect the decisions themselves?

In this paper, we address this last question, examining the consequences of testing memory for multiple perpetrators [9]. We present three experiments examining whether current showup identification decisions are associated with witness choosing behavior on previous showup decisions. We aimed to determine whether sequential dependencies (i.e., whether choosing behavior on previous tests influences choosing on a current test) should be considered in cases when eyewitnesses are asked to make multiple identification decisions, specifically when those decisions pertain to the different suspects in a multiple perpetrator crime.

Identification of multiple perpetrators

Clifford and Hollin [10] first revealed the difficulty of eyewitness identification in the context of multiple perpetrator crimes when they had participants view a non-violent event with one, three, or five perpetrators. Despite only having to select the main perpetrator from a target-present lineup immediately following the crime, only 30% of participants in the three-perpetrator condition and 20% in the five-perpetrator condition made accurate identifications (compared with 40% in the one-perpetrator condition). More recently, Megreya and Bindemann [11] demonstrated a similar drop in accuracy with as few as two unfamiliar faces to be encoded. Participants viewed a mock crime with one perpetrator alone or with an accomplice and were subsequently asked to identify the perpetrator. The presence of a second person at encoding was associated with decreased identification accuracy in target-present lineups (lower hit rates and higher miss rates). Approximately 54% of participants who saw the perpetrator alone were able to accurately identify him/her, compared with only 29% of participants who saw the perpetrator with an accomplice.

To date, three procedures have been proposed to address the applied issue of the multiple perpetrator identification disadvantage. The two-person serial lineup [12], the elimination lineup [13, 14], and an adapted sequential identification procedure [15] were each tested against traditional simultaneous lineups, sequential lineups, or both. The results were mixed, and any improvements associated with these methods depended upon which target identity was being presented (i.e., accomplice vs. perpetrator), the presence or absence of the target in the lineups, or both. Unfortunately, when these new methods fall short, we do not know if it is because the proposed adaptations did not address the mechanisms they intended to, or if the theories used to justify these adaptations are ultimately not relevant to the multiple perpetrator identification disadvantage. For example, the two-person serial lineup is intended to provide context to aid memory by presenting the sequential lineups of the culprit and of the accomplice at the same time [12]. Although the lineups for each are flashed side-by-side on the screen, the two suspects are never shown simultaneously, but always paired with a filler. In theory, the context of one face should aid our ability to recognize or reject the other face. But when this lineup does not improve identification accuracy, is it because contextual cuing is not useful for faces in a lineup context? Or is it because the suspects are never shown together, and thus are not cuing memory? Perhaps it is difficult to interpret their results because they are premature attempts to fix problems that are still not well understood, meaning the adapted lineups amount to trial-and-error solutions.

Shallow encoding [16] and increased memorial demand [17] have more recently been explored as reasons for the decreased identification performance for multiple perpetrator crimes, and both appear to play a role. However, there is another independent factor that is unique to multiple perpetrator identification that has yet to be considered: the decisional structure of making multiple identifications. Below, we explore how the act of making multiple identifications may undermine the integrity of those decisions.

Sequential dependencies in perception and recognition

An individual police lineup has been likened by researchers to a real-world signal detection decision, but with the modification to include filler (i.e., non-suspect) misidentifications [18, 19]. The signal detection model, however, mathematically assumes independence of trials, for which a decision is based solely on the evidence present in that trial. In contrast, research in perception and memory demonstrates that a current decision (i) can be influenced by a previous one (i–j), so that a current response may favor (assimilation) or disfavor (contrast) the preceding responses [20]. In other words, in a series of trials presented one-after-another, the responses, although separate, are not independent. These sequential dependencies appear in perception, classification, and recognition tasks where participants make multiple, sequential decisions—tasks that present a theoretical overlap with making multiple eyewitness identification decisions.

Sequential dependency can be demonstrated in its simplest form in a traditional detection experiment. Howarth and Bulmer [21] seated participants in a dark room with a flash-bulb set at a 50% detection rate at a given intensity, meaning that the light was bright enough to be detected, but dim enough that participants only reported seeing it half of the time. The momentary flashes were accompanied by the sound of a bell, so that when participants heard the bell ring, they indicated whether or not they had seen the flash of light (yes vs. no). At 50% detection, participants will make errors half of the time; errors that should theoretically display natural fluctuations and therefore appear randomly throughout the hundreds of trials. However, participants demonstrated a tendency to assimilate responses towards previous ones, meaning that a no response was more likely to be followed by another no response than a yes response. Further still, at some points, the light signal was omitted so that the bell rang without the accompanying light flash. When the experimenters forced a sequence of three of these blank trials (no-no-no), they found the same degree of assimilation for the subsequent fourth response as for three natural occurring negative responses. Such sequential dependencies are found in a variety of tasks, including absolute judgments of sound [22] and the perceptual classification of facial expressions [23].

The mechanism underlying sequential dependencies remains a subject of debate, with attempts to model sequential dependencies favoring one of the two systems involved in a perception task: decisional processes and the cognitive system. Some models consider sequential dependencies to arise from biased decision-making [20]. According to these models, assimilation results from the observer’s short-term assumption that the most recent stimulus is also the most likely to occur again. However, patterns of contrasting answers are the result of the observer attempting to correct decisional criteria to a desirable level in the long-term. These fluctuations in response bias purport to explain why judgments show assimilation immediately following trial i, but revert to contrast after a few trials. On the other side of the debate are models arguing that sequential dependencies arise either entirely, or at least in part, from the cognitive system [24, 25, 26]. In these models, sequential dependencies arise as a result of inappropriate information being carried forward from the previous trial, affecting the perception of the current stimulus.

Malmberg and Annis [27] were the first to demonstrate sequential dependencies in recognition memory. They presented a series of experiments using traditional recognition paradigms and judgments of frequency recognition tasks to approximate the perception and categorization tasks that routinely demonstrate sequential dependencies. For example, in one experiment, participants studied 40 word pairs and were later tested on their recognition for those words among never-studied words. As with Howarth and Bulmer’s [21] light-detection experiment, participants were more likely to respond old if they had responded old (rather than new) on the previous trial, regardless of whether the previous response was correct (hit) or incorrect (false alarm). The appearance of sequential dependencies was consistent across several replications with different stimuli, including landscape images, and picture-word pairs.

The current research

Studies investigating the cause of the multi-face recognition disadvantage [16, 17] tend to focus on the encoding conditions: how factors that affect perception and attention interfere with encoding, and thus damage chances of identification from the outset. Consequently, studies adapting lineups that were originally designed for single-perpetrator crimes so far considered these encoding difficulties and adjusted methodology in attempts to compensate for the resulting impoverished memory [12, 15]. While this is a reasonable starting point to investigate multiple perpetrator identifications, it is also important to explore other factors that may affect identification decisions. In this vein, we investigated the possibility of sequential dependencies within the eyewitness paradigm. Specifically, how does the act of making multiple identification decisions for unique perpetrators affect the validity of those decisions? Some research has considered the impact of making multiple showup identification decisions for a single-perpetrator crime [28, 29], the non-independence of multiple identification decision remains untested in the context of eyewitness identification for multiple suspects related to the multiple perpetrators of the same crime.

Multiple perpetrator crimes present a framework in which relatively few sequential decisions are made, and in which these decisions have serious consequences. Sequential dependencies measured in the recognition paradigm have little substantial effect on overall recognition accuracy because the beneficial and detrimental sequences of dependencies will typically balance out over the many trials, reducing its impact on the overall accuracy for recognition [27]. Considering identification paradigms lack the many trials needed to balance out recognition accuracy, the appearance of sequential dependencies in this context would be a matter of substantial impact and cause for concern. Therefore, we tested for sequential dependency effects within the eyewitness identification context by having participants make multiple, consecutive showup decisions.

Show ups were chosen because they are particularly well-suited for an initial test for sequential dependencies within the eyewitness context for three reasons. First, although lineups are advisable because they reduce the probability of misidentification by random chance, showups (live or photographic) are still a common identification procedure around the world [28, 30, 31]. Second, forced-report showup decisions (Is this the perpetrator? Yes vs. no) emulate the binary-decision tasks in which sequential dependencies have been consistently observed. Third, showups permit a controlled investigation of sequential dependencies on identification decision-making free from the influence of lineup construction variables (e.g., filler similarity, lineup presentation method) or from the statistical noise of making comparisons between fillers. If sequential dependencies are found to affect showup decision-making, subsequent investigations can determine how these effects interact with lineup composition and presentation variables.

Across two initial experiments, we examined the relation of previous identification decisions to subsequent choosing behavior in the context of the multiple showup identification decisions for a multiple perpetrator crime. If it is possible to predict current choosing on a showup identification decision from previous choosing, it provides initial evidence that sequential effects may be present in multiple showup identification decisions. Given that research has previously demonstrated that sequential dependencies in recognition are a result of interference from previous trials [27], Experiments 1 and 2 consider both previous signal (target-presence: present vs. absent) and previous response (Choosing: yes vs. no) as predictors of current choosing behavior [25]. We expected that initial showup responses would predict choosing for subsequent showup responses. In other words, choosing on a previous showup identification would be associated with choosing on subsequent ones, and rejecting on a previous showup identification would be associated with rejecting on subsequent ones. We also expected previous target-presence to be separately associated with the current identification decision, such that the previous target being present would predict current choosing and the previous target being absent would predict current rejecting [25].

We also considered the possibility of an interaction between current target-presence and previous choosing on current choosing, such that being confronted with a target-absent trial would further raise the probability of rejection given a previous rejection. Non-memorial factors tend to exert stronger effects on recognition memory tasks when the target stimulus is absent and there is no opportunity for genuine recognition [32]. In other words, if memory is not able to provide the answer, people look for other cues to influence their decision. In this way, sequential dependencies might represent an attempt to use imperfect cues to guide decision-making under conditions of uncertainty [33]. Straight forward sequential dependencies should arise regardless of target presence, but it is possible that the strength of the effect will vary depending on whether the target is present or not.

Although Experiments 1 and 2 were conducted separately, they used similar methodologies and analyses to answer the same question. Thus, although the data are not collapsed across experiments, the methods and results are presented together.

Experiments 1 and 2

Ethics statement

These studies were approved by the ethical review board of the Faculty of Psychology and Neuroscience of Maastricht University. Written consent was obtained from participants in Experiment 1. Participants in Experiment 2 provided consent by clicking the button to continue the experiment.

Participants and design

A total of 411 participants were tested, 404 of which were included in analyses. Participants either completed the experiment in the lab (Experiment 1, N = 120) or online (Experiment 2, N = 291). The average age of participants was 21 years (M = 20.77, SD = 3.64). They were compensated with a €5 gift voucher (Experiment 1) or participation credit (Experiments 1 and 2).

Participants viewed a three-person mock crime video and were subsequently presented with three photographic showups, one for each of the three perpetrators. In Experiment 1, we aimed to provide an initial test of sequential dependency in facial identification. Four conditions were chosen to optimize conditions for sequential dependencies through an established pattern of target-present and target-absent showup photographs [21]. The first and second showups were always consistent in target-presence; they were either both target-absent (TA) or both target-present (TP), while the third showup was either consistent or different, leading to four conditions with targets: (1) TA/TA/TP, (2) TA/TA/TA, (3) TP/TP/TA, and (4) TP/TP/TP. In retrospect, we realized this also meant that we were not able to disentangle the effect of target-presence between showups 1 and 2 on showup 2. Thus, Experiment 2 implemented all combinations of target presence by adding four additional conditions with targets: (5) TA/TA/TP, (6) TA/TP/TA, (7) TP/TA/TP, and (8) TP/TA/TA. Presentation order of targets (i.e., 123, 132, 231, 213, 312, 321) was counterbalanced for both experiments.

General method for Experiments 1 and 2

Materials.

Crime video. In the 2:45 min mock crime video, the male victim arrives by bike and locks it against a railing with other bikes. Three target people, one woman and two men, are shown in the background gesturing towards the victim. When the victim walks into a nearby building, the thieves use a hand-saw to break the locks of two bikes, including the victim’s, and walk away with the bikes. Each target actor in the video has approximately 15–20 s of close-up shots in which their faces are clearly visible.

Showups. Three target-absent and three target-present showups were constructed, one for each of the three perpetrators. The showups consisted of color photographs 4.39 x 5.89 cm in size. The targets were photographed on the same day as the stimulus event was filmed, but wore different clothing. One innocent suspect was selected as a replacement for each target in the target-absent showups. The replacements were chosen based on similarity to the actual target, as established by a pilot study with N = 22 participants (age: M = 27.45, SD = 12.14). Specifically, replacements were rated as statistically similar to the perpetrator with regard to memorability, distinctiveness, and typicality [34; 35]. Participants were also asked to judge the similarity of the target faces paired with each of their possible replacements. This comparison score was used to match for similarity across the three target-replacement pairings, so that each of the three target-absent showups would be equally difficult for participants to judge. Results of the pilot study are available in supplementary materials (S1 Table).

Procedure.

Participants arrived at the lab for individual testing sessions (Experiment 1) or received a Qualtrics [36] link to complete the experiment online (Experiment 2). Participants were informed that the experiment would be administered using a self-paced computer task. After giving informed consent, participants were told that they would be shown a video and were instructed to pay close attention as they would be asked questions about it later. After watching the mock crime video, participants completed a 20–30 min filler task by answering a series of questionnaires (Experiment 1 and 2) or by completing a combination of search tasks and word-generation games (Experiment 2). Next, participants were reminded that they had seen a film of three thieves stealing a bike, and were now considered eyewitnesses. They were instructed: You will be shown a series of three photographs. Each photograph is one suspect for each of the three bike thieves. For each photograph, please decide whether or not the person shown was one of the perpetrators. Once you make a decision, you will move on to the next photo. A subsequent screen displayed a one-time warning that the persons in the photographs may or may not be the actual perpetrators.

Participants were then shown a photo for one of the perpetrators (Suspect 1: Present or absent). A forced-report question asked if the person shown was one of the perpetrators (yes or no), after which they were asked to indicate how certain they were in their decision (0–100%). The procedure was repeated for Suspects 2 and 3. Although suspects are numbered here for convenience, presentation order of targets was counterbalanced; meaning Suspect 1 for the eyewitness could correspond to any of the three perpetrators. Following all identification decisions and confidence ratings, participants were shown the photos of those they had positively identified and asked to name the role each played in the crime. However, role assignment and confidence are outside of the scope of the current research and are therefore not addressed further. Finally, participants were thanked for their time and debriefed.

Experiment 2 differs from Experiment 1 in two ways. In order to determine whether participants had watched the entire video, a still image of a white arrow and the text “This is a white arrow, please remember this arrow as you will be asked about it later” was added for the last 7 s of the video (after the target event). Following the filler task, participants were asked to name the shape and color presented at the end of the video. This section of the computer task was timed so that the task advanced automatically after 2:52 min regardless of whether or not the video was paused. Therefore, participants who could not correctly name the shape and color (n = 4) were assumed to have not completed the video and were removed from all analyses. A final question prompted participants to describe the environment in which they completed the experiment (e.g., time of day, location, presence of others).

Results

In Experiment 1, all 120 participants were retained for data analysis. In Experiment 2, seven participants were removed from data analysis for answering the control questions incorrectly (4), not completing the filler task (2), or because Qualtrics recorded their experiment duration time as exceeding four hours and the participant did not respond to requests to elaborate (1), leaving 284 participants.

Descriptive statistics for choosing on showups.

Across the three showup identification decisions in both Experiments 1 and 2, choosing rates were low, at 34–42%. Overall, only 4–12% of participants chose on all three showups. Meanwhile, 15–22% of participants rejected all decisions. Less than half of participants (26–47%) chose on at least two showups. See Table 1 for choosing rates for each experiment.

Download:

Table 1. Experiments 1 and 2: Proportion (frequency) of choosing across showups and overall.

https://doi.org/10.1371/journal.pone.0208403.t001

Experiment 1: Testing for sequential dependencies.

In order to establish the association of previous identification decisions and both previous and current target-presence with current identification decisions, we performed separate binary logistic regressions for choosing on the second and third showup. For example, for choosing on the second showup, we entered previous target-presence (absent vs. present on Showup 1), current target-presence (absent vs. present on Showup 2) and previous choosing (yes vs. no on Showup 1) as predictors. For choosing on the third showup, we used previous target-presence, current target-presence (Showup 3), and previous choosing (yes vs. no on Showup 1 and Showup 2) as predictors. Because target-presence for the first and second showups did not vary in Experiment 1, target-presence for Showups 1 and 2 were included as a single predictor.

In the initial analyses for Showup 2, we included all main effects in the equation. In the initial analyses for Showup 3, we included all main effects and the current target-presence by previous response (selection vs. rejection) interaction. We then sequentially excluded the interaction if non-significant and any non-significant main effects by order of distance from the current decision. However, given our theoretical predictions, previous choosing was always included in the final model. Although we present the results descriptively here, relevant statistics for full models can be found in Table 2 and relevant statistics for final models can be found in Table 3.

Download:

Table 2. Experiments 1 and 2: Complete models of logistic regressions predicting choosing on showups 2 and 3 based on previous choosing and target-presence.

https://doi.org/10.1371/journal.pone.0208403.t002

Download:

Table 3. Experiments 1 and 2: Final models of logistic regressions predicting choosing on showups 2 and 3 based on previous choosing and target-presence.

https://doi.org/10.1371/journal.pone.0208403.t003

Choosing behavior on the second showup. Only target-presence was a significant predictor in the final model. Participants were more likely to choose when the target was present. However, due to the fact that target-presence for Showups 1 and 2 did not vary within subjects, it is unclear if it is current target-presence, previous target-presence, or both that are associated with choosing behavior for Showup 2.

Choosing behavior on the third showup. The current target-presence by previous choosing interaction was significant. Simple effects were examined by reverse-coding target-presence [37]. Results revealed that only when the current trial was target-present, choosing on Showup 2 predicted choosing on Showup 3: the odds of choosing on the third target-present showup were 5.88 times more likely for those who chose on the second showup compared with those who rejected the second showup. In other words, 79% of those who chose on Showup 2 also chose on a target-present Showup 3, while only 39% of those who rejected Showup 2 subsequently chose on a target-present Showup 3.

Experiment 2: Testing for sequential dependencies.

Analyses for Experiment 2 were analogous to Experiment 1 with the exception that all initial models included the current target-presence by previous response interaction. The analyses presented here include data from all eight target-presence conditions. We additionally re-ran analyses in Experiment 2 using only the data from the four target-presence conditions in Experiment 1 (TA/TA/TA; TA/TA/TP; TP/TP/TP; TP/TP/TA). Isolating these four conditions did not significantly change the results, and we therefore report only the fully-randomised results. Interested persons can the data available on the OSF. See Table 2 for the relevant statistics for full models including all predictors, and Table 3 for the relevant statistics for final models.

Choosing behavior on the second showup. As expected, choosing on Showup 1 was a significant predictor of choosing on Showup 2. However, current choosing contrasted previous choosing, so that if participants chose on the first showup, the odds of choosing were 1.72 times less likely than the odds of not choosing. In other words, 72% who chose on Showup 1 subsequently rejected Showup 2. Meanwhile 62% of participants who rejected Showup 1 went on to reject Showup 2. The lack of significant interaction for current target-presence by previous choosing indicates that this sequential dependency was not affected by the current presence of the target. However, current target-presence was also a significant predictor for choosing.

Choosing behavior on the third showup. For choosing on Showup 3, only current target-presence was a significant predictor.

Discussion

Experiments 1 and 2 were initial tests for sequential dependencies across multiple showup identification decisions in the context of multiple perpetrator crimes. We expected previous responses (choosing) and previous target-presence to be related to current decisions. While we did find some evidence for sequential dependencies in both experiments, effects were not consistently predictable. Namely, in Experiment 1, we could only predict choosing behavior between the second and third showups if the third showup was target-present and in Experiment 2 we could only predict choosing behavior between the first and second showups. In Experiment 1, when the current trial was target-present, participants who chose on the second showup were more likely to also choose on the third showup compared with those who had rejected the second showup (assimilation). Although we did expect to find an interaction between current target-presence and previous choosing, the interaction operated counter to expectations. In Experiment 2, regardless of target-presence, participants who chose on the first showup, were more likely to not choose on the second showup (contrast). Taken together, results from both Experiments 1 and 2 provide inconsistent evidence for the capacity to predict choosing behavior from previous choosing. This inconsistency is surprising given the theoretical overlap to fields that have robustly produced sequential dependencies, including perception, absolute identification, and, most pertinently, recognition.

In recognition tests, Malmberg and Annis [27] found sequential dependencies between previous and current responses: A hit on a previous trial increased the probability of a hit on a current trial, but previous hits and false alarms also increased the probability of false alarms on a current trial. In essence, participants were more likely to choose on a current trial if they had chosen on a previous one. This effect was replicated with a variety of paired stimuli (e.g., landscape photo pairs, non-word pairs), as well as with a single-item classic recognition test. While the current research retains similarities to these basic recognition paradigms, as well as other contexts in which sequential dependencies have robustly appeared (i.e., perception, categorization tasks [22, 23]), the eyewitness paradigm also presents differences that may explain the inconsistent results reported here.

Consequently, we considered potentially important differences that may explain the inconsistent results reported here. First, the number of stimuli in our experiment differs greatly from a basic recognition paradigm. In a typical recognition experiment, participants are presented with long lists of words or images, given little time to study these items, and are then tested on those items along with never-before-seen items. Conversely, our experiment only included three perpetrators to study over the course of a 2.5 min mock crime video. Although we cannot ignore the possibility that there are simply not enough stimuli being studied, and therefore participants are not uncertain enough to rely on previous responses, the maximum average participant accuracy rates of 65% do suggest that our filler task allowed for sufficient memory decay to induce uncertainty. Meanwhile, sequential dependencies in recognition are thought to be a result of interference from previous trials that affect mnemonic processing during testing. Therefore, it seems more likely that our results reveal a difference during testing rather than a difference during encoding.

A second difference lies in the number of trials during the testing phases. While recognition experiments may have tens or hundreds of test trials, our participants encountered only three. Perhaps this is not a sufficient number of trials for sequential dependencies to arise. Sequential dependencies have been explained through accumulator models, which predict shifts over time based on criterion placement or accumulation starting points (e.g., Selective Attention, Mapping and Ballistic Accumulation; SAMBA; [24, 25]. The SAMBA model, for example, posits that a participant classifying the loudness of a sound (i.e., soft vs. loud) uses the sound on initial trials to generate a range between which the subsequent sounds are expected to fall. This range establishes how soft the participants can expect a soft sound to be and how loud they can expect a loud sound to be. When confronted with the task of classifying the sound on the current trial, the observer will compare the sound to the upper and lower range in relation to the loudness of the previous response. Their response will depend upon the strength of the evidence for each of these answers. When a soft response is given on the current trial, it is hypothesized that this biases the perception of the sound on the subsequent trial by temporarily reducing the strength of evidence needed to favor another soft response. Thus, assimilation arises from the decision making: because the soft response now has the advantage, the following trial is more likely to reach the threshold to be classified as soft. Contrast, however, arises from the perceptual mechanisms: Because observers are comparing the current sound to the previous one, any change louder or softer can lead to over- and underestimation of the strength of that sound. In this model, assimilation and contrast both occur because the stronger effect (assimilation) eventually decays to give way to the weaker one (contrast [24]). It is possible that such models require an adjustment period over multiple trials in order to calibrate the upper and lower range of perceptual (and in the case of recognition, mnemonic) processing. As a result, the small number of trials present in our experiment might be insufficient for sequential dependencies to arise.

To address the issues outlined above, Experiment 3 used the recognition paradigm in an attempt to replicate and extend the work of Malmberg and Annis ([27] near-pairs condition) using three different categories of stimuli: photos of faces, photos of landscapes (places) and words. Although we did not expect recognition of faces to explain the lack of predictable sequential effects in the above experiments, it is important to note that sequential dependencies have not yet been tested using face recognition stimuli. For the sake of completeness, we compared the new condition of face stimuli to two conditions with stimuli for which sequential dependencies have been detected during recognition tasks (i.e., places and words).

Accordingly, these concerns were translated into three goals: (1) to extend previous research by testing for sequential dependencies on overall responding in face image recognition, (2) if found, to determine if these sequential dependencies translate to predictable choosing behavior, and (3) to examine whether the strength of these effects vary across the testing phase.

We predicted sequential effects would arise across all three sets of stimuli. If sequential dependencies were observed for responses overall, we predicted that sequential effects would be stronger in the second half compared to the first half of testing blocks and thus also expect to be able to predict choosing behavior in late, but not early, test trials.