The impact of hyperlinks on reading text

Gemma Fitzsimmons; Mark J. Weal; Denis Drieghe

doi:10.1371/journal.pone.0210900

Abstract

There has been debate about whether blue hyperlinks on the Web cause disruption to reading. A series of eye tracking experiments were conducted to explore if coloured words in black text had any impact on reading behaviour outside and inside a Web environment. Experiment 1 and 2 explored the saliency of coloured words embedded in single sentences and the impact on reading behaviour. In Experiment 3, the effects of coloured words/hyperlinks in passages of text in a Web-like environment was explored. Experiment 1 and 2 showed that multiple coloured words in text had no negative impact on reading behaviour. However, if the sentence featured only a single coloured word, a reduction in skipping rates was observed. This suggests that the visual saliency associated with a single coloured word may signal to the reader that the word is important, whereas this signalling is reduced when multiple words are coloured. In Experiment 3, when reading passages of text containing hyperlinks in a Web environment, participants showed a tendency to re-read sentences that contained hyperlinked, uncommon words compared to hyperlinked, common words. Hyperlinks highlight important information and suggest additional content, which for more difficult concepts, invites rereading of the preceding text.

Citation: Fitzsimmons G, Weal MJ, Drieghe D (2019) The impact of hyperlinks on reading text. PLoS ONE 14(2): e0210900. https://doi.org/10.1371/journal.pone.0210900

Editor: Veronica Whitford, University of Texas at El Paso, UNITED STATES

Received: September 27, 2018; Accepted: January 3, 2019; Published: February 6, 2019

Copyright: © 2019 Fitzsimmons et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: The data underlying the results presented in the experiments in this manuscript are available from the UK Data Service. The DOI is: 10.5255/UKDA-SN-853342.

Funding: This research was funded by an EPSRC grant for the Doctoral Training Centre in Web Science: EP/G036926/1. This work formed a part of a PhD completed in the Web Science DTC.

Competing interests: The authors have declared that no competing interests exist.

Introduction

One of the main differences between reading on and off the Web is that the materials that are being read on the Web contain hyperlinks embedded within the text. Two differences between reading hyperlinked words compared to plain words are explored in this study: Firstly, hyperlinks are typically coloured and therefore salient compared to the rest of the text and secondly, a hyperlink links one piece of information to another, perhaps on a separate page of the same website, or a different website all together. Hyperlinks are a tool to navigate the Web and the word chosen to be hyperlinked often represents the page the hyperlink is linking to. This paper systematically explores these features in order to understand the impact of hyperlinks on reading text on the Web.

Starting with saliency, hyperlinks are salient items that stand out from the rest of the text in some way. Visual saliency is a stimulus-driven signal that announces to us that a certain item or location is different to the rest of the visual field and is worthy of attention. For example, a lone red item in a field of green items will stand out to us and be salient compared to the rest of the items and draw our attention [1]. The way hyperlinks are denoted usually follows the convention that hyperlinks are denoted in blue with the rest of the text in black. It is this colour difference which makes the hyperlinks stand out. However, Nielsen [2] claimed that it was a bad decision to make hypertext links blue because only 2% of the cones on the retina are sensitive to blue making it a poor choice in terms of usability [3]. Nevertheless, Nielsen admits that the convention of the blue hyperlink should remain because users know that blue text denotes a hyperlink, making it easier for users to recognise hyperlinks more rapidly. This is supported by research on automatic attention which suggests that when a user consistently searches the same environment for the same information which is consistently represented in the same way, the processing becomes automatic [4,5]. This could also be true for hyperlinks because blue text in a webpage context almost always represents a hyperlink. Indeed, Campbell and Maglio [6] found that participants were quicker when searching for a target word in a webpage that was blue and underlined than target words that were black and underlined.

Very little research explores the impact of the saliency of words when reading for comprehension. Simola, Kuisma, Oorni, Uusitalo and Hyönä [7] explored reading in a Web environment and found that salient advertisements can distract attention and disrupt reading. If salient adverts can distract readers, it is conceivable that salient words may as well. White and Filik [8] examined bold words in passages of normal text. They found that bold text had shorter fixation durations suggesting that saliency in text can affect information processing and suggest their finding reflects the improved visual discriminability of the target words, making it easier to identify. There is also evidence suggesting that saliency can affect not just when we move our eyes, but also where we move them. Leyland, Kirkby, Juhasz, Pollatsek and Liversedge [9] examined eye movement behaviour during a reading experiment on fully or partially shaded words within the text and found that when a word was shaded it had an effect on saccadic targeting, influencing where the eyes move to. If only the first half of the word was shaded, the targeting was closer to the beginning of the word compared to when the entire word was shaded. Furthermore, partially shaded words were fixated for longer than fully shaded words, or non-shaded words, suggesting that visual non-uniformity (in that the shading was inconsistent with word boundaries) also affects when we move our eyes.

Recently, Gagl [10] asked participants to read text that featured target words that were either not highlighted or highlighted by being coloured in blue or by being underlined. Gagl found that highlighting a word by colouring it or underlining it had no negative or positive impact on reading during first pass. However, in total viewing times (which includes re-reading time of the word), there was an effect of whether the target was highlighted. The un-highlighted black words showed a reduced viewing time in comparison to the other conditions. This suggests that highlighting with colour or underlining increased re-reading. Gagl proposed that having hyperlinks coloured in blue is a good choice because it does not disrupt first pass reading, but attention is drawn to the highlighted words as is evident from re-reading so it serves the function of highlighting important information.

There has also been research into learning from electronic texts that suggest that hyperlinks do attract attention to them and that this attention actually assists in the retention of the hyperlinked word. The saliency of the hyperlinked words would ensure better acquisition and retention [11] and this idea is also compatible with the classic phenomenon called the Von Restorff effect [12], where items that ‘stand out’ are more likely to be remembered.

Turning to the linking function of hyperlinks, this information can be considered to be, in terms of cognitive processes, more high-level compared to the information that is exclusively contained in the lexical representation of the word that is hyperlinked. Hyperlinks denote a connection to other content somewhere else on the Web. Carr [13] suggested that hyperlinks within the text are a distraction and therefore hinder comprehension of the text. Having to evaluate hyperlinks and navigating a path through them is demanding and is an extraneous task to the act of reading itself. This means that having a hyperlink in the text that links to other content renders the act of reading more laborious and so from this perspective we expect this higher-level processing to be reflected in the eye movement measures during reading of the text.

In terms of a prediction for a high-level factor on reading, the current models of eye movements during reading do not make direct predictions for the impact of hyperlinks but the closest to a prediction that can be derived originates from the E-Z Reader model of eye movements during reading [14]. The EZ-Reader model suggests that higher-level processes intervene in eye movement control only when “something is wrong” and either send a signal to stop moving forward or to execute a regression. As a result, higher-level processes would exclusively impact the later eye movement measures (regressions and re-reading) so based on this model we hypothesise seeing effects of reading hyperlinks exclusively in the later eye movement measures.

Typographical cues have been shown to improve memory for the signalled content [15–18]. However, simply bolding or underlining the text does not automatically mean it will be remembered, the signal needs to be useful to the reader. Golding and Fowler [19] found that when the reader expected questions on specific details, underlining sections of text facilitated cued recall for those sections. The important information that the reader needed for the task was highlighted and this helped them find the information easily. However, when the reader was expected to provide an outline of text or a list of solutions to the problem discussed in the text, the readers did not experience any benefits from the signalling as it wasn’t useful for the task at hand. Thus, signals need to be relevant to the reader to assist them in their task.

There is also the issue that even if some signals are useful, will the addition of (even) more signals be more useful for the reader? If most of the text has some form of signal to cue the importance of the information, then the signal might not be as effective or as informative compared to when only the most important text is signalled. This “over-signalling” can reduce the effectiveness of typographical cues. For example, Lorch, Lorch and Klusewitz [20] asked individuals to read a four-page text after which they were tested on memory for specific target sentences. The text either contained no underlining (control), underlining of the target sentences (light signalling) or underlining of the target sentences and half of the non-target sentences (heavy signalling). Recall was improved when the text had light signalling, but performance was not different from the control condition when there was heavy signalling. If the signalling is not useful for the task, for instance when the signalling is seemingly meaningless, the reader will ignore it. Lorch, et al. [20] went on to replicate the control and light signalling conditions, but using capitalisation as the signalling tool instead of underlining. They found that reading was slower for the light signalling condition, but memory recall was improved. Upon further examination, they also observed that the readers slowed down on the signalled content alone and speeded up again when reading non-signalled content. This suggests that the reader may have thought the signalled content was important so decided to spend more time on it. During reading the reader needs to discriminate important and unimportant information and signals in the text can be used to assist the reader.

In terms of reading on the Web, hyperlinks could be said to be a typographical signal due to the fact that hyperlinks are a single word or short phrase that is salient from the rest of the text. Hyperlinked words could also be considered important by the reader and as such the presence of the hyperlink may add emphasis to that section of text.

The experiments here focus on how we read hyperlinked text and whether the links influence reading behaviour. In order to examine how links affect reading behaviour, we will first examine outside of a Web context any potential disruption of reading exclusively due to the target word being a salient colour compared to the rest of the text, before examining in a Web context whether this is due to the link being perceived as important due to the additional information that it can link to.

Three experiments were conducted to explore this issue. The first experiment, Experiment 1, explored whether a salient, coloured word negatively impacts reading behaviour outside of a hypertext context. Experiment 1 only used a single coloured word in a single-line sentence during reading for comprehension to explore the impact of saliency. The aim of Experiment 1 was to explore the impact of a single coloured word in a sentence and also to investigate if there was a difference between colours, or if simply being a salient word had an impact on reading. A follow-on experiment (Experiment 2), was conducted, exploring whether multiple coloured words had an impact on reading and we also include a word frequency manipulation to see if the difficulty of the word interacts with the fact that the word is coloured. A robust finding in eye movements during reading is that a high-frequency word receives shorter and fewer fixations than a low-frequency word [21,22]. This manipulation allowed us to examine whether there would be an additional cost of the colouring for words that are more difficult. Experiment 2 built upon Experiment 1 by exploring the impact of multiple coloured words in a sentence. By first exploring the impact of coloured words in text we can understand the impact of coloured words in plain text compared to coloured words shown in a hyperlinked environment, such as in Experiment 3. Experiment 3 explored whether perceiving the words as links influences reading behaviour by presenting the coloured words in text that can be perceived as hypertext. We also included a word frequency manipulation in this experiment in order to explore whether common lexical effects are present in hyperlinked text and to investigate if they are modulated by the word being hyperlinked. Together, these experiments assessed whether there is a difference between reading coloured words (embedded in words of a different colour) and reading hyperlinks and how this affects reading behaviour. In other words, Experiment 1 and 2 will help us to separate whether any observed effects seen in Experiment 3 are exclusively due to the saliency of a blue word or due to the fact that the blue words are hyperlinks in a hypertext environment.

As previously mentioned only 2% of the cones on the retina are sensitive to blue making it supposedly a poor choice in terms of usability [3], and this could impact reading behaviour as well if it is more difficult to read text in a certain colour. In line with previous research which has suggested that hyperlinks disrupt reading behaviour [2,13], we predicted that the coloured words would be fixated for longer because of the saliency of the coloured word. In Experiment 1, several colours were used for the target word to investigate whether blue was indeed particularly disruptive and we also predicted that specifically for grey target words that they would be fixated for longer due to their reduced contrast, thereby making them visually more difficult to process [23].

Experiment 1

Method

Participants.

Thirty native English speakers (2 male, 28 female) with an average age of 19.80 years participated in exchange for course credits. All had normal or corrected-to-normal vision and no known reading disabilities.

Apparatus.

Eye movements were measured with an SR-Research Eyelink 1000 eye tracker operating at 1000 Hz (1 sample every millisecond). Participants viewed the stimuli binocularly, but only the right eye was tracked. Words were presented in 14pt mono-spaced Courier font. The participant’s eye was 73 cm from the display; at this distance three characters equalled 1° of visual angle.

Materials and design.

Thirty sentences were used and a single target word in each sentence would appear in one of five colours, which correspond to the five experimental conditions (black (RGB: 0,0,0), blue (RGB: 0,0,255), green (RGB: 0,255,0), red (RGB: 255,0,0) or grey (RGB: 192,192,192); see Fig 1). The rest of the sentence was always rendered in black. A counterbalanced design was used in which each participant read one version of each of the thirty sentences with an equal number from each condition. Participants were instructed to read for comprehension and told that they would occasionally have to answer comprehension questions about the sentences. Comprehension questions were presented randomly in 25% of trials, they were simple yes/no questions and the accuracy of answering these was high (97.5% accuracy), indicating that participants were reading the text correctly.

Download:

Fig 1. Example stimulus from Experiment 1 for the 5 different conditions.

https://doi.org/10.1371/journal.pone.0210900.g001

Procedure.

Before any of the experiments in this article took place, ethics approval was applied for, peer-reviewed and granted by the University of Southampton Psychology Department Ethics Committee. Ethics approval was sought and approved for all experiments within this article. Participants were given an information sheet and a verbal description of the experimental procedure and informed that they would be reading sentences on a monitor while their eyes were being tracked. They were told to read for comprehension and that they were to respond to comprehension questions presented after the sentences. If participants enquired about the colour of the words they were told the words were coloured at random and did not correspond to the comprehension questions. The participants were seated in front of the monitor and their heads were stabilised using a head and chin rest to reduce head movements. The initial calibration required approximately five minutes before the actual experiment began. At the beginning of each trial a fixation point was presented on the screen where the beginning of the text was set to appear. The participants were required to fixate this point before the sentence was presented to ensure that the first fixation fell on the first word of the sentence. When the participant had finished reading the text on the screen, they pressed a button to continue to the next trial. Each participant first read three practice trials to become familiar with the procedure. The experiment lasted approximately 15 minutes.

Results

Trials where there was tracking loss were removed prior to the analyses. Fixations shorter than 80ms that were within one character of the previous or following fixation were merged and all remaining fixations shorter than 80ms or longer than 800ms were removed (resulting in the removal of 4.87% of the total dataset). Finally, when calculating the eye movement measures, data that were more than 2.5 standard deviations from the mean for a participant within a specific condition were removed (<1% of dataset). Data loss affected all conditions similarly.

Several eye-movement measures were calculated based on the target word. Skipping probability is the probability that a target word does not receive a direct fixation during the first-pass, first-fixation duration is the duration of the initial first-pass fixation on the target word, single fixation duration is the duration of the fixation if the reader made exactly one first-pass fixation on the target word, gaze duration is the sum of all first-pass fixations on the target word, go past time is the time between first fixating the word and moving past it to the right (including regressions that originate from the target word), and total time is the total amount of time spent on the target word during the whole trial, including any re-reading that might occur.

We ran Linear Mixed Models (LMMs) using the lme4 package (Version 1.1–12) [24] in R (Version 3.3.1) [25] to explore the impact of the colour of the target words on fixation times. Binominal models were used for the skipping probability measure.

The colour of the target word was included as a fixed factor, with treatment contrasts specifying black as the baseline in order to be able to compare reading of the target word embedded in a plain sentence without having a different colour with the reading of the target word rendered in another colour. Participants and items were included as random effects variables in a so-called maximal random model which included both intercepts and slopes for the colour factor [26]. If a model did not converge, the random effect structure was reduced first by removing the random effect correlations and then the interactions between the slopes and finally by successively removing the random effects explaining the least variance until the maximal converging model was identified. All the patterns observed in the models were identical whether they were run on log-transformed or untransformed fixation durations, allowing us to present the data run on the untransformed fixation durations in order to increase transparency. Absolute values of t equal to or bigger than 1.96 were interpreted a significant because for high degrees of freedom as is typically the case in LMMs, the t statistic approximates the z statistic.

The means for all of the eye movement measures for Experiment 1 are listed in Table 1. Participants were significantly less likely to skip a target word when it was not in black (see Table 2 for the LMM output). This suggests that the saliency of the coloured target word draws attention to it, making it more likely that participants will fixate on it.

Download:

Table 1. Means of eye movement measures for Experiment 1.

https://doi.org/10.1371/journal.pone.0210900.t001

Download:

Table 2. Fixed effect estimates for all eye movement measures for Experiment 1.

https://doi.org/10.1371/journal.pone.0210900.t002

However, there were no statistically significant differences in any of the fixation time measures across the conditions except when the target word was grey. The reduced contrast of the target word in this condition increased the fixation time on that word both in early and late eye movement measures compared to any other condition because it was more difficult to visually process (e.g., [23]). Also, there was a significant difference in the later eye movement measures for when the target word was shown in green: Participants spent longer on the green target word in total reading time and were more likely to regress back to re-read the preceding text, as shown by the increased go past times. This suggests that the participants also found the green word a bit more difficult to process and we suggest this is also due to the reduced contrast of the green text compared to the other colours used besides grey (see Fig 1). To verify this, we determined the luminance of the colours used on the screen during the experiment. Luminance is measured in candela per square metre (cd/m2). If we look at the luminance of each of the colours used we notice that the grey and green are similar in luminance (grey:80.0 cd/m2; green: 73.2 cd/m2) and closer to the luminance of the white background (103.0 cd/m2) than any of the other colours; blue (10.2 cd/m2), red (18.6 cd/m2) and black (0.7 cd/m2). However, it is not clear why this effect of luminance would manifest itself exclusively in later eye movement measures for the green colour.

Discussion

Experiment 1 demonstrated that a coloured word is less likely to be skipped, perhaps because the reader thought the colour serves as a signal that the word might be important in some way [19,20]. Or it could simply be because the coloured word was salient against the rest of the text and attracted the readers’ eye [7,9,27]. There was no negative impact on reading behaviour in terms of fixation times when a word was coloured unless when the colour was associated with reduced contrast making it more difficult to read as seen when the target word was grey or green.

Experiment 2

Experiment 2 follows on from Experiment 1 by including multiple coloured words in a sentence to investigate if additional salient words will have an impact on reading behaviour compared to the single coloured word presented in Experiment 1. If a single coloured word causes a reduction in word skipping for that word, will it occur for all salient words in a sentence when there are multiple? Or will an effect of “over-signalling” occur where the signal of importance is reduced when words are coloured seemingly randomly [19,20]. Note that on the Web the presence of multiple hyperlinks across the screen will likely be the default as opposed to a single coloured word.

In Experiment 2 we only use the colour blue or black for our target words to feature in our coloured word condition. We choose to only use blue or black to represent the colours most often used in a Web environment, where black text tends to represent the unlinked text and the blue text represents the hyperlinked text. We also include a word frequency manipulation to explore whether word difficulty interacts with whether a word is presented in a salient colour (compared to the rest of the text) or not. A reader typically spends more time fixating a difficult or low frequency word than an easy or high frequency word [21,28,29]. We want to explore if a reader will spend even longer processing a more difficult word if it is also coloured/salient.