Facial Cosmetics and Attractiveness: Comparing the Effect Sizes of Professionally-Applied Cosmetics and Identity

Alex L. Jones; Robin S. S. Kramer

doi:10.1371/journal.pone.0164218

Abstract

Forms of body decoration exist in all human cultures. However, in Western societies, women are more likely to engage in appearance modification, especially through the use of facial cosmetics. How effective are cosmetics at altering attractiveness? Previous research has hinted that the effect is not large, especially when compared to the variation in attractiveness observed between individuals due to differences in identity. In order to build a fuller understanding of how cosmetics and identity affect attractiveness, here we examine how professionally-applied cosmetics alter attractiveness and compare this effect with the variation in attractiveness observed between individuals. In Study 1, 33 YouTube models were rated for attractiveness before and after the application of professionally-applied cosmetics. Cosmetics explained a larger proportion of the variation in attractiveness compared with previous studies, but this effect remained smaller than variation caused by differences in attractiveness between individuals. Study 2 replicated the results of the first study with a sample of 45 supermodels, with the aim of examining the effect of cosmetics in a sample of faces with low variation in attractiveness between individuals. While the effect size of cosmetics was generally large, between-person variability due to identity remained larger. Both studies also found interactions between cosmetics and identity–more attractive models received smaller increases when cosmetics were worn. Overall, we show that professionally-applied cosmetics produce a larger effect than self-applied cosmetics, an important theoretical consideration for the field. However, the effect of individual differences in facial appearance is ultimately more important in perceptions of attractiveness.

Citation: Jones AL, Kramer RSS (2016) Facial Cosmetics and Attractiveness: Comparing the Effect Sizes of Professionally-Applied Cosmetics and Identity. PLoS ONE 11(10): e0164218. https://doi.org/10.1371/journal.pone.0164218

Editor: Katsumi Watanabe, Tokyo Daigaku, JAPAN

Received: April 19, 2016; Accepted: September 21, 2016; Published: October 11, 2016

Copyright: © 2016 Jones, Kramer. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: The authors received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Modification of the body with dyes, paints, and other pigments is among the most universal of human behaviours, present in all cultures [1–3]. However, in Western society, women perform the majority of self-adornment [4], and perhaps the most prevalent behaviour of this kind is the use of facial cosmetics. This behaviour is served by the global cosmetics industry which is worth billions of pounds [5].

Women report using cosmetics for a variety of reasons, ranging from anxiety about facial appearance, conformity to social norms, and public self-consciousness [6–8], through to appearing more sociable and assertive to others [6]. Cosmetics are effective at improving social perceptions that the wearer may wish to modulate, with individuals appearing to be healthier and earning more [9], displaying greater competence, likeability and trustworthiness [10], as well as appearing more prestigious and dominant [11]. Cosmetics also influence the behaviour of others, especially men, who tip higher amounts and with greater frequency to waitresses wearing cosmetics [12], and are more likely to approach wearers in the environment [13]. It is likely that the effect of cosmetics on social perceptions is brought about by the increase in attractiveness it confers to faces, which is now a well documented effect [10,14–17]. Research has documented cosmetics function by altering sex-typical colouration in faces such as facial contrast [18–21], by increasing the homogeneity of facial skin [22,23], or by affecting colour cues to traits such as health [24] and age [25].

While the effect of cosmetics on perceived attractiveness seems clear [14,17], other research has revealed it is more nuanced than previously thought. Etcoff and colleagues [10] demonstrated that attractiveness increased linearly with the amount of cosmetics worn—simply, more cosmetics equates to appearing more attractive. Of the range of cosmetics that can be worn, the quantity of cosmetics applied to the eyes and mouth have been shown to be significant predictors of attractiveness [26], with more cosmetics on these features leading to higher ratings of attractiveness. However, other evidence suggests that the typical amount of cosmetics applied by a sample of young women is excessive, with observers preferring close to half the actual amount for optimal attractiveness [16], calling into question the linear relationship between cosmetics quantity and attractiveness.

One concern of facial attractiveness research is that it does not compare the effects of predictors of attractiveness (e.g., symmetry, averageness, sex typicality [27]; against other sources of variation [28]. Recent work has begun to address this by examining the importance of within-person variation in attractiveness (caused by the presence or absence of makeup, for example), compared with the between-person variation in attractiveness simply due to differences between identities [29]. Specifically, it has been previously shown that the effect of cosmetics on attractiveness, a source of within-person variation, is very small, explaining just 2% of the variance in ratings [15]. This is an especially small effect when compared with differences in attractiveness between individuals, a between-person variation in attractiveness, which explained 69% of the variance in judgements. More simply, while facial cosmetics do increase attractiveness, that contribution is small and does little to change an individual’s attractiveness standing in the population.

However, the use of cosmetics is an idiosyncratic and extremely varied practice [3], and its effect on attractiveness is more complex than previously thought. The use of a professional makeup artist is a common practice in almost all studies examining the effect of cosmetics on perceptions [9,10,12,17,30,31], and only a few utilise self-applied cosmetics [14,16,26]. An initial examination of the effect size of cosmetics on attractiveness also had models self-apply their cosmetics [15]. There are good reasons for using professionally-applied cosmetics, as it provides a clearer test of how cosmetics alter facial attractiveness. The increased variability in self-applied cosmetics, due, for example, to differences in application skill or the products used, could make it more difficult to detect an effect of cosmetics on attractiveness, and previous work has indeed found the effect to be small [15]. This distinction represents a trade-off between experimental control and ecological validity—the vast majority of women, if any, do not have a professional makeup artist apply their cosmetics daily, yet the majority of studies examining cosmetics and attractiveness draw conclusions based on professionally-applied cosmetics, which may only indirectly inform as to how cosmetics affect attractiveness in the real world.

We seek to address important theoretical points regarding how cosmetics influence attractiveness. How large is the effect size of cosmetics on attractiveness when cosmetics have been professionally-applied? If cosmetics in psychological experiments are applied with more skill than is typically achieved, then current knowledge of cosmetics and attractiveness likely overstates the relationship, given the reliance on professionally-applied cosmetics in the literature. Moreover, how does the ability of professionally-applied cosmetics compare to previous measures of the effect of cosmetics on attractiveness? In the following study, we examine the effect size of cosmetics on attractiveness in two sets of faces that have had cosmetics applied professionally, with the prediction that the effect will be substantially larger than the previous assessment that considered self-applied cosmetics [15]. In addition, by using a similar design to previous research, we can draw direct comparisons with current knowledge of how cosmetics and identity affect attractiveness.

A separate but related question regarding cosmetics concerns how it affects faces of different levels of attractiveness. Many studies in the literature on cosmetics and social perceptions have used models recruited from university or college [14,15,20]. How do cosmetics affect faces of a different population, specifically faces considered to be very attractive? Previous research found no interaction between cosmetics and identity [15], suggesting cosmetics affect each face’s attractiveness similarly. However, the models used were of a university-aged sample of population-typical attractiveness levels. The present studies, particularly Study 2, examine the effect cosmetics have on perceived attractiveness in a sample of women typically considered to be very attractive—models. Using a sample of faces that are already constrained in attractiveness enables us to manipulate another source of variation in attractiveness, specifically between-person variability. As such, we can observe the effects of cosmetics on attractiveness in a sample with a (hypothesised) lower effect of identity (differences between individuals) than elsewhere.

The present study has several aims. First, we examine how cosmetics affect attractiveness when cosmetics have been professionally-applied. We predict that cosmetics will have a notably larger effect size in this sample compared to the previous study examining this question [15]. Second, we consider the effect size of cosmetics in sets of faces that are considered highly attractive, where between-person variation (identity effect size) should be reduced. The relative effect size of cosmetics may therefore be increased, and may be more likely to overshadow the smaller between-person variation in attractiveness. Conversely, cosmetics may have less of an effect in these samples as the women are already at the higher end of attractiveness without cosmetics, leaving little room for judgements of attractiveness to increase when cosmetics are applied. Finally, by using an identical design to previous research [15], we will compare the findings obtained in these studies to those presented in previous research in order to build a fuller picture of the relative importance of cosmetics and identity in attractiveness perceptions.

Study 1

In the first study, we examine how cosmetics impact attractiveness when they are applied professionally. To do this, we take advantage of an Internet-based sample to acquire images of models whose cosmetics have been applied by high-profile makeup artists. Compared to previous work examining this question [15], we predict that the effect size due to cosmetics should be larger here. However, the effect size of identity may still overshadow it.

Method

Participants.

Ninety North American university students (age M = 18.57 years, SD = 0.75, 41 men) participated in the main study for course credit. Due to a software error, age data was not recorded for the first 50 participants, with the mean age being calculated from the remaining participants. However, all participants were within the same demographic and age range. A further 15 students (age M = 19.93 years, SD = 1.16, three men) rated the quantity of cosmetics worn by the models. Informed consent was obtained from all participants included in the study.

Ethics Statement.

Ethical approval for all studies was obtained from the Gettysburg College institutional review board (IRB). All participants gave written informed consent before beginning the study.

Stimuli.

From the YouTube website, we collected images of White British women (n = 33, age unknown but approximately 20–35 years), who acted as models while their cosmetics were applied by high-profile professional makeup artists from the United Kingdom. Twenty-three models were obtained from one artist’s channel (www.youtube.com/user/lisaeldridgedotcom) with a further ten collected from another (www.youtube.com/user/ctilburymakeup). We utilised all available videos at the time of writing that featured a model receiving a makeover where they were shown before and after an application of cosmetics. In addition, we included only videos where faces began free of cosmetics, and the artist had the intention of applying a particular cosmetics look, rather than with the aim of hiding blemishes or skin conditions (such as acne). Images were captured from video tutorials, which served to instruct viewers on a number of popular cosmetics styles for a range of scenarios. Both authors classified the cosmetics looks into categories using information provided by descriptions within the videos. Three categories were apparent—an everyday, natural look (n = 7), a ‘going out’ look (n = 14), and vintage or editorial looks based on cosmetics the makeup artist had applied during professional photo shoots in the past (n = 12). A third researcher, with extensive experience in this field, arrived at these three categories independently, providing further confirmation.

We captured a high-resolution screenshot of each model at the end of each video, where images of the models were presented before and after their application of cosmetics side-by-side. Models had a neutral expression and looked directly into the camera for the comparison. In addition, the two photographs were taken under the same lighting and camera conditions. From each comparison screenshot, we cropped the ‘before’ and ‘after’ versions of each model to produce two separate images. Final images were cropped just below the chin, at the hairline (or mid-forehead based on the limitations of the original), and tight to the widest part of the face (and so removing the ears). Given the variable nature of the images in terms of hairstyle, we chose models whose hair did not occlude their faces, and we masked loose hair in the lower portions of the images if it was not tied back. Images were resized to a height of 451 pixels. Given copyright restrictions, we present the average of models without cosmetics, and separately with cosmetics, in Fig 1 to illustrate. Averages were produced using JPsychomorph after landmarks were applied to the facial features in each image [32].

Download:

Fig 1. The average model without (left) and with cosmetics (right).

These averages are cropped mid-forehead because several of the YouTube videos presented individuals in this way, resulting in insufficient information above this point for generating averages.

https://doi.org/10.1371/journal.pone.0164218.g001

Procedure.

Participants rated the attractiveness of the models using custom PsychoPy software [33]. Images were presented in a random order, and each participant rated each model only once, in a randomly selected cosmetics condition (i.e., either with or without cosmetics). This design was specifically chosen to prevent carryover effects between conditions [15,29]. Participants rated the attractiveness of the models on a 1 (very unattractive) to 7 (very attractive) scale, indicating their response via mouse click. Stimuli remained onscreen until a judgement was made.

A separate sample of participants judged the quantity of cosmetics worn by the models. These participants saw the ‘without’ and ‘with cosmetics’ images onscreen next to each other, and were asked ‘how much makeup has been applied to this face?’ Participants indicated their responses via mouse click on a 1 (very light) to 7 (very heavy) scale. Trials were presented in a random order. Though this is only a perceived measure of quantity, rather than an actual quantity of cosmetics, we believe it to be suitable as it is the perceived quantity that would affect the perceptions of observers. Importantly, other studies have found general agreement in the quantity of cosmetics applied by a professional makeup artist and the perceived amount of cosmetics being worn [31].

Results

Each image was rated an average of 45 times (SD = 4.45). We examined agreement by calculating the pooled standard deviation for ratings in each cosmetics condition; without SD_p = 1.34; with cosmetics SD_p = 1.44. Responses were given on a 7-point scale, so the generally low variability indicates good agreement in ratings [15,34]. To examine effects of observer sex on ratings, the data were split by the sex of each observer before averaging. This resulted in four scores for each model—one in each cosmetics condition, as rated by men and women.

We also calculated the average amount of perceived cosmetics applied (M = 4.96, SD = 1.09), as judged by the separate sample of raters. These judgements of quantity were collected in order to be able to control for the varying amounts of cosmetics worn by each model in our analyses. However, this measure showed no relationship with the dependent variable (attractiveness) at all levels of observer sex and cosmetics, all rs < .25, ps > .160. As such, there was no reason to include quantity as a covariate, and we therefore analysed our results using a repeated measures ANOVA with model as the unit of analysis.

We focus here on the effect sizes of variables in order to estimate the real world effect of cosmetics on attractiveness. In particular, we utilise eta squared (η²) as a measure of effect size, which expresses how much each factor contributes to the total variance in attractiveness ratings as an interpretable percentage value, rather than partial eta squared, which does not sum across factors to one. We calculated η² effect sizes for both main effects (Cosmetics, Observer Sex) and the interaction by dividing the sums of squares (SS) attributable to each effect by the total SS, calculated by summing the SS attributable to each effect and their respective errors. We also gave special consideration to the variance attributable to differences between items. This variation is typically ignored in repeated measures analyses since it usually represents variation between participants on the measured dependent variable, which is generally unimportant for repeated measures designs (which instead focus on variation within participants). However, in this case, it takes on a useful property. By using the images of the models as the unit of analysis, the variation between models represents variation in attractiveness arising due to the fact that models have different facial identities or appearances. We were therefore able to calculate an effect size for this ‘identity’ measure. The full results of the ANOVA are reported in Table 1, illustrating the effect sizes, their associated SS, and other statistics. It should be noted that there is no error term for conducting an F test on differences between models, and as such, no F ratio is calculated interactions with the Identity measure can be interpreted as an error term for that variable [35].

Download:

Table 1. Results of the analysis of variance from Study 1.

https://doi.org/10.1371/journal.pone.0164218.t001

Men assigned lower ratings of attractiveness (M = 3.74, 95% CI [3.47, 4.00]) than women (M = 3.89, [3.66, 4.13]), a result consistent with previous literature [15,36,37] which we do not pursue further here. Importantly, models were rated as more attractive with cosmetics (M = 4.39, [4.11, 4.68]) than without (M = 3.23, [2.95, 3.51]). The Observer Sex x Cosmetics interaction was driven by men rating faces without cosmetics as less attractive than women rating those same faces, t(32) = 4.32, p < .001, d = 0.75, but both sexes assigned similar ratings for models with cosmetics, t(32) = 0.42, p = .676, d = 0.07, indicating a larger influence of cosmetics on attractiveness for men. However, the effect size of this interaction was very small (η² = 0.01), suggesting a relatively unimportant result.

Of more importance was the Cosmetics x Identity interaction (η² = 0.14), which indicates that the application of cosmetics altered the attractiveness of individual models differently. To examine this further, we computed a difference score for each model between their attractiveness with and without cosmetics, as rated by men and women. This difference illustrates the boost in attractiveness conferred by cosmetics, and we carried out a correlation between these values and the attractiveness of the models without cosmetics. Ratings assigned by both women and men showed a negative correlation between these values, r(31) = -.53, 95% CI [-.73, -.23], p = .001, and r(31) = -.48, [-.71, -.16], p = .005, respectively, indicating that the more attractive a model was, the less of an increase in attractiveness cosmetics conferred, a pattern which did not change when combining ratings given by men and women, r(31) = -.46, [-.69, -.14], p = .007 (see Fig 2).

Download:

Fig 2. An illustration of the average attractiveness (combining ratings made by men and women) of each model, both without cosmetics and with cosmetics.

Models are ordered in terms of increasing attractiveness without cosmetics. An upward pointing arrow indicates an increase in attractiveness with cosmetics, while a downward arrow indicates a decrease.

https://doi.org/10.1371/journal.pone.0164218.g002

Table 1 illustrates that the Identity effect size (η² = 0.45) is 1.36 times larger than the effect size attributed to Cosmetics (η² = 0.33). The differences in attractiveness between individuals explains more variance than an application of cosmetics, but the ratio of these effect sizes is much smaller than in previous accounts [15]. This suggests that a professional application of cosmetics (in comparison with self-application) is capable of producing a larger effect on attractiveness perceptions, although this remains smaller than the effect due to identity differences between women.

We conducted a final analysis to examine whether the cosmetics ‘look’ ascribed by the artist affected perceptions of attractiveness differently for men and women. The above analysis was repeated, but with the addition of ‘look’ as a source of variation between models. The three-way mixed model ANOVA revealed no significant main effects of cosmetics look or interactions with this factor, all Fs < 1.18, ps > .320. However, it is worth noting that the ‘cosmetics look’ variable had low power (ranging from .076 to .242 across main effects and interactions), so further study is required to investigate the role of cosmetics look in perceived attractiveness.

Study 2

The models used in Study 1 were women who had agreed to participate for the purposes of demonstration in a makeup tutorial. We have shown that the effect of cosmetics, when professionally-applied, results in a larger effect size compared with previous research [15]. Next, we investigate how cosmetics alter the attractiveness of a sample of women who are generally regarded as very attractive and earn a living based on their appearance—supermodels. We examine how much variation in attractiveness can be explained by cosmetics, and compare it with the effect size of identity, the differences in attractiveness between supermodels. Here, the effect size of identity should be smaller, given the potentially homogenous nature of the women in terms of attractiveness. How much of a benefit do cosmetics confer to highly attractive women, and in turn, do cosmetics overcome the differences in attractiveness between individuals?