Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Uncertainty in the association between socio-demographic characteristics and mental health

  • Nataliya Rybnikova,

    Roles Conceptualization, Formal analysis, Investigation, Methodology, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliations Faculty of Architecture and City Planning, Technion—Israel Institute of Technology, Haifa, Israel, Department of Geography and Environmental Studies, Haifa University, Haifa, Israel

  • Dani Broitman ,

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing

    danib@technion.ac.il

    Affiliation Faculty of Architecture and City Planning, Technion—Israel Institute of Technology, Haifa, Israel

  • Murielle Mary-Krause,

    Roles Data curation, Validation, Writing – original draft, Writing – review & editing

    Affiliation Sorbonne Université, INSERM, Institut Pierre Louis d’Epidémiologie et de Santé Publique (IPLESP), Equipe de Recherche en Epidémiologie Sociale (ERES), Paris, France

  • Maria Melchior,

    Roles Data curation, Funding acquisition, Supervision, Validation, Writing – original draft, Writing – review & editing

    Affiliation Sorbonne Université, INSERM, Institut Pierre Louis d’Epidémiologie et de Santé Publique (IPLESP), Equipe de Recherche en Epidémiologie Sociale (ERES), Paris, France

  • Yakov Ben-Haim

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Validation, Visualization, Writing – original draft, Writing – review & editing

    Affiliation Faculty of Mechanical Engineering, Technion—Israel Institute of Technology, Haifa, Israel

Abstract

Questionnaires are among the most basic and widespread tools to assess the mental health of a population in epidemiological and public health studies. Their most obvious advantage (firsthand self-report) is also the source of their main problems: the raw data requires interpretation, and are a snapshot of the specific sample’s status at a given time. Efforts to deal with both issues created a bi-dimensional space defined by two orthogonal axes, in which most of the quantitative mental health research can be located. Methods aimed to assure that mental health diagnoses are solidly grounded on existing raw data are part of the individual validity axis. Tools allowing the generalization of the results across the entire population compose the collective validity axis. This paper raises a different question. Since one goal of mental health assessments is to obtain results that can be generalized to some extent, an important question is how robust is a questionnaire result when applied to a different population or to the same population at a different time. In this case, there is deep uncertainty, without any a priori probabilistic information. The main claim of this paper is that this task requires the development of a new robustness to deep uncertainty axis, defining a three-dimensional research space. We demonstrate the analysis of deep uncertainty using the concept of robustness in info-gap decision theory. Based on data from questionnaires collected before and during the Covid-19 pandemic, we first locate a mental health assessment in the space defined by the individual validity axis and the collective validity axis. Then we develop a model of info-gap robustness to uncertainty in mental health assessment, showing how the robustness to deep uncertainty axis interacts with the other two axes, highlighting the contributions and the limitations of this approach. The ability to measure robustness to deep uncertainty in the mental health realm is important particularly in troubled and changing times. In this paper, we provide the basic methodological building blocks of the suggested approach using the outbreak of Covid-19 as a recent example.

1 Introduction

Health assessment is a contested topic in social and epidemiological sciences. In the past, the measurement of health was a combination of quantitative indicators at macro and micro levels (health care expenditure or mortality rates, as an example of the former, clinical data, of the latter). The concept gradually evolved towards the inclusion of qualitative or subjective indicators, such as self-reported health information [1]. The modern measurement of mental health is intrinsically linked to the progressive acceptance of self-reported indicators using standardized questions, which can be traced back to the broad definition of health in the constitution of the World Health Organization from 1946 [2].

However, this type of indicator entailed a range of new questions related to socio-cultural and personal biases [3]. Efforts to define strong links between symptom questions and the diagnosis of disorders are one of the two main axes of quantitative mental health assessment. We call it the individual validity axis. Part of the efforts along this axis advocate homogenization and standardization, focusing on the general population or even international comparisons [46]. Other studies, on the contrary, develop mental health screening tools tailored to specific populations [710]. Research methods located along the individual validity axis aim to assure that mental health diagnoses are solidly grounded on the questions included in the questionnaires and the collected responses to them. The second focus of quantitative mental health assessment is the collective validity axis. Its purpose is to allow the generalization of the results across the entire population, attenuating the effect of possible individual variations in the collected sample [11]. The range of variations in this axis is also wide, spanning from traditional statistical methods [12, 13] to modern data science approaches [14, 15].

The individual and collective validity axes are orthogonal in the sense that any quantitative mental health assessment is located somewhere along each one of them, but a change in one of the axes does not necessarily affect the other. Generalized estimation equations (GEE) models, a statistical tool used for the analysis of longitudinal data, can be applied to test the efficiency of multiple measures of Patient Health Questionnaires (PHQ) [16]. Alternatively, the same GEE models are used as well to analyze the mental health impacts of lockdowns through Adult Self Report (ASR), a well-validated instrument to assess adult psychopathology [17].

The individual and the collective validity axes span a wide and rich bi-dimensional space in which much quantitative mental health research thrives. Indeed, this bi-dimensional space grows steadily with the publication of new quantitative research using validated mental health instruments and analyzed with inference tools. The present paper is not designed to contribute to this type of research. Instead, the intention is to raise questions about uncertainty that cannot be answered in the context of the space spanned by the two mentioned axes. This is the motivation for the suggestion of a third axis, able to provide a response based on info-gap theory [18, 19].

Sample data designed using any of the validated mental health instruments are collected at a specific time, under specific conditions. These data can be analyzed proceeding exclusively from the sample itself. Alternatively, these data can be used to infer statistical results about the general population from which the sample was drawn, if certain probabilistic information is assumed. However, deep uncertainty arises in new and unfamiliar situations, such as different populations or times, and is characterized by the absence or serious deficiency of probabilistic information.

Knight distinguished between "Risk as a known chance and true Uncertainty" which is "unmeasurable" in any probabilistic sense [20, pp.20, 21]. Knightian uncertainty arises from a substantial absence of data, knowledge or understanding relevant to a decision at hand. Knightian uncertainty deals with situations for which probabilities are unknown, so likelihoods cannot be calculated and conventional probabilistic insurance against failure is impossible. Nonetheless, facing Knightian uncertainty, one can still ask: to what extent can the data change (e.g. by considering a different time or location) and the original conclusions are still valid? The question is not probabilistic. We are not asking: What is the likelihood of validity? The question is: How robust are the conclusions to variation in the data themselves? The concept of robustness developed here is non-probabilistic and is based on info-gap decision theory.

This question of robustness to Knightian uncertainty cannot be answered in the space defined by individual/collective validity. This question requires an additional perspective: robustness to non-probabilistic uncertainty. This perspective can be viewed as an orthogonal axis to the two previous axes in the following sense. Analysis of the robustness of the results does not explicitly influence either the individual or collective validity of the methodology used in the analysis. Instead, analysis of robustness to non-probabilistic uncertainty can be applied to any methodological context composed of the individual/collective validity components. Fig 1 schematically shows a 3D space of the three components.

thumbnail
Fig 1. Geometric metaphor.

A 3D space of research methodology–orthogonal composition of individual validity, collective validity, and robustness of the results.

https://doi.org/10.1371/journal.pone.0294664.g001

In epidemiological research, examining the robustness of results can prevent both under- and over-estimating public health risks and help produce more informed public health policies. To the best of our knowledge, research in this field is focused solely on individual validity or collective validity. The present study aims to fill this gap. The present paper aims at developing the methodology of robustness analysis for a pre-chosen research task–examining the associations between socio-demographic characteristics and mental health disorders in an adult French population. We examine these associations for two time periods: before and during the COVID-19 pandemic. By doing so, we aim to examine the robustness of the derived conclusions about the presence or absence of the examined associations both in normal and extreme conditions. We believe that this would contribute to developing more informed differential policies in the field of mental health protection, which would suit specific situations the best.

The paper is structured as follows: Section 2 –the core section of the paper–describes each of the methodological components, with emphasis on the robustness analysis. Section 3 explains the data set and the substantive questions used for the demonstration of the methodology. Section 4 reports the results of the application of the methodology to the substantive questions. Finally, section 5 discusses and summarizes the results, and proposes directions for further research.

2 Methodology: Individual, collective, and robustness components

We proceed in three stages. (i) We use the Adult Self Report (ASR) scale evaluating mental health [21, 22] to identify the presence of mental health difficulties before and during the COVID-19 pandemic among study participants (individual validity component). (ii) We use χ2-analysis to test for statistically significant differences in mental health among different groups of participants (collective validity component) based on estimated probabilities of membership in the groups. (iii) Finally, we employ info-gap theory to assess the robustness of the conclusions to Knightian uncertainty in the data.

2.1. Individual validity: ASR-based estimates of mental health and socio-demographic characteristics

In this study, we assume that the individual validity component is given. We use the ASR, which is a validated measure of mental health based on symptoms of depression, anxiety, and somatic disorder. Therefore, we begin by studying the association between socio-demographic characteristics and mental health difficulties. The estimates of mental health are given as dichotomized variables (implying either the absence or presence of certain symptoms) according to the 85th percentile of the ASR scale, representing substantial presence of adverse mental health symptoms. We count the individuals with and without a particular mental disorder among different socio-demographic groups, also defined as dichotomous. In section 3 we describe the data in detail, but for the purpose of the individual validity description it is enough to stress that in all cases we use two dichotomous variables. Therefore, we obtain a set of 2x2 matrices for further analyses (see Table 1).

2.2. Collective validity: χ2 test for assessing the ‘mental health–socio-demographic characteristics’ associations

To identify the absence or presence of an association between a socio-demographic characteristic and mental health, we utilize the χ2 test to compare the marginal probability distributions of the two populations. For each 2x2 matrix (see Table 1 in subsection 2.1), we consider two different populations (the 2 rows); in each population, there are two different types of members (the two columns). Let pij denote the probability that an observation chosen randomly from the ith population is of type j. This probability distribution is normalized over the two types, so pi1 + pi2 = 1 for i = 1, 2. The null hypothesis is that, for each type, j, the probabilities pij are the same in each population, and that the groups and types are all statistically independent. The alternative hypothesis is that this is false. Specifically: (1)

Thus H0 is the assertion that the probability of population type j = 1 is the same for both populations, i = 1 and i = 2. Because pij is normalized over j, hypothesis H0 likewise asserts that the probability of population type j = 2 is the same for both populations, i = 1 and i = 2. In other words, we are assessing the association between mental health and socio-demographic characteristics in terms of the probability distributions of these properties.

The emphasis of this paper is primarily methodological, focusing on extending the analysis to consideration of Knightian uncertainty. As a result, we consider only three pairs of population-type and socio-demographic variables, even though there are many possible pairs and the analysis could be extended to larger groups of these variables. Again, the emphasis is on the methodology for extending the analysis to Knightian uncertainty, rather than on a detailed substantive analysis of relations between populations and demographics.

Let Nij denote the number of observations in the sample of type j from population i. Let Ni+ denote the total number of observations in the samples from population i for i = 1,2. Likewise, let N+j denote the total number of observations of type j for j = 1,2. Finally, let N denote the total number of observations. Specifically: (2)

Let denote the observed number of observations in the sample of type j from population i while Nij denotes the unknown true value at some future time or place. Let , and denote the observed values of Ni+, N+jand N, respectively.

The χ2 statistic is (3) where we have defined the following empirical approximation, conditioned on H0, for the expected number of observations of type j in group i: (4)

Conditioned on H0, Q will approximately have a χ2distribution (strictly true in the limit of infinite data) with (2–1)⋅(2–1) = 1 degree of freedom. Qc(p) represents the quantile of the χ2 distribution corresponding to a cumulative probability of p. Specifically, with one degree of freedom, Qc(0.95) = 3.84. If H0 holds, then the probability that the χ2 statistic exceeds 3.84, just by chance, equals 0.05. Therefore, one rejects the hypothesis H0 at level of significance p = 0.95 if Q > 3.84. That is, H0 is rejected at level of significance p if: (5)

Otherwise, H0 is not rejected, and one concludes that there is no significant variation in mental health outcomes among different socio-demographic groups of the population.

2.3. Robustness to uncertainty: Info-gapping the χ2 test

Our analysis is based on the concept of robustness to uncertainty as developed in info-gap theory [18, 19, 23]. The assessment of robustness is based on three components: a system model, a performance requirement, and an info-gap model of uncertainty. Robustness is the greatest horizon of uncertainty (in the uncertainty model) up to which the system model satisfies the performance requirement. In the present application, the system model is the χ2 statistic, and the performance requirement is that the statistic must not exceed a critical value, for any realization of data in the uncertainty model. This analysis has three distinctive characteristics.

First, uncertainty is represented non-probabilistically. The data upon which the analysis is based are highly uncertain for several reasons in addition to standard statistical variability. The data reflect a specific population, while we may be interested in a different population at some other time or location, or perhaps the original population at a different time. In other words, the data entail what Knight [20] called “true” uncertainty for which probability distributions are lacking. An info-gap model of uncertainty is an unbounded family of nested sets of an uncertain entity. An info-gap model is non-probabilistic and represents Knightian uncertainty.

The second distinctive characteristic of the info-gap analysis of robustness is that the performance is satisfied—satisfying a critical requirement—rather than optimized.

Third, the robustness to uncertainty is optimized. That is, we optimize robustness to uncertainty, to achieve an acceptable but possibly sub-optimal outcome. We will see subsequently that robustness trades off against performance: better performance has worse robustness to Knightian uncertainty. This motivates satisfying the performance and optimizing the robustness.

Recall that denotes the observed number of observations in the sample of type j from population i while Nij denotes the unknown true value in some future time or place. Recall also that , and denote the observed values of Ni+, N+jand N, respectively. We have insufficient information to choose a probability distribution for these uncertain variables in possible future situations. Nonetheless, a rough first approximation is that the observations are statistically independent and identically distributed. This approximation, were it true, would imply that the observations are Poisson random variables. If we treat as an estimate of the mean for that category, then the standard deviation of the observations would be approximately under the Poisson approximation. We adopt this as an approximation of the relative range of variability of the observations, without assuming that the uncertain future observations are actually Poisson distributed. More precisely, we adopt a fractional-error info-gap model for uncertainty in the observations, in which the range of variation of the unknown true values are centered on the observations, and normalized by . Specifically, the info-gap model for uncertainty in the observations becomes: (6)

The Nij only take discrete values, but we are treating them as continuous variables for simplicity. The maximum distance between a continuous variable and the nearest discrete variable is 0.5. Thus, the continuous approximation of the discrete Nij values entails an error no larger than plus or minus 0.5. The values of the observed variables are typically in the range from 70 to 150, with the smallest value of 16 and the largest value of 506. The fractional-error normalization in the info-gap model of Eq (6) is whose smallest value is 4 and with typical values from 8 to 12. The errors resulting from the continuous approximation are thus small, and this approximation has negligible impact on the evaluation of the robustness to uncertainty of this data.

The set U(h) contains all data sets, Nij, where values deviate fractionally from the corresponding observed values, , by no more than the value h. We make no assumptions about the relative likelihood of possible realizations of Nij. The info-gap model in Eq (6) presumes no specific probability distribution. We used the Poisson concept to choose the normalization of the uncertainty-range for each Nij. Furthermore, the value of h is unknown, and the sets U(h) become more inclusive as h grows, and there is no probabilistic normalization of the likelihood of the elements of the sets U(h). This unboundedness endows h with its meaning as an horizon of uncertainty. The info-gap model is thus not a single set, but rather a family of nested sets that is unbounded in the space of possible data sets. The info-gap model represents uncertainty (in the data) without any probabilistic model for likelihood of future observations. In other words, the info-gap model of uncertainty is a quantification of Knightian true uncertainty.

Let us suppose that the observations are such that H0 is not rejected at significance level p based on the observations. That is, the observed value of the χ2 statistic, Q in Eqs (3) and (4) based on the observed values , and , does not exceed the value Qc(p). The robustness for not rejecting H0 is the greatest horizon of uncertainty h, up to which Eq (5) does not hold for all values of the χ2 statistic, Q, based on all observations in the info-gap model of Eq (6): (7) is defined as zero if this set of h values is empty. Reading this relation from left to right: the robustness for accepting H0 at level of significance p is the greatest horizon of uncertainty, h, up to which the χ2 statistic, Q, is no greater than the critical value, Qc(p), for all possible realizations of the data in the uncertainty set U(h).

An analogous definition of robustness holds when H0 is rejected at level of significance p based on the observations. Specifically, the robustness for rejecting H0 at level of significance p is the greatest horizon of uncertainty, h, up to which Eq (5) holds for at least one value of the χ2 statistic, Q, based on observations in the info-gap model of Eq (6): (8) is defined as zero if this set of h values is empty. It is evident that and cannot both be greater than zero, though both are non-negative.

Consider the constraint on Q in the inequality in Eq (7), this constraint is relaxed as the value of the critical statistic, Qc(p), increases. The robustness is the greatest horizon of uncertainty up to which this constraint is not violated. As a result, the robustness function in Eq (7), thought of as a function of the critical statistic Qc(p), increases as Qc(p) increases. In other words, the robustness is a monotonic function: The robustness increases as Qc(p) increases. The robustness function is thus fully characterized by its inverse function because the robustness is monotonic. Consequently, the robustness function in Eq (7) can be evaluated by evaluating its inverse function.

Let m0(h) denote the inner maximum in Eq (7). A plot of h vs. m0(h) is identical to a plot of vs. Qc(p). That is, m0(h) is the inverse function of . This inverse function is evaluated as a constrained optimization. Specifically, maximize Q, as defined in Eqs (3) and (4), subject to the following constraints: (9)

Similarly, the robustness function in Eq (8) is evaluated by evaluating its inverse function. Let m1(h) denote the inner minimum in Eq (8). A plot of h vs. m1(h) is identical to a plot of vs. Qc(p). That is, m1(h) is the inverse function of . This inverse function is evaluated also as a constrained optimization, that is, minimize Q, as defined in Eqs (3) and (4), subject to the constraints defined in Eq (9).

Finally, consider the specific cases of Eqs (7) and (8) in which Qc(p) = 3.84 which is the χ2 threshold for p = 0.95. In these cases, and are called “critical robustnesses”. This means that if either critical robustness is large, then the corresponding hypothesis can be confidently adopted for a wide range of Knightian uncertainty. Conversely, a small value of a critical robustness implies that the corresponding hypothesis is vulnerable to even a low level of Knightian uncertainty in the data. The values of the critical robustness are specified in the caption of Table 3, and represented by vertical dotted lines in the graphs presented below.

3 Substantive mental health questions

3.1. Motivation

Past research on the long-term psychological consequences of an epidemic outbreak was based on infectious disease spreads which were concentrated in specific geographic areas, and where quarantine measures, when applied, were limited to specific groups [24, 25]. Moreover, with important exceptions, most past studies were cross-sectional or qualitative, and the role of pre-existing psychological difficulties could not be fully explored [26]. It is difficult to know the extent to which the results of these previous studies apply to the Covid-19 virus outbreak, where entire populations of many countries were required to implement social distancing measures for preventive purposes. Short-term effects of the pandemic are only starting to be unveiled, both regarding mental health assessments [27], and the impacts on mental health care [28, 29]. The impact of the Covid-19 pandemic and of universal quarantine and social distancing measures that lasted several weeks, on long-term patterns of mental health, are yet unknown.

A variety of studies examining the influence of the pandemic on mental health across different countries and population groups has emerged since 2020. Table 2 summarizes the findings on the associations between individual characteristics and mental health problems revealed by these studies.

thumbnail
Table 2. A summary of several recent studies on individual characteristics and mental health during the course of the COVID-19 pandemic (selected references retrieved from a search in google scholar using the keywords “Covid-19” and “mental health” in December 2022).

https://doi.org/10.1371/journal.pone.0294664.t002

As can be seen in Table 2, some individual characteristics appear to be associated with higher rates of mental health problems. However, the results vary greatly. As shown by a recent review, the prevalence of mental health problems in the context of the COVID-19 pandemic varied from 6.33% to 50.9% for symptoms of anxiety, from 14.6% to 48.3% for symptoms of depression, and from 8.1% to 81.9% for symptoms of stress [39]. This means that the findings of a selected study, conducted on a certain population, might not be applicable to other populations (even within the same country), let alone other countries and regions.

Therefore, we apply the proposed methodology to the analysis of associations between mental health disorders and individual characteristics in the adult population in France. First, we are interested in assessing the degree to which associations between socio-demographic characteristics and mental health disorders have changed as a result of the COVID-19 pandemic. Then we calculate the robustness-to-uncertainty of these conclusions using an info-gap analysis of all the associations under study.

3.2. Data set

Data used in the present analysis are derived from the TEMPO (Trajectoires EpideMiologiques en POpulation) and TEMPO COVID-19 project. The latter is an extension of the existing cohort study, TEMPO, which aims to evaluate individual, familial, and social determinants of mental health difficulties, addictive behaviors, and their trajectories over time [40]. The TEMPO COVID-19 project comprises 9 waves of data collection, online or via postal questionnaires, performed between March 2020 and May 2021.

For the purpose of this analysis, we used mental health outcome variables assessed using the Adult Self Report (ASR) Achenbach System [21, 22]. This scale is an adult extension of the widely used Child Behavioral Checklist (CBCL). Subscales in the present analysis include 18 items to evaluate symptoms of anxiety and depression. Each item was scored 0, 1 or 2, and the level of difficulty was calculated by summing up all relevant items. Afterward, for inter-questionnaire comparison, ASR scores were standardized from 0 to 100. Finally, the scores were dichotomized using the 85th percentile, and only participants with higher scores were considered as having mental health difficulties (anxiety, depression, and somatic disorders).

To assess participants’ socio-demographic characteristics we used respondents’ gender (male or female), marital status (single or non-single), and living together with at least one child (yes or no) available from the study questionnaires.

In the study, we used a prior TEMPO questionnaire performed in 2018 for the pre-pandemic analysis. To analyze the associations during the pandemic period we have aggregated nine waves of data collection within TEMPO COVID-19, performed between March 2020 and May 2021. The sizes of the aggregated pandemic-time sample emerged comparable with those of the pre-pandemic samples. Specifically, the sample sizes for pre-pandemic-time varied from 829 to 858 respondents, depending on the combinations of mental health difficulty and socio-demographic characteristic. For the pandemic-time questionnaires, the corresponding numbers were from 654 to 731.

4 Results

4.1. Chi-square analyses

Table 3 reports selected χ2 statistic values for testing associations between participants’ socio-demographic characteristics prior to the COVID-19 pandemic (questionnaire Q0) and during the COVID-19 pandemic (aggregated questionnaire Q1*). The selection was based on two types of cases: switching conclusions between pre-pandemic and pandemic times (absence of association before but presence afterwards, or the opposite) or cases in which H1 is accepted in both time periods. We selected only one socio-demographic characteristic that exemplifies each type of case. Other results, namely, lack of association during both periods, are not reported.

thumbnail
Table 3. Associations between selected participants’ socio-demographic characteristics and mental health difficulties that change between pre- and pandemic periods.

The first row reports χ2 statistics (values exceeding the critical value of 3.84 are marked in bold font), and the 2nd row the critical robustness for accepting/rejecting H0.

https://doi.org/10.1371/journal.pone.0294664.t003

The relation between gender and depression is an example of a stable association regardless of the Covid-19 pandemic. This means that both before and after the pandemic there is a significantly different outcome regarding depression in men and women. This is demonstrated by consistent χ2 statistic values well above the critical value of 3.84 as seen in row 1.

But there are also associations that changed during the pandemic. An example of this switch is the ‘marital status–anxiety disorder’ reported in row 2: The χ2 statistic switches from a value below the 3.84 threshold before the pandemic, to a value above the threshold during the pandemic, so the decision switches from H0 to H1. This means that, even if before the pandemic there were no significant differences between married or single persons regarding the level of reported anxiety, significant differences emerged after the Covid-19 outbreak.

A third type of pre- and post-pandemic mental health change is exemplified by the association ‘living with a child–somatic disorder’ (row 3 in Table 3). An opposite trend is observed here: Before the pandemic, somatic disorders had significantly different impact among persons living with children or in households composed of adults only (χ2 well above 3.84). This supports hypothesis H1. However, during the pandemic period, these different impacts seem to vanish, as evidenced by a drop of the χ2 statistic to 0.74. This result leads to the conclusion that H0 (no significant difference) should be accepted during the pandemic.

4.2. Info-gap analysis of robustness

These conclusions are drawn from data obtained from specific populations at specific times. We use the concept of robustness to Knightian uncertainty, as developed in info-gap theory, to assess the degree of confidence one has in applying these conclusions to other populations or other times [18, 19]. The analysis is summarized in Fig 2 (below), that corresponds to each of the cases described in Table 3. We now discuss those graphs.

thumbnail
Fig 2. Robustness functions.

Robustness functions of the associations between the Socio-demographic characteristics and mental health conditions described in Table 3. (A) Gender–Depression. (B) Marital status–Anxiety. (C) Living with a child–Somatic.

https://doi.org/10.1371/journal.pone.0294664.g002

Fig 2(1) depicts the association between gender and depression. The χ2 statistic reported in Table 3 indicates that the chance of experiencing depression is not the same for persons of different gender. H0 is rejected with χ2 value of 18.52 before Covid-19, and 22.57 afterwards. These values are represented in the graph by the intersection of the black and red curves, respectively, with the horizontal axis (the intersection at 22.57 lies outside the graph). The robustness on the horizontal axis is precisely zero. This means that the observed values, 18.52 and 22.57, have no robustness against change in the population. In other words, with zero robustness, we have no confidence in the generality of these specific numerical results. This is the zeroing property of robustness.

But let’s look at the overall behaviour of both curves in Fig 2(1). The question that the info-gap analysis addresses is the following: To what extent would we be confident in the conclusions (i.e., the presence of the examined association both before and after the pandemics) if these questionnaires were used in different locations or at different times? In other words, to what extent can the data underlying row 1 in Table 3 vary while also yielding the same conclusions? If the same conclusions are inferred even from very different data sets, then the inference is plausible for very different circumstances. Conversely, if even small data variations alter the conclusions, then the inference is implausible under different circumstances.

The black and red lines in Fig 2(1) represent the info-gap robustness of the observed results to Knightian uncertainty for the two χ2 tests discussed above. The horizontal axes in the figure depict the critical value of χ2, namely Qc(p), while the vertical axes show the robustness to uncertainty, or . The negative slopes of these robustness curves express the irrevocable trade off between robustness to uncertainty and critical value: greater critical value (more definitive statistical decision) entails lower robustness against Knightian uncertainty.

The vertical grey line (Qc(p) = 3.84) is the statistical decision threshold for p = 0.95: Cases to the left indicate accepting H0 at 0.95 significance, while cases to the right indicate rejecting H0 at 0.95 significance. Both robustness curves in Fig 2(1) increase from right to left. The black curve (before the pandemic) intersects the threshold at robustness of 1.40 and the red curve (during the pandemic) at 1.75. These values of robustness mean that, for variation of the data up to 1.40 standard deviations in the pre-pandemic data (black curve), or up to 1.75 standard deviations for the pandemic data (red curve), the upper boundary of the range of the calculated χ2 statistic would not exceed the critical value of 3.84 (vertical grey line): One can maintain that the rejection of H0 holds within these intervals. But if the divergence of the data goes further and exceeds 1.40 or 1.75 standard deviations (for the black and red curves respectively), the maximal value of the calculated χ2 can exceed the threshold (Qc(p) = 3.84) and the rejection of H0 is no longer justified for populations or times other than those observed. In both cases, the confidence in generalizing these results to other populations or times is moderate.

Fig 2(2) displays the robustness curves for the relation between marital status and anxiety. These curves show a switch in the relation between pre-pandemic and pandemic times. The association between marital status and anxiety was not significant before the pandemic (χ2 statistic was 3.23; horizontal axis of the black curve) but becomes significant during the pandemic (χ2 statistic of 5.92; red horizontal axis). However, these horizontal intercepts are subject to the zeroing property of robustness: We have no confidence in obtaining precisely the same results in different populations or times. As we allow the values to diverge from the observed values, the black curve (accepting H0, like in Fig 2(1)) increases from left to right, and the red curve increases from right to left (rejecting H0). This time the question is to what extent would we be confident in these conclusions (i.e., on the absence of the examined association before the pandemic and the presence of it afterwards) if these questionnaires were used in different locations or at different times. The response is given, again, by the intersection of both curves with the threshold value of 3.84. The conclusion that there is no association between marital status and anxiety before the pandemic (i.e., black curve, acceptance of H0) holds only if the sample variation does not exceed 0.05 standard deviations. On the other hand, the conclusion that there is association between marital status and anxiety during the pandemic (i.e., red curve, rejection of H0) holds only if the sample variation does not exceed 0.20 standard deviations. In both cases, this is a quite low robustness and we have little confidence in the generalizability of these results to other circumstances.

The black and red lines in Fig 2(3) are info-gap robustness curves for the observed results to Knightian uncertainty in the second case of switching χ2 statistics. Living with a child was significantly associated with somatic symptoms before the pandemic, but the association vanished during the pandemic. Again, the horizontal axis in Fig 2(3) depicts the critical value of χ2, while the vertical axis shows the robustness to uncertainty. The vertical grey line (Qc(p) = 3.84) is, as previously, the statistical decision threshold: Cases to the left indicate accepting H0, while cases to the right indicate rejecting H0. The black curve (before the pandemic) intersects the horizontal axis at 9.82 (rejecting H0) while the red curve (during the pandemic) intersects it at 0.74 (accepting H0). This is the third example of the zeroing property of robustness: these observed values have no robustness to change in the data. We also see trade off between the robustness and the critical value, as expressed in the slopes of the robustness curves. The black robustness curve increases from right to left indicating that less definitive rejection of H0 has greater robustness to Knightian uncertainty. The red curve increases from left to right indicating that less definition acceptance of H0 has greater robustness. In this particular example both curves intersect the threshold (Q = 3.84) at the same robustness of 0.55. This is interpreted as in the previous cases: If the sample before the pandemic varies up to 0.55 standard deviations (but not more), the conclusion of a significant association still holds. Likewise, if the sample after the pandemic varies up to 0.55 standard deviations, the lack of association between living with a child and somatic symptoms (i.e., accepting H0) still holds. In both cases the robustness to Knightian uncertainty is low.

5 Discussion and Conclusions

5.1. Results related to the mental health impact of the Covid-19 pandemic

The present study develops a methodology for assessing the robustness-to-uncertainty of results from a selected epidemiological analysis. As a case study, we used a data set of self-reports on mental health and socio-demographic characteristics of an adult population in France before and during the COVID-19 pandemic. χ2 analysis was chosen for examining the absence or presence of the associations between different mental health disorders and socio-demographic characteristics of the respondents. The focus of the paper is the development of a methodology of assessing the robustness of the conclusions to non-random variation in the data, specifically, assessing the confidence in applying the conclusions to other populations or times.

The analysis revealed three types of relevant associations between socio-demographic characteristics and mental health when comparing the period before the pandemic and after the outbreak. The first type refers to stable associations both in pre-pandemic and pandemic periods. This is the case for the association between gender and depression which occurs both before the pandemic and after its outbreak (row 1, Table 3). This finding is in line with previous studies reporting the existence of associations between mental health problems and gender during the pandemic [17, 3038]. Compared with these studies, our results indicate that these associations hold for pre-pandemic times too.

The second type of association between socio-demographic characteristics and mental health refers to a lack of association before the pandemic and the emergence of association after the Covid-19 breakout for some characteristics under study. This is the case for associations between marital status and anxiety (row 2, Table 3). These results support previous findings about the existence of an association between living alone and higher rates of anxiety disorder [31, 33, 3638].

Finally, the third (and perhaps least expected) type of association between socio-demographic characteristics and mental health refers to the disappearance of significant associations during the pandemic, despite their existence in the previous period. This is the case for ‘living with a child–somatic disorder’ (row 3, Table 3). We can only speculate about the meaning of the fading out of these associations. One possible explanation may be the “democratic effect” of the pandemic: Certain characteristics that in normal times sharply differentiate between population groups (e.g. having or not having children) may be less significant in times of emergency [3638, 41, 42]. An additional “democratic effect” may be the temporary resurgence of the welfare state and grassroots social involvement during the pandemic [4345].

5.2. Results related to the robustness to deep uncertainty

The paper adds a new dimension to traditional quantitative mental health research. In addition to the individual and the collective validity axes, we introduce the robustness to deep uncertainty axis. Implementing this concept by means of info-gap decision theory, we are able to quantify deep Knightian uncertainty without any a priori probabilistic information. A three-dimensional research space is defined by introducing this new axis. Within this three-dimensional space we explore to what extent the conclusions drawn from the data can be reliably applied to future populations in different situations.

The results indicate that in most of the cases the robustness to uncertainty is quite low. Only in a few cases the robustness surpasses the value of 1, and the maximal observed robustness value is 1.75, which is still only moderate robustness as explained earlier. Low robustness values such as these indicate that relatively small changes in numerical results (deriving from samples of different populations or times) are sufficient to invalidate the conclusions of the analysis. For instance, robustness of 1 means that the conclusions are extendable to other situations provided that the data vary no more than 1 standard deviation. A variation of this magnitude can be expected just by chance, so one can have little confidence in asserting the conclusions for other situations.

However, greater robustness might be reached with larger sample sizes. This is because the χ2 test is sensitive to the sample size, as evidenced by Eqs (3) and (4). As an hypothetical experiment, let us proportionally double the observed Nij values in Table 3, recalculating both the χ2 test and the robustness analysis. The results are shown in Table 4.

thumbnail
Table 4. Associations between mental health disorders and different socio-demographic characteristics of simulated enlarged samples.

The real values appear without parentheses, while the values obtained using enlarged samples are in parentheses.

https://doi.org/10.1371/journal.pone.0294664.t004

As can be seen from the table, with doubled sample sizes, the conclusions are substantially robust in some but not all cases. For example, the association between gender and depression increases from robustness of less than 2, to 2.45 and 3.00 for pre-pandemic and pandemic-time periods, correspondingly. These values correspond to variation of the data up to 2.45 and 3.00 standard deviations, respectively. This implies reasonable confidence in applying these conclusions to other situations in this hypothetical case. The other two associations do not show large robustness for the hypothetically enlarged data set, suggesting that much larger samples would be needed. The conclusion from this hypothetical experiment is, in broad terms, that much larger samples are needed in order to obtain results that can be confidently generalized to other times or populations. Only much larger samples would have substantial robustness to non-probabilistic uncertainty in the data.

5.3. Summary

The study of quantitative mental health assessments is traditionally conceptualized with two orthogonal axes. The first is focused on the relationship between questions about symptoms and diagnoses: We call it the individual validity axis. The second is the collective validity axis, solidly grounded in statistics and probability. Its primary concern is to allow generalization of results obtained by a sample, to the entire population. Much quantitative mental health research can be positioned in a virtual map by defining the utilized validity methods on one axis, and the inferential tools on the other.

We explored the concept of non-probabilistic Knightian uncertainty arising in other possible samples. We employed info-gap analysis of robustness to assess the impact of this Knightian uncertainty. Info-gap robustness is ontologically different from statistical uncertainty because it entails no probabilistic or statistical assumptions. Robustness to this type of deep Knightian uncertainty enables one to assess the degree of confidence in inference from a known situation to a hypothetical new situation that may differ fundamentally. Robustness to deep uncertainty is orthogonal both to the individual validity axis and the collective validity axis, in the sense that it does not necessarily affect either of these axes. For example, info-gap analysis is indifferent to the implemented validation method. In addition, when examining statistical inference, robustness analysis is indifferent to whether the null hypothesis is accepted or rejected.

The introduction of info-gap theory as a tool to assess deep uncertainty creates an additional axis that expands the domain of analysis in quantitative mental health assessments. The new opportunities opened by this approach were shown by using a χ2 test followed by an info-gap analysis of the results. The limitations of the approach were highlighted also, with suggestions about ways to overcome them.

The ability to measure robustness to deep uncertainty in the mental health realm is vital because the research area is based mainly on questionnaires that reflect the conditions of a specific population and time. The ability to measure robustness is important particularly in troubled and changing times. A recent example is the outbreak of Covid-19, an unexpected, traumatic, and worldwide event that may have changed human behavior in several ways, including its impact on mental health. The analysis of robustness to deep uncertainty, implemented using info-gap theory, seems to be particularly suitable for this type of situation. In the present paper, we provide the basic methodological building blocks of the suggested approach through a simple example.

References

  1. 1. Bullinger M. International comparability of health interview surveys: An overview of methods and approaches. BIOMEDICAL AND HEALTH RESEARCH-COMMISSION OF THE EUROPEAN COMMUNITIES THEN IOS PRESS. 2003;57:1–2.
  2. 2. World Health Organization. Constitution. World Health Organization; 1989.
  3. 3. Kohrt BA, Rasmussen A, Kaiser BN, Haroz EE, Maharjan SM, Mutamba BB, De Jong JT, Hinton DE. Cultural concepts of distress and psychiatric disorders: literature review and research recommendations for global mental health epidemiology. International journal of epidemiology. 2014 Apr 1;43(2):365–406. pmid:24366490
  4. 4. Meltzer H. Development of a common instrument for mental health. In Nosikov A, Gudex C. Development of a common instrument for mental health. EUROHIS: Developing common instruments for health surveys. 2003;57:35.
  5. 5. Haro JM, Arbabzadeh‐Bouchez S, Brugha TS, De Girolamo G, Guyer ME, Jin R, Lepine JP, Mazzi F, Reneses B, Vilagut G, Sampson NA. Concordance of the Composite International Diagnostic Interview Version 3.0 (CIDI 3.0) with standardized clinical assessments in the WHO World Mental Health surveys. International journal of methods in psychiatric research. 2006 Dec;15(4):167–80. pmid:17266013
  6. 6. Auerbach RP, Mortier P, Bruffaerts R, Alonso J, Benjet C, Cuijpers P, Demyttenaere K, Ebert DD, Green JG, Hasking P, Murray E. WHO world mental health surveys international college student project: prevalence and distribution of mental disorders. Journal of abnormal psychology. 2018 Oct;127(7):623. pmid:30211576
  7. 7. Kroenke K, Spitzer RL, Williams JB, Löwe B. The patient health questionnaire somatic, anxiety, and depressive symptom scales: a systematic review. General hospital psychiatry. 2010 Jul 1;32(4):345–59. pmid:20633738
  8. 8. Glover TA, Albers CA. Considerations for evaluating universal screening assessments. Journal of school psychology. 2007 Apr 1;45(2):117–35.
  9. 9. Ali GC, Ryan G, De Silva MJ. Validated screening tools for common mental disorders in low and middle income countries: a systematic review. PloS one. 2016 Jun 16;11(6):e0156939. pmid:27310297
  10. 10. Shields RE, Korol S, Carleton RN, McElheran M, Stelnicki AM, Groll D, Anderson GS. Brief mental health disorder screening questionnaires and use with public safety personnel: a review. International journal of environmental research and public health. 2021 Apr 3;18(7):3743. pmid:33916659
  11. 11. Brown C, Lloyd K. Qualitative methods in psychiatric research. Advances in psychiatric treatment. 2001 Sep;7(5):350–6.
  12. 12. Aoun S, Pennebaker D, Wood C. Assessing population need for mental health care: a review of approaches and predictors. Mental Health Services Research. 2004 Mar;6:33–46. pmid:15002679
  13. 13. Elhai JD, Calhoun PS, Ford JD. Statistical procedures for analyzing mental health services data. Psychiatry research. 2008 Aug 15;160(2):129–36. pmid:18585790
  14. 14. Silva C, Saraee M, Saraee M. Data science in public mental health: a new analytic framework. In2019 IEEE Symposium on Computers and Communications (ISCC) 2019 Jun 29 (pp. 1123–1128). IEEE.
  15. 15. Russ TC, Woelbert E, Davis KA, Hafferty JD, Ibrahim Z, Inkster B, John A, Lee W, Maxwell M, McIntosh AM, Stewart R. How data science can advance mental health research. Nature human behaviour. 2019 Jan;3(1):24–32. pmid:30932051
  16. 16. Staples LG, Dear BF, Gandy M, Fogliati V, Fogliati R, Karin E, Nielssen O, Titov N. Psychometric properties and clinical utility of brief measures of depression, anxiety, and general distress: The PHQ-2, GAD-2, and K-6. General hospital psychiatry. 2019 Jan 1;56:13–8. pmid:30508772
  17. 17. Andersen AJ, Mary-Krause M, Bustamante JJ, Héron M, El Aarbaoui T, Melchior M. Symptoms of anxiety/depression during the COVID-19 pandemic and associated lockdown in the community: longitudinal data from the TEMPO cohort in France. BMC psychiatry. 2021 Dec;21(1):1–9.
  18. 18. Ben-Haim Y. Info-gap decision theory: decisions under severe uncertainty. Elsevier; 2006 Oct 11.
  19. 19. Ben-Haim Y. Info-gap economics: an operational introduction. Basingstoke, UK: Palgrave Macmillan; 2010 Apr 9.
  20. 20. Knight FH. Risk, uncertainty and profit. Houghton Mifflin; 1921.
  21. 21. Achenbach TM, Rescorla LA. Manual for the ASEBA adult forms & profiles;2003.
  22. 22. Rescorla LA. The Achenbach System of Em irically Based Assessment (ASEBA) or Ages 18 to 90+ Years. The use of psychological testing for treatment planning and outcomes assessment. 2004 Jul 21;3:115–52.
  23. 23. Ben-Haim Y. The dilemmas of wonderland: decisions in the age of innovation. Oxford University Press; 2018 Aug 30.
  24. 24. Chong MY, Wang WC, Hsieh WC, Lee CY, Chiu NM, Yeh WC, Huang TL, Wen JK, Chen CL. Psychological impact of severe acute respiratory syndrome on health workers in a tertiary hospital. The British journal of psychiatry. 2004 Aug;185(2):127–33. pmid:15286063
  25. 25. Marvaldi M, Mallet J, Dubertret C, Moro MR, Guessoum SB. Anxiety, depression, trauma-related, and sleep disorders among healthcare workers during the COVID-19 pandemic: A systematic review and meta-analysis. Neuroscience & Biobehavioral Reviews. 2021 Jul 1;126:252–64. pmid:33774085
  26. 26. Brooks SK, Webster RK, Smith LE, Woodland L, Wessely S, Greenberg N, Rubin GJ. The psychological impact of quarantine and how to reduce it: rapid review of the evidence. The lancet. 2020 Mar 14;395(10227):912–20. pmid:32112714
  27. 27. Holman EA, Thompson RR, Garfin DR, Silver RC. The unfolding COVID-19 pandemic: A probability-based, nationally representative study of mental health in the United States. Science advances. 2020 Oct 14;6(42):eabd5390. pmid:32948511
  28. 28. Moreno C, Wykes T, Galderisi S, Nordentoft M, Crossley N, Jones N, Cannon M, Correll CU, Byrne L, Carr S, Chen EY. How mental health care should change as a consequence of the COVID-19 pandemic. The lancet psychiatry. 2020 Sep 1;7(9):813–24. pmid:32682460
  29. 29. Hervalejo D, Carcedo RJ, Fernández-Rouco N. Family and mental health during the confinement due to the COVID-19 pandemic in Spain: the perspective of the counselors participating in psychological helpline services. Journal of Comparative Family Studies. 2020 Oct;51(3–4):399–416.
  30. 30. Pierce M, Hope H, Ford T, Hatch S, Hotopf M, John A, Kontopantelis E, Webb R, Wessely S, McManus S, Abel KM. Mental health before and during the COVID-19 pandemic: a longitudinal probability sample survey of the UK population. The Lancet Psychiatry. 2020 Oct 1;7(10):883–92. pmid:32707037
  31. 31. Essadek A, Rabeyron T. Mental health of French students during the Covid-19 pandemic. Journal of affective disorders. 2020 Dec 1;277:392–3. pmid:32861840
  32. 32. Wang C, Pan R, Wan X, Tan Y, Xu L, Ho CS, Ho RC. Immediate psychological responses and associated factors during the initial stage of the 2019 coronavirus disease (COVID-19) epidemic among the general population in China. International journal of environmental research and public health. 2020 Mar;17(5):1729. pmid:32155789
  33. 33. Fancourt D, Steptoe A, Bu F. Trajectories of anxiety and depressive symptoms during enforced isolation due to COVID-19 in England: a longitudinal observational study. The Lancet Psychiatry. 2021 Feb 1;8(2):141–9. pmid:33308420
  34. 34. Pedersen MT, Andersen TO, Clotworthy A, Jensen AK, Strandberg-Larsen K, Rod NH, Varga TV. Time trends in mental health indicators during the initial 16 months of the COVID-19 pandemic in Denmark. BMC psychiatry. 2022 Dec;22(1):1–3.
  35. 35. Keller A, Groot J, Matta J, Bu F, El Aarbaoui T, Melchior M, Fancourt D, Zins M, Goldberg M, Nybo Andersen AM, Rod NH. Housing environment and mental health of Europeans during the COVID-19 pandemic: A cross-country comparison. Scientific reports. 2022 Apr 4;12(1):5612. pmid:35379838
  36. 36. Msherghi A, Alsuyihili A, Alsoufi A, Ashini A, Alkshik Z, Alshareea E, Idheiraj H, Nagib T, Abusriwel M, Mustafa N, Mohammed F. Mental health consequences of lockdown during the COVID-19 pandemic: a cross-sectional study. Frontiers in Psychology. 2021 Feb 26;12:605279. pmid:33732181
  37. 37. Ahmed MZ, Ahmed O, Aibao Z, Hanbin S, Siyu L, Ahmad A. Epidemic of COVID-19 in China and associated psychological problems. Asian journal of psychiatry. 2020 Jun 1;51:102092. pmid:32315963
  38. 38. Bonnini S, Borghesi M. Relationship between Mental Health and Socio-Economic, Demographic and Environmental Factors in the COVID-19 Lockdown Period—A Multivariate Regression Analysis. Mathematics. 2022 Sep 6;10(18):3237.
  39. 39. Xiong J, Lipsitz O, Nasri F, Lui LM, Gill H, Phan L, Chen-Li D, Iacobucci M, Ho R, Majeed A, McIntyre RS. Impact of COVID-19 pandemic on mental health in the general population: A systematic review. Journal of affective disorders. 2020 Dec 1;277:55–64. pmid:32799105
  40. 40. Mary-Krause M, Herranz Bustamante JJ, Héron M, Andersen AJ, El Aarbaoui T, Melchior M. Impact of COVID-19-like symptoms on occurrence of anxiety/depression during lockdown among the French general population. PLoS One. 2021 Jul 26;16(7):e0255158. pmid:34310661
  41. 41. Amat F, Arenas A, Falcó-Gimeno A, Muñoz J. Pandemics meet democracy. Experimental evidence from the COVID-19 crisis in Spain.
  42. 42. Engler S, Brunner P, Loviat R, Abou-Chadi T, Leemann L, Glaser A, Kübler D. Democracy in times of the pandemic: explaining the variation of COVID-19 policies across European democracies. West European Politics. 2021 Sep 19;44(5–6):1077–102.
  43. 43. Bhat M, Qadri M, Kundroo M, Ahanger N, Agarwal B. Sentiment analysis of social media response on the Covid19 outbreak. Brain, behavior, and immunity. 2020 Jul;87:136. pmid:32418721
  44. 44. Béland D, Dinan S, Rocco P, Waddan A. Social policy responses to COVID‐19 in Canada and the United States: Explaining policy variations between two liberal welfare state regimes. Social Policy & Administration. 2021 Mar;55(2):280–94.
  45. 45. Cantillon B, Seeleib‐Kaiser M, Van der Veen R. The COVID‐19 crisis and policy responses by continental European welfare states. Social Policy & Administration. 2021 Mar;55(2):326–38. pmid:34230722