Abstract
Social network extensions of Heider’s balance theory have led to a plethora of adaptations, often inconsistent with Heider and each other. We present a general model that permits the description and testing of specific balance theoretic predictions as Heider had originally proposed them. We formulate balance statements as a comparison of two conditional probabilities of a tie: , conditioned on 2-path relations
vs. their counterfactual negation,
. Here q, r and s, are indices that identify elements in a set of mutually exclusive and exhaustive relations in a multigraph (their sum produces a complete graph). This relaxes the dichotomous assumption of a signed graph. We identify neutral ties distinctive from negative and positive ties and convert the underlying signed graph into a restricted multigraph. The point-biserial correlations of relation q with the count of 2-paths (through relations r and s) describe the difference in conditional probabilities, or the prevalence for any stipulated balance configuration. In total 18 such unique balance correlations are identified for any trichotomous multigraph, although 27 theoretical statements exist. Two major advantages of balance correlations are: direct comparison between any two correlations over time, even if network sizes and densities differ; and the evaluation of specific balanced and unbalanced behaviors and predictions. We apply the model to a large data set consisting of friendly vs hostile relations between countries from 1816 to 2007. We find strong evidence for one of the balance statements, and virtually no evidence for unbalanced predictions. However, there is ample stable evidence that “neutral” ties are important in balancing the relations among nations. Results suggest that prevalence of balance-driven behavior varies over time. Indeed, other behaviors may prevail in certain eras.
Figures
Citation: Dekker D, Krackhardt D, Doreian P, Krivitsky PN (2024) Balance correlations, agentic zeros, and networks: The structure of 192 years of war and peace. PLoS ONE 19(12): e0315088. https://doi.org/10.1371/journal.pone.0315088
Editor: Abdullah Mohammad Ghazi Al khatib, Damascus University, SYRIAN ARAB REPUBLIC
Received: February 28, 2024; Accepted: November 19, 2024; Published: December 20, 2024
Copyright: © 2024 Dekker et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All data files are available from the Correlates of War database. https://correlatesofwar.org/data-sets/.
Funding: The author(s) received no specific funding for this work.
Competing interests: The authors have declared that no competing interests exist.
1 Introduction
Balance theory prevails as one of the most widespread theories of social behavior to emerge in the past 70 years. With its roots in psychology [1], it is widely used in sociology [2, 3], political science [4], social psychology [5], and anthropology [6]. But, perhaps no field has invested more in balance theory than social networks. Whether these networks are studied by social scientists [7, 8], statisticians [9, 10], or physicists [11], all appeal to balance theory as an explanation or motivation for actor behavior in their research.
One problem with its ubiquitous presence, however, would be that balance theory has taken on different meanings and forms as it has been adapted by the various branches of the study of human behavior, especially those studying social networks. Different network scholars have reinterpreted balance theory in such different ways that it is almost unrecognizable by those who founded the theory itself.
Most notably we want to address the gap that has risen since [12] proved a result on network partitioning as a consequence of balance seeking behavior of nodes. This shifted research away from certain crucial elements in balance theory as it was originally conceived. Already, [13] noticed different strands of balance theory had come about. Where social network theory mostly focuses on structural balance theory as an explanation for group formation and partitioning in networks, the social psychology strand considers diversity in nodal affect and behavior underlying balance processes [13–15]. What is not considered in neither branch is the prevalence of such behavior in the whole signed network. This is fundamental to understanding the micro-macro link, and this paper explores this issue for the network of international relations over nearly 200 years. Measuring and testing behavioral assumptions of balance theory requires to release the dichotomous relation restriction, and hence requires another more general model.
The purpose of this paper is to provide a formal approach to the study of balance in networks that is precise yet flexible, simple yet sophisticated, and innovative yet hones even more closely to Heider’s original theoretical logic. In a major departure from the dominant research agenda, we look beyond the traditional dichotomous perspective that the only ties of interest are the positive and negative ones. Specifically, we ask the question, what role do agentic zeros (neutral relations) play in the consideration of balanced states in a network? Our approach permits researchers in all fields to better specify and understand balance theory’s implications and predictions under such expanded conditions.
2 Balance in a new theoretical frame
2.1 Background
Heider [1, 16, 17] originally defined balance as a function of the relationships between three entities: a person (P), another person (O), and an inanimate object (X). Each relationship between these three entities was characterized as either positive or negative. If the product of the signs is positive, the triad is deemed to be balanced; if the product is negative, the triad is deemed to be unbalanced. This yields eight combinations of possible tie patterns. The central premise in Heider’s balance theory is that P strives to attain balance due to adapting the relation to O, such that the triad becomes in any of the ‘Balanced’-states shown in Table 1.
Newcomb [5] was among the first to apply Heider’s theory exclusively to a triad of people. By identifying “X” as a third individual, rather than an object as Heider had done, he demonstrated how we could use balance among three people to explain many relational social dynamics. An important extension of this idea was developed by Cartwright and Harary [12], who proposed formal mathematical statements about balance properties of the network as a whole, not just isolated triples of actors. The triple was considered an unordered set without any special consideration given to one node over another—i.e., it was an unlabeled triple. However, this has obfuscated the focus on the role of “P” as the actor/agent of interest in Heider’s work. Indeed, the labels in “P − O − X” were replaced with far less discriminating labels i, j, k. This became the standard for one branch of balance theory [13, 14, 18] (see those for description of different branches), the concept and measure of triad census as introduced by [19] and continues to this day [15, 20–23].
Rapaport [24] provided an important theoretical contribution by translating these technical and mathematical elements into simple statements of social relations that has inspired much of the subsequent work in this area. Specifically, each of these eight configurations has a social interpretation that could be construed to give these balance rules an intuition in concrete, everyday terms like friendship and enemy. These eight theoretical statements corresponding to Heider’s P − O − X model are in Table 2.
While such statements make the Heiderian predictions easy to understand, these also are inherently overly demanding. For example, no one really believes that everytime someone encounters a friend of a friend that she will establish a friendship with that person. Rather, a more reasonable prediction from this first Heiderian principle is that a friend of a friend is more likely to be a friend than if that person is not a friend of a friend. That is, the 2-path relation, friend-of-a-friend, creates a favorable context in which a friendship is more likely to form than without that context. Unfortunately, measures of the Degree of Balance, usually don’t allow to distinguish between these different types of (un)balanced behaviors [12, 20, 21, 23, 25].
2.2 Balance theory as a set of probability statements
We will throughout this paper replace Heider’s P − O − X terms with corresponding network terms. We start by noting that the only relationship that Heider specifically designated as interpersonal was the relationship between P and O. And the question posed by the model is what relation P would choose to have with O, given the association both have with X. This X provides a contextual element that influences P’s choice. Consistent with the social network literature, we refer to this P–O relationship as an Ego –Alter relationship, wherein Ego (E) is choosing a relationship with Alter (A). We retain the label “X” to refer to a contextual third party. With this minor amendment, we can formalize the first Heiderian “a friend of a friend is a friend” principle with the following conditional inequality:
(1)
or, equivalently,
(2)
where
indicates a friendship tie. Similar conditional inequalities can be made of each of the other three primary Heiderian predictions about balanced relations. In cases of unbalanced relations, such as “a friend of an enemy is a friend,” then the prediction is that the 2-path relation, being a friend of an enemy, should lead to relatively infrequent friendships. Such unbalanced statements would be similarly described as in Eq 1, except that the inequality is reversed:
(3)
or, equivalently,
(4)
where
indicates a symmetric friendship tie and
indicates a symmetric enemy relation.
At the network level, we can frame this prediction as a statement about the relative likelihood that the predicted tie would appear given that it is facing a contextual 2-path relation. That is, in the network as a whole, we can cross-tabulate the number of occurrences of a tie type with the number of occurrences of the specified 2-path relation, shown in Table 3.
If we assume a uniform and random draw from all {i, j, k} triples in the network, then we can directly calculate the relevant conditional probabilities from this 2x2 table. The first conditional in Eqs 1 and 2, the probability that a friendship between Ego and Alter exists, given that there is an X who is also a friend of Ego and a friend of Alter, is calculated as a/(a + c). The second conditional in each equation is calculated as b/(b + d). Thus, the difference between these two fractions (a/(a + c)) − (b/(b + d)) represents the extent to which the particular balance theory prediction is true. The larger the difference, the stronger the association given the conditional 2-path. If the relation is unbalanced, such as described in Eqs 3 and 4, then the difference between these two fractions would be negative. The larger the negative value, the stronger the effect of the 2-path in suppressing the existence of the Ego − Alter tie.
Conversely, if (a/(a + c)) − (b/(b + d)) is positive, then it will also be the case that the difference between the corresponding row percentages will be positive (i.e., (a/(a + b)) − (c/(c + d)) > 0). This represents the symmetric claim that if the Ego − Alter tie exists, then the probability of the designated 2-path Ego − X − Alter tie will be greater than if the Ego − Alter tie does not exist. While these differences will not in general exactly equal each other, they will always be of the same sign. Moreover, the size of these differences will correspond to the strength of the particular Heiderian balance principle.
Putting all this together, we can describe the size of the balance theory effect by taking the geometric mean of these two differences in probabilities:
Finally, we draw on earlier work [26] to note that this geometric mean is equal to the Pearson correlation between the vector of their corresponding Ego − X − Alter 2-path ties for all {i, j, k} triples and the corresponding vector of their associated Ego − Alter ties. That is,
where
(5)
The sign function in Eq 5 differentiates between balanced and unbalanced predictions. If a/(a + c) > b/(b + d), then the correlation between the presence of the predicted Ego–Alter tie and its conditional 2-path will be positive; thus, the positive root of this square root function is used, indicating that the Ego − Alter tie probability is enhanced in the presence of the Ego − X − Alter 2-path. If a/(a + c) < b/(b + d), then this correlation will be negative; thus, the negative root of this square root function is used, indicating that the Ego − Alter tie probability is suppressed in the presence of the Ego − X − Alter 2-path.
In other words, the Pearson product-moment correlation between observed Ego − Alter ties and their associated Ego − X − Alter 2-path ties directly measures the observed prevalence of balance (or unbalance) in the network as a whole.
2.3 The critical role of (agentic) non-ties
Another restriction in Heider’s model is that it assumes all relations are either positive or negative, not neutral. On the other hand, in reality many people are going to have neutral sentiments (neither positive nor negative) about each other. While such non-ties are often seen as part of any natural network, they are often treated as “missing data” and are excluded from any predictions [12]. Or, by some conventions, as noted by [3], “… the absence of a positive sentiment implies the presence of a negative sentiment,” and as such a lack of a tie may be deemed to represent a negative tie.
In a major departure from this tradition, we explicitly consider neutral relations as agentic or social choices, just as much as positive and negative relations. There are many reasons that neutral or non-relationship ties should be considered a choice within a balance theory context. For example, in a highly polarized and conflict-ridden environment, Ego may have positive sentiments toward X, but Ego also observes that Alter and X have an intense dislike for each other. If Ego were to develop a positive relation with Alter, even if it were to serve Ego’s interests to do so, this would create an unbalanced triple. One way of escaping this uncomfortable position is to establish a neutral or non-relationship with Alter, rather than taking sides in the dispute. That is, Ego may take the “Switzerland” option of staying neutral in the spat, in order to preserve her options to work with Alter in the future.
Strategic neutrality is not the only reason for considering neutral relations in balance theory. Within the blockmodeling perspective it has been noted frequently how zero blocks (groups of people who are discouraged from relating to each other or to other groups) can be found as a way to preserve social order [27, 28]. The cost of forming ties under such conditions could be substantial in a social situation. Understanding how these non-ties play a role in balancing triadic relations is explicitly part of our model.
Yet another reason for observing null ties, even in cases where relationships to all other actors are evaluated is a limitation in resources. Such restrictions prevent engagement in positive or negative ties. For example, in international relations smaller countries may not have sufficient budget or human capital to invest in embassies in each other country. Also, an isolationist approach to foreign affairs may prevent joining collective security agreements. Or negatively, countries cannot afford to go at war in case of resource dependencies. Although different ideologies, internal policies and choices of countries may lead to neutrality, the behavioral consequence is uniform, and distinct from positive and negative ties.
Thus, Ego may develop a neutral attitude towards Alter. Moreover, Ego’s observation that X may have a neutral relation with either Ego or Alter (or both) could affect Ego’s calculations in how to consider her tie with Alter. We therefore maintain that any prediction stemming from balance theory about a real network must recognize this inherent character of the network and include positive, negative and neutral ties as part of the balance processes.
2.4 Triad signature
Our claim, then, is that all signed graphs can be decomposed into three (sentiment) subnetworks: Positive ties, Negative ties, and Zero ties (neutral or non-ties). These three types of relations define a restricted multigraph on any network :
- {P} ≔ the set of (i, j) unordered pairs designating all ties where i has a positive tie with j;
- {N} ≔ the set of (i, j) unordered pairs designating all ties where i has a negative tie with j; and
- {Z} ≔ the set of (i, j) unordered pairs designating all ties where i does not have a positive tie nor a negative tie with j, which we indicate as a neutral or “zero” tie.
We ignore self-ties (i, i) and hence these will not be in these three sets. By convention, we represent these three sets in square matrix form as follows:
These sets are, by definition, mutually exclusive and exhaustive in the network; thus, their sum, as well as their union, forms a complete and simple graph on the full set of nodes in . These three networks (P, N and Z) henceforth will be referred to as the set of primitive relations in
.
Given the three possible relations in any (i, j) pair, then there are 3 × 3 = 9 possible configurations in any 2-path i-k-j. For each 2-path configuration, there are three relations between i and j that complete the triple, yielding 33 = 27 configurations possible. Each of these may be designated with a 3-tuple identifier. We will use the following rules to identify each unique possible configuration:
Given any ordered triple of actors in graph , we designate three roles, one ascribed to each of the three actors: Ego, the focal person to whom the balance consideration is paramount; Alter, the person whom Ego is potentially establishing a relationship with; and Context, the person with whom both Ego and Alter have a relation that defines the context that determines whether the i-j relation is in balance or not. We will heretofore abbreviate these role references as “E” for Ego, “A” for Alter, and we retain Heider’s “X” for the Contextual third party. Any labeled triple, then, has three tie values, designated by the three pairs of the three nodes. Each pair will be accorded a value corresponding to one of the three types of ties (Positive, Negative, or Zero–from hereon out referred to as p ties, n ties and z ties, respectively). Using this notation, any and all configurations of ties amongst any three nodes may be designated by assigning the letter p, n or z to each of the three pairs. The first letter will designate the relation between Ego and Alter; the second will designate the relation between Ego and X; and the third letter will designate the relation between Alter and X (see Table 4).
For reasons that will become clear later, we will separate the first letter in this 3-tuple from the remaining two letters with a period. Thus, for example, the 3-tuple <p.nz> would indicate that Ego has a positive tie with Alter, Ego has a negative tie with X, and Alter has a zero tie (non-tie) with X. Heider’s “the enemy of an enemy is a friend” prediction would be represented as <p.nn>. These three letters, in the designated order, provide a triad signature, which permits the unique designation of each of the 33 = 27 possible configurations that Ego might face.
While any ordering of these letters is possible, we have chosen this arbitrary ordering because it allows one to see some of the principles behind Heider’s theoretical predictions. In fact, the consistent application of this order of these three elemental relations in the signature provide three important features: 1) The order identifies a specific structural configuration, one of the possible 27; 2) The order represents specific and formal predictive statements from balance theory; and 3) The order naturally leads to a precise calculation of the balance correlation measure of the network-level of balance specifically designated by the signature. We will discuss each of these features in turn below.
2.4.1 Identifying the 27 triad signatures.
The 27 possible labeled configurations are provided in Fig 1. Positive ties (p) are represented in black; negative ties (n) are represented in dashed red lines; zero-ties (z) or neutral ties are blank, and are indicated with a letter “z” to emphasize they are treated as ties. Node labels refer to the Ego (E), Alter (A), and Context (X) roles in each triad. As we will discuss later, at the network level some of these triads will be mathematically redundant. Nonetheless, in order to keep the theoretical stories clear, it is still important to retain all 27 triad signatures.
Note: Each signature is given by any combination of p, n and z, in angle brackets. A solid line indicates a positive tie; a dashed line indicates a negative tie; no line indicates a zero-tie. Node labels indicate roles of Ego (E), Alter (A), and Contextual (X).
As an interesting extension of this last point, we note that many of these labeled configurations are isomorphic, yet their theoretical claims are still distinct. For example, <p.pz> is isomorphic with <z.pp>, but each is a different claim on the experience that Ego faces. In the former case, Ego is the central character, the bridge between two friends who are neutral toward each other. In the <z.pp> case, however, Ego is neutral to Alter, even though his friend (X) likes Alter. How each of these two configurations may play out in Ego’s experience from a balance theory perspective could be quite different, depending on the options Ego experiences. In the <z.pp> case, Ego could actively close the triple with a positive tie to Alter, creating a <p.pp> triple, which would be attractively balanced in Heider’s view. But, in the <p.pz> case, Ego has less agency, since the neutral tie is between Alter and X. Thus, these two isomorphic structures represent entirely different balancing experiences and options from Ego’s viewpoint.
2.4.1 Triad signature as a predictive statement.
Each triadic signature also operates as a unique theoretical predictive statement regarding balance processes. To see how this is so, we return to the predictive statement that Heiderian balance can make: If balance theory holds, then Ego should be motivated to create balance in its ties and to alter relations that create unbalanced situations. In a triadic context, the probability that Ego will actively choose to engage in a tie (either p, n or z) with Alter will depend on the 2-path relation through X. In the present model, the first letter in the triad signature represents this choice faced by Ego. This separate role is visually enhanced by the placement of a period between it and the latter two letters in the signature.
To illustrate this feature, we begin with a list of all eight classic Heiderian balance-theory statements, along with our triadic signature notation in Table 5. The core prediction in Heider’s model is that if a triple is unbalanced, then this creates psychological tension and discomfort. Thus, an unbalanced triple should not occur, or if it does occur, an attempt will be made by Ego to balance it by reconsidering her relations with Alter to make it balanced. On the other hand, if a triple is balanced, it will induce a sense of equanimity on Ego’s part and is more likely to be stable and remain in place. Viewed in this way, each balanced signature easily translates to a corresponding prediction from balance theory. Consider the following decomposition of the triadic elements.
Given any set of relational ties (q, r, and s) among Ego, Alter and X, such that Ego has a q-tie with Alter, Ego has an r-tie with X, and Alter has an s-tie with X, the signature of this triad <q.rs> can be decomposed into two parts: the q-tie, and the 2-path rs-tie. Now all balance theory statements can be framed by treating the q-tie as a dependent variable, predicted by the contextual 2-path, the rs component (i.e., P(q) = f(rs)).
As a general principle, if the triple <q.rs> is balanced, then the prediction is
To illustrate, the Heiderian balance prediction that “the enemy of an enemy is a friend” can be restated more formally as “the probability that a specific Alter will be a friend is enhanced if I have an enemy who is also the enemy of this Alter.” This statement immediately corresponds to and is represented by the configuration <p.nn>, where p is the dependent variable and nn is the 2-path conditional. The prediction is that p is more likely to occur given the presence of the 2-path nn than if nn is not present:
Take as an example the second unbalanced statement in Table 2, “A friend of an enemy is a friend,” represented by the configuration <p.np>. Since it is theoretically unbalanced, it is predicted to create an uncomfortable situation for Ego. Given that the 2-path relation “a friend of an enemy” exists between Ego and Alter, then it is predicted that Ego will not, or at least be less likely to, establish a positive friendship tie to Alter. The specific prediction, then, is that a p-tie is less likely to occur given the presence of the 2-path np than if np is not present.
All balance theory statements can be framed this way. Evidence for or against these predictions, stemming directly from triad signatures, can be brought to bear on each statement to see which specific predictions hold up under scrutiny. While this sounds straightforward, we will soon see that the issue of balance is considerably more complicated than this simple interpretation.
2.4.3 Triad signature-induced measure of network balance: The balance correlation.
The embeddedness principle. Before we introduce this third, and most critical, function of the Triad Signature, we note one final difference between Heider’s original theoretical formulation and applications to social network analysis. Heider himself focused on isolated and ideational triples of entities and made predictions based solely on these three entities. In a network context, however, it is important to recognize that no labeled Ego − X − Alter combination occurs in isolation but rather is embedded in a complex structure of triples. In fact, changes in one tie can imply balance changes elsewhere in the network. Consider the case depicted in Fig 2. The three nodes in the middle of the graph (Ego, 1, and 2), constitute an unbalanced triple: Ego has a positive tie with Node 1, and Ego also has a positive tie with Node 2; but, Nodes 1 and 2 have a negative tie. Consider the specific ordered triple where node 1 is Alter and node 2 is the contextual X. Ego is faced with a dilemma. In considering whether to change her relationship with Alter (node 1), Ego finds herself embedded in several critical triples with node 1 as an Alter ((Ego,1,3), (Ego,1,4), (Ego,1,5)). Each of these triples has the signature of <p.pp>, and therefore these are balanced. If Ego were to balance the (Ego,1,2) triple by switching her relationship with node 1 from <p> to <n>, then that would distort the balance that currently exists in these three other <p.pp> triples. In Heiderian terms, the balance cost would far outweigh the balance benefits for switching.
Note: The black edges represent positive ties; the gray dashed edges represent negative ties. The focal labeled triple of interest here is {Ego,1,2}, which is unbalanced (<p.pn>). Since the Ego-2 tie is embedded in more unbalanced triples than the Ego-1 tie is, Ego is more likely to change her relationship with node 2 as an Alter than node 1 as an Alter in trying to resolve this unbalance.
On the other hand, consider the alternate ordered triple where Ego is now taking stock of her relationship with 2 as the Alter, in light of node 1’s new role as Context (X). With node 2 as Alter, Ego also has to consider that there are, again, several contexts ((Ego,2,6), (Ego,2,7), and (Ego,2,8)), to take into account, not just the triple with node 1 (Ego,2,1). In this case, however, all the pertinent triples are currently unbalanced with a <p.pn> structure. If Ego were to switch her relation to Alter (node 2) from <p> to <n>, then all four of these ordered triples would suddenly become a balanced triple <n.pn>. Therefore, in this case, it is easy to see that the balance forces greatly favor Ego’s choice of changing her tie to Node 2 from positive to negative to fix the unbalance in her embedded network.
Not all complex contexts are as easily resolved as in this artificial example. What needs to be underscored here, though, is that in a network context, the Ego − Alter pair does not have one sole X to consider in any balance calculation. And, the larger the number of contextual 2-paths that constitute pre-conditions for balance, the greater the nudging force on the Ego − Alter pair to make a relationship that obeys the rules of balance. Indeed, each Ego − Alter pair faces (n-2) triples of some configuration; some of these may be balanced, some unbalanced, and perhaps some undecided. The Embeddedness Principle simply states that all of these triples must be considered in assessing what the balance forces are on any Ego − Alter pair.
Taking this Embeddedness Principle into account, we now make Heider’s prediction more concrete. Since social networks provide many observations, even around a specific dyad, this affords us the opportunity to turn the prediction into a measurable probabilistic statement, rather than simply “the network is balanced” or “the network is unbalanced”, as [12] originally suggested. We propose that balance statements are better framed with words like “tends to” or “is more likely to”. Consider the “friend of a friend is a friend” prediction. Rather than stating “In every case, a friend of a friend is a friend”, we argue that a better, empirically testable extension of this claim would be “A friend of a friend will tend to be a friend.” And the more friend-of-friend contexts one is embedded in, the more likely Ego will choose Alter as a friend, assuming balance theory is correct. This insight will help us explore Heider’s model more precisely.
One of the basic predictions stemming from balance theory is that, if individuals are prone to attaining and maintaining balance, then at any given time we should find that balanced relations are more prevalent in a network. That is, if balance theory actually works to influence overall structures, then we should find that the 2-paths that exist in a network should be associated with a higher probability of the balance-confirming direct tie from Ego to Alter. As we observed in Eq 1, the more a balance-based 2-path conditional exists around any particular Ego − Alter pair, the higher the probability we would observe the predicted direct tie from Ego to Alter.
A key advantage of formalizing Heiderian predictions using triad signatures is that they make it easy to extend this correlational measure to include this embeddedness principle. [26] created a correlation coefficient, the Transitivity Correlation, that assesses the extent to which the 2-path pre-transitive condition in a digraph (i-k-j) is correlated with the presence of a direct tie i-j. Their solution to this problem was simply to correlate the off-diagonal elements in the matrix form of with the corresponding elements in matrix
. Since
contains the number of directed 2-path ties of the form i-k-j for every 〈i, j〉 dyad, this point biserial correlation indicates the extent to which there is an overall tendency in the digraph
to close a directed 2-path with a directed tie. If the correlation is strong and positive, that would indicate that the tendency towards transitive closure is increased as a function of the number of directed 2-path paths surrounding any dyad. A negative correlation would indicate that the more directed 2-path paths that surround a dyad, the less likely it is that i will be directly tied to j, indicating that
has an intransitivity propensity (such as one would find in a star-shaped graph). A zero correlation would indicate that, controlling for the density in
and the number of directed 2-paths in
, there is no overall propensity towards transitivity in
.
We can use this same approach to look at balance in a graph. While the solution is not as simple as correlating with
, we can still create a matrix of the number of contextual 2-paths that would be specified by the balance theory prediction; and we can just as easily create another matrix of ties that would be predicted by this same balance theory statement. If these matrices are positively correlated, then that would constitute evidence that the specific balance predictions tend to hold in
.
This approach requires calculating for any given signed graph, , two matrices, M1 and M2, where M1 contains the predicted Ego − Alter relation, and M2 contains the number of conditional 2-paths around each dyad that are being tested. These two matrices are straightforward derivatives of the signature notation we have used thus far. In the general case, where the triad signature describing the predicted balance relation is given by the 3-tuple <q.rs>, then
where RS is the inner product of the relation R and the relation S.
A balance correlation, then, is simply the product-moment correlation between off-diagonal elements in M1 with the corresponding elements in M2. Moreover, as just demonstrated, these correlations are easily generated from the signatures themselves. For example, the <n.pn> signature generates a balance statement that negative n-ties are induced from an abundance of 2-path pn-ties; and the balance correlation is ρ(N, PN), where N is the matrix of n-ties, and PN is the compound word [27] created by the matrix product of the P matrix and the N matrix of ties.
Balance theory predicts a positive correlation on balanced signatures and a negative correlation on unbalanced signatures. Whether a network is balanced overall, in this view, depends on which of the possible balanced or unbalanced signatures you are looking at. For example, within the same network, there could be a strong positive balance correlation for the predictive <p.nn> statement but a negative correlation for the predictive <n.pn>. Or, perhaps we observe a positive correlation for <p.pp> but also a positive correlation for <p.pn> (which, according to balance theory, should be negative). That is, balance theory may hold for some configurations but not others.
Phenotypical twins. As we have already noted above, there are 33 = 27 triad signatures. And, each of these signatures doubles as a possible prediction and generates a correlation that permits us to evaluate this prediction. It should be recognized, while these predictive statements and measured correlations can be generated for any of the 27 signatures, many of these signatures do not correspond to anything that Heider would claim actually stems from balance theory. Yet, it is still possible that these empirical correlations might yield insights into structural patterns that could lead to new theoretical extensions, akin to what balance theory has produced over the past half century [19, 29]. A thorough exploration of such patterns in any dataset would result in a 9 × 3 correlation matrix, where each of the three dependent relations (Q, R, and S) is potentially predicted by each of the 9 possible 2-path relations.
While such an exploration sounds inviting, an important caveat is required here. Each of these signatures represents a different theoretical claim/prediction, but it is not the case that each is empirically distinguishable from all other predictions at the aggregated network level. For example, consider the balanced triad signature <n.np>, Ego sends a negative tie to Alter and a negative tie to X; Alter sends a positive tie to X. The prediction represented by this signature is that, conditioned on Ego disliking a third party (X) and Alter likes that same third party, this promotes Ego to send a negative tie to Alter. In assessing this statement, the correlation takes into account how many of these particular conditional 2-paths there are between Ego and Alter.
Now consider a second configuration/prediction, also balanced, represented by the triad signature: <n.pn>. Here the primary difference is that the conditional 2-path represents Ego as having a positive relationship with X and Alter has a negative relationship with X (rather than vice versa). While the balance prediction is the same, the theoretical process is different. Indeed, in the first <n.pn> case, at least Ego has one positive tie, which she may wish to keep. In the second case, Ego is surrounded by negative ties, and perhaps is less committed to maintaining them.
These two predictions, even though both represent balanced states, are quite different from one another on the surface. Yet, while the theoretical mechanism for balance and adjustment might be arguably dissimilar in these two cases, a closer look reveals that they are empirically indistinguishable at the network level.
Consider the particular triple in the first case, <n.np>, where Ego sends a negative tie to Alter. We found in that particular Ego − Alter pair that there existed a certain number of 2-paths of the form <nz>. Let’s call that number knz. It is also the case that for every Ego − Alter pair where the relationship is <n> (negative), there is exactly one corresponding pair where Ego and Alter switch roles as Ego becomes Alter and Alter becomes Ego. For each original Ego − Alter pair, the mirrored Alter-Ego pair will also have the same set of knz 2-paths, except when they switch roles the <nz> 2-paths become <zn> 2-paths. Thus, knz for the original Ego − Alter pair will equal kzn for the mirrored Alter-Ego pair. This equality will repeat for each original Ego − Alter pair. These equivalent and commensurate frequency counts of dyads and 2-paths will be repeated for every Ego − Alter pair in the multigraph. The result is that the correlation between N and NP will equal the correlation between N and PN. They will necessarily be identical, even though they theoretically describe two different mechanisms.
We claim that these two configurations, these two triad signatures <n.pn> and <n.np> are behaviorally indistinguishable, or phenotypical twins, in that no evidence can be observed in any network that would allow one to know which of the two theoretical mechanisms is a better explanation of the tendencies in the network toward tie formation or maintenance. While their genotypes are different (they have different fundamental signatures and theoretical underpinnings), their observable manifestations and influences, at least as indicated by exact correlations between the predicted relation and the conditional 2-paths, are not.
Not all signatures have phenotypical twins (hereafter referred to simply as “twins”). Indeed, of the 27 signature triads, nine have no twin. But the remaining 18 do have exactly one twin. It is important to know which signatures have a twin, and what the twin’s signature is, because it will be impossible to tell which theoretical explanation deserves credit for the observed strength in predictive power as indicated by the balance correlation. It will call for a mandatory dose of humility on all our parts.
We provide the following theorem for uncovering twins in triad signatures (a general proof of this theorem on multi-digraphs is provided in S1 Appendix):
Theorem 2.1. For any triad signature <q.rs>, if it has a twin, it will consist of the signature <q.sr>. If and only if s = r (that is, s and r are the same relation), then <q.rs> will not have a twin.
Thus, <n.pn> and <n.np> are twins. Note, however, that <n.pn> and <p.nn> are not twins, even though they are isomorphic in their graph structures. It is important to note that non-twin signatures can, in any given graph, have identical balance correlations. There may be several balance correlations equal to 0, for example, indicating that there is no tendency toward balance or unbalance as predicted in those signatures. The critical issue with twin signatures is that their Balance correlations will be equal by construction in any and all possible graphs of any size composed of any arrangement of <p>, <n> and <z> ties. As we will see below in our analysis, isomorphic triads but which are not twin signatures, can have dramatically different balance correlations. Indeed, in some cases, we will observe isomorphic triadic signatures that will reveal opposite signs; that is, one signature will exhibit balance while its isomorphic counterpart will be unbalanced.
3 Empirical example
3.1 Background
The importance of Balance Theory for international relations has been widely recognized [30–34]. Current models of (structural) balance theory allow for consideration of either two types of relations (positive, or negative) (see [22, 25]). Any further refinement in strength, or nominal categorization of relationships is impossible within the current framework. The proposed approach in this paper does allow to investigate many more refinements, which occur in international relations. Note however, that although we are studying data over time, this is not a study into the dynamics of international relations. Rather we identify balance correlations per year that subsequently substantiate a time series. In other words we study embedded triadic balance behaviour at a system level, not the feedback of system level changes into the considerations of countries when making international relational choices, although this is a further extension we would highly support, it is not studied in this paper.
3.2 Data
To demonstrate the usefulness and flexibility of this model, we will use the Correlates of War (CoW) data. These data are the result of an ambitious project started in 1969 by a political scientist at the University of Michigan, David Singer [35, 36] (for a current and exhaustive review). This data set includes a comprehensive accounting of both formal defense alliances and military acts of aggression that occurred each year between all pairs of countries from 1816 to 2007. We considered any formal defense alliance between the pair of countries as a positive tie. Acts of aggression recorded in the data set included many categories, ranging from threats of military action against another country to all out wars or invasions (see Table 6 taken from [37]). We coded any aggressive act by one country against another as a negative tie. In the rare case where both a defense alliance and an act of aggression occurred within the same year, we coded that tie as a negative tie for that year.
As detailed in [37], there were 22 CoW codes regarding negative ties between nations. They clustered these categories into five buckets, each representing a level of hostility between nations, where 1 represented no military action and 5 represented a full scale war. Instances of no military action (level 1 per their coding scheme) were not coded as a negative tie in our analysis. The remaining 21 codes, however, were deemed a negative tie in this paper. See Table 6 for a full listing of these codes.
Under the assumption of limited relational resources it is reasonable to argue that countries will consider relations with the whole set of other countries. They are limited in developing ties due to resources restrictions, which differ per country. It is recognized that the implicit or explicit neutrality choice is an embedded triad choice driven by a set of possible motives including geographical distance, but render this a political decision.
3.3 Testing balance correlations
Testing balance correlation raises the question: against which null distribution? We rely on a Monte Carlo-simulation to generate a sample of random networks from a data generating process (DGP) assumed for the population. This DGP preserves the marginal distribution of negative, positive and zero ties observed in the data for each year. For example, if a country has a defense alliance with 12 other countries, and engages in an act of aggression with 5 other countries, then the DGP would generate sample networks that would preserve this number of positive and negative ties with other countries in expectation.
To construct such a DGP, we make use of the result that the maximum-entropy model with specific expectations for some functions of the random variable of interest is (to use modern statistical terminology) an exponential family model. Its sufficient statistic must comprise the functions whose expectations are to be fixed [38]. Exponential-family random graph models (ERGMs) have been developed both for scenarios in which positive and negative ties may co-occur and must thus be modeled jointly [39] and for scenarios in which they cannot [40], with the general polytomous relationship case first specified by [41]. Our application then requires the sufficient statistic vector to be the concatenation of positive and negative tie degrees of each node, resulting in a model of the form
where 1n is an n-vector of 1s and κ(η) is the normalising constant so that
sums to 1 over the sample space of signed networks of interest. Estimation and simulation are then possible using general techniques for valued ERGMs [42], or by a specialized implementation. In a parallel development, this and other null models for signed graphs were studied by [43], including an optimized implementation.
In addition, this null-generating process all preserves the observed covariance in distribution of negative, positive and zero ties in each year. This constitutes a conservative test, one that restricts the sampling space support taking the empirical marginals of the data matrices as expected degree. This degree based approach is more conservative than for example an Erdös-Renyi random network approach [43], because the latter would allow a wider range of possible random networks to be considered. In our data set of country ties, we assume that there is diversity between countries in their capacity to engage in positive and negative ties. Note that this doesn’t imply countries with less capacity are not making active choices. Their choice sets differ, but we assume they can and do consider all possible ties explicitly or implicitly. This assumption supports our choice to base the test on degree expectations, as it includes the investment distribution all countries make.
This DGP approach is based on the observed data for each given year. Thus, the set of probable null graphs can be seriously constrained, and these constraints can change for each balance correlation for each year. The resulting distribution of null graphs may exhibit unfamiliar shapes. For example, in Fig 3 we see the distribution of balance correlations for the set of <pp> 2-path balance correlations (<p.pp>, <n.pp>, and <z.pp>). The <p.pp> balance correlations are mostly positive, ranging from just below zero to.60. The <z.pp> balance correlations, however, are mostly negative, ranging from −.55 to.00 The distribution for the <n.pp> balance correlations, ranges from −.05 but a range from just below that up to just below +.40. The shape of each of these distributions depends on the number of countries in the data set that year, the density of each type of tie, the degree distributions of each type of tie, and the covariances among these degree distributions. Especially for the latter two distributions this leads to stron bi-modal distributions as can be seen from Fig 3. These peculiarities may differ per year. Therefore a separate DGP was used to test each balance correlation for each year.
Note: In this example, 1000 graphs were generated from a null-model that simulated networks that had expected p, n, and z degrees equal to those observed in 1916. The observed densities of the positive, negative and zero ties were.023, .036, and .940, respectively. This histogram displays the distribution of three balance correlations (<p.pp>, <n.pp>, and <z.pp>) calculated from these null graphs. The green-shaded part of the histogram, mostly on the right side of the chart, displays the frequency distribution of the <p.pp> correlations. The teal-shaded part, mostly on the left side of the chart, displays the frequency distribution of the <z.pp> correlations. The purple-shaded portion of the histogram, in the middle of the chart, displays the frequency distribution of the <n.pp> correlations. The dashed vertical colored lines represent the corresponding balance correlations for the observed data for that year. The observed balance correlations for both <p.pp> (the vertical green line) and <z.pp> (the vertical teal line) were outside the null region, suggesting they were statistically significant coefficients. The observed balance correlation <n.pp> (the vertical purple line), however, was within the null distribution region and therefore not statistically significant.
3.4 Results
Before we present the primary balance results, it is important to comment on the context for these data (see Fig 4). Overt acts of aggression (our source of negative ties) vary over the years for which we had data (1816 to 2007). In a few of the earlier years, there were no acts of aggression at all. For such years, we do not, indeed cannot, compute a balance correlation. Also, the number of affirmative defense agreements between nations varies considerably over this period of history. And, of course, this means the proportion of zero ties will also vary over this time.
In Fig 5, we plot the density of ties of each of the three types over this time period. First and foremost, it is clear that Z-ties are the most prevalent, with densities ranging from.75 to.99 over these 192 years. Negative ties are usually (but not always) less frequent than positive ties, with spikes in density most notably during WWI and WWII. Perhaps most interesting is the pattern of positive ties. In general, the density of international relations characterized by formal defense alliances of some type varied between almost none (1909) to almost 20% (at the conclusion of WWII). What is striking in this plot is that these alliances were most frequent before 1867 and after 1936. In between these two years, the density of inter-nation defense agreements dropped considerably. 1867 was the beginning of the dissolution of the Austro-Hungarian empire; 1936 was the onset of WWII with the re-militarization of the Rhineland. We will call this period the “Lack of Alliances” (or LOA) period. We use it here to refer to this specific span of years (1867 to 1936), one that we found to be critical in our exploration of balance, given the regularity with which patterns of balance were interrupted during this period. It can be characterized as a period not of relative calm as much as a period of disorder, where alliances were either not sought or not lasting.
Note: The three dotted plots show how the density of the positive, negative and zero ties varied over the 192 years studied. The green dots indicate the density of positive ties, the purple dots indicate the density of negative ties, and the teal dots indicate the density of the zero ties. These densities within any given year by definition sum to 1.0. Of special note is the period of time between 1867 and 1936, the LOA years, during which there was a precipitous decline in the density of positive ties (alliances) between nations.
We organize the presentation of the balance correlations into families. A family is defined by the 2-path conditional. For example, the <pn> family consists of the triad signatures <p.pn>, <n.pn>, and <z.pn>.
Thus, we present three results for each of six families (<pp>, <pn>, <pz>, <nn>, <nz>, and <zz>) below. We leave out the phenotypical twins (<np>, <zp> and <zn>) from our presentation of the results below since they would be redundant with <pn>, <pz> and <nz>, respectively.
In each of the following figures, we plot the balance correlations for each year across the 192 years for which we have data (1816 to 2007). In addition, we show the 95% interval from the DGP simulated null distribution in gray. Thus, points representing balance correlations outside that gray region can be considered statistically significant in that they would not be expected to appear under the null model. Balance correlations that are within the bounds of the null model, thus are not deemed statistically significant.
3.4.1 PP family.
Fig 6 contains the three plots of the <pp> family of balance correlations for <p.pp>, <n.pp>, and <z.pp>. In the first plot, we examine the pattern of the classic balance claim that a “friend of a friend is a friend.” We see that the <p.pp> balance correlations For the most part are persistently strong and significant, ranging mostly from.8 to.95, providing strong evidence for this balance condition. The primary exception to this lies within the LOA period we identified earlier. During this time, we can see that the country level behavior in the world tends to result in less balance on a global level, while it can also not be inferred that the observed balance structure is due to balance oriented behavior. It is clear, also, that the low number of positive alliances gives rise to a wider band of balance correlations under the null model. This suggests system wide turmoil.
Remarkably, there is much less evidence for a persistent pattern of unbalance represented by the <n.pp> triad signature. According the classic balance theory, a friend of a friend should not be an enemy, or at least is less likely to be an enemy. If this principle were true, then this should yield a strongly negative correlation. Indeed, there is little evidence that this condition lowers the probability of a negative tie between the two nations; these correlations hover around zero for almost the entire observed period.
While negative ties are not suppressed in the face of 2-path alliance relations, zero ties are clearly so. The null distribution (the gray region in the plot) suggests that the structure of the distribution of positive, negative and zero ties makes a randomly generated balance correlation more likely to be negative, it is still the case that zero ties are much less likely to appear than can be attributed to chance under this null model. Again, the primary exception to this general finding occurs during the LOA period. It appears for the <pp> family, countries are likely to form alliances with other countries where they have many common mutual alliances with third party countries. And, while they may or may not engage in a hostile stance against these countries, they appear to avoid a neutral or non-tie relation with them. That is, they are forced to either fish or cut bait, to choose sides.
3.4.2 PN (and twin NP) family.
Fig 7, showing the three plots of balance correlations in the PN family, provides an interesting extension of the results in the prior figure. While the balance correlations for <p.pn>, the first panel in Fig 7, show a predominent tendency towards correlations near zero up to the years just prior to WWII, the period subsequent to WWII contain mostly significantly positive correlations. This is true despite the fact that <p.pn> configuration theoretically represents an unbalanced condition: One should not consider an enemy of a friend (or its twin condition, a friend of an enemy) a friend. Yet it seems to be a consistent strategy since the late 1950’s. Establishing these structures is a tertius gaudens strategy [44].
On the other hand, the second plot in Fig 7 shows evidence for balance in that for a majority of years the <n.pn> configuration is also positive, significant and never negative over this period. And the strength of these findings, as represented by the size of the correlations, exceeds the size of the balance-disconfirming correlations in the <p.pn> plot. That is, during much of this period of time we examined, the propensity to engage in a hostile act against a country who is a friend of an enemy is substantially greater than if they are not a friend of an enemy. The fact that both of these conditions (<p.pn> and <n.pn>) exist at the same time speaks to the complexity of international relations. To presume that only one balance rule operates in any given temporal or network circumstance is belied by these results.
What is also clear is the z-ties tend to be significantly suppressed in the face of <pn> 2-path relations. Again, it suggests neutrality is less an option many times in the face of these 2-path conditions. This general tendency, though, does not always coincide with the years during which this was true for the <z.pp> balance correlations in Fig 6.
3.4.3 PZ (and twin ZP) family.
Fig 8 is our first case where all members of the family are excluded from any of Heider’s original predictions. Thus, it is difficult to say whether any of these correlations should be theoretically balanced. What we do observe, though, is whether they appear to be empirically balanced. That is, are there any general patterns that show persistently positive (balanced) correlations, or perhaps persistently negative (unbalanced) correlations?
The answer is, clearly, “yes” to both questions. We find that alliance relations (positive ties) are less likely in the face of many <pz> (or <zp>) 2-path relations, suggesting that <p.pz> is an empirically unbalanced configuration, at least outside the LOA era. And, by contrast we find that a neutral stance against another country is more likely in the face of many <pz> (or <zp>) 2-path relations, suggesting that <z.pz> appears to be an empirically balanced configuration (again, outside the LOA years).
3.4.4 NN family.
Fig 9 provides the balance correlations for the <nn> family. The first plot in this figure provides general support of the Heiderian balance proposition that “an enemy of an enemy is a friend” in that for over half of the years the <p.nn> correlation is positive and significant. In the post LOA period, since WWII, the <z.nn> correlation is almost always significantly negative, suggesting that neutral stances are perhaps discouraged with an enemy of an enemy.
Of interest in the <n.nn> plots is the large range of correlations resulting from the null model. In particular, in the <n.nn> plot, the null region often ranges from a correlation of zero up to.95. While most of the observed correlations are either 0 or close to it. This is largely because the density of n-ties is low to begin with, and thus the density of <nn> 2-path ties to be even less frequent. This scarcity leaves the correlation to be quite unstable under the null hypothesis.
3.4.5 NZ (and twin ZN) family.
Fig 10 shows the results for the <nz> (or <zn> twin) family. Almost all of these balance correlations are either zero of close to zero, with the minor exception of the significantly positive <n.nz> correlations during the two world wars. Not only are the preponderance of these balance correlations near zero, they also are either just barely border the null region or clearly not significant at all. Thus, we have no evidence that a country who is an enemy of a neutral partner is likely to differentially engage in any particular relation (p, n or z) with another nation.
3.4.6 ZZ family.
Our final set of results are provided in Fig 11. Z ties, and <zz> 2-path ties, are the most dense. Thus, as is displayed in the plots, the null regions are relatively thin. The <n.zz> balance correlations are still largely insignificant, either inside or bordering the null region.
It is worth deliberating on the meaning of these <zz> structures. These suggest that the Ego − Alter dyad is surrounded by non-ties or neutral ties to an inordinate number of other countries. It is as if Ego and Alter are in separate or separated worlds. Thus, a <p.zz> pattern would suggest that Ego and Alter tend to form a bridge between these relatively disconnected parties. A positive balance correlation for <p.zz> would indicate that an alliance is more likely if they are from these distant parts; a negative correlation would indicate that Ego and Alter are less likely to form an alliance if they are from these separated parts. What we see from the <p.zz> plot is that the balance correlations are generally if only slightly negative, suggesting alliances are discouraged across these chasms, but occur more frequently than expected under the null-model.
A <z.zz> triple tells a slightly different story. In this case, a positive correlation would indicate that Ego − Alter pairs across such gulfs are relatively more likely to stay neutral with each other. This would suggest that there are relatively disconnected clusters in the community, and that inside these clusters are where much of the action (both positive and negative) are happening. Indeed, the <z.zz> balance correlations, again outside the LOA period, are positive, if not exceptionally strong, but occur less frequently than expected. This supports the idea of a push towards global integration.
What is interesting in this family of plots, however, is that for the <p.zz> and <z.zz> balance correlations outside the LOA years, we find correlations that are significant (not contained in the null region) yet closer to zero than the null region itself. This pattern is particularly striking in the post LOA years, during and following WWII. The <p.zz> correlations are predominantly negative and yet they are significantly greater than we would expect by chance under the null, while the opposite holds for the <z.zz> correlations.
3.4.7 Overview of results.
Fig 12 provides a visual summary of the results across all years. Each point in the plot represents the average balance correlation across all 192 years for which there are data. They are divided into three groups: the “theoretically balanced” group (<p.pp>, <n.pn>, and <p.nn>), the “theoretically unbalanced” group (<n.pp>, <p.pn>, and <n.nn>), and the “indeterminate” group composed of the remaining 12 triad signature configurations that contain at least one z-tie.
Several overall findings are apparent from this figure. First, the strongest finding supports the original Heiderian balance prediction represented by the <p.pp> signature. That is, at this international level, countries who have alliances with common third countries tend to have alliances with each other: “A friend of a friend is a friend.” The other theoretically predicted balanced structures are weaker, but at least some of the <n.pn> structures are significant over time (see Fig 7). Interestingly, none of Heider’s theoretically predicted unbalanced structures receive empirical support. All of these correlations hover around zero. Indeed, if anything, the <n.nn> and <p.pn> unbalanced structures show relatively frequent positive correlations, just the opposite of what Heiderian theory would predict.
Perhaps the most interesting, and surprising, results involve those configurations with a z-tie. The most striking negative correlation does not emerge from any of the theoretically unbalanced triples but rather occurs in the <z.pp> configuration. That is, being an ally of an ally seems to carry no systematic information about hostile acts between two countries that differentiates it from other contexts. In contrast, a neutral tie is far less likely to occur in this context than in any other context. Other z-tie results also engage in unexpected patterns, although the strength of those findings is not as evident as is the <z.pp> case. We dilate these results and based on the correlations among Balance Correlations make the case that beside the theoretical arguments for treating z-ties as separate category, the empirical results strongly corroborate this idea.
3.4.8 Interpreting z-ties.
In our example on relations between countries, clusters of balance theoretical behaviors can be identified, i.e. clusters of balance correlations can be discerned that are positively correlated, irrespective of whether the balance correlations are significant or not. Furthermore, these clusters may show negative correlations with balance correlations in other clusters. Earlier we discussed the marking difference between <p.pp> and <z.pp>. Fig 13 shows a strong negative correlation (−.95) between these two signatures. So the mean values of these balance correlation are opposite, and this is consistent over time. An even stronger negative correlation occurs between <z.pz> and <p.pz> (−.99). Although the balance correlations themselves are not extreme, the observed signatures in our example seem to be highly dependent. It is stressed that this is an empirical effect not a priori determined.
Using a hierarchical clustering, looking at 7 clusters gives a first singleton <n.nn> (unbalanced), and 4 pairs (<n.pn> and <p.nn>, both balanced; <n.pp> and <p.pn>, both unbalanced; and two indeterminate pairs, <n.nz> and <p.zz>, and, <z.pn> and <z.nn>). The balanced and unbalanced pairs have much stronger correlations, .86 and .79, respectively, compared to .31 and .48 for the respective indeterminate pairs. There are two larger blocks with 4 and 5 signatures, respectively. The first contains signatures from 4 different families, and includes the balanced <p.pp>, which is most strongly correlated with <z.pz> (.75), while the other two, <z.nz> and <n.zz> are even stronger correlated (.91) the other correlations range from.24 to.39. In the largest cluster all signatures are theoretically indeterminate with highest correlations between the pair <n.pz> and <p.nz> (.93), and the pair <z.pp> and <p.pz> (.78). The other correlations in this cluster range from.23 and.53.
Now this result may helpful to assess whether the z-ties should also empirically be considered a separate type. To interpret the valence of z-ties as either positive or negative if we replace these z’s with either a p or n designation, such that it provides a balance theoretical prediction consistent with the observed mean balance correlation (Fig 12).
In Table 7 little support for the idea that z-ties can be considered either as positive or negative is found. Rather there is much support for what we call z-tie ambiguity. When z-labels are imputed with n or p labels, according to achieve a signature that is coherent with Heiderian balance theory multiple situations can be distinguished. First, in most cases coherence can be achieved by exclusive p or n imputation. Second, coherence can be brought about by either p or n imputation. Third, coherence can only be achieved by imputation of both p and n.
In the latter case there is obvious ambiguity, revoking the idea that z-ties can be considered either positive or negative. This occurs in 2 cases (<z.nz> =.01 and <p.zz> = −.11), providing a first logic to consider z-ties as a separate class on empirical grounds, besides the theoretical argument provided earlier. However, in all other cases a unique alternative labeling for z-ties exists that will result in a coherent mean balance correlation.
In the second case where a coherent mean balance correlation occurs whenever either an n or an p gets imputed there is a case for which no ambiguity can be detected. In the case of <z.pz> when z is imputed with the p label, a strong positive correlation (.75) between <z.pz> and <p.pp> supports the conjecture that z-ties are to be seen as positive ties. However, in the other indeterminate signature such support is not found as correlation of <n.zz> with both <n.pp> as well as with <n.nn> are not significant.
In the eight cases where one unique label creates a coherent balance correlation, only one signature has a significant correlation with the imputed signature (<n.pz> and <n.pp>, ρ =.32). In 4 cases there is no significant result, so we must conclude on ambiguity of z-ties. Certainly not on the whole set as there are 6 undisputed signatures where z-ties are to be treated as positive ties, and 3 signatures as negative ties. But, neither on the context specific level as only two indeterminate signatures are consistently correlated with imputed signatures. This warrants treating z-ties as a separate class based on empirical grounds.
4 Extensions
This paper deals with signed graphs, where edges between nation states are undirected symmetric ties. One of major advances in [19] was to recognize that networks are often, perhaps most often, composed of directed ties. The work presented here can be extended to directed ties, also, leading to new specific theoretical predictions and explorations of possible balance statements and commensurate balance correlations to assess them.
However, it should be noted that this increases substantially the number of possible balance statements. Instead of one relation for each pair, there is both an in-tie and an out-tie that would have to be specified; thus, for each pair we must specify two relations, not just one. For example, we could describe a case where Ego sent a positive tie to Alter, but Alter does not respond with any sentiment (i.e., send a z-tie back). Perhaps Ego sends a negative tie to X, and X reciprocates with a non-tie. Further, suppose Alter sends a negative tie to X and X responds with a similar negative tie back to Alter. This configuration would be described as <pz. nz nn>. Since there are six directed ties in any labeled triple, then the number of possible directed triad signatures is 36 = 729, not simply 27 as described in this paper. An interesting extension of our approach to digraphs is that of these 729 signatures, 702 have an indistinguishable twin and 27 do not (see the proof of the theorem 2.1 in S1 Appendix), resulting in 378 unique balance correlations. We expound on these structures, their balance statements, and their empirical results in a subsequent paper on balance correlations in directed graphs.
An extension that would even further reintegrate a structural approach with Heider’s original ideas is to replace an individual node as context with sets of attitudes, values, norms, or issues [13]. This would allow to test different versions of balance theory, and further establish the link to cognitive dissonance theory [18, 45, 46].
The null model we have chosen controls for both positive and negative degree heterogeneity of the nodes. The result of [38] is very general, and the exponential family framework can be used to control for other social forces, such as geographic proximity and language, by augmenting or replacing the sufficient statistic in Eq 6 with operationalizations of these effects.
5 Discussion and conclusion
Our intent here is to propose a new approach to the theorizing and study of balance processes in network contexts. By formally incorporating neutral or zero ties into the analysis, we extend balance theory in a significant way that better matches the real world of actors in a systemic relational structure. And, in an important way, we return to Heider’s original thinking by focusing on Ego’s choice of how to relate to a specific Alter when confronted with the concomitant relations with one or many third parties. We present a novel nomenclature, the triad signature, that assists in identifying and theorizing about (un)balance conditions.
Finally, we propose a simple, intuitive measure, the balance correlation, that directly reflects the extent to which Heider’s predictions (and its extensions in this paper) are supported by the evidence in an observed network. This measure captures the difference in probabilities of Ego’s choice of a tie with Alter, conditioned on whether the specific balance-theoretic 2-path configuration exists or does not exist. We also describe a reasonable null test of this statistical measure and apply it to a well-known data set employed by political scientists in international affairs, the Correlates of War data.
As powerful as this approach is, we also note that the 27 different theoretical predictions that can be identified using the triad signature are not always uniquely identifiable empirically. We show that, indeed, 18 of these signature have a phenotypical twin signature, in that the two signatures will necessarily receive identical support in any given network and can therefore never be distinguished using a balance correlation. We also provide a proof that reveals the necessary and sufficient conditions for twins to exist and how to identify them.
We further demonstrate the power of this approach by exploring the balance correlations of all the possible predictions identified by the triad signatures over almost two centuries of conflict and alliances among extant nations in the world. We find systematic underlying patterns that deserve attention. First, we find strong evidence overall for balance theory on positive ties. In cases where the conditional 2-path relations contain either two positive ties or two negative ties, the probability of a direct positive tie is significant for most of the years, as would be expected according to balance theory.
Second, we find that the predictions around unbalanced states are not confirmed for the most part and occasionally even counter to Heider’s original contention. For example, the theoretically unbalanced <n.pp> correlation hovers around zero for most of the two centuries of data. And a clear example of a result that dis-confirms an unbalanced prediction is given in Fig 7, where the theoretically unbalanced <p.pn> signature is consistently positive in the post WWII era. This is consistent with an alternative to balance seeking behavior, namely a tertius gaudens strategy as described in [44]. However, the results also show that the balanced <n.pn> signature is stronger. This emphasizes that both strategies may be at play at the same time in the same network, revealing the complexity of studying international relations.
Thirdly, in a related general finding, it appears that the z-ties (neutral ties) play a larger role in establishing balance, or at least avoiding unbalanced states, than negative hostile ties over substantial time periods. For example, in the case of where the Ego − Alter dyad is embedded in many <pp> 2-path relations (see Fig 6), there is no discernible evidence that Ego avoids negative ties (<n.pp>), but there is substantial and significant suppression or avoidance of neutral ties (<z.pp>).
This study of embedded triadic balance behaviour over time suggests that although the strive for balance may be a component of individual countries’ motivation, it does not necessarily suggests social equilibrium or social resilience of the system. In fact, we find over time that balance can become more and less prevalent in whole networks and that these levels of prevalence can be stable over longer time spans. This is consistent with Heider’s [1] observation that the tension produced by unbalance may be attractive. In the setting of international relations such imbalance may hold promise for opportunities to achieve nodal benefits. The presented method indicates the prevalence of different types of balance, imbalance and undetermined behaviors.
In sum, it is our contention that balance correlations provide a simple, intuitive way to assess the extent to which specific assumption on balance behaviors might be true on the average across an entire network. Furthermore, by incorporating agentic zeroes into the exploration of balance conditions, we may uncover new properties of social order that have been prevalent, even dominant, but hidden from the researcher’s purvue or agenda. We hope this new approach opens up avenues of exploration that have been heretofore ignored.
As a final remark we want to emphasize that an updated Correlates of War database (real-time) may demonstrate the practical use of balance correlation as an indicator of the level of global stability. Although we leave it to historians to corroborate that the LOA-era was characterized by dismal turbulence more than in other periods, the consistent drop in balanced relations might well serve as warning signal. If so, balance correlations may inform politicians and military leaders on global temperature in international relations.
Supporting information
S1 File. This file contains the data on which the figures are generated in this paper and is suitable to be read in the R-program for statistical computing and graphics.
https://doi.org/10.1371/journal.pone.0315088.s001
(RDATA)
S2 File. This is a Jupyter Notebook file for Python with links to the data on the Correlates of War website used in this paper.
It is advised to first download the data and save them in the same folder from which the notebook is running under the names used in the notebook. In addition, it contains the code that was used in pre-processing the data and generating the yearly networks on the three different relations. By providing this notebook, we enable readers to inspect the code and understand the underlying methodology and replicate the data generation process for their own research. The required libraries are pandas and numpy.
https://doi.org/10.1371/journal.pone.0315088.s002
(PDF)
Acknowledgments
We like to thank participants at Networks 2021 (Bloomington, USA), Sunbelt 2022, (Cairns, Australia), Sunbelt 2023 (Portland, USA), ION IX (Lexington, USA), and EUSN 2023 (Ljubljana, Slovenia) for valuable remarks.
References
- 1.
Heider F. The Psychology Of Interpersonal Relations. New York: John Wiley & Sons; 1958.
- 2. Moody J. Balance theory and triads: Structural constraints on positive and negative relations. Social Psychology Quarterly. 1986;49(3):237–252.
- 3. Rawlings CM, Friedkin NE. The structural balance theory of sentiment networks: Elaboration and test. American Journal of Sociology. 2017;123(2):510–548.
- 4.
Krasner SD. Structural Realism and International Relations. Princeton University Press; 1985.
- 5. Newcomb TM. An approach to the study of communicative acts. Psychological Review. 1953;60(6):393–404. pmid:13112341
- 6. Hage P. Graph Theory as a Structural Model in Cultural Anthropology. Annual Review of Anthropology. 1979;8(1):115–136.
- 7. Doreian P, Krackhardt D. Pre-transitive balance mechanisms for signed networks. Journal of Mathematical Sociology. 2001;25(1):43–67.
- 8. Feld SL. The focused organization of social ties. American journal of sociology. 1981;86(5):1015–1035.
- 9. Snijders TA. The statistical evaluation of social network dynamics. Sociological methodology. 2001;31(1):361–395.
- 10. Fienberg SE, Meyer MM, Wasserman SS. Statistical analysis of multiple sociometric relations. Journal of the american Statistical association. 1985;80(389):51–67.
- 11. Kirkley A, Cantwell GT, Newman ME. Balance in signed networks. Physical Review E. 2019;99(1):012320. pmid:30780212
- 12. Cartwright D, Harary F. Structural balance: a generalization of Heider’s theory. Psychological review. 1956;63(5):277. pmid:13359597
- 13. Opp KD. Balance Theory: Progress and Stagnation of a Social Psychological Theory. Philosophy of the Social Sciences. 1984;14(1):27–49.
- 14. Doreian P. Evolution of human signed networks. Metodološki zvezki. 2004;2(1):277–293.
- 15. Estrada E. Rethinking structural balance in signed social networks. Discrete Applied Mathematics. 2019;268:70–90.
- 16. Heider F. Social perception and phenomenal causality. Psychological review. 1944;51(6):358.
- 17. Heider F. Attitudes and cognitive organization. The Journal of psychology. 1946;21(1):107–112. pmid:21010780
- 18.
Opp KD. Structural Balance, Social Networks, and the Diffusion of Innovations. Can Balance Theory Explain the Protestant Reformation? Linköping, Sweden: Institue for Analytical Sociology, Linköping University; 2021. Available from: https://www.youtube.com/watch?v=VQJ8ydNpwP4.
- 19.
Holland PW, Leinhardt S. Perspectives on social network research. New York: Academic Press; 1979.
- 20. Dinh L, Rezapour R, Jiang L, Diesner J. Enhancing structural balance theory and measurement to analyze signed digraphs of real-world social networks. Frontiers in Human Dynamics. 2023;4:1028393.
- 21. Aref S, Wilson MC. Measuring partial balance in signed networks. Journal of Complex Networks. 2018;6(4):566–595.
- 22. Aref S, Wilson MC. Balance and frustration in signed networks. Journal of Complex Networks. 2019;7(2):163–189.
- 23. Estrada E, Benzi M. Walk-based measure of balance in signed networks: Detecting lack of balance in social networks. Physical Review E. 2014;90(4):042802. pmid:25375544
- 24.
Rapoport A. Mathematical models of social interaction. In: Luce RD, Bush RR, Galanter E, editors. Handbook of Mathematical Psychology. vol. II. New York, NY: John Wiley and Sons; 1963. p. 493–579.
- 25. Harary F. On the measurement of structural balance. Behavioral Science. 1959;4(4):316–323.
- 26. Dekker D, Krackhardt D, Snijders TAB. Transitivity correlation: A descriptive measure of network transitivity. Network Science. 2019;7(3):353–375.
- 27. White HC, Boorman SA, Breiger RL. Social Structure from Multiple Networks. I. Blockmodels of Roles and Positions. American Journal of Sociology. 1976;81(4):730–780.
- 28. Pachucki MA, Breiger RL. Cultural Holes: Beyond Relationality in Social Networks and Culture. Annual Review of Sociology. 2010;36:205–224.
- 29. Davis JA. The davis/holland/leinhardt studies: An overview. Perspectives on social network research. 1979; p. 51–62.
- 30. Moore M. Structural balance and international relations. European Journal of Social Psychology. 1979;9(3):323–326.
- 31. Antal T, Krapivsky PL, Redner S. Social balance on networks: The dynamics of friendship and enmity. Physica D: Nonlinear Phenomena. 2006;224(1-2):130–136.
- 32. Doreian P, Mrvar A. Structural Balance and Signed International Relations. Journal of Social Structure. 2015;16(1):1–49.
- 33. Kinne BJ, Maoz Z. Local Politics, Global Consequences: How Structural Imbalance in Domestic Political Networks Affects International Relations. The Journal of Politics. 2023;85(1):107–124.
- 34. Zhang M, Chen S, Du W, Cao X, Li D, Zhang J, et al. Risky cascading transitions in international relationships. PNAS Nexus. 2023;2(1):pgac289. pmid:36712936
- 35. Singer JD. Reconstructing the correlates of war dataset on material capabilities of states, 1816–1985. International Interactions. 1988;14(2):115–132.
- 36.
Bremer SA, Keohane RO, Stanger MJ. Correlates of War: A Critical Assessment. Cornell University Press; 2022.
- 37. Palmeri HG, Carlson RN, Ostendorf M. An analysis of the COW database. Behavior Research Methods. 2020;52(2):493–503.
- 38. Jaynes ET. Information Theory and Statistical Mechanics. Physical Review. 1957;106(4):620–630.
- 39. Huitsing G, Van Duijn MAJ, Snijders TAB, Wang P, Sainio M, Salmivalli C, et al. Univariate and multivariate models of positive and negative networks: Liking, disliking, and bully–victim relationships. Social Networks. 2012;34(4):645–657.
- 40.
Fritz C, Mehrl M, Thurner PW, kauermann G. Exponential Random Graph Models for Dynamic Signed Networks: An Application to International Relations; 2022.
- 41. Robins G, Pattison P, Wasserman S. Logit models and logistic regressions for social networks: III. Valued relations. Psychometrika. 1999;64(3):371–394.
- 42. Krivitsky PN. Exponential-family random graph models for valued networks. Electronic Journal of Statistics. 2012;6. pmid:24678374
- 43. Gallo A, Garlaschelli D, Lambiotte R, Saracco F, Squartini T. Testing structural balance theories in heterogeneous signed networks. Communications Physics. 2024;7(1):154.
- 44.
Burt RS. Structural holes: the social structure of competition. Cambridge, Mass.: Harvard University Press; 1992.
- 45.
Festinger L. A theory of cognitive dissonance. A theory of cognitive dissonance. Stanford University Press; 1957.
- 46.
Harmon-Jones E, editor. Cognitive dissonance: Reexamining a pivotal theory in psychology (2nd ed). Washington: American Psychological Association; 2019.