If a Graph is a Faction Iis is Continuous or Discrete

Proc Natl Acad Sci U S A. 2011 Feb 1; 108(5): 1771–1776.

From the Cover

Applied Mathematics

Continuous-time model of structural balance

Seth A. Marvel

^aCenter for Applied Mathematics, and

Jon Kleinberg

^bDepartment of Computer Science, Cornell University, Ithaca, NY 14853

Robert D. Kleinberg

^bDepartment of Computer Science, Cornell University, Ithaca, NY 14853

Steven H. Strogatz

^aCenter for Applied Mathematics, and

Supplementary Materials: Supporting Information

GUID: 713BB5AE-2A78-4828-BDAF-B2C8A95BEE50

GUID: 849B4182-31ED-4392-BB07-A1608BA2EDD8

Abstract

It is not uncommon for certain social networks to divide into two opposing camps in response to stress. This happens, for example, in networks of political parties during winner-takes-all elections, in networks of companies competing to establish technical standards, and in networks of nations faced with mounting threats of war. A simple model for these two-sided separations is the dynamical system dX/dt =X ², where X is a matrix of the friendliness or unfriendliness between pairs of nodes in the network. Previous simulations suggested that only two types of behavior were possible for this system: Either all relationships become friendly or two hostile factions emerge. Here we prove that for generic initial conditions, these are indeed the only possible outcomes. Our analysis yields a closed-form expression for faction membership as a function of the initial conditions and implies that the initial amount of friendliness in large social networks (started from random initial conditions) determines whether they will end up in intractable conflict or global harmony.

Keywords: random matrix theory, polarization

The mathematical model that we want to study is best understood as an outgrowth of a theory from social psychology known as structural balance (1). So let us begin with a brief explanation of what this theory says.

Consider three individuals: Anna, Bill, and Carl, and suppose that Bill and Carl are friends with Anna, but are unfriendly with each other. If the sentiment in the relationships is strong enough, Bill may try to strengthen his friendship with Anna by encouraging her to turn against Carl, and Carl might likewise try to convince Anna to terminate her friendship with Bill. Anna, for her own part, may try to bring Bill and Carl together so they can reconcile and become friends. In abstract terms, relationship triangles containing exactly two friendships are prone to transition to triangles with either one or three friendships.

Alternately, suppose that Anna, Bill, and Carl all view each other as rivals. In many such situations, there are incentives for the two people in the weakest rivalry to cooperate and form a working friendship or alliance against the third. In these cases, a single friendship may be prone to appear in a relationship triangle that initially has none.

These two thought experiments suggest a notion of stability, or balance, that can be traced back to the work of Heider (2). Heider's theory was expanded into a graph-theoretic framework by Cartwright and Harary (3), who considered graphs on n nodes (representing people, countries, or corporations) with edges signed either positive (+) to denote friendship or negative (−) to denote rivalry. If a social network feels the proper social stresses (those felt by Anna, Bill, and Carl in the examples above), then Cartwright and Harary's theory predicts that in steady state the triangles in the graph should contain an odd number of positive edges—in other words, three positive edges or one positive edge and two negative edges. We refer to such triangles as balanced, and triangles with an even number of positive edges as unbalanced. Finally, we call a graph complete if it contains edges between all pairs of nodes, and we say that a complete graph with signs on its edges is balanced if all its triangles are balanced. (All graphs in our discussion will be complete.)

As it turns out, these local notions of balance theory are closely related to the global structure of two opposing factions. In particular, suppose that the nodes of a complete graph are partitioned into two factions such that all edges inside each faction are positive and all edges between nodes in opposite factions are negative. (One of these factions may be empty, in which case the other faction includes all the nodes in the graph, and consequently all edges of the network are positive.) Note that this network must be balanced, because each triangle either has all three members in the same faction (yielding three positive edges) or has two members in one faction and the third member in the other faction (yielding one positive edge and two negative ones). In fact, a stronger and less obvious statement is true: Any balanced graph can be partitioned into two factions in this way, with one faction possibly empty (3). As a result, when we speak of balanced graphs, we can equivalently speak of networks with this type of two-faction structure.

Model

Structural balance is a static theory—it posits what a "stable" signing of a social network should look like. However, its underlying motivation is dynamic, based on how unbalanced triangles ought to resolve to balanced ones. This situation has led naturally to a search for a full dynamic theory of structural balance. Yet finding systems that reliably guide networks to balance has proved to be a challenge in itself.

A first exploration of this issue was conducted by Antal et al. (4) who considered a family of discrete-time models. In one of the main models of this family, an edge of the graph is examined in each time step, and its sign is flipped if this produces more balanced triangles than unbalanced ones. Although a balanced graph is a stable point for these discrete dynamics, it turns out that many unbalanced graphs called jammed states are as well (4, 5).

Thus, the natural problem became to identify and rigorously analyze a simple system that could progress to balanced graphs from generic initial configurations. An approach to this problem was taken by Kułakowski et al. (6), who proposed a continuous-time model for structural balance. They represented the state of a completely connected social network using a real symmetric n ×n matrix X whose entry x _ij represents the strength of the friendliness or unfriendliness between nodes i and j (a positive value denotes a friendly relationship and a negative value an unfriendly one). Note that for a given X, there is a signed complete graph with edge signs equal to the signs of the corresponding elements x _ij in X. We will call X balanced if this associated signed complete graph is balanced.

Kułakowski et al. considered variations on the following basic differential equation, which they proposed as a dynamical system governing the evolution of the relationships over time:

equation image

[1]

Remarkably, simulations showed that for essentially any initial X(0), the system reached a balanced pattern of edge signs in finite time.

Writing Eq.1 directly in terms of the entries x _ij gives a sense of why this differential equation should promote balance:

equation image

[2]

Notice that x _ij is being pushed in a positive or negative direction based on the relationships that i and j have with k: If x _ik and x _kj have the same sign, their product guides the value of x _ij in the positive direction, whereas if x _ik and x _kj have opposite signs, their product guides the value of x _ij in the negative direction. In each case, this is the direction required to balance the triangle {i,j,k}. Note also that Eq.2 applies for the case that i =j. Although this case is harder to interpret, the monotonic increase of x _ii implied by Eq.2 might be viewed in psychological terms as an increase of self-approval or self-confidence as i becomes more resolute in its opinions about others in the network.

For a network with just three nodes, it can be easily proved that a variant of these dynamics generically balances the single triangle in this network; such a three-node analysis has been given by Kułakowski et al. (6), and we describe a short proof in SI Text. What is much less clear, however, is how the system should behave with a larger number of nodes, when the effects governing any one edge {i,j} are summed over all nodes k to produce a single aggregate effect on x _ij.

It has therefore been an open problem to prove that Eq.2 or any of the related systems studied by Kułakowski et al. will bring a generic initial matrix X(0) to a balanced state. It has also been an open problem to characterize the structure of the balanced state that arises as a function of the starting state X(0).

Results

In this paper, we resolve these two open problems. We first show that for a random initial matrix (with entries sampled independently from an absolutely continuous distribution with bounded support), the system reaches a balanced matrix in finite time with a probability converging to 1 in the number of nodes n. In addition, we provide a closed-form expression for this balanced matrix in terms of the initial one; essentially, we discover that the system of differential equations serves to "collapse" the starting matrix to a nearby rank-one matrix. We also characterize additional aspects of the process, giving, for example, a description of an "exceptional" set of matrices of probability measure converging to 0 in n for which the dynamics are not necessarily guaranteed to produce a balanced state.

We then analyze the solutions of the system for classes of random matrices in the large-n limit—in particular, we consider the case in which each unique matrix entry is drawn independently from a distribution with bounded support that is symmetric about a number μ (the mean value of the initial friendliness among the nodes). In this case, we find a transition in the solution as μ varies: When μ > 0, the system evolves to an all-positive sign pattern, whereas when μ ≤ 0, the system evolves to a state in which the network is divided evenly into two all-positive cliques connected entirely by negative edges. We end by discussing some implications of the model and the associated transition between harmony and conflict, including an evaluation of the model on empirical data and some potential connections to research on reconciliation in social psychology.

Behavior of the Model: Evolution to a Balanced State

Suppose we randomly select the x _ij(0)'s from a continuous distribution on the real line. Then the x _ij(t)'s found by numerical integration generally sort themselves in finite time into the sign pattern of two feuding factions. To reformulate this observation as a precise statement and explain why the behavior holds so pervasively, we now solve Eq.1 explicitly.

Solution to the Model.

The initial matrix X(0) is real and symmetric by assumption, so we can write it as QD(0)Q ^T, where D(0) is the diagonal matrix with the eigenvalues of X(0), denoted λ ₁≥λ ₂≥⋯≥λ _n, as diagonal entries ordered from largest to smallest, and Q is the orthogonal matrix with the corresponding eigenvectors of X(0), denoted ω ₁,ω ₂,…,ω _n, as columns. The superscript T signifies transposition.

The differential equation Eq.1 is a special case of a general family of equations known as matrix Riccati equations (7). The analysis of the full family is complicated and not fully resolved, but we now show that the special case of concern to us, Eq.1, has an explicit solution with a form that exposes its connections to structural balance. We proceed as follows. First, we observe that by separation of variables, the solution of the single-variable differential equation An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq1.jpg (overdot representing differentiation by time) with initial condition x(0) =λ _k is

equation image

[3]

Therefore the diagonal matrix D(t) = diag[ℓ ₁(t),ℓ ₂(t),…,ℓ _n(t)] is the solution of Eq.1 for the initial condition X(0) = diag(λ ₁,λ ₂,…,λ _n).

Moreover Y(t) =QD(t)Q ^T is also a solution of Eq.1 because An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq2.jpg . But Y(t) has the same initial condition as X(t) in our original problem: Y(0) =QD(0)Q ^T =X(0). So by uniqueness, Y(t) =QD(t)Q ^T must be the solution we seek.

Our solution X(t) can also be written in a different way to mimic the solution of the one-dimensional equation An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq3.jpg . Because , where q _ij is the (i,j)th entry of Q, we can expand the denominators of the ℓ _k(t) functions in powers of t to rewrite X(t) as X(0) +X(0)² t +X(0)³ t ² + …, or more concisely,

equation image

[4]

(Note that the matrices X(0) and [I -X(0)t]^-1 commute.) This equation is valid when t is less than the radius of convergence of every λ _k, that is, when t < 1/λ ₁ (assuming λ ₁ > 0).

Finally we note that the above method of solving Eq.1 contains a reduction of the number of dynamical variables of the system from An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq5.jpg to n. The constants of motion generated by this reduction are just the off-diagonal elements of Q ^T X(t)Q =D(t), or An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq7.jpg for all 1 ≤i <j ≤n. Furthermore, the procedure for reducing X(t) can be easily generalized to any system of the form , where f is a polynomial of X.

Behavior of the Solution.

Let us now examine the behavior of our solution X(t) to see why in the typical case it splits into two factions in finite time. It turns out that this is the guaranteed outcome if the following three conditions hold (and as we see below, they hold with probability converging to 1 as n goes to infinity):

λ ₁ > 0,
λ ₁ ≠λ ₂ (and hence λ ₁ >λ ₂), and
all components of ω ₁ are nonzero.

To see why these conditions imply a split into two factions, observe from Eq.3 that each ℓ _k(t) diverges to infinity at t = 1/λ _k. Because An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq9.jpg , all x _ij's diverge to infinity when the ℓ _k with the smallest positive 1/λ _k does. Under the first and second conditions, this ℓ _k is ℓ ₁, so the blow-up time t ^∗ of Eq.1 must be 1/λ ₁. To show that the nodes are partitioned into two factions as X(t) approaches t ^∗, let An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq10.jpg on the half-open interval [0,t ^∗), where ||X(t)|| denotes the Frobenius norm of X. The matrix has the sign pattern of X(t), and as t approaches t ^∗ it converges to the rank-one matrix

equation image

[5]

Now let ω _1k denote the value of the kth coordinate of ω ₁, and let S = {k:ω _1k > 0} and T = {k:ω _1k < 0}. Then S and T partition the node indices 1,2,…,n by our condition that ω ₁ has no zero components. From Eq.5, this partition must correspond to two cliques of friends joined by a complete bipartite graph of unfriendly ties.

The Three Conditions.

We now return to the three conditions above. We first show that the second and third hold with probability 1. We then show that the first condition holds with probability converging to 1 as n goes to infinity. Finally, we analyze the behavior of the system in the unlikely event that the first condition does not hold. The fact that the conjunction of all three conditions holds with probability converging to 1 as n grows large justifies our earlier claim that the behavior described above holds for almost all choices of initial conditions.

First we show why the second and third conditions hold with probability 1 so long as the (joint) distribution from which X(0) is drawn is absolutely continuous with respect to Lebesgue measure—in other words, assigns probability zero to any set of matrices whose Lebesgue measure is zero. Our arguments below make use of the following two basic facts:

The set of zeros of a nontrivial multivariate polynomial has Lebesgue measure zero, and
the existence of a common root of two univariate polynomials P and Q is equivalent to the vanishing of a multivariate polynomial in the coefficients of P and Q (specifically, it is equivalent to the vanishing of the determinant of the Sylvester matrix of P and Q, also called the resultant of P and Q).

To show that λ ₁ ≠λ ₂ with probability 1, let P denote the characteristic polynomial of X(0), and let Q denote the derivative of P. Then X(0) has a repeated eigenvalue if and only if P has a repeated root, which it does if and only if P and Q have a common root. This condition is equivalent to the vanishing of the resultant of P and Q, which is a multivariate polynomial in the entries of X(0). The polynomial cannot be zero everywhere, because there is at least one symmetric matrix that does not have a repeated eigenvalue. So the set of matrices having a repeated eigenvalue has Lebesgue measure zero.

Similarly, to show that all components of ω ₁ are nonzero, let P denote the characteristic polynomial of X(0) and P _i the characteristic polynomial of the (n - 1) × (n - 1) submatrix X _i(0) obtained by deleting the ith row and ith column of X(0). It is easy to check that if any eigenvector of X(0) has a zero in its ith component, then the vector obtained by deleting that component is an eigenvector of X _i(0) with the same eigenvalue. Consequently, P and P _i must have a common root, implying that the resultant of P and P _i vanishes. This resultant is once again a multivariate polynomial in the entries of X(0), and once again it must be nonzero somewhere because there is at least one symmetric matrix whose eigenvectors all have nonzero entries. Hence, the set of matrices having an eigenvector with zero in its ith component has Lebesgue measure zero.

Finally, to determine the likelihood of the first condition, we first must say a bit more about the way that X(0) is selected. Suppose that the off-diagonal x _ij(0)'s are drawn randomly from a common distribution F and the on-diagonal x _ii(0)'s are drawn randomly from a common distribution G. All selections are independent for i ≤j. [For i >j, we let x _ij(0) =x _ji(0), so that X(0) is symmetric.] For this construction of X(0), Arnold (8) has shown that with the remarkably weak additional assumption that F has a finite second moment, Wigner's semicircle law holds in probability as n grows to infinity. This in turn implies that λ ₁ > 0 in probability in the same limit.

Moreover, suppose we are in the low-probability case that λ ₁ ≤ 0. In this case, the analysis above shows that all the functions ℓ _i(t) converge to 0 as t → ∞. Thus, An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq12.jpg , and because X(t) =QD(t)Q ^T, we also have .

Although the entries of X(t) converge to zero when λ ₁ ≤ 0, one might still want to know if the sign pattern of X(t) is eventually constant (i.e., remains unchanged for all t above some threshold value) and, if so, what determines this sign pattern. It is possible to answer this question, again assuming the second and third conditions. By expanding the function ℓ _i(t) =λ _i/(1 -λ _i t) in powers of u = 1/t, we obtain the asymptotic series

equation image

[6]

which implies

equation image

[7]

In the limit of small u, the leading-order term of the diagonal entries of X(t) is the linear term, which has negative sign. For the off-diagonal entries of X(t), the leading-order term as u tends to zero is the quadratic term, whose sign matches the sign of the corresponding off-diagonal entry of the matrix -X(0)^-1.

Behavior of the Model: From Factions to Unification

The analysis in the previous section tells us how to find both the blow-up time t ^∗ and final sign configuration of a network if we know its initial state X(0). However, we might also want to know whether we can characterize the behavior of X(t) in the large-n limit in terms of statistical parameters of X(0). This could, for example, help us forecast the behavior of groups of individuals when collecting complete relationship-level data is not feasible. Clearly if the underlying network is a complete graph, it is not realistic to consider n that are too large, but we find fortunately in simulations that the asymptotic behavior we will derive in this section becomes apparent even for moderate values of n (less than 100). As a result, these large-n results are perhaps most usefully viewed as an approximate guide to what happens in medium-sized groups that are large enough to show predictable collective behavior but for which a completely connected set of relationships is still feasible to maintain.

In this section, we show that there is a transition from final states consisting of two factions to final states consisting of all-positive relations as the "mean friendliness" of X(0) (the mean of the distributions used to generate the off-diagonal entries of X(0)) is increased from negative to positive values. This is consistent with the numerical simulations shown in Fig. 1; as noted above, the asymptotic behavior we are studying is already clear in these simulations, which are performed with n = 90.

An external file that holds a picture, illustration, etc. Object name is pnas.1013213108fig1.jpg

Representative large-n plots of the model for (A) μ > 0 (μ = 3/10 in the plot shown), (B) μ = 0, and (C) μ < 0 (μ = -3 in the plot shown). For all three plots, σ = 1 and n = 90. To reduce image complexity, only one randomly sampled fifth of the trajectories is included. In the second plot, t ^∗ denotes the time at which the system diverges, and ε denotes a sufficiently small displacement. The white curves superimposed on the three plots are the large-n trajectories x _ij(t) =x _ij(0) -μ +μ/(1 -μnct) for x _ij(0) =μ,μ ± 3σ/2, where c represents a rescaling of time. Because we want to fix the blow-up time t ^∗ near 1 and because ct ^∗ = 1/λ ₁ as found in the text, we choose c = 1/(μn +ν -μ +σ ²/μ) for A and An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq26.jpg for B and C using estimates of λ ₁ taken from ref. 9. The black dotted lines mark the blow-up times t ^∗ = 1/(cλ ₁).

Before discussing the details, we describe how X(0) is selected in this section. We start by adopting the procedure of Füredi and Komlós (9): The elements x _ij(0) are drawn independently from distributions F _ij with zero mass outside of [-K,K]. The off-diagonal F _ij's have a common expectation μ and finite variance σ ², whereas the on-diagonal F _ii's have a common expectation ν and variance τ ². In addition, we require that each off-diagonal distribution F _ij be symmetric about μ. Random matrix models of this type have attracted considerable recent interest (see, e.g., refs. 10 and 11), but we need only the basic results of Füredi and Komlós (9) for our purposes, and so we use these in the development below. We consider the three cases of positive, zero, and negative μ.

Case 1: μ > 0.

The results of Füredi and Komlós (9) show that when μ > 0, the deviation of ω ₁ from An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq14.jpg vanishes in probability in the large-n limit. Hence the final state of the system consists of one large clique of friends containing all but at most a vanishing fraction of the nodes. Moreover, by assuming a bound on σ we can strengthen this statement further: If σ <μ/2, then the findings of Füredi and Komlós imply that the final state consists of a single clique of friends, with no negative edges. These observations are consistent with the representative numerical trial shown in Fig. 1 A. Moreover, Füredi and Komlós show that the asymptotic behavior of λ ₁ grows like μn +O(1), and hence the blow-up time scales like 1/(μn).

We can gain insight into the behavior of the system for small t using an informal Taylor series calculation: If we rescale time in Eq.1 by inserting a 1/n before the summation, compute the Taylor expansion of x _ij(t) term by term, and then take the expectation of each term, we obtain the geometric series x(t) =μ +μ ² t +μ ³ t ² + ⋯, or

equation image

[8]

With significantly more work, it can be proved that every trajectory x _ij(t) has this time dependence on [0,1/K) in the large-n limit with probability 1 (see SI Text), so we may write

equation image

[9]

for all t in [0,1/K). Observe that this limit has a blow-up time t ^∗ of 1/μ. Because our rescaling of time represents a zooming in or magnification of time by a factor of n, this t ^∗ corresponds to a blow-up time asymptotic to 1/(μn) for the unrescaled system, consistent with the results of Füredi and Komlós.

Case 2: μ = 0.

In the event that the network starts from a mean friendliness of zero, numerical experiments indicate that the system ends up with two factions of equal size in the large-n limit (Fig. 1 B). We now prove this to be the case. For the remainder of this discussion, we abbreviate X(0) as A and x _ij(0) as a _ij.

Because the off-diagonal entries of A have symmetric distributions by assumption, we have for any off-diagonal a _ij and any interval S _ij on the real line that P(a _ij∈S _ij) =P(-a _ij∈S _ij). Now let D be a diagonal matrix with some sequence of +1 and -1 along its diagonal (where the ith diagonal entry is denoted by d _i). Then the random matrices A and B =DAD are identically distributed, as we now show.

To say that A and B are identically distributed means that for every Borel set of matrices S, P(A∈S) =P(B∈S). To prove this, it suffices to consider the case in which S is a product of intervals S _ij, because these product sets generate the Borel σ-algebra. The upper triangular entries of A are independent, so An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq15.jpg . Similarly, . By the symmetry of the off-diagonal distributions, , which gives us P(A∈S) =P(B∈S) as desired. (Note that when i =j, the factor d _i d _j is 1 so the on-diagonal distributions need not be symmetric.)

Now consider the set S of matrices with an ω ₁ consisting of all-positive components. The above demonstration implies that the probability of choosing an A in this set is the same as choosing an A such that B is in this set. Regarding the later event, A(Dω _i) =λ _i(Dω _i) implies Bω _i =λ _i ω _i, so the λ ₁ eigenvector of the A used to compute B is Dω ₁. This demonstrates that all sign patterns for the components of ω ₁ are equally likely. In other words, the distribution of the number of positive components in ω ₁ is the binomial distribution B(n,1/2) and the fraction of positive components in ω ₁ converges (in several senses) to 1/2 as n grows large.

Additionally, we can consider how λ ₁ varies with n in the case that μ = 0 to determine when the blowup will occur. Füredi and Komlós (9) found for this case that An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq18.jpg with probability tending to 1, so with probability tending to 1 the blow-up time shrinks to zero like , an order of slower than in the μ > 0 case.

Case 3: μ < 0.

For this final case, Füredi and Komlós (9) found that An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq21.jpg with probability tending to 1. The semicircle law gives a lower bound: in probability. So the blow-up time goes to zero like in the unrescaled system.

Note also that if we define a matrix C = -A, where A is now the initial matrix X(0) of Case 3, then C satisfies the condition of Case 1, μ > 0. Thus the distance between the top eigenvector of C and An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq24.jpg declines to zero in probability just as in Case 1. Furthermore, every other eigenvector of C is orthogonal to the largest one. Hence if σ < |μ|/2, then with probability tending to 1, every other eigenvector acquires a mixture of positive and negative components in the large-n limit, including the bottom eigenvector of C, which is the top eigenvector of A. This establishes that in the case that μ < 0 and σ < |μ|/2, the system ends up in a state with two factions with probability converging to 1 for all finite n.

Numerical simulations of the case that μ < 0 suggest the conjecture that the two factions are approximately equal in size for large n. Furthermore, the derivation of Eq.9 is in fact valid for all μ, so each trajectory rapidly decays from x _ij(0) toward x _ij(0) -μ on [0,1/K) (Fig. 1 C). This transient decay appears to extend beyond t = 1/K in numerical simulations. So, for example, if time is rescaled by An external file that holds a picture, illustration, etc. Object name is pnas.1013213108eq25.jpg instead of 1/n, we would hypothesize that (i) each trajectory makes a complete jump from x _ij(0) to x _ij(0) -μ in the large-n limit and that (ii) from this point onward, the system behaves like an initial configuration of the μ = 0 case and so separates into two equal factions en route to its blowup at 1/(2σ).

Discussion

In this final section, we review our results and their significance relative to previous work in structural balance theory. We then compare the predictions of the model with data, discuss potential criticisms of the model, and finish with some intriguing connections between the behavior of the model and recent social-psychological work on neutralizing two-sided conflicts.

Our first result is a demonstration that the model forms two factions in finite time across a broad set of initial conditions. As noted at the outset, similar demonstrations have not been possible for dynamic models of structural balance in earlier literature because these models contained so-called jammed states that could trap a social network before it reached a two-faction configuration (4, 5). The model of Kułakowski et al. by contrast has no such jammed states for generic initial conditions and hence provides a robust means for a social network to balance itself.

The second result of the paper is the discovery and characterization of a transition from global polarization to global harmony as the initial mean friendliness of the network crosses from nonpositive to positive values. Similar transitions have been observed in other models of structural balance but so far none has been characterized at a quantitative level. For example, Antal et al. (4) found a nonlinear transition from two cliques of equal size to a single unified clique as the fraction of positively signed edges at t = 0 was increased from 0 to 1 (see figure 5 of ref. 4). The authors provided a qualitative argument for this transition, but left open the problem of its quantitative detail. Our results both confirm the generality of their observations and provide a quantitative account of a transition analogous to theirs.

To complement the theoretical nature of our work and get a better sense of how the model behaves in practice, we can numerically integrate it for several cases of empirical social network data where the real-life outcomes of the time evolution are known. Our first example is based on a study by Zachary (12) who witnessed the breakup of a karate club into two smaller clubs. Prior to the separation, Zachary collected counts of the number of social contexts in which each pair of individuals interacted outside of the karate club, with the idea being that the more social contexts they shared, the greater the likelihood for information exchange. These counts, or capacities as Zachary called them, can be converted to estimates of friendliness and rivalry in many different ways. For a large class of such conversions, Eq.1 predicts the same division that Zachary's method found, which misclassified only 1 of the 34 club members (Fig. 2 A and B).

An external file that holds a picture, illustration, etc. Object name is pnas.1013213108fig2.jpg

Tests of the model of Kułakowski et al. (Eq.1) against two existing datasets. (A) The evolution of the model starting from Zachary's capacity matrix with the capacity of each relationship reduced by 0.58. This is the minimal downward displacement necessary (to two significant figures) for the resulting separation to be correct for all but 1 of the 34 club members. For reasons described by Zachary (12), this is basically the best separation we can expect. (B) The evolution of the model from Zachary's capacity matrix with the capacity of zero between the two club leaders replaced by -11; the resulting factions are identical to those in A. Substituted values less than -11 yield the same two factions, whereas greater values produce less accurate divisions. (C) The evolution of the model starting from Axelrod and Bennett's 1939 propensity(i,j)·size(i)·size(j) matrix for the 17 countries involved in World War II (by Axelrod and Bennett's definition). The model finds the correct split into Allied and Axis powers with the exceptions of Denmark and Portugal. Axelrod and Bennett's own landscape theory of aggregation does slightly better—its only misclassification is Portugal.

A second example can be constructed from the data of a study by Axelrod and Bennett (13) regarding the aggregation of Allied and Axis powers during World War II. If we simply take the entries of their propensity(i,j)·size(i)·size(j) matrix to be proportional to the friendliness felt between the various pairs of countries in the war, then running the model gives the correct Allied-Axis split for all countries except Denmark and Portugal (Fig. 2 C).

Nevertheless, the model clearly contains several strong simplifications of the underlying social processes. The first of these is inherent to structural balance theory itself; it is a framework restricted to capture a particular kind of social situation, in which the need for consistency among one's friendships and rivalries brings about the emergence of two factions. Extensions of the theory have considered models in which it is possible to have multiple mutual enemies and hence more than two factions (14), and also networks that are not complete graphs (3). However, our focus here has been on the basic theory, because as we have seen, obtaining a satisfactory dynamics even for this simplest form of structural balance has been an elusive challenge. Moreover, the basic version of structural balance that we have considered here, with a complete graph of relationships and constraints leading to two factions, is relevant to a range of different situations. These span the kinds of settings discussed earlier in this section, including clubs, classrooms, and small organizations (15), as well as international relations during crisis (where a large set of nations can all mutually maintain friendly or unfriendly diplomatic relations) (13, 16).

Another consequence of the particular model studied here that has no direct analogue in real social situations is the divergence to infinity of the relationship strengths x _ij. However, because the purpose of the model is to study the pattern of signs that emerges, our main conclusions are based not on the actual magnitudes of these numbers but on the fact that the sign pattern eventually stabilizes at a point before the divergence. This stabilization of the sign pattern is our primary focus, and one could interpret the subsequent singularity as simply the straightforward and unimpeded "ramping up" of values caused by the system once all inconsistencies have been worked out of the social relations—the divergence itself can be viewed as taking place beyond the window of time over which the system corresponds to anything real. Alternately, one can imagine that as the community completes its separation into two groups, other social processes take over. For example, individuals with differing ideological views or social preferences may self-segregate, breaking the all-to-all assumption of the model. In other cases, mounting tensions may erupt into violence, reflecting a sort of bound on the relationship intensity achievable for pairs of nodes in the network.

Finally, we note that there is a large body of work in social psychology that studies issues such as the formation and reconciliation of factions from a much more empirical basis; see, for example, refs. 17 and 18. It is an interesting open problem to determine the extent to which the strictly mathematical development of the models here can be combined with the perspectives in this empirical body of literature, ultimately leading to a richer theory of these types of social processes.

Supplementary Material

Acknowledgments.

We thank Nick Trefethen and David Bindel for pointers to the literature on matrix Riccati equations. Research was supported in part by the John D. and Catherine T. MacArthur Foundation, a Google Research Grant, a Yahoo! Research Alliance Grant, an Alfred P. Sloan Foundation Fellowship, a Microsoft Research New Faculty Fellowship, a grant from the Air Force Office of Scientific Research, and National Science Foundation Grants CCF-0325453, BCS-0537606, CCF-0643934, IIS-0705774, and CISE-0835706.

Footnotes

References

1. Wasserman S, Faust K. Social Network Analysis: Methods and Applications, Structural Analysis in the Social Sciences. New York: Cambridge Univ Press; 1994. pp. 220–248. [Google Scholar]

2. Heider F. Attitudes and cognitive organization. J Psychol. 1946;21:107–112. [PubMed] [Google Scholar]

3. Cartwright D, Harary F. Structural balance: A generalization of Heider's theory. Psychol Rev. 1956;63:277–293. [PubMed] [Google Scholar]

4. Antal T, Krapivsky PL, Redner S. Dynamics of social balance on networks. Phys Rev E. 2005;72:036121. [PubMed] [Google Scholar]

5. Marvel SA, Strogatz SH, Kleinberg JM. Energy landscape of social balance. Phys Rev Lett. 2009;103:198701. [PubMed] [Google Scholar]

6. Kułakowski K, Gawroński P, Gronek P. The Heider balance—a continuous approach. Int J Mod Phys C. 2005;16:707–716. [Google Scholar]

7. Abou-Kandil H, Freiling G, Ionescu V, Jank G. Matrix Riccati Equations in Control and Systems Theory. Basel, Switzerland: Birkhäuser; 2003. p. 21. [Google Scholar]

8. Arnold L. On Wigner's semicircle law for the eigenvalues of random matrices. Probab Theory Rel. 1971;19:191. [Google Scholar]

9. Füredi Z, Komlós J. The eigenvalues of random symmetric matrices. Combinatorica. 1981;1:233–241. [Google Scholar]

10. Vu V. Spectral norm of random matrices. Combinatorica. 2007;27:721–736. [Google Scholar]

11. Tao T, Vu V. Smooth analysis of the condition number and the least singular value. Math Comput. 2010;79:2333–2352. [Google Scholar]

12. Zachary WW. An information flow model for conflict and fission in small groups. J Anthropol Res. 1977;33:452–473. [Google Scholar]

13. Axelrod R, Bennett DS. A landscape theory of aggregation. Brit J Polit Sci. 1993;23:211–233. [Google Scholar]

14. Davis JA. Clustering and structural balance in graphs. Hum Relat. 1967;20(2):181–187. [Google Scholar]

15. Doreian P, Mrvar A. A partitioning approach to structural balance. Soc Networks. 1996;18:149–168. [Google Scholar]

16. Moore M. An international application of Heider's balance theory. Eur J Soc Psychol. 1978;8:401–405. [Google Scholar]

17. Pettigrew TF. Intergroup contact theory. Annu Rev Psychol. 1998;49:65–85. [PubMed] [Google Scholar]

18. Pettigrew TF, Tropp LR. A meta-analytic test of intergroup contact theory. J Pers Soc Psychol. 2006;90:751–783. [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

smiththabees02.blogspot.com

Source: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3033300/

If a Graph is a Faction Iis is Continuous or Discrete

From the Cover

Continuous-time model of structural balance

Seth A. Marvel

Jon Kleinberg

Robert D. Kleinberg

Steven H. Strogatz

Abstract

Model

Results

Behavior of the Model: Evolution to a Balanced State

Solution to the Model.

Behavior of the Solution.

The Three Conditions.

Behavior of the Model: From Factions to Unification

Case 1: μ > 0.

Case 2: μ = 0.

Case 3: μ < 0.

Discussion

Supplementary Material

Acknowledgments.

Footnotes

References

0 Response to "If a Graph is a Faction Iis is Continuous or Discrete"

Postar um comentário

Iklan Atas Artikel

Iklan Tengah Artikel 1

Iklan Tengah Artikel 2

Iklan Bawah Artikel