Figures
Abstract
Inspired by recent works on evolutionary graph theory, an area of growing interest in mathematical and computational biology, we present examples of undirected structures acting as suppressors of selection for any fitness value r > 1. This means that the average fixation probability of an advantageous mutant or invader individual placed at some node is strictly less than that of this individual placed in a well-mixed population. This leads the way to study more robust structures less prone to invasion, contrary to what happens with the amplifiers of selection where the fixation probability is increased on average for advantageous invader individuals. A few families of amplifiers are known, although some effort was required to prove it. Here, we use computer aided techniques to find an exact analytical expression of the fixation probability for some graphs of small order (equal to 6, 8 and 10) proving that selection is effectively reduced for r > 1. Some numerical experiments using Monte Carlo methods are also performed for larger graphs and some variants.
Citation: Alcalde Cuesta F, González Sequeiros P, Lozano Rojo Á (2017) Suppressors of selection. PLoS ONE 12(7): e0180549. https://doi.org/10.1371/journal.pone.0180549
Editor: Sergio Gómez, Universitat Rovira i Virgili, SPAIN
Received: January 19, 2017; Accepted: June 16, 2017; Published: July 10, 2017
Copyright: © 2017 Alcalde Cuesta et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.
Funding: This work was supported by the Spanish Ministry of Economy and Competitiveness (Grants MTM2013-46337-C2-2-P and MTM2016-77642-C2-2-P) to FA, PGS, and AL, the Government of Galicia (Grant GPC2015/006), and the European Regional Development Fund. AL was also supported by the Government of Aragón, the European Regional Development Fund (Grant E15 Geometría), and the Defense University Center of Zaragoza (Grant CUD 2015-10). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Competing interests: The authors have declared that no competing interests exist.
Introduction
Evolutionary dynamics has been classically studied for well-mixed populations, but there is a wide interest in the evolution of complex networks after site invasion. The process transforming nodes occupied by residents into nodes occupied by mutants or invaders is described by the Moran model. Introduced by Moran [1] as the Markov chain counting the number of invading mutants in a well-mixed population, it was adapted to weighted graphs by Lieberman et al. [2] and Nowak [3] (see also [4–8]). For undirected networks where links have no orientation, invaders will either become extinct or take over the whole population, reaching one of the two absorbing states, extinction or fixation. The fixation probability is the fundamental quantity in the stochastic evolutionary analysis of a finite population.
If the population is well-mixed, at the beginning, one single node is chosen to be occupied by an invader individual among a population of N resident individuals. Afterwards, an individual is randomly chosen for reproduction, with probability proportional to its reproductive advantage (1 for residents and r ≥ 1 for invaders), and its clonal offspring replaces another individual chosen at random. In this case, the fixation probability is given by (1)
In evolutionary network theory, the nodes are occupied by resident or invader individuals (usually assuming that the invader arises uniformly at random) and the replacements are limited to the nodes which are connected by oriented links. As in the well-mixed case, we also restrict ourselves to birth-death updating when the process evolves. According to the Circulation Theorem [2], any weight-balanced network has the same fixation probability as the well-mixed population of the same size N. In the undirected case, this means that the temperature Ti = ∑j∼i 1/dj of every vertex i (where j is a neighbor of i and dj is the number of neighbors of j) is constant, and the network is said to be isothermal. But there are graph structures altering substantially the behavior of the fixation probability depending on the fitness. For example, the (average) fixation probability in the oriented line is equal to 1/N and the reproductive advantage of the invader individuals is completely suppressed. But in the directed case, absorbing barriers may not be accessible from any state, and the fixation probability may be even null (see [9] for an example).
Thus, we focus our attention on connected undirected networks where absorbing barriers can be reached from any state. As showed in [2, 3] (see also [10]), there are directed and undirected graph structures that asymptotically amplify this advantage. The fixation probability of a complete bipartite network KN−m,m (described in Fig 1) converges to the same limit as the fixation probability (2) of the Moran process with fitness r2 as m → ∞ and N − m is constant [9]. Assuming that fitness differences are amplified or reduced for network sequences of increasing size, a notion of amplifier and suppressor of selection has been introduced in [10] for several initialization types (describing the initial distribution of the invasion process). To distinguish both dynamics, that is, amplification and suppression of selection, a numerical analysis for a few fitness values r = 0.75, 1, 1.25, 1.5, 1.75 has been done in [11] for birth-death and death-birth processes on directed and undirected graphs (see [12] for a comparative analysis of both update mechanisms).
(A) In the star graph K1,m, the center is connected with m peripheral nodes. (B) The vertex set of a complete bipartite graph Kn,m is divided into two disjoint sets interconnected by edges.
Here we always assume that the distribution is uniform: the probability that a node will be occupied by the initial invader is equal for all the nodes, see Eq 7. We say that a network is an amplifier of selection if the fixation probability Φ(r) > Φ0(r) and a suppressor of selection if Φ(r) < Φ0(r) for all r > 1. Notice that Φ(1) = 1/N and the inequalities must be reversed for r < 1. Due to the exact analytical expression given by Monk et al. [13] using martingales (see also [10]), one can see that star graphs and complete bipartite graphs are amplifiers of natural selection whose fixation probabilities are bounded from above by Φ2(r). The existence of suppressors of selection was firstly showed in [14], but similarly to [11] only for some fitness values (namely, for values r ≤ 10). The aim of the paper is to present the first examples of graphs (of order 6, 8 and 10) which are suppressors of selection for any fitness value r > 1. From the point of view of robustness against invasion [15], these graphs are more robust than complete graphs (being now necessary to add a sign to ‖Φ − Φ0‖∞ = supr≥1|Φ(r) − Φ0(r)|). Better yet, we propose a complete family of graphs of even order 2n + 2 with n ≥ 2, called ℓ-graphs, which we believe are suppressors of selection. The proof of this assertion for the graphs of order 6, 8 and 10 is completed with a numerical simulation for larger orders. Some other variants are also explored numerically in order to understand why they are suppressors of selection.
Results
All the examples of so-called suppressors of selection given in [2, 3] are directed graphs. The abundance of amplifiers and suppressors of selection have been explored numerically by Hindersin et al. in [11] for directed and undirected graphs under birth-death and death-birth updating. Different types of initialization or placement of new invaders have been distinguished in [10] in order to classify different evolutionary dynamics on directed graphs. As explained, we focus our attention on connected undirected graphs under uniform initialization.
Firstly, we computed the fixation probability of all undirected graphs of order 10 or less for fitness values r varying from 0.25 to 10 with step size of 0.25 using with the FinisTerrae2 supercomputer (1024 cores of Haskell 2680v3 CPUs for almost 3 days) installed at CESGA, [16]. We found an unique suppressor of selection of order 6, namely the graph ℓ6, although there are other possible suppressors in orders varying from 7 to 10. We constructed the graphs ℓ8 and ℓ10 (as well the whole ℓ-family) from this initial example. More precisely, we call ℓ-graph an undirected graph of even order N = 2n + 2 ≥ 6 obtained from the complete graph K2n by dividing its vertex set into two halves with n ≥ 2 vertices and adding 2 extra vertices. Each of them is connected to one of the halves of K2n and with the other extra vertex. Graphs ℓ6, ℓ8 and ℓ10 of order 6, 8 and 10 are shown in Fig 2.
Graphs ℓ6, ℓ8 and ℓ10 are suppressors of selection
Computer aided techniques has been used to find exact analytical expressions of the fixation probability Φ for the first elements of this family with orders 6, 8 and 10 (see Fig 2). This computation proves that the graphs ℓ6, ℓ8 and ℓ10 are suppressors of selection with fixation probabilities Φ(r) < Φ0(r) for all r > 1 and Φ(r) > Φ0(r) for all r < 1.
At first, to bound the fixation probability from above, one could try to stop the process on ℓ2n+2 at the time that some extra vertex is occupied by a mutant. But as we will see later, the evolution from that time on seems to play an essential role in determining the suppressor character of the graph. Like for star and looping star graphs, which are amplifiers of selection for uniform initialization [10], we needed then to find the exact analytical expression of the fixation probability. Unfortunately, the elegant martingale method proposed in [13, Theorem 2.1] and later used in [10] (which is based on Doob’s optional stopping theorem [17]) is not useful for ℓ6, ℓ8 and ℓ10. We have had to implement a specific method to compute exactly their fixation probability.
As we shall see in the description of the mathematical model in the Methods section, the fixation probability Φ(r) is a rational function given as the quotient of two rational polynomials Φ′(r) and Φ′′(r) of degree bounded above by 2N − 2. Using the symmetries of each ℓ-graph, we can lower this bound to a quantity (3) (as proved in the Methods section and in S1 Text) and hence there are at most 2(d + 1) coefficients involved in Φ(r). Since Φ(r) converges to 1 as r → +∞, the leading coefficients of Φ′(r) and Φ′′(r) can be assumed to be 1 and that number is reduced to 2d. Then we can replace the system of 2N linear equations defining the fixation probability Φ(r) (see Eq 6) with a system of 2d linear equations (see Eq 8) corresponding to the 2d rational coefficients of Φ′(r) and Φ′′(r), which arise from evaluating Φ(r) for integer and rational values of the fitness r varying from 1 to d + 1 and from 1/2 to 1/d. Finally, we wrote a SageMath program [18] (see S1 File) to symbolically compute the exact fixation probability Φ(r) of the graphs ℓ6, ℓ8 and ℓ10 for these fitness values and then to solve the reduced linear system. Once the fixation probability Φ has been calculated, the sign of the difference Δ = Φ − Φ0 is analyzed to confirm that Δ(r) < 0 for all r > 1. In the Methods section, we give a more detailed explanation of both theoretical and computational arguments used to have exact analytical expressions of the fixation probability Φ for ℓ6, ℓ8 and ℓ10. The exact values of Φ and Δ are given in S1 Text. As a result of the uniform initialization, since 2/N converges to 0 as N = 2n + 2 goes to infinity, the fixation probabilities Φ(r) of ℓN and Φ0(r) of KN given by Eq 1 become more and more closer and the suppression effect tends to disappear for large populations. Notice however that this phenomenon can be avoided by modifying the initialization type or increasing the number of extra vertices.
Numerical experiments in larger orders. Further examples
However, the method used for ℓ6, ℓ8 and ℓ10 does not seem applicable to larger orders since it would require a substantial amount of memory and computation time. Therefore, we explored the suppression of selection for other graphs in the ℓ-family using Monte Carlo simulation (applying the Loop-Erasing technique of [9] to speedup the computations). Even if it does not require much memory and can be parallelized on a computer cluster, a very large number of trials—namely 1010 trails for each fitness value—has been necessary to compare the fixation probability of ℓ12 and ℓ24 with that of the complete graphs of the same order. In fact, since the fixation probabilities Φ(r) and Φ0(r) become more and more closer, we should need to increase this number more and more as N goes to ∞. Anyway, for fitness values r varying from 0 to 4 with step size of 0.25, we showed that the ℓ-graphs of orders 12 and 24 are also suppressors of selection as can be seen in Fig 3. In S1 Fig, a similar method is applied to all the graphs ℓ2n+2 with 2 ≤ n ≤ 11, including ℓ6, ℓ8 and ℓ10 in order to compare the fixation probabilities obtained by symbolic computation with the numerical solutions given by Monte Carlo simulation.
The functions Φ0(r) − Φ(r) associated to the ℓ-graphs of order 6, 8, and 10 are represented for fitness values r varying from 1 to 4. For the ℓ-graphs of order 12 and 24, we applied the Monte Carlo method to compute the difference between the fixation probabilities of each graph and the complete graph of the same order using 1010 trials for each fitness value r varying from 0 to 4 with step size of 0.25. A 99% confidence interval is showed for each simulated value.
To investigate the structural reasons of the suppression of selection in these graphs, this experiment has been completed by altering the balance in the connections of the two extra nodes with the central complete graph in order 6 and considering two variants (a fortiori unbalanced) of order 7 (see Fig 4). As showed in Fig 5, the graphs and become amplifiers of selection from relatively small values of the fitness, while the graph is a suppressor of selection for high fitness values. We discover a similar behavior for larger orders (see Fig 6).
The exponents are the sizes of the partition of the central Kn.
The differences Δ(r) = Φ(r) − Φ0(r) have been estimated using 1010 trials for each fitness value r varying from 0 to 10 with step size of 0.25.
Unbalanced ℓ-graphs of (A) order 8, (B) order 9 and (C) order 10.
Discussion
Motivated by interest in the robustness of networks against invasion, we tried to shed some light on the influence of the structural properties of graphs upon increasing or decreasing the fixation probability of new invaders occupying the nodes of a network. We computed the fixation probability of all undirected graphs of order 10 or less for fitness values r varying from 0.25 to 10 with step size of 0.25 using the FinisTerrae2 supercomputer (1024 cores of Haskell 2680v3 CPUs for almost 3 days) installed at CESGA [16]. Thanks to this experimental approach, we found that there are graph structures acting as suppressors of selection according to the terminology introduced in [2, 3]. This means that, for every fitness value r > 1, the average fixation probability Φ(r) of an advantageous invader individual placed at a random node is strictly less than that of this individual placed in a well-mixed population. For neutral drift r = 1, both probabilities Φ(1) and Φ0(1) are obviously equal, whereas the average fixation probability Φ(r) becomes strictly greater than Φ0(r) for a disadvantageous invader with fitness r < 1. We proposed a novel method to compute the fixation probability of graphs having low order and a big group of symmetries, and we used computer aided techniques to find an exact analytical expression of the fixation probability for three examples of size 6, 8 and 10. The SageMath program [18] used to compute the fixation probability of these graphs is available at [19]. Monte Carlo simulation was also used to see with high precision that other graphs in this family are suppressors of selection for some fitness values (varying from 1 to 4 with step size of 0.25). Memory requirements make it unfeasible to apply the same method for large orders, but it could be useful to study transitions between both regimes, suppression and amplification, in low order. On the other hand, although we are only concerned here with the evolutionary dynamics of graphs under birth-death updating, similarly to the work by Kaveh et al. [12] and Hindersin et al. [11], it could be also interesting to study the properties of the ℓ-family under death-birth updating. We also showed that the mechanism that activates the suppression of selection is quite subtle, since a certain imbalance in the number of nodes of the central complete graph which are connected with each additional node transforms our models into amplifiers from certain fitness values. Finally, if the spreading of favorable innovations can be enhanced by those network structures amplifying the advantage of mutant or invader individuals [20], as counterpart, the discovery of these examples is a first step towards finding structural properties that increase the robustness of a complex network against invasion [15]. This is a particularly interesting property for biological networks like brain and protein-protein interaction networks, as well as for technological networks like electrical power grids or backbone networks, where high fitness values are possible. In fact, these kind of models have had impact not only in evolutionary and invasion dynamics, but also in tumor growth [3, 21, 22] and economics and management [23].
Methods
Mathematical model
Let G be a connected undirected graph with node set V = {1, …, N}. Denote by di the degree of the node i. The Moran process on G is a Markov chain Xn whose states are the sets of nodes S inhabited by mutant or invader individuals at each time step n. The transition probabilities are obtained from a stochastic matrix W = (wij) where wij = 1/di if i ∼ j are neighbors and wij = 0 otherwise. More precisely, the transition probability between S and S′ is given by (4) where r > 0 is the fitness and (5) is the total reproductive weight of invaders and residents. The fixation probability of each subset S ⊂ V inhabited by invaders gives a solution of the system of 2N linear equations (6) Since G is undirected, the only recurrent states are S = ∅ and S = V. Then Eq 6 has a unique solution [24]. The (average) fixation probability is given by (7) It is a rational function depending on the fitness r ∈ (0, +∞). Notice that Φ(r) may be calculated using the embedded Markov chain instead of the standard Markov chain above described, both associated to the process, making the total reproductive weight disappear from the computations [9].
Computation method
As we proved in S1 Text, the average fixation probability is a rational function Φ(r) = Φ′(r)/Φ′′(r) where the numerator Φ′(r) = ∑i airi and the denominator Φ′′(r) = ∑i biri are polynomials with rational coefficients of degree less than or equal to 2N − 2. Using the symmetries of each ℓ-graph, we can reduce the space of states to the set of 4-uplas ordered lexicographically (from halves to extra vertices) by k ≥ k′ or e ≥ e′, or equivalently the system of linear equations Eq 6 to a new system with at most linear equations. For ℓ6, ℓ8 and ℓ10, we have 21, 36 and 55 reduced states respectively. Therefore, we can lower the former bound of the degree of Φ′(r) and Φ′′(r) to a quantity proving Eq 3. Details are explained in S1 Text. Hence, we should only compute the 2(d + 1) coefficients involved in Φ(r). Actually, since Φ(r) converges to 1 as r → +∞, we can assume that ad = bd = 1. Thus, we can replace Eq 6 with the system of 2d linear equations (8) which arise from evaluating the rational function Φ(r) for fitness values r ∈ {1, …, d + 1, 1/2, …, 1/d}. This choice is due to those are the least complex rational numbers, which can be described with only few bits, and the length in bits of the solution of Eq 8 grows exponentially depending on the coefficients [25]. Finally, we wrote a SageMath program [18] that symbolically
- computes the exact fixation probability Φ(r) of the graphs ℓ6, ℓ8 and ℓ10 for these fitness values, and
- solves the reduced linear system Eq 8.
This program is available at [19]. Once the fixation probability Φ of the graphs ℓ6, ℓ8 and ℓ10 has been calculated solving this system, the sign of the numerator Δ′ and the denominator Δ′′ of the rational function Δ(r) = Φ(r) − Φ0(r) is analyzed in order to prove that Δ(r) < 0 for all r > 1. The exact values of Φ and Δ are given in S1 Text.
Supporting information
S1 Text. The fixation probability as a rational function and the fixation formulas.
As supporting information, we include the essential tools in order to prove that ℓ6, ℓ8 and ℓ10 are suppressors of selection for any fitness value r > 1. In S1 Text, we prove that the fixation probability Φ(r) of any connected undirected graph of order N is a rational function obtained as the quotient of two polynomials Φ′ and Φ′′ with rational coefficients of degree at most 2N − 2, which is reduced to d for any ℓ-graph. Next, we give the exact values of Φ = Φ′/Φ′′ and Δ = Δ′/Δ′′ for the graphs ℓ6, ℓ8 and ℓ10.
https://doi.org/10.1371/journal.pone.0180549.s001
(PDF)
S1 Fig. The differences Φ0(r) − Φ(r) for the ℓ-graphs of orders between 6 and 24.
https://doi.org/10.1371/journal.pone.0180549.s002
(PDF)
S1 File. SageMath program.
To compute the fixation probabilities of ℓ6, ℓ8 and ℓ10 for any fitness value r > 1, we ran a SageMath program available at [19].
https://doi.org/10.1371/journal.pone.0180549.s003
(SAGE)
Acknowledgments
The authors are grateful to Prof. Antonio Gómez Tato for his valuable support, and thank the supercomputer facilities provided by CESGA.
Author Contributions
- Conceptualization: FA PGS AL.
- Formal analysis: FA PGS AL.
- Funding acquisition: AL FA.
- Investigation: FA PGS AL.
- Resources: FA AL.
- Software: AL.
- Validation: FA PGS AL.
- Visualization: AL.
- Writing – original draft: FA PGS AL.
- Writing – review & editing: FA PGS AL.
References
- 1. Moran PAP. Random processes in genetics. Proc Cambridge Philos Soc. 1958;54:60–71.
- 2. Lieberman E, Hauert C, Nowak MA. Evolutionary dynamics on graphs. Nature. 2005;433(7023):312–316. pmid:15662424
- 3.
Nowak MA. Evolutionary Dynamics: Exploring the Equations of Life. Belknap Press of Harvard University Press; 2006.
- 4. Broom M, Rychtář J. An analysis of the fixation probability of a mutant on special classes of non-directed graphs. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2008;464(2098):2609–2627.
- 5. Broom M, Hadjichrysanthou C, Rychtář J, Stadler BT. Two results on evolutionary processes on general non-directed graphs. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2010;466(2121):2795–2798.
- 6. Shakarian P, Roos P, Johnson A. A review of evolutionary graph theory with applications to game theory. Biosystems. 2012;107(2):66–80. pmid:22020107
- 7. Shakarian P, Roos P, Moores G. A novel analytical method for evolutionary graph theory problems. Biosystems. 2013;111(2):136–144. pmid:23353025
- 8. Voorhees B, Murray A. Fixation probabilities for simple digraphs. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2013;469:20120676.
- 9. Alcalde Cuesta F, González Sequeiros P, Lozano Rojo Á. Fast and asymptotic computation of the fixation probability for Moran processes on graphs. Biosystems. 2015;129:25–35. pmid:25625871
- 10. Adlam B, Chatterjee K, Nowak MA. Amplifiers of selection. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2015;471:20150114.
- 11. Hindersin L, Traulsen A. Most Undirected Random Graphs Are Amplifiers of Selection for Birth-Death Dynamics, but Suppressors of Selection for Death-Birth Dynamics. PLoS Comput Biol. 2015;11(11):e1004437. pmid:26544962
- 12. Kaveh K, Komarova NL, Kohandel M. The duality of spatial death–birth and birth–death processes and limitations of the isothermal theorem. Royal Society Open Science. 2015;2:140465. pmid:26064637
- 13. Monk T, Green P, Paulin M. Martingales and fixation probabilities of evolutionary graphs. Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences. 2014;470:20130730.
- 14. Broom M, Rychtář J, Stadler BT. Evolutionary dynamics on graphs—the effect of graph structure and initial placement on mutant spread. J Stat Theory Pract. 2011;5(3):369–381.
- 15. Alcalde Cuesta F, González Sequeiros P, Lozano Rojo Á. Exploring the topological sources of robustness against invasion in biological and technological networks. Scientific Reports. 2016;6:20666. pmid:26861189
- 16.
Alcalde Cuesta F, González Sequeiros P, Lozano Rojo Á, Vigara Benito R. An Accurate Database of the Fixation Probabilities for All Undirected Graphs of Order 10 or Less. In: Rojas I, Ortuño F, editors. Bioinformatics and Biomedical Engineering: 5th International Work-Conference, IWBBIO 2017, Granada, Spain, April 26–28, 2017, Proceedings, Part II. Cham: Springer International Publishing; 2017. p. 209–220.
- 17.
Karlin S, Taylor HM. A first course in stochastic processes. 2nd ed. Academic Press [A subsidiary of Harcourt Brace Jovanovich Publishers], New York-London; 1975.
- 18.
Stein WA, et al. Sage Mathematics Software (Version 5.8); 2013.
- 19.
Alcalde Cuesta F, González Sequeiros P, Lozano Rojo Á. Suppressors of selection; 2016. Available from: https://bitbucket.org/snippets/alvarolozano/7jnka/.
- 20. Tan S, Lu J. Characterizing the effect of population heterogeneity on evolutionary dynamics on complex networks. Sci Rep. 2014;4:5034. pmid:24849192
- 21. Hindersin L, Werner B, Dingli D, Traulsen A. Should tissue structure suppress or amplify selection to minimize cancer risk? Biology Direct. 2016;11(1):41. pmid:27549612
- 22. Komarova NL, Sengupta A, Nowak MA. Mutation-selection networks of cancer initiation: tumor suppressor genes and chromosomal instability. Journal of Theoretical Biology. 2003;223(4):433–450. pmid:12875822
- 23. Salas-Fumás V, Sáenz-Royo C, Lozano Rojo Á. Organisational structure and performance of consensus decisions through mutual influences: A computer simulation approach. Decision Support Systems. 2016;86:61–72.
- 24.
Taylor HM, Karlin S. An introduction to stochastic modeling. 3rd ed. San Diego, CA: Academic Press Inc.; 1998.
- 25.
Fang XG, Havas G. On the Worst-case Complexity of Integer Gaussian Elimination. In: Proceedings of the 1997 International Symposium on Symbolic and Algebraic Computation. ISSAC’97. New York, NY, USA: ACM; 1997. p. 28–31.