Briefings in Bioinformatics Advance Access originally published online on February 3, 2006
Briefings in Bioinformatics 2006 7(1):70-85; doi:10.1093/bib/bbk006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Published by Oxford University Press 2006.
Biological applications of the theory of birth-and-death processes
Corresponding author. Eugene V. Koonin, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Besthesda, MD 20894. Tel: 301-435-5913; Fax: 301-435-7794; E-mail: koonin{at}ncbi.nlm.nih.gov
| ABSTRACT |
|---|
|
|
|---|
In this review, we discuss applications of the theory of birth-and-death processes to problems in biology, primarily, those of evolutionary genomics. The mathematical principles of the theory of these processes are briefly described. Birth-and-death processes, with some straightforward additions such as innovation, are a simple, natural and formal framework for modeling a vast variety of biological processes such as population dynamics, speciation, genome evolution, including growth of paralogous gene families and horizontal gene transfer and somatic evolution of cancers. We further describe how empirical data, e.g. distributions of paralogous gene family size, can be used to choose the model that best reflects the actual course of evolution among different versions of birth-death-and-innovation models. We conclude that birth-and-death processes, thanks to their mathematical transparency, flexibility and relevance to fundamental biological processes, are going to be an indispensable mathematical tool for the burgeoning field of systems biology.
Keywords: mathematical modeling, genome evolution, birth-and-death process, Moran model, horizontal gene transfer, tumorigenesis
| INTRODUCTION |
|---|
|
|
|---|
Mathematics has long been intertwined with the biological sciences. The importance of mathematical approaches in many areas of biology is obvious but it is less appreciated that biological questions have stimulated the emergence of a variety of new directions in mathematics. Among many others, the areas of mathematics fully or partially developed in response to demands of biology include branching processes, traveling wave solutions of reaction-diffusion systems, Turing bifurcation and diffusive instability, analysis of replicator equations, stochastic coalescent process, evolutionary game theory and analysis of variance [1, 2].
Another fruitful and diverse mathematical field inspired by biology is the theory of birth-and-death processes. This theory was developed in the beginning of the twentieth century as a result of attempts to model growth of a population, taking into account stochastic demographic factors. With time, the theory became increasingly sophisticated, spawning new branches of stochastic process analysis [3]. Importantly, however, the first and simplest birth-and-death processes considered by Yule [4], Feller [5] and Kendall [6] provide a natural and useful theoretical framework for several areas of modern biology, such as estimation of the age of alleles, reconstruction of phylogenies and modeling various aspects of genome evolution.
A birth-and-death process is a stochastic process in which jumps from a particular state (number of individuals, cells, lineages, etc.) are only allowed to neighboring states (Figure 1). A jump to the right, i.e. increase by one of the number of individual or similar quantities represents birth, whereas a jump to the left represents death (Figure 1). This property considerably simplifies the mathematical analysis, but the process remains applicable to numerous real-world systems. Birth-and-death models allow one to address any questions formulated in terms of transition or state probabilities of the process, stationary distribution, mean, variance and distribution of times of the first entrance to a particular set of states, probabilities of extinction, the mean time of existence, etc. The results obtained with these models can be compared with empirical data allowing one to either reject some of the initial assumptions, or accept the model as a useful tool for analysis and prediction of properties of the real system.
|
Early models in population biology, including those based on the theory of birth-and-death processes, were largely deterministic. However, from the very beginning of population growth modeling, it had been clear that a more refined analysis must take into account the role of stochastic factors in the evolution of the population. An early recognition of this fact is evident in the 1874 study of extinction of human families by Watson and Galton [7] (reviewed, e.g. in [6]). The classic deterministic theory of population growth treats the size of a population as a continuous variable. This means that the state space of the process under consideration is continuous in the deterministic setting. By contrast, the state space of the corresponding stochastic process is discrete. In this regard, the stochastic models are more realistic than the deterministic ones because counts of individuals (genes, cells, number of species or gene families, etc.) are discrete by definition. Loosely speaking, each deterministic model can be viewed as an approximation of the corresponding stochastic model. This, however, does not mean that deterministic models always yield qualitatively valid solutions. A case in point is the phenomenon of persistence and its complement, extinction: there are situations when a deterministic model predicts that a population approaches a positive stationary level, whereas the corresponding stochastic model shows that extinction occurs with certainty. A classic example is a population model that consists of a simple branching process with the mean number of children equal to one. This process inevitably goes to extinction whereas the corresponding deterministic model describes a population with a constant size. Another well-known example is the logistic stochastic process considered next [8, 9].
Here we review the basic principles of the theory of birth-and-death processes and discuss examples of recent studies that involve, as the main or an auxiliary approach, analysis of a birth-and-death process. A simple introduction to the theory of birth-and-death processes is given in [9] and [8]. A more complete mathematical presentation can be found in [1013] and in other mathematical texts on stochastic processes.
| BRIEF MATHEMATICAL BACKGROUND |
|---|
|
|
|---|
The general study of temporally continuous, stochastic models of population growth apparently started with the work of Feller [5]. The cardinal assumption was that the growth of a population can be represented by a Markov process, i.e. the state of the population at time t can be described by the value of a random variable X(t) with the property,
|
|
i
t0 and whenever t0 < t. The nature of the variable X(t) differs from model to model. In this review, we consider only models with continuous time although an analogous theory exists for stochastic birth-and-death processes with discrete time [8].
If we interpret X(t) as a population size, then a birth-and-death process is a Markov process {X(t), t
0} such that, in an interval (t, t+
t), each individual in the population has the probability
n
t + o(
t) of giving birth to a new individual (probability of transition from state n to state n + 1) and the probability µn
t + o(
t) of dying (probability of transition from state n to state n 1). The parameters
n and µn are called the birth rate and death rate, respectively (n is the population size). For an intuitively plausible depiction of a birth-and-death process, it is useful to imagine a material particle, which moves from an integer to the neighboring integer, the path function X(t) being the position of particle at time t (Figure 1). We emphasize that, by specifying the rates
n and µn in terms of known or, at least in principle, identifiable parameters of the real world system, we define the particular model that is used to extract salient information on the behavior of the system. In other words, application of the theory of birth-and-death processes consists of two stages: first, the rates
n and µn have to be specified, and second, the resulting process, which depends on the parameters of the biological system, is analyzed.
The state probabilities pn(t) = Pr{X(t) = n} of the process being in state n at time t satisfies the following system of differential equations, called Kolmogorov forward equations [9]:
|
|
Equations (1) have to be solved subject to an initial condition and some boundary conditions. It is sufficient to solve them for the initial condition pn(0) =
n,m, i.e. for the case when the process is initially in a definite state m; here
n,m = 1 if n = m and
n,m = 0 if n
m. The state probabilities pn(t) give full information about the analyzed process but it is usually difficult to solve system of Equation (1).
Here we present a brief classification of the simple birth-and-death processes. Throughout the text, we denote the general birth and death rates as
n and µn, respectively; these rates may depend on state, n, time, and other parameters, whereas symbols
and µ correspond to constants.
The simplest case when the solution of system of Equation (1) is straightforward is a pure birth process or the Poisson process. In this case, we have
n =
, µn = 0, and the solution of (1) subject to the initial condition p0(0) = 1 is the Poisson distribution
|
|
t.
It is well known that the distribution of the time intervals between any two successive jumps in any Markov process with continuous time and discrete space of states is exponential [8]. More precisely, let Wi be the instant of the ith jump of the birth-and-death process (1) and Ti = Wi+1 Wi be the sojourn time; suppose that X(Wi) = n, then the process spends exponentially distributed time Ti in the state X(t)=n with the mean E[Ti] = 1/(
n+µn). When a jump occurs, it will be a birth with the probability
n/(
n + µn) or a death with the probability µn/(
n + µn).
If the birth and death rates of system of Equation (1) are linear functions of n, then the so-called probability-generating functions technique can be applied for writing down the appropriate partial differential equations for the probability-generating function [14]. Historically, this method had been the main tool for analyzing various birth-and-death processes [6, 15]. It is straightforward to obtain results for a simple birth process (
n =
n, µn = 0 [9]), simple death process (
n = 0, µn = µn [9]), simple death-and-immigration process (
n =
, µn = µn [16]), simple birth-and-death process (
n =
n, µn = µn [9]), generalized simple birth-and-death process (
n =
(t)n, µn = µ(t)n [6]) and a simple birth-and-death process with immigration (
n =
n +
, µn = µn [9, 17]).
For example, for a simple birth process with the initial condition pm(0) = 1, it can be shown that the state probabilities are
|
|
For a simple birth-and-death process, one can find [9]
|
|
|
|
As pointed out above, the method of probability-generating functions generally works when birth and death rates are linear functions of n. Karlin and McGregor [17, 21] showed that the solution of (1) could be obtained with the help of a sequence of orthogonal polynomials, which are closely related to the birth-and-death process. The general linear case
n =
n +
, µn = µn +
, was solved in [17] for
= 0. The case of
0 was analyzed in [22]. The asymptotically symmetric quadratic case
n = (N n) (n + a), µn = n(n + b) first appeared in applications concerned with genetic models [23]. Some other special cases also have been described [24], and more on recent advances in the study of birth-and-death processes and associated polynomials can be found in [25]. The problem with exact solutions of system (1) is that, in many cases, the expressions for the state probabilities, although explicit, are intractable for analysis and include special polynomials. In such cases, it may be sensible to solve more modest problems concerning the birth-and-death process under consideration, without the knowledge of the time-dependent behavior of state probabilities pn(t).
It is usually not difficult to write down the differential equations for the first few moments of X(t) [14]. For example, for a simple birth process, the equation for the mean E[X(t)] is
|
|
t for the case p1(0)=1. It should be noticed that, in this case, the mean growth of the process follows the same exponential law as the one that appears in the simplest deterministic model of population growth, namely, dN(t)/dt =
N(t), where N(t) is the population size and
is a Malthusian parameter. Sometimes, this fact is used as justification for the assertion that the deterministic theory is simply an account of the expectation behavior of the random variables, which occur in the stochastic formulation. This is not generally true as first pointed out by Feller [5]. For example, this is not the case for the logistic stochastic process considered next.
The method for writing down equations for the moments of X(t) works only when
n and µn are linear functions of n. If
n and µn include terms with n of degree higher than 1, the equation for the mean involves, generally, the second moment, the equation for the second moment involves the third moment and so on. This hierarchy of equations can only be solved approximately, by making any of a variety of approximations. Recently, the moment closure approximation has become a popular technique for obtaining such solutions [26]. Applications of this method to the logistic stochastic process can be found in [27, 28].
Some other tractable quantities give insight into the evolution of a birth-and-death process, in particular:
- the steady-state probabilities
describing the process when it is in a dynamical equilibrium;
- the probability that a given state is ever reached;
- the first passage time, i.e. the time for the process to reach a given state for the first time, its probability density function and its moments.
| MODELS OF POPULATION EVOLUTION BASED ON THE BIRTH-AND-DEATH PROCESS |
|---|
|
|
|---|
The Moran model
One of the main goals of theoretical population genetics is to study changes in the genetic makeup of the population under various evolutionary forces, such as selection, mutation, migration and random drift. Theoretical studies rely on deterministic and stochastic mathematical models which capture the essential features of populations. Deterministic models are applicable when the size of the population is very large; and when the finite population size cannot be ignored, stochastic models should be used. Furthermore, certain problems can be solved only in stochastic settings.
Let us consider a population of haploid individuals of a fixed constant size N (this is not a very restrictive assumption [29]). Let us further consider one locus that can have two types of alleles, A and a. The individuals with different alleles may differ in fitness, i.e. some individuals have a selective advantage over others. A few of many problems intrinsic to this model that can be of a particular interest are the following: Because of random drift (stochastic nature of births and deaths), one of alleles eventually goes extinct; if there are no mutations, the population becomes homogeneous and the interesting quantity is the speed with which it approaches homozygosity. Another situation is appearance of a unique mutant (e.g. A); in this case, the probability of fixation of A and the mean time of fixation are of particular importance. If we assume that mutations can go in either direction, then the population will be heterozygous forever, and the variable of interest is the stationary distribution. To study all these problems quantitatively, a mathematical model is required.
Two such models have been the basis of most of the work in population genetics: the WrightFisher model [30, 31] and the Moran model [29]. The WrightFisher model describes populations with discrete, seasonal reproduction and non-overlapping generations, whereas the Moran model is most applicable to populations with continuous reproduction. One of the possible formulations of the Moran model represents a continuous time birth-and-death process with nonlinear rates and finite state space. The Moran model is important for two reasons: First, in contrast to the WrightFisher model, it applies to organisms with overlapping generations. Second, many results that can be obtained only approximately under the WrightFisher model can be derived exactly using the Moran model. As Moran pointed out, this model is simpler because from any state one can move only to the neighboring ones, which simplifies the theory [31].
In order to analyze the model, we need to define the birth and death rates. The biological system can be described using several parameters: the population size, the current number of individuals that carry allele A, the selective advantage of A over a (or vice versa) and the mutation rates from A to a and from a to A. If the rates
n and µn can be expressed as functions of these parameters, a complete description of the system (within the framework of the model) will be obtained.
Let there be n copies of allele A and N n copies of allele a; each individual lives for an exponentially distributed amount of time with a mean of 1 and then is replaced; the replacement for individual x is chosen at random from all individuals in the population including x itself; the new individual replaces the old one at x. It is assumed that individuals carrying A have the selection coefficient s. The transition rates for the Moran model can be written as
|
|
|
|
0 and u
0, we deal with a birth-and-death process with reflecting boundaries (
0
0, µN
0). The presence of reflecting boundaries means that there exists a stationary distribution p* which is easy to calculate numerically noting that, at equilibrium,
If we assume that there are no mutations, then
0 = µN = 0, and we have a birth-and-death process with absorbing boundaries. In this case, the fate of a unique mutant appearing in the population can be studied. One of the main questions in this situation is the probability of fixation, i.e. the probability that a new mutant penetrates the entire population. In mathematical terms, this can be expressed as the probability of reaching the absorbing state N before reaching the absorbing state 0. The probability that the system ends up in the state N (the probability of fixation) if initially there is only one A is [14]
|
|
= 0, it is readily evaluated to
|
|
Logistic growth
Since the very beginning of the study of birth-and-death processes, the logistic process had been considered [5]. The deterministic version of the logistic model was originally introduced in [34]. This model accounts for the density dependence in the growth of a single population. It is based on the hypothesis that the net birth rate per individual (i.e. the difference between the birth rate and the death rate) is a linearly decreasing function of the population size. This implies that the net population birth rate is a quadratic function of the population size. The model is closed in the sense that no immiagration or emigration is allowed. Mathematically, the deterministic logistic model leads to a nonlinear differential equation
|
|
) = K. Various stochastic formulations of the logistic model have been developed and analyzed [8, 35]. Apparently, the first stochastic version of the logistic model was formulated by Feller [5] as a finite-state birth-and-death process. Feller derived the Kolmogorov forward equations for the state probabilities. One of his results is that the solution of the deterministic model does not agree with the expectation of the solution of the stochastic model when both are studied as functions of time. However, the difference can be shown to be asymptotically small as the maximum population size increases.
The logistic stochastic process is important for several reasons. It is well appreciated that the genetic makeup of a population strongly depends on the population structure while most of the population evolution models (like the Moran model considered previously) assume a constant population size. So, for more realistic modeling of population evolution, stochastic models with changing population size are required. Density-dependent effects influence the size of the population, preventing indefinite growth, and the logistic model is the simplest stochastic model with changing population size and density-dependent mechanisms that affect this size.
In the logistic model, the state zero is usually an absorbing state such that eventual absorption at the origin is certain (contrary to the solution of the deterministic model), and all states except the origin are transient (the state is called transient if the process visits this state only finitely many times). Two qualitatively different behaviors are possible at any given time: the process either goes extinct after having reached the absorbing state at the origin or remains in the set of transient states. Thus, the immediate two issues to address are the calculation of the mean time to extinction and the possible behavior of the system prior to extinction. The mean time to extinction can be calculated using the known formulas. For example, Goel and Richter-Dyn use a stochastic version of a special case of the logistic model to examine the extinction of a colonizing species [14]. However, the time to extinction may not have a known distribution, because of which characterizing the system by the mean time to extinction can be misleading.
The behavior of the process prior to extinction can be productively explored within the framework of the so-called quasi-stationary distributions [36]. The quasi-stationary distribution cannot be found analytically but there are effective numerical methods for determining such distributions [35]. The simplest way to obtain an approximation of the quasi-stationary distribution is to restrict consideration to transient states (making the state space strictly positive). By excluding zero from the state space, one can establish a related process without an absorbing state. This method has been applied in several mathematical models [15, 37] and is valid when the time to extinction is reasonably large [35]. More recent results on stochastic logistic process can be found in [27, 28, 3840].
| SOME BIOLOGICAL APPLICATIONS OF BIRTH-AND-DEATH PROCESSES |
|---|
|
|
|---|
Rates and patterns of gene duplications and evolution of multigene families
Gene duplication is one of the principal mechanisms of genome evolution [41, 42]. In particular, in multicellular eukaryotes, gene duplications gave rise to numerous multigene families, and the mode of evolution of genes within multigene families has been the subject of many theoretical and empirical studies. These studies reveal a high degree of within-species homogeneity among duplicated sequences. It has been proposed that a correction mechanism must have operated to spread mutations among the paralogous genes in a multigene family, a phenomenon dubbed concerted evolution [4345]. Concerted evolution essentially means that members of a multigene family do not evolve independently of each other. A classic example of this type of gene family is the cluster of rRNA genes in which all several hundred paralogous genes have (nearly) identical sequences, even in nontranscribed spacer regions [46]. Thus, the genes of this cluster in humans are more similar to one another than to most of the rRNA genes of chimpanzees. This high degree of sequence homogeneity within species is believed to be achieved by gene conversion, i.e. frequent interlocus recombination.
However, not all patterns of variation between genes in multigene families can be explained by concerted evolution [44, 47]. The best characterized cases in point are the multigene families of major histocompatibility complex (MHC) genes and immunoglobulin genes (Ig). The members of these families from the same species are not necessarily more closely related to one another than to the genes from different species. To characterize the mode of evolution of these systems, Nei and coworkers conducted phylogenetic analyses of several multigene families of the MHC and Ig systems [47]. The results show that the evolutionary pattern of these families is quite different from that of concerted evolution. To explain this distinct pattern, a birth-and-death model of evolution has been proposed [47, 48]. Under this model, paralogous genes are produced by various mechanisms, including tandem and block duplication, and some of the duplicates diverge functionally whereas others become pseudogenes and eventually deteriorate beyond recognition. The end result of this mode of evolution is a multigene family consisting of a mixture of divergent groups of genes, with highly similar sequences within groups plus a substantial number of pseudogenes. The main difference between the birth-and death evolution process and concerted evolution is that, under the former, genes in multigene families evolve independently of each other. The results of phylogenetic studies on several other gene families, including ubiquitins, histones and various receptors, are consistent with the predictions of the birth-and-death evolution model [4953].
The birth-and-death evolution model of multigene family evolution, although originally presented at the verbal level, allows the use of the classical theory of birth-and-death processes to estimate the rates of gene duplication and loss for groups with a well-established phylogeny; in particular, such an analysis has been performed for vertebrate evolution over the last 500 million years [54]. This problem is closely related to the problem of phylogeny reconstruction. It has been shown that it is possible to estimate the birth and death rates of lineages for reconstructed phylogenies although they contain no explicit information about extinct lineages [18, 55]. A null model in phylogenetics is usually one in which, over a short time-span, each extant lineage has the same probability of dividing into two or dying as any other lineage. Assuming that the birth and death rates for each lineage are constant, this model is a continuous-time, linear birth-and-death process with transition rates
n =
n and µn = µn. The birth-and-death model relates the number of extant lineages NT and the expected number of lineages Nt at time t, which survive to or at least have one descendant at time T, by the formula [56]:
|
|
Cotton and Page applied this model to estimate rates of gene duplication (
) and loss (µ) [54]. They showed that constant rates of gene duplication and loss fit the pattern of recent gene family evolution reasonably well, implying that, contrary to several recent suggestions [5759], there is no evidence of a recent increase in duplication rate. The appearance of such an increase is likely to be an artifact caused by the fact that a greater number of recent duplications have survived in extant genomes compared to older duplications [18]. By contrast, the second, ancient peak of duplications at
500 million years ago, originally detected by Gu and coworkers [58], could not be accounted for by the constant-duplication-rate model and is likely to reflect a whole genome (2R) or, at least, very large scale duplication in the early evolution of vertebrates [54]. The estimates of both duplication and loss rates obtained by Cotton and Page (duplication rate of
0.00115 Myr1 lineage1 and a loss rate of
0.00740 Myr1 lineage1) are substantially (about an order of magnitude) lower than previous estimates [42].
Origin of power law distributions of genome-related quantities
A broad variety of phenomena in physics, biology and the social sphere is described by power law distributions [6062]. In the field of genomics, the power laws have been observed in the distribution of the number of transcripts per gene, the number of interactions per protein, the number of genes in coexpressed gene sets, the number of genes or pseudogenes in paralogous families, the number of connections per node in metabolic networks and other quantities that can be obtained by genome analysis [61, 6367]. These distributions are described by the formula P(i) = ci
, where P(i) is the frequency of nodes with exactly i connections or sets with exactly i members,
is a parameter which typically assumes values between 1 and 3, and c is a normalization constant. More recently, it has been shown that the distributions of several genome-related quantities are best described by the generalized Pareto function P(i) = c(i + a)
, where
> 0, a are parameters [6870]. At large i (i >> a), this distribution is indistinguishable from a power law, but at small i, it deviates substantially, with the magnitude of the deviation depending on a.
As already pointed out in the preceding section, in a somewhat different context, a birth-and-death process is a natural mathematical framework for modeling evolution of gene families, with duplication constituting a gene birth and gene loss treated as a death event. The birth-and-death approach has been applied to modeling the evolution of paralogous genome family sizes [65, 70, 71], the distribution of folds and families in the protein universe [72] and proteinprotein interaction networks [73].
For the analysis of the evolution of gene family sizes, which we consider here in somewhat greater detail, a third elementary process, innovation, via horizontal gene transfer or emergence of genes from non-coding sequences, has been incorporated into the model. Innovation is analogous to birth except that an innovation event specifically adds a member to class 1 (Figure 1) which corresponds to the acquisition of a new gene by the analyzed genome. Accordingly, this type of evolutionary models has been dubbed birth-death-and-innovation models (BDIM) [70]. Following [70], let us suppose that we deal with a general birth-and-death process with state probabilities given by Equation (1). Assume that
N = 0, where N is the maximal possible multigene family size. This means that the process under consideration is a general birth-and-death process with finite state space and reflecting boundaries (i.e.
0
0, µN
0). Consequently, there is a stationary equilibrium distribution p*. Let us define a function
(n)=
n1/µn. The asymptotic of the stationary distribution is completely defined by the asymptotic behavior of
(n).
Let us suppose that, for large n, the following expansion is valid:
|
|
are real numbers and
is positive. The following main result gives classification of possible stationary distributions of the birth-and-death model ([70], Theorem 1): - If s
0 then
, where
(n) is the gamma-function;
- If s = 0 and
1, then
;
- If s = 0,
= 1 and
0, then
;
- If s = 0,
= 1 and
= 0, then
.
(n) has the form
(n) = 1
/n + o(1/n), then the resulting stationary distribution asymptotically tends to a power law. Analysis of the fit of different versions of the BDIM to the empirical distributions of gene family size in a variety of genomes showed that the simplest form of BDIM that gave a good fit to the empirical data was the second-order balanced linear model (
n =
(n + a), µn = µ(n + b), where a and b are constants), whereas the simple model
n =
n, µn = µn could be confidently rejected ([70] and Figure 2). This result implied that the evolution of individual genes within a family of paralogs was not entirely independent, i.e. there was at least a weak dependence of gene birth and death rates on family size (or an interaction between paralogs, by analogy to chemical kinetics).
|
In a further development of this analysis, the characteristics of evolution of the system, such as the probability of formation of a family of the given size before extinction and the mean times of formation and extinction of a family of a given size, were examined by analyzing the stochastic version of BDIM [74, 75]. Given the published estimates of the rates of gene duplication and loss [42], it has been found that the linear model, which gives a good approximation of the stationary distributions of family sizes for different genomes, predicts completely unrealistic mean times for reaching the observed sizes of the largest gene families. In computer simulations with a large ensemble of genes, even the minimum time required for the formation of the largest family has been shown to be unrealistically long. Thus, the linear BDIM is incompatible with the estimates of the rate of genome size growth derived from the empirical data (the use of the lower estimates of duplication rates reported by Cotton and Page [54] would only exacerbate the problem). By contrast, models with the degree between 2 and 3 (e.g. for the model of degree 2, or the quadratic model,
n =
(n + a1)(n + a2), µn = µ(n + b1)(n + b2)), which involve stronger dependences between the evolutionary rates of paralogs, resulting in a self-accelerated evolutionary regime, could account for faster evolution, although the mean times of formation of the largest gene families are still too long to fit the actual time scale of evolution. However, it has been shown that the variance of the time required for the formation of the largest families is extremely large (coefficient of variation >>1), which means that some families would grow much faster than the mean rate. Thus, the minimal time required for family formation is more relevant for a realistic representation of genome evolution than the mean time. Monte Carlo simulations of family growth from an ensemble of simultaneously evolving singletons show [74, 75] that the time elapsed before the formation of the largest family was much shorter than the estimated mean time and approached realistic values (Figure 3). The biological interpretation of the dependence between evolutionary rates of paralogs that is intrinsic to the higher-degree BDIM, is not immediately obvious. However, it is tempting to propose that such dependence reflects the fact that evolution of paralogous families, particularly, large ones is an adaptive process governed by positive Darwinian selection. Birth-and-death models do not include selection explicitly but the requirement for higher-degree BDIM to present a realistic description of genome evolution may imply the decisive role of selection in this process.
|
Almost simultaneously, another model of gene family evolution generating the observed power law distributions has been proposed [76, 77]. Here, the process under consideration is a simple linear birth-and-death process with immigration. With a given intensity
, there is an incoming Poisson process which corresponds to appearance of new gene families, supposedly, via duplication of a pre-existing gene followed by radical modification. It has been shown that, asymptotically, power law distribution for the state probabilities is possible if
> µ [76, 77]. In the model of Karev and coworkers [70, 74, 75], the condition is the opposite, i.e. a balanced BDIM with a power law asymptotic can be obtained only if
< µ, which is in full agreement with the latest estimates of the duplication and loss rates [54].
Modeling horizontal gene transfer
Comparison of the numerous sequenced genomes from all three kingdoms has shown beyond reasonable doubt that horizontal gene transfer (HGT) occurred on numerous occasions, at least, in the evolution of prokaryotes and unicellular eukaryotes [7882]. However, rigorous quantitative evaluation of the amount of HGT is extremely hard which leads to vast differences in opinion on the role of HGT in evolution, from the concept of pervasive HGT fully shaping prokaryotic genomes [82] to a much more reserved view of HGT as a relatively minor evolutionary factor [8385].
The first evolutionary-theoretical analysis of horizontal gene transfer in microbial populations has been reported by Berg and Kurland who applied the Moran model for the analysis of genetic drift in a haploid population [86]. A more general model of horizontal gene transfer, which includes both intra- and inter-population gene fluxes, has been recently formulated and analyzed [87]. In particular, this work involved an attempt to identify the conditions under which a horizontally transferred sequence can be fixed or at least penetrate a significant fraction of the population.
The model includes five parameters: inactivating mutation rate u of the novel sequence, selection coefficient s, invasion rate
(the rate with which the individuals of the population acquire a new gene from other populations), within-population horizontal transmission (infection) rate
and population size N. The state of the process n represents the number of individuals carrying the novel sequence. The rates are
|
|
0
0, µN
0). This means that there is a stationary distribution for which a good approximation can be found. This stationary distribution can be used to estimate the mean population penetration of the novel sequence depending on the parameter values (Figure 4). Figure 4 shows that, if the rate of invasion
N is substantially lower than the rate of inactivating mutation uN, significant penetration (on average) can be reached only with high positive values of (s +
)N, i.e. when the new gene is rapidly spread within the population and/or confers a selective advantage onto the recipient.
|
Berg and Kurland [86] considered the model with rates Equation (5) and no inter-population transfer, i.e.
= 0, and came to the conclusion that horizontally acquired new genes can be fixed in a population only when these sequences confer a substantial selective advantage onto the recipient and therefore are subject to strong positive selection. If there is no invasion in the model, the new gene is doomed to extinct. But a more detailed analysis of the mean time to extinction and quasi-stationary distributions shows that a new gene can penetrate a significant part of the recipient population on its way to extinction [87]. Taking into account the processes of within-population transmission (infection) and invasion leads to conclusions that are substantially different from those of Berg and Kurland: if the rates of these processes are non-negligible, horizontally transferred sequences do get fixed or at least persist in a significant part of the recipient population for a long time, even if they are neutral or slightly deleterious. In other words, complete analysis of the state space given by Equation (5) shows that fixation or persistence of horizontally transferred genes in a population can be achieved via different routes: it can happen either due to a high rate of invasion or a high rate of infection, or substantial selective value. Generally, the modeling results are compatible with the notion that HGT, while certainly not completely promiscuous or uniformly rampant, could be a pivotal force in evolution, at least in the prokaryotic world.
Somatic evolution of cancer cells
Recently, the Moran model has been applied to model carcinogenesis [8892]. The main assumption in these studies is that cancer is initiated in tissue compartments where only a relatively small number of cells are at risk of mutating into a transformed state escaping homeostatic regulation. In this case, the evolutionary dynamics can be approximated by a low-dimensional stochastic process with a linear Kolmogorov forward equation that can be solved analytically. Most of the time, the cell population is homogeneous with respect to the relevant mutations.
First, assume that there are two types of cells, A and B, and let a and b be their numbers. Cells can reproduce, mutate or die. Mutation is only allowed from A to B with rate u. The total number of cells is kept constant (N), N = a + b. Cells can have different fitness. Under these assumptions, evolution of cells can be described with the Moran process with rates (2). The corresponding Markov process is a biased random walk with one absorbing state, b = N. If u
0, then, sooner or later, the system finds itself in this absorbing state. Under conditions given in [88], the probability of finding the system with b cells, 1 < b < N, is of the order of u << 1/N. The system spends most of the time in the homogeneous states b = 0 and b = N. If we denote the state with all a as A and the state with all b as B, then, approximately, we have the two-state chain
|
|
(s), is the probability of fixation given by (3); uN
(s) is the corresponding rate of jump from one (almost) homogeneous state to another. This stochastic process is described by the following Kolmogorov forward equations: |
|
The inactivation of both alleles of a tumor suppressor gene (TSG) implies a net reproductive advantage for the affected cell and might lead to clonal expansion. The inactivation of the first allele of the TSG may be neutral or may lead to a selective advantage or disadvantage. Thus, the process of inactivation of a TSG can be modeled by the system with three types of cells a, b and c. The mutational network can be depicted as:
|
|
To approximate the stochastic process under consideration, one should take into account the possibility of jumping from the state A to the state C (a stochastic tunnel). It can happen if a mutation from a to b gives birth to a lineage of cells of type b that does not go to fixation (these cells may go extinct due to random drift combined with selection pressure, especially if s1 < 1), at the same time giving birth to a lineage of cells of type c.
The same kind of model was used to answer the question whether genetic instability (an elevated mutation rate) is an early (i.e. a potential cause) or late (i.e. probable side-effect) event in tumorigenesis. The path to cancer cells can be described by the following mutational network:
|
|
| CONCLUSIONS |
|---|
|
|
|---|
Mathematical modeling always had been trying to find a compromise between simplicity of analysis and requirements of realism. On the one hand, we have extremely complex biological systems; on the other hand, we need to formally address some quantitative issues about these systems which often can be done only through the use of mathematical models that may rest on grossly over-simplified assumptions. On some occasions, however, a particular mathematical formalism seems to be pre-adapted to a variety of biological systems and can be profitably used to model a diverse set of processes. Birth-and-death stochastic processes are one class of such lucky models, as we attempted to show in this review using examples from several disparate areas of biology.
The utility of the theory of birth-and-death processes seems to be owing to its two crucial features. First, a huge variety of fundamental biological processes can be described in terms of elementary events that can be identified with birth-and-death event. These include actual birth of a new individual, cell division, mutation, gene duplication, horizontal gene transfer from one individual of a population to another (or from one population to another), appearance of a new lineage (e.g. a species) in a genealogy and others. Second, the mathematical theory is well studied, relatively simple, many results are known and the technique is flexible such that it can be adjusted in cases when the standard birth-and-death process is insufficient to describe the analyzed process. The BDIM discussed here are one such case where the additional elementary process of innovation had to be introduced to adequately incorporate for the existing basic understanding of genome evolution [70]. Another, conceptually similar example comes from population biology. Populations can suffer dramatic declines from disease or food shortage but, perhaps surprisingly, such populations can survive for long periods of time and, although they may eventually become extinct, they can exhibit an apparently stationary regime. This behavior has been successfully modeled using the so-called birth-death-and-catastrophe process [12, 93, 94]. Another extension of a birth-and-death process is the situation when transitions from a state are allowed not only to neighboring states but also to some other states within the given limits. Often, such systems are governed by a quasi-birth-and-death process (QBD process). These processes extend the classical birth-and-death processes to the vectorial case, and the tri-diagonal generator of parameters is substituted by a block-tridiagonal matrix ([95] and references therein). Furthermore, diffusion approximations of birth-and-death processes have been developed and characterized simultaneously with the standard version ([5]; see [11] for a mathematical description). Very recently, a diffusion approximation of the BDIM has been developed as a specific application to the analysis of evolution of gene families [96, 97].
It seems that, with the rapid accumulation of genomic data, which include not only sequences but also genome-wide information on gene expression, genetic interactions between genes, proteinprotein interactions, regulatory networks and more, the role of mathematical modeling in the new integrative (systems) biology is going to be indispensable and increasing for the foreseeable future. We believe that birth-and-death processes comprise an essential part of the mathematical framework of this new biology.
Key Points
|
| FOOTNOTES |
|---|
|
|
|---|
Artem Novozhilov is a post-doctoral Fellow at the National Center for Biotechnology Information of the National Institutes of Health in Bethesda, Maryland. He works on mathematical modeling of evolution at different levels. He received his PhD from Moscow State University of Communication Means.
Georgy Karev is a Research Scientist at the National Center for Biotechnology Information of the National Institutes of Health in Bethesda, Maryland. His research is focused on mathematical modeling in biology. Main research interests include modeling of genome evolution, dynamics of heterogeneous populations and communities (demographic models, forest ecosystems), bifurcation approach to modeling complex biological system (epidemiological models, cancer modeling, neuron firing model), structural individual-based models, stochastic theory of populations. He received his PhD in Mathematics from the Institute of Electronic Engineering in Moscow, Russia, and Dr. Sci. degree in Biophysics from the Institute of Biophysics, Krasnoyarsk, Russia.
Eugene V. Koonin is a Senior Investigator and Group Leader at the National Center for Biotechnology Information of the National Institutes of Health in Bethesda, Maryland. His research interests include all aspects of comparative and evolutionary genomics, in particular, mathematical modeling of genome evolution and evolutionary systems biology. He received his PhD in Molecular Biology from the Department of Biology of Moscow State University (then USSR).
Submitted: July 15, 2005. Received (in revised form): November 24, 2005.
| References |
|---|
|
|
|---|
- Hastings A, Palmer MA. Mathematics and biology. A bright future for biologists and mathematicians? Science 2003; 299:20034.
[Abstract/Free Full Text] - Cohen JE. Mathematics is biology's next microscope, only better; biology is mathematics next physics, only better. PLoS Biol 2004; 2:e439.[CrossRef][Medline]
- Jagers P. The growth and stabilization of populations. Statistical Science 1991; 6:26974.
- Yule GU. A mathematical theory of evolution, based on the conclusions of Dr. J.C. Willis, F.R.S. Philosophical Transactions of the Royal Society of London Ser B 1924; 213:2187.
- Feller W. Die Grundlagen der Volterrschen Theorie des Kampfes ums Dasein in wahrscheinlichkeitstheoretischer Behandlung. Acta Biometrica 1939; 5:1140.
- Kendall DG. On the generalized birth-and-death process. Ann Math Stat 1948; 19:115.
- Watson HW, Galton F. On the probability of the extinction of families. J Anthrop Inst 1874; 4:13844.
- Allen LJS. An Introduction to Stochastic Processes with Applications to Biology. Upper Saddle River, NJ: Prentice Hall 2003.
- Bailey NTJ. The Elements of Stochastic Processes. New York: Wiley 1990.
- Karlin S, Taylor HM. A First Course in Stochastic Processes. 2nd edn San Diego, New York, Boston, London: Academic Press 1975.
- Karlin S, Taylor HM. A Second Course in Stochastic Processes. San Diego, New York, Boston, London: Academic Press 1981.
- Anderson W. Continuous Time Markov Chains: An Applications-Oriented Approach. New York: Springer Verlag 1991.
- Bhattacharya R, Waymire E. Stochastic Processes with Applications. New York: Wiley 1990.
- Goel NS, Richter-Dyn N. Stochastic Models in Biology. New York: Academic Press 1974.
- Kendall DG. Stochastic processes and population growth. J Royal Stat Soc 1949; 11:23082.
- Cox DR, Miller HD. The Theory of Stochastic Processes. New York: Wiley 1965.
- Karlin S, McGregor J. Linear growth, birth and death processes. J Math Mech 1958; 7:643.
- Harvey PH, May RM, Nee S. Phylogenies without fossils. Evolution 1994; 48:5239.
- Nee S, May RM, Harvey PH. The reconstructed evolutionary process. Philos Trans R Soc Lond B Biol Sci 1994; 344:30511.[Web of Science][Medline]
- Slatkin M. The age of alleles. In Slatkin M, Veuille M (Eds.). Modern Developments in Theoretical Population Genetics: The Legacy of Gustave Malecot. Oxford University Press 2002 pp. 23360.
- Karlin S, McGregor J. The differential equations of birth-and-death processes, and the Stieltjes moment problem. Trans Amer Math Soc 1958; 85:489546.
- Ismail MEH, Letessier J, Valent G. Linear birth and death models and associated Laguerre polynomials. J Approx Theory 1988; 56:33748.
- Karlin S, McGregor J. On a genetic model of Moran. Proc Cambridge Philos Soc 1962; 58:299311.
- Valent J. Exact solutions of some quadratic and quartic birth and death processes and related orthogonal polynomials. J Comp Appl Math 1996; 67:10327.
- Van Doorn E. Birth-death processes and associated polynomials. J Comp Appl Math 2003; 153:497506.
- Levin SA, Pacala SW. Theories of simplification and scaling of spatially distributed processes. In Tilman D, Kareiva P (Eds.). Spatial Ecology: The Role of Space in Population Dynamics and Interspecific Interactions. Princeton University Press 1997.
- Nasell I. Moment closure and the stochastic logistic model. Theor Popul Biol 2003; 63:15968.[Medline]
- Newman TJ, Ferdy JB, Quince C. Extinction times and moment closure in the stochastic logistic process. Theor Popul Biol 2004; 65:11526.[Medline]
- Moran PA. Random processes in genetics. Proc Philos Soc Math Phys Sci 1958; 54:6071.
- Fisher RA. The Genetical Theory of Natural Selection. London & New York: Oxford University Press 1930.
- Wright S. Adaptation and selection. Genetics, Paleontology and Evolution. Princeton, NJ: Princeton Univ. Press 1949.
- Kimura M. The Neutral Theory of Molecular Evolution. Cambridge: Cambridge University Press 1983.
- Durrett R. Probability Models for DNA Sequence Evolution. New York: Springer 2002.
- Verhulst PF. Notice sur la loi que la population suit dans son accroisement. Corr Math et Phys 1938; X:11321.
- Nasell I. Extinction and quasi-stationarity in the Verhulst logistic model. J Theor Biol 2001; 211:1127.[CrossRef][Web of Science][Medline]
- Darroch JN, Seneta E. On quasi-stationary distributions in absorbing discrete-time finite Markov chains. J Appl Prob 1965; 2:88100.
- Pielou EC. An Introduction to Mathematical Ecology. New York: Wiley-Interscience 1969.
- Matis JH, Kiffe TR. Effects of immigration on some stochastic logistic models: a cumulant truncation analysis. Theor Popul Biol 1999; 56:13961.[CrossRef][Web of Science][Medline]
- Matis JH, Kiffe TR. Stochastic Population Models: A Compartmental Perspective New York: Springer 2000.
- Matis JH, Kiffe TR. On stochastic logistic population growth models with immigration and multiple births. Theor Popul Biol 2004; 65:89104.[Medline]
- Ohno S. Evolution by gene duplication. Berlin-Heidelberg-New York: Springer-Verlag 1970.
- Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science 2000; 290:115155.
[Abstract/Free Full Text] - Graur D, Li WH. Fundamentals of Molecular Evolution. Sunderland, MA: Sinauer Assosiates, Inc. 2000.
- Nei M, Rooney AP. Concerted and birth-and-death evolution of multigene families. Annu Rev Genet 2004; 39:121152.[CrossRef]
- Santoyo G, Romero D. Gene conversion and concerted evolution in bacterial genomes. FEMS Microbiol Rev 2005; 29:16983.[CrossRef][Web of Science][Medline]
- Gonzalez IL, Sylvester JE. Human rDNA: evolutionary patterns within the genes and tandem arrays derived from multiple chromosomes. Genomics 2001; 73:25563.[CrossRef][Web of Science][Medline]
- Nei M, Gu X, Sitnikova T. Evolution by the birth- and-death process in multigene families of the vertebrate immune system. Proc Natl Acad Sci USA 1997; 94:7799806.
[Abstract/Free Full Text] - Ota T, Nei M. Divergent evolution and evolution by the birth-and-death process in the immunoglobulin VH gene family. Mol Biol Evol 1994; 11:46982.[Abstract]
- Nei M, Rogozin IB, Piontkivska H. Purifying selection and birth-and-death evolution in the ubiquitin gene family. Proc Natl Acad Sci USA 2000; 97:1086671.
[Abstract/Free Full Text] - Rooney AP, Piontkivska H, Nei M. Molecular evolution of the nontandemly repeated genes of the histone 3 multigene family. Mol Biol Evol 2002; 19:6875.
[Abstract/Free Full Text] - Piontkivska H, Rooney AP, Nei M. Purifying selection and birth-and-death evolution in the histone H4 gene family. Mol Biol Evol 2002; 19:68997.
[Abstract/Free Full Text] - Nikolaidis N, Nei M. Concerted and nonconcerted evolution of the Hsp70 gene superfamily in two sibling species of nematodes. Mol Biol Evol 2004; 21:498505.
[Abstract/Free Full Text] - Eirin-Lopez JM, Gonzalez-Tizon AM, Martinez A, et al. Birth-and-death evolution with strong purifying selection in the histone H1 multigene family and the origin of orphon H1 genes. Mol Biol Evol 2004; 21:19922003.
[Abstract/Free Full Text] - Cotton JA, Page RD. Rates and patterns of gene duplication and loss in the human genome. Proc Biol Sci 2005; 272:27783.
- Nee S, Mooers AO, Harvey PH. Tempo and mode of evolution revealed from molecular phylogenies. Proc Natl Acad Sci USA 1992; 89:83226.
[Abstract/Free Full Text] - Kubo J, Isawa Y. Inferring the rates of branching and extinction from molecular phylogenies. Evolution 1995; 49:694704.
- Eichler EE. Recent duplication, domain accretion and the dynamic mutation of the human genome. Trends Genet 2001; 17:6619.[CrossRef][Web of Science][Medline]
- Gu X, Wang Y, Gu J. Age distribution of human gene families shows significant roles of both large- and small-scale duplications in vertebrate evolution. Nat Genet 2002; 31:2059.[CrossRef][Web of Science][Medline]
- Fortna A, Kim Y, MacLaren E, et al. Lineage-specific gene duplication and loss in human and great ape evolution. PLoS Biol 2004; 2:E207.[CrossRef][Medline]
- Barabasi AL. Linked: The New Science of Networks. New York: Perseus Pr 2002.
- Barabasi AL, Oltvai ZN. Network biology: understanding the cells functional organization. Nat Rev Genet 2004; 5:10113.[CrossRef][Web of Science][Medline]
- Dorogovtsev SN, Mendes JF. Scaling properties of scale-free evolving networks: Continuous approach. Phys Rev E Stat Phys Plasmas Fluids Relat Interdiscip Topics 2001; 63:056125.
- Jeong H, Mason SP, Barabasi AL, et al. Lethality and centrality in protein networks. Nature 2001; 411:412.[CrossRef][Medline]
- Jeong H, Tombor B, Albert R, et al. The large-scale organization of metabolic networks. Nature 2000; 407:6514.[CrossRef][Medline]
- Qian J, Luscombe NM, Gerstein M. Protein family and fold occurrence in genomes: power-law behaviour and evolutionary model. J Mol Biol 2001; 313:67381.[CrossRef][Web of Science][Medline]
- In Koonin EV, Wolf YI, Karev GP (Eds.). Power Laws, Scalefree Networks and Genome Biology. Georgetown, TX: Landes Bioscience 2005.
- Koonin EV, Wolf YI, Karev GP. The structure of the protein universe and genome evolution. Nature 2002; 420:21823.[CrossRef][Medline]
- Kuznetsov VA. Statistics of the numbers of transcripts and protein sequences encoded in the genome. In Zhang W, Shmulevich I (Eds.). Computational and Statistical Approaches to Genomics. Kluwer 2002 pp. 12571.
- Kuznetsov VA. Distribution associated with stochastic processes of gene expression in a single eukariotic cell. EUROSIP J Appl Signal Processing 2001; 4:28596.
- Karev GP, Wolf YI, Rzhetsky AY, et al. Birth and death of protein domains: A simple model of evolution explains power law behavior. BMC Evol Biol 2002; 2:18.[CrossRef][Medline]
- Rzhetsky A, Gomez SM. Birth of scale-free molecular networks and the number of distinct DNA and protein domains per genome. Bioinformatics 2001; 17:98896.
[Abstract/Free Full Text] - Dokholyan NV, Shakhnovich B, Shakhnovich EI. Expanding protein universe and its origin from the biological Big Bang. Proc Natl Acad Sci USA 2002; 99:1413236.
[Abstract/Free Full Text] - Pastor-Satorras R, Smith E, Sole RV. Evolving protein interaction networks through gene duplication. J Theor Biol 2003; 222:199210.[CrossRef][Web of Science][Medline]
- Karev GP, Wolf YI, Berezovskaya FS, et al. Gene family evolution: an in-depth theoretical and simulation analysis of non-linear birth-death-innovation models. BMC Evol Biol 2004; 9:32.
- Karev GP, Wolf YI, Koonin EV. Simple stochastic birth and death models of genome evolution: was there enough time for us to evolve? Bioinformatics 2003; 19:18891900.
[Abstract/Free Full Text] - Reed WJ, Hughes BD. A model explaining the size distribution of gene and protein families. Math Biosciences 2004; 189:97102.
- Reed WJ, Hughes BD. From gene families and genera to incomes and internet file sizes: why power laws are so common in nature. Phys Rev E Stat Nonlin Soft Matter Phys 2002; 66:067103.[Medline]
- Doolittle WF. Phylogenetic classification and the universal tree. Science 1999; 284:21249.[CrossRef][Web of Science][Medline]
- Doolittle WF. Lateral genomics. Trends Cell Biol 1999; 9:58.[CrossRef][Web of Science][Medline]
- Koonin EV, Aravind L, Kondrashov AS. The impact of comparative genomics on our understanding of evolution. Cell 2000; 101:5736.[CrossRef][Web of Science][Medline]
- Koonin EV, Makarova KS, Aravind L. Horizontal gene transfer in prokaryotes: quantification and classification. Annu Rev Microbiol 2001; 55:70942.[CrossRef][Web of Science][Medline]
- Gogarten JP, Doolittle WF, Lawrence JG. Prokaryotic evolution in light of gene transfer. Mol Biol Evol 2002; 19:222638.
[Abstract/Free Full Text] - Kurland CG. What tangled web: barriers to rampant horizontal gene transfer. Bioessays 2005; 27:7417.[CrossRef][Web of Science][Medline]
- Kurland CG. Something for everyone. Horizontal gene transfer in evolution. EMBO Rep 2000; 1:925.[Medline]
- Kurland CG, Canback B, Berg OG. Horizontal gene transfer: a critical view. Proc Natl Acad Sci USA 2003; 100:965862.
[Abstract/Free Full Text] - Berg OG, Kurland CG. Evolution of microbial genomes: sequence acquisition and loss. Mol Biol Evol 2002; 19:226576.
[Abstract/Free Full Text] - Novozhilov AS, Karev GP, Koonin EV. Mathematical modeling of evolution of horizontally transferred genes. Mol Biol Evol 2005; 22:172132.
[Abstract/Free Full Text] - Komarova NL, Sengupta A, Nowak MA. Mutation-selection networks of cancer initiation: tumor suppressor genes and chromosomal instability. J Theor Biol 2003; 223:43350.[CrossRef][Web of Science][Medline]
- Nowak MA, Michor F, Komarova NL, et al. Evolutionary dynamics of tumor suppressor gene inactivation. Proc Natl Acad Sci USA 2004; 101:106358.
[Abstract/Free Full Text] - Iwasa Y, Michor F, Komarova NL, et al. Population genetics of tumor suppressor genes. J Theor Biol 2005; 233:1523.[Medline]
- Michor F, Iwasa Y, Nowak MA. Dynamics of cancer progression. Nat Rev Cancer 2004; 4:197205.[CrossRef][Web of Science][Medline]
- Michor F, Iwasa Y, Rajagopalan H, et al. Linear model of colon cancer initiation. Cell Cycle 2004; 3:35862.[Web of Science][Medline]
- Pollett PK. Quasi-stationarity in populations that are subject to large-scale mortality or emigration. Environ Int 2001; 27:2316.[Medline]
- Pollett PK. Modeling the long-time behavior of evanescent ecological systems. Ecological Modeling 1996; 86:1359.
- Perez-Ocona R, Montoro-Cazorlab D. A multiple system governed by a quasi-birth-and-death process. Reliability Engineering and System Safety 2004; 84:18796.
- Berezovskaya FS, Karev GP. Bifurcations of self-similar solutions of the Fokker-Plank Equation. Discrete and Continuous Dynamical Systems, Special 2005 9199.
- Karev GP, Berezovskaya FS, Koonin EV. Modeling genome evolution with a diffusion approximation of a birth-and-death process on World Wide Web URL: http://arxiv.org/ftp/q-bio/papers/0507/0507019pdf. 2005; 21:suppl. 3, iii12iii19.
This article has been cited by other articles:
![]() |
M. K. Basu, E. Poliakov, and I. B. Rogozin Domain mobility in proteins: functional and evolutionary implications Brief Bioinform, May 1, 2009; 10(3): 205 - 216. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







