Team:USTC Software/hoWMMD

{|-
 * valign = "top"|


 * align = "justify"|

Mathematical Formulation
High dimensional model representation(HDMR) is a general set of quantitative model assessment and analysis tools for capturing high dimensional IO system behavior. As the impact of the multiple input variables on the output can be independent and cooperative, it is natural to express the model output fx as a finite hierarchical correlated function expansion in terms of the input variables:

$$ f(x) = f_{0}+\sum_{i=1}^{n}f_{i}(x_{i})+\sum_{1\leq i<j\leq n} f_{ij}(x_{i}, x_{j})+ $$


 * $$\sum_{1\leq i<j<k\leq

n}f_{ijk}(x_{i},x_{j},x_{k})+...+ $$



\sum_{1\leq i_{1}<...<i_{l}\leq n}f_{i_{1}i_{2}...i_{l}}(x_{i1},x_{i2},...,x_{il})+...+f_{12...n}(x_{1},x_{2},...,x_{n}) $$

where the zeroth-order component function $$f_{0}$$ is a constant representing the mean response to $$f(x)$$, and the first order component function $$ f_{i}(x_{i})$$ gives the independent contribution to $$f(x)$$ by the ith input variable acting alone, the second order component function $$f_{ij}(x_{i},x_{j})$$ gives the pair correlated contribution to $$f(x)$$ by the input variables $$x_{i}$$ and $$x_{j}$$, etc. The last term contains any residual $$n$$th order correlated contribution of all input variables.

The basic conjecture underlying HDMR is that the component functions arising in typical real problems are likely to exhibit only low order l cooperativity among the input variables such that the significant terms in the HDMR expansion are expected to satisfy the relation: l<1. An HDMR expansion to second order

$$\begin{align} f(x)=f_{0}+\sum\limits_{i=1}^{n}f_{i}(x_{i})+\sum\limits_{1\leq i<j\leq n}f_{ij}(x_{i},x_{j}), \\ \end{align}\,\!$$

often provides a satisfactory description of $$f(x)$$ for many high dimensional systems when the input variables are properly chosen. This is also the formula we use to achieve HDMR expansion. Similarly, reactants of two at most are usually involed in biochemical reactions. Thus, we can simplify the differential equations to the following form:

$$\begin{align} \frac{dx_{k}}{dt} &=&\sum\limits_{i=1}^{n}f_{i}(x_{i},{\mathbf{p}} _{i})+\sum\limits_{1\leq i<j\leq n}f_{ij}(x_{i},x_{j},{\mathbf{p}} _{ij})-kx_{i}+C_{k}\\ \end{align}\,\!$$

$$\begin{align}s.t.\text{ algebra equations} \\ \end{align}\,\!$$

for $$k=1,\ldots ,n.$$ The left side term of the equation above represents the changing rate of variable $$x_{k}.$$We assume that each term on the right side represents one step biochemical reaction which is equivalent to the rate term defined in SBML. It means that changing rate of one variavle is determined by different rate terms. $$C_{k}$$ is a constant number. $$f_{i}(\cdot )$$ and $$f_{ij}(\cdot )$$ have similar definitions and represent reaction rates that vary according to kinetic laws. $${\mathbf{p}}_{i}$$ and $${ \mathbf{p}}_{ij}$$ contain the parameter values of the kinetic law. It should mention that the last term$$-kx_{i}$$ represents the degradation of $$x_{i},$$ which seems to belong to $$f_{i}(x_{i})$$ that may render degradation term undermined, we should add such a term for its ubiqutious existence in biological systems.

It is illustrative to consider the network generation task using a compact mathematical framework. Especially for metabolic networks, the structural and kinetic information can be well summarized using a time variant concentration vector $$\mathbf{x}$$, a time invariant stoichimetric matrix $$S$$ , and a time variant reaction rate vector $$\mathbf{v}$$. Vector $$v$$ contains concentrations for all the $$k$$ species $$(x_{k},k=1,...,n)$$. The $$n\times m$$ matrix $$S$$ represents the network structure by storing stoichiometric coefficients of all n reactions in its columns. The element $$S(i,j)>0$$ if reaction $$j$$ produces species $$i$$, $$S(i,j)<0$$ if reaction $$j$$ consumes species $$i$$, and otherwise $$S(i,j)=0$$. The reaction rate vector $$\mathbf{v}$$ describes reaction rates $$v_{j}$$, $$j=1,...,n$$. Reaction rates vary according to kinetic laws which are linear or nonlinear algebraic functions.

Typically, kinetic laws determine the rates based on the amounts of species participating to reactions as well as various reaction specific parameters. Altogether, an ODE model can be formulated as

$$\begin{align} \frac{d\mathbf{x}}{dt}=S\mathbf{v} \\ \end{align}\,\!$$

The reaction rates $$v_{j}$$, $$j=1,...,m$$ are determined by kinetic laws $$f_{j} $$ as

$$\begin{align} v_{j}=f_{j}({\mathbf{x}}_{j},{\mathbf{p}}_{j}), \\ \end{align}\,\!$$

in which $${\mathbf{x}}_{j}$$ includes concentrations of species taking part in the reaction $$j$$, and $${\mathbf{p}}_{j}$$ contains the parameter values of the kinetic law.

Candidate Kinetic Laws
Inspired by Matlab SimBiology (The MathWorks™), we list possible kinetic laws that are mostly encountered in biochemical reactions. These candidate kinetic laws are utilized in design sub level 1,2,3 and will be shuffled in Genetic Algorithm.


 * valign = "top"|


 * }
 * }