Atkinson–Mingarelli theorem: Difference between revisions
en>Sodin →References: lk |
duplicate "piecewise continuous" in the second paragraph |
||
Line 1: | Line 1: | ||
In [[applied mathematics]], '''Graver bases''' enable iterative solutions of linear and various nonlinear [[integer programming]] problems in [[polynomial time]]. They were introduced by [[Jack E. Graver]].<ref name="Gra">Jack E. Graver: On the foundations of linear and linear integer programming, Mathematical Programming 9:207–226, 1975</ref> Their connection to the theory of [[Gröbner basis|Gröbner bases]] was discussed by [[Bernd Sturmfels]].<ref name="Stu">[[Bernd Sturmfels]], ''Gröbner Bases and Convex Polytopes'', [[American Mathematical Society]], xii+162 pp., 1996</ref> The algorithmic theory of Graver bases and its application to integer programming is described by [[Shmuel Onn]].<ref name="Onn">[http://ie.technion.ac.il/~onn Shmuel onn]: [http://books.google.co.il/books?id=yXvfDzHo-WsC&printsec=frontcover&dq=inauthor:%22Shmuel+Onn%22&hl=en&ei=ccjLToGAPMXLhAep-ZznDQ&sa=X&oi=book_result&ct=result&redir_esc=y#v=onepage&q=inauthor%3A%22Shmuel%20Onn%22&f=false ''Nonlinear Discrete Optimization''], [[European Mathematical Society]], x+137 pp., 2010</ref><ref name="MSRI">Shmuel Onn: [http://www.msri.org/web/msri/online-videos/-/video/showSpeaker/200002593 Linear and nonlinear integer optimization], Online Video Lecture Series, [[Mathematical Sciences Research Institute]], Berkeley, 2010</ref> | |||
==Formal definition== | |||
The '''Graver basis''' of an ''m'' × ''n'' integer matrix <math>A</math> is the finite set <math>G(A)</math> of minimal elements in the set | |||
: <math>\{x\in\mathbb{Z}^n : Ax=0,\ x\neq 0\} \, </math> | |||
under a well partial order on <math>\mathbb{Z}^n</math> defined by <math>x\sqsubseteq y</math> when <math>x_iy_i\geq 0</math> and <math>|x_i|\leq |y_i|</math> for all i. For example, the Graver basis of <math>A=(1 2 1)</math> consists of the vectors (2,−1,0), (0,−1,2), (1,0,−1), (1,−1,1) and their negations. | |||
==Solving integer programming using Graver bases== | |||
[[Integer programming]] is the problem of optimizing a linear or nonlinear objective function over the set of integer points satisfying a system of linear inequalities. Formally, it can be written in standard form as follows: | |||
:<math>\min\{f(x)\ :\ x \in \mathbb{Z}^n,\ Ax=b,\ l\leq x\leq u\}\ .</math> | |||
It is one of the most fundamental discrete optimization problems and has a very broad modeling power and numerous applications in a variety of areas, but is typically very hard computationally as noted below. However, given the Graver basis <math>G(A)</math> of <math>A</math>, the problem with linear and various nonlinear objective functions can be solved in polynomial time as explained next. | |||
===Linear integer programming=== | |||
The most studied case, treated thoroughly in,<ref name="Sch">[[Alexander Schrijver]]: ''Theory of Linear and Integer Programming'', Wiley, xii+471 pp., 1986</ref> is that of [[integer programming|linear integer programming]], | |||
:<math>\min\{wx\ :\ x \in \mathbb{Z}^n,\ Ax=b,\ l\leq x\leq u\}\ .</math> | |||
It may be assumed that all variables are bounded from below and above: such bounds either appear naturally in the application at hand, or can be enforced without losing any optimal solutions. But, even with linear objective functions the problem is NP-hard and hence presumably cannot be solved in polynomial time. | |||
However, given the Graver basis <math>G(A)</math> of <math>A</math> it ''can'' be solved in polynomial time using the following simple iterative algorithm.<ref name="Onn" /><ref name="HOW">Raymond Hemmecke, Shmuel Onn, Robert Weismantel: A polynomial oracle-time | |||
algorithm for convex integer minimization, Mathematical Programming | |||
126:97–117, 2011</ref> Assume first that some initial feasible point ''x'' is given. While possible, repeat the following iteration: find positive integer ''q'' and element ''g'' in <math>G(A)</math> such that ''x'' + ''qg'' does not violate the bounds and gives best possible improvement; update ''x'' := ''x'' + ''qg'' and proceed to the next iteration. The last point ''x'' is optimal and the number of iterations is polynomial. To find an initial feasible point, a suitable auxiliary program can be set up and solved in a similar fashion. | |||
===Nonlinear integer programming=== | |||
Turning to the case of general objective functions ''f'', if the variables are unbounded then the problem may in fact be uncomputable: it follows from the solution of [[Hilbert's 10th problem]] (see <ref name="Mat">Yuri V. Matiyasevich: ''Hilbert's Tenth Problem'', MIT Press, xxiv+264 pp., 1993</ref>), that there exists no algorithm which, given an integer polynomial ''f'' of degree 8 in 58 variables, decides if the minimum value of f over all 58-dimensional integer vectors is 0. However, when the variables are bounded, the problem | |||
: <math>\min\{f(x)\ :\ x \in \mathbb{Z}^n,\ Ax=b,\ l\leq x\leq u\}</math> | |||
can be solved using the Graver basis <math>G(A)</math> in polynomial time for | |||
several nonlinear objective functions including: | |||
* '''Separable-convex''' functions of the form <math>f(x)=\sum_{i=1}^n f_i(x_i)</math>; | |||
* In particular the '''p-norm''' <math>f(x)=\|x\|_p</math> for every positive integer ''p''; | |||
* '''Composite-concave''' functions ''f''(''x'') = ''g''(''Wx''), where ''W'' is a ''d'' × ''n'' integer matrix with ''d'' fixed, and where ''g'' is a ''d''-variate concave function; | |||
* Certain '''(in)-definite quadratic''' and '''higher degree polynomial''' functions. | |||
==Some applications== | |||
===Multi-dimensional tables=== | |||
Consider the following optimization problem over three-dimensional tables with prescribed line sums, | |||
:<math>\min\{wx\ :\ x\in{\mathbb{Z}}_+^{l\times m\times n}\,,\ \sum_i x_{i,j,k}=a_{j,k} \,,\ \sum_j x_{i,j,k}=b_{i,k}\,,\ \sum_k x_{i,j,k}=c_{i,j}\}\ ,</math> | |||
with <math>a_{j,k}</math>, <math>b_{i,k}</math>, <math>c_{i,j}</math> nonnegative integers (whose maximum value implicitly bounds all variables from above). Denote by <math>A</math> the (''lm'' + ''ln'' + ''mn'') × (''lmn'') defining matrix of this system. Note that this matrix is generally ''not'' [[totally unimodular]]. Nonetheless, it was shown in <ref name="DHOW">Jesus A. De Loera, Raymond Hemmecke, Shmuel Onn, Robert Weismantel: ''N''-fold integer programming, Discrete | |||
Optimization, 5:231–241, 2008</ref> that for every fixed ''l'' and ''m'', its Graver basis <math>G(A)</math> can be computed in time which is polynomial in ''n''. The results mentioned above allow then to solve this problem in polynomial time for fixed ''l'' and ''m'' and variable ''n''. Note that if only one side ''l'' of the table is fixed (even with ''l'' = 3) while both ''m'' and ''n'' are variable, then the problem is NP hard, as shown in.<ref name="DO1">Jesus A. De Loera, Shmuel Onn: The complexity of | |||
three-way statistical tables, SIAM Journal on Computing, 33:819–836, 2004</ref> More generally, similar positive results hold for higher dimensional table problems (introduced in <ref name="Mot">Theodore S. Motzkin: The multi-index transportation | |||
problem, Bulletin of the American Mathematical Society 58:494, 1952</ref>): for every fixed ''d'' and <math>m_1, \dots, m_d</math>, (non)-linear optimization over <math>m_1 \times \cdots \times m_d \times n</math> tables with variable n and prescribed margins can be done in polynomial time. This has further applications to entry security problems and privacy in statistical databases. | |||
===Multi-commodity flows=== | |||
Consider the ''integer'' [[multi-commodity flow problem]] of routing ''k'' types of integer commodities from ''m'' suppliers to ''n'' consumers subject to supply, consumption, and capacity constraints, so as to minimize the sum of linear or congestion-dependent convex costs on the arcs. Then for every fixed ''k'' and ''m'', the Graver basis of the defining system can be computed and the resulting separable-convex objective function | |||
minimized in time which is polynomial in the variable number ''n'' of consumers and in the rest of the data. | |||
===Other applications=== | |||
The many applications of the algorithmic theory of Graver bases also include stochastic integer programming,<ref name="HOW" /> ''N''-fold integer programming, ''N''-fold 4-block decomposable integer programming,<ref name="HKW">Raymond Hemmecke, Matthias Köppe, Robert Weismantel: A polynomial-time algorithm for optimizing over ''N''-fold 4-block decomposable integer programs, IPCO 14, 2010</ref> clustering, and disclosure control in statistical databases. In some of these applications the relevant Graver basis ''cannot'' be computed in polynomial time, but can be accessed in an indirect way that allows to use it in polynomial time. | |||
==Universality and parametrization== | |||
It was shown in <ref name="DO2">Jesus A. De Loera, Shmuel Onn: All | |||
linear and integer programs are slim 3-way transportation programs, SIAM Journal on Optimization, 17:806–821, 2006</ref> that every (bounded) integer programming problem is precisely equivalent to the 3 × ''m'' × ''n'' table problem discussed above for some ''m'' and ''n'' and some line sums. Thus, such 3 × ''m'' × ''n'' table problems are '''universal''' for integer programming. Moreover, for each fixed ''m'', the resulting class of integer programs can be solved in polynomial time by the iterative Graver basis method described above. So the table width ''m'' provides a '''parametrization''' of all integer programming problems. | |||
==Hierarchy of approximations for integer programming== | |||
Denote by <math>G(m,n)</math> the Graver basis of the matrix <math>A</math> defining the universal 3 × ''m'' × ''n'' table problem discussed above. The elements of <math>G(m,n)</math> are 3 × ''m'' × ''n'' tables with all line sums equal to 0. The ''type'' of such a table is the number of its nonzero 3 × ''m'' ''layers''. It turns out that there is a finite ''Graver complexity function'' ''g''(''m'') such that for any ''n'', the type of any element of the Graver basis <math>G(m,n)</math> is at most ''g''(''m''). It follows that the Graver basis <math>G(m,n)</math> is the union of the <math>{n\choose g(m)}</math> suitably embedded copies of the Graver basis <math>G(m,g(m))</math>. To approximately solve in practice an arbitrary integer programming problem, proceed as follows. First embed it in a suitable 3 × ''m'' × ''n'' table problem as enabled by the universality noted above. Now apply the following hierarchy of approximations of <math>G(m,n)</math>. At level ''k'' of this hierarchy, let <math>G_k(m,n)</math> be the subset of <math>G(m,n)</math> consisting only of those elements of type at most ''k''; use this approximation <math>G_k(m,n)</math> in the iterative algorithm as long as possible to obtain as good as possible feasible point for the integer programming problem embedded in the 3 × ''m'' × ''n'' table problem; if the objective function value of this point is satisfactory (say, as compared to the value of the [[linear programming relaxation]]), then use this point; otherwise increment ''k'' and proceed to the next level of the hierarchy. It can be shown <ref name="Onn" /> that for any fixed level ''k'', the approximation <math>G_k(m,n)</math> of the Graver basis has polynomial cardinality <math>O\left(m^{g(k)}n^k\right)</math> and can be computed in that much time. | |||
==Fixed parameter tractability: from polynomial to cubic time complexity== | |||
The time complexity of solving some of the applications discussed above, such as multi-dimensional table problems, multicommodity flow problems, and ''N''-fold integer programming problems, is dominated by the cardinality of the relevant Graver basis, which is a polynomial <math>O\left(n^g\right)</math> with typically large degree ''g'' which | |||
is a suitable Graver complexity of the system. In <ref name="HOR">Raymond Hemmecke, Shmuel Onn, Lyubov Romanchuk: ''N''-fold integer programming in cubic time, Mathematical Programming, 137:325–341, 2013</ref> a much faster algorithm is presented, which finds at each iteration the best improvement ''x'' + ''qg'' with ''q'' positive integer and ''g'' element in <math>G(A)</math> ''without explicitly constructing the Graver basis'', in cubic time <math>O\left(n^3\right)</math> regardless of the system. | |||
In the terminology of [[parameterized complexity]], this implies that all these problems suitably parameterized, and in particular ''l'' × ''m'' × ''n'' table problems parametrized by ''l'' and ''m'', are '''fixed parameter tractable'''. | |||
==References== | |||
{{reflist}} | |||
[[Category:Linear programming]] |
Revision as of 17:32, 26 July 2013
In applied mathematics, Graver bases enable iterative solutions of linear and various nonlinear integer programming problems in polynomial time. They were introduced by Jack E. Graver.[1] Their connection to the theory of Gröbner bases was discussed by Bernd Sturmfels.[2] The algorithmic theory of Graver bases and its application to integer programming is described by Shmuel Onn.[3][4]
Formal definition
The Graver basis of an m × n integer matrix is the finite set of minimal elements in the set
under a well partial order on defined by when and for all i. For example, the Graver basis of consists of the vectors (2,−1,0), (0,−1,2), (1,0,−1), (1,−1,1) and their negations.
Solving integer programming using Graver bases
Integer programming is the problem of optimizing a linear or nonlinear objective function over the set of integer points satisfying a system of linear inequalities. Formally, it can be written in standard form as follows:
It is one of the most fundamental discrete optimization problems and has a very broad modeling power and numerous applications in a variety of areas, but is typically very hard computationally as noted below. However, given the Graver basis of , the problem with linear and various nonlinear objective functions can be solved in polynomial time as explained next.
Linear integer programming
The most studied case, treated thoroughly in,[5] is that of linear integer programming,
It may be assumed that all variables are bounded from below and above: such bounds either appear naturally in the application at hand, or can be enforced without losing any optimal solutions. But, even with linear objective functions the problem is NP-hard and hence presumably cannot be solved in polynomial time.
However, given the Graver basis of it can be solved in polynomial time using the following simple iterative algorithm.[3][6] Assume first that some initial feasible point x is given. While possible, repeat the following iteration: find positive integer q and element g in such that x + qg does not violate the bounds and gives best possible improvement; update x := x + qg and proceed to the next iteration. The last point x is optimal and the number of iterations is polynomial. To find an initial feasible point, a suitable auxiliary program can be set up and solved in a similar fashion.
Nonlinear integer programming
Turning to the case of general objective functions f, if the variables are unbounded then the problem may in fact be uncomputable: it follows from the solution of Hilbert's 10th problem (see [7]), that there exists no algorithm which, given an integer polynomial f of degree 8 in 58 variables, decides if the minimum value of f over all 58-dimensional integer vectors is 0. However, when the variables are bounded, the problem
can be solved using the Graver basis in polynomial time for several nonlinear objective functions including:
- Separable-convex functions of the form ;
- In particular the p-norm for every positive integer p;
- Composite-concave functions f(x) = g(Wx), where W is a d × n integer matrix with d fixed, and where g is a d-variate concave function;
- Certain (in)-definite quadratic and higher degree polynomial functions.
Some applications
Multi-dimensional tables
Consider the following optimization problem over three-dimensional tables with prescribed line sums,
with , , nonnegative integers (whose maximum value implicitly bounds all variables from above). Denote by the (lm + ln + mn) × (lmn) defining matrix of this system. Note that this matrix is generally not totally unimodular. Nonetheless, it was shown in [8] that for every fixed l and m, its Graver basis can be computed in time which is polynomial in n. The results mentioned above allow then to solve this problem in polynomial time for fixed l and m and variable n. Note that if only one side l of the table is fixed (even with l = 3) while both m and n are variable, then the problem is NP hard, as shown in.[9] More generally, similar positive results hold for higher dimensional table problems (introduced in [10]): for every fixed d and , (non)-linear optimization over tables with variable n and prescribed margins can be done in polynomial time. This has further applications to entry security problems and privacy in statistical databases.
Multi-commodity flows
Consider the integer multi-commodity flow problem of routing k types of integer commodities from m suppliers to n consumers subject to supply, consumption, and capacity constraints, so as to minimize the sum of linear or congestion-dependent convex costs on the arcs. Then for every fixed k and m, the Graver basis of the defining system can be computed and the resulting separable-convex objective function minimized in time which is polynomial in the variable number n of consumers and in the rest of the data.
Other applications
The many applications of the algorithmic theory of Graver bases also include stochastic integer programming,[6] N-fold integer programming, N-fold 4-block decomposable integer programming,[11] clustering, and disclosure control in statistical databases. In some of these applications the relevant Graver basis cannot be computed in polynomial time, but can be accessed in an indirect way that allows to use it in polynomial time.
Universality and parametrization
It was shown in [12] that every (bounded) integer programming problem is precisely equivalent to the 3 × m × n table problem discussed above for some m and n and some line sums. Thus, such 3 × m × n table problems are universal for integer programming. Moreover, for each fixed m, the resulting class of integer programs can be solved in polynomial time by the iterative Graver basis method described above. So the table width m provides a parametrization of all integer programming problems.
Hierarchy of approximations for integer programming
Denote by the Graver basis of the matrix defining the universal 3 × m × n table problem discussed above. The elements of are 3 × m × n tables with all line sums equal to 0. The type of such a table is the number of its nonzero 3 × m layers. It turns out that there is a finite Graver complexity function g(m) such that for any n, the type of any element of the Graver basis is at most g(m). It follows that the Graver basis is the union of the suitably embedded copies of the Graver basis . To approximately solve in practice an arbitrary integer programming problem, proceed as follows. First embed it in a suitable 3 × m × n table problem as enabled by the universality noted above. Now apply the following hierarchy of approximations of . At level k of this hierarchy, let be the subset of consisting only of those elements of type at most k; use this approximation in the iterative algorithm as long as possible to obtain as good as possible feasible point for the integer programming problem embedded in the 3 × m × n table problem; if the objective function value of this point is satisfactory (say, as compared to the value of the linear programming relaxation), then use this point; otherwise increment k and proceed to the next level of the hierarchy. It can be shown [3] that for any fixed level k, the approximation of the Graver basis has polynomial cardinality and can be computed in that much time.
Fixed parameter tractability: from polynomial to cubic time complexity
The time complexity of solving some of the applications discussed above, such as multi-dimensional table problems, multicommodity flow problems, and N-fold integer programming problems, is dominated by the cardinality of the relevant Graver basis, which is a polynomial with typically large degree g which is a suitable Graver complexity of the system. In [13] a much faster algorithm is presented, which finds at each iteration the best improvement x + qg with q positive integer and g element in without explicitly constructing the Graver basis, in cubic time regardless of the system. In the terminology of parameterized complexity, this implies that all these problems suitably parameterized, and in particular l × m × n table problems parametrized by l and m, are fixed parameter tractable.
References
43 year old Petroleum Engineer Harry from Deep River, usually spends time with hobbies and interests like renting movies, property developers in singapore new condominium and vehicle racing. Constantly enjoys going to destinations like Camino Real de Tierra Adentro.
- ↑ Jack E. Graver: On the foundations of linear and linear integer programming, Mathematical Programming 9:207–226, 1975
- ↑ Bernd Sturmfels, Gröbner Bases and Convex Polytopes, American Mathematical Society, xii+162 pp., 1996
- ↑ 3.0 3.1 3.2 Shmuel onn: Nonlinear Discrete Optimization, European Mathematical Society, x+137 pp., 2010
- ↑ Shmuel Onn: Linear and nonlinear integer optimization, Online Video Lecture Series, Mathematical Sciences Research Institute, Berkeley, 2010
- ↑ Alexander Schrijver: Theory of Linear and Integer Programming, Wiley, xii+471 pp., 1986
- ↑ 6.0 6.1 Raymond Hemmecke, Shmuel Onn, Robert Weismantel: A polynomial oracle-time algorithm for convex integer minimization, Mathematical Programming 126:97–117, 2011
- ↑ Yuri V. Matiyasevich: Hilbert's Tenth Problem, MIT Press, xxiv+264 pp., 1993
- ↑ Jesus A. De Loera, Raymond Hemmecke, Shmuel Onn, Robert Weismantel: N-fold integer programming, Discrete Optimization, 5:231–241, 2008
- ↑ Jesus A. De Loera, Shmuel Onn: The complexity of three-way statistical tables, SIAM Journal on Computing, 33:819–836, 2004
- ↑ Theodore S. Motzkin: The multi-index transportation problem, Bulletin of the American Mathematical Society 58:494, 1952
- ↑ Raymond Hemmecke, Matthias Köppe, Robert Weismantel: A polynomial-time algorithm for optimizing over N-fold 4-block decomposable integer programs, IPCO 14, 2010
- ↑ Jesus A. De Loera, Shmuel Onn: All linear and integer programs are slim 3-way transportation programs, SIAM Journal on Optimization, 17:806–821, 2006
- ↑ Raymond Hemmecke, Shmuel Onn, Lyubov Romanchuk: N-fold integer programming in cubic time, Mathematical Programming, 137:325–341, 2013