Assignment 6: Linear arithmetic

In the class, we discussed the theory of linear arithmetic (shortened as LA in the following) and linear programming (shortened as LP in the following). Arithmetical constraints are ubiquitous in every aspect of formal reasoning and (hard) problem solving, and both LA and LP are very powerful tools to establish these models and to solve the problems.

From the computer science point of view, most LA and LP problems can be solved by special-purpose software or libraries, one such example is the IBM's very popular CPLEX. However, most modern solvers/provers also have good enough support for both LA and LP theories, by incorporating well-tuned decision procedures for LA/LP over the integers, rationals or real domains. Take Z3 as an example, it has decent support for both LA and LP.

In this assignment, you'll be familiarizing yourself with the background theories of LA/LP, with the important applications of LA/LP to real-world (hard) problem (by using Z3), and with the underpinning algorithms of modern provers/solvers to solve these problems.

This assignment is divided into three parts, each of which contains both some tutorials and problems. The first part introduces the basic background and knowledge of LA/LP theory; the second part discusses the applications of LA/LP theory, by solving some (NP-Complete hard) problems; and the third part covers the algorithm implementation including the Fourier-Motzkin variable elimination, Simplex. Some problems are tagged with Exercise, which you should solve. And several problems are tagged with Challenge, which are optional. Download this code template to start with.

Before starting with this assignment, make sure you've finished Software Setup in the assignment 1, and have both Z3 and Python (3.7+ recommended) properly installed on your computer (or laptop). For any problems, please feel free to contact us for help.

Hand-in Procedure

When you finished the assignment, zip you code files with file name studentid-assignment6.zip (e.g SA19225111-assignment6.zip), and submit it to Postgraduate Information Platform. Any late submission will NOT be accepted.

Part A: Basic Linear Arithmetic Theory

In this section, we'll discuss the basic LA/LP theories, by using the solver Z3.

Basic Linear Arithmetic

The solver Z3 has complete support for this syntax, and Z3 can solve a group of equalities/inequalities very efficiently. Suppose we can have two numbers $x, y\in $ ℝ, and the following constraints: \begin{cases} x + y = 0.8 \\ x - y = 0.2 \end{cases}

and Z3 will output the model it finds, if the above constraints are satisfiable.

Exercise 1: Read the code for this exercise we give in the la-theory.py Python file, we have given all the code for this exercise. Make sure you understand the code, and run the code, what's the output?

Of course, not all constraints are satisfiable, for example, consider the following constraints on two variables $x, y \in $ ℝ: \begin{cases} x + y = 0.8 \\ x + y = 0.2 \end{cases} where $$ x,y \in ℝ, $$ obviously, these constraints are unsatisfiable.

Exercise 2: Read the code for this exercise we give in the la-theory.py Python file. Finish this exercise by filling in the missing code. Your code should output "unsat", if your implementation is correct.

Besides the domain real ℝ ("Reals"), Z3 also has decent support for the integer domain ℤ. For instance, for two integer variables $x, y \in $ ℤ, this code

declares two integers $x$ and $y$ in Z3. Similarly, we can ask Z3 to check such constraints on Integers, for instance, given the following constraints: $$ \begin{cases} x + y = 8 \\ x - y = 2 \end{cases} $$ where $$ x,y \in ℤ, $$ and give the model.

Exercise 3: Read the code for this exercise we give in the la-theory.py Python file. Finish this exercise by filling in the missing code. What's the output?

One key point to note here is that: when we talk about the satisfiability of constraints, we must make it clear what domains (ℝ, ℤ, ℚ or others?) we are working on, because the satisfiability of constraints is dependent on the underlying domain. To understand this, let's consider the following example, given these constraints: \begin{cases} x + y = 8 \\ x - y = 1 \end{cases} and we can conclude that these constraints are satisfiable on domain ℝ, but not on domain ℤ.

Exercise 4: Read the code for this exercise we give in the la-theory.py Python file. Finish this exercise by filling in the missing code to check the above constraints are satisfiable domain ℝ, but not on domain ℤ.

0-1 Integer Linear Arithmetic

There is one important special case for the LA on domain ℤ, which is called the 0-1 integer linear arithmetic (or 0-1 ILA). In this case, all integer variables can only be of two values: $0$ or $1$, in this sense, they corresponds, perfectly, to boolean values (true or false).

The 0-1 Integer LA (as well as the 0-1 Integer Linear Programming we'll be discussing in the following) has many nice properties and many important applications, we'll discuss some in the following. But for now, let's start by considering one interesting problem, to let us appreciate 0-1 ILA's simplicity and power.

The problem is this: suppose we are given a list $l$ of integers $$ [e_1, e_2, \ldots, e_n], $$ and we are required to write a program, to check whether or not there is at least one zero $0$ in the list $l$.

Of course, it's not to write a simple Python program to solve this problem. But for our purpose, let's consider how to solve this problem using 0-1 integer LA. For this, let's first declare a list $X$ of auxiliary 0-1 integer variables $$ [x_1, x_2, \ldots, x_n], $$ which satisfy $$ x_i\in\left\{0, 1\right\}, \text{ for } 1 \le i\le n, $$ and also $$ \sum_{i=1}^{n} x_i = 1. \tag{1} $$

Informally speaking, there is just one $1$ in the list $X$. For the given input integer list $l$ (which is being searched for $0$s): $$ [e_1, e_2, \ldots, e_n], $$ we require $$ \sum_{i=1}^{n} \left(x_i*e_i\right) = 0. \tag{2} $$

Exercise 5: Why the above constraints $(1)$ and $(2)$ can guarantee that there is at least one $0$ in the input list $l$ of integers? You don't need to write code, just convince yourself the correctness of these constraints.

Exercise 6: Fill in the missing code in the function check_zero_la(), to generate the necessary constraints. If your code is right, this function will output something like $$ [x_2 = 0, x_1 = 1, x_3 = 0, x_0 = 0], $$ what does this output imply?

Challenge: Using 0-1 ILA, write a program to count the number of $0$s in a given list of integers. For instance, in the list $$ [4, 7, 0, 3, 0, 9, 0], $$ there are three $0$s. Don't forget to test your code. How does your program behave on large input list?

Non-linear constraints and domain $\mathbb{Q}$

In previous sections, we've always been talking about the linear arithmetics, but many solvers, including Z3, also support the satisfiability solving on non-linear constraints.

Suppose we want to check the constraint $$ x * x = 2, $$ we can write (as we may expect):

Exercise 7: Write some code to check that this non-linear constraint $$ x*x + y*y \lt 0 $$ on two real numbers $x, y \in $ ℝ, is unsat.

One important application of the non-linear arithmetic is to reason constraints on rational domain $\mathbb{Q}$. We have studied linear arithmetic both on the real domain ℝ and the integer domain ℤ, Z3 has built-in supports for these two domains. However, the bad news is that Z3 does not support rational domain $\mathbb{Q}$ directly. To mitigate this, we can use a very important property of rational numbers: suppose $x \in \mathbb{Q}$ is a rational number, then we have $$ x = p/q \tag{3}, $$

for $$ q \neq 0, $$ where $p, q \in \mathbb{Z}$. The formulae $(3)$ can be rewritten into $$ x*q = p \tag{4}. $$ Please notice that the equation $(4)$ is also non-linear.

Let's study one example to see how this strategy works in practice, consider our previous example once again, we can check the constraint $$ x*x = 2 \tag{5} $$ on rational domain $\mathbb{Q}$, it's unsatisfiable.

Exercise 8: Please fill in the missing code to check the satisfiability of the above equation $(5)$.

Part B: LA/LP Applications

In this section, we will study how to establish models and solve problems using the LA/LP theories, with the aid of the Z3 solver.

To be specific, we'll solve several very hard problems, by "hard", we mean that these problems are all NP-complete, that is, there are no known polynomial algorithms for these problems in general.

However, you will soon see that how easy (and how fun) it is to solve these problems with LA/LP solvers, such as Z3. And you'll acknowledge the power and simplicity of the solver-based problem solving strategy, in general.

The NP-Complete problems that we'll be solving are (some of them have been discussed in our previous lectures):

Note that this is not a complete list of problems that a SMT-based solver can handle, after you have learned this technique, you are encouraged to try to solve other NP-Complete problems as well.

Subset Sum Problem

The subset problem is a well-known satisfiability problem: given a multiset (a multiset is like a normal set, expect for the elements can be duplicated) $S$, we should decide a non-empty subset $T$ of $S$, such that $$ \sum T = 0, \text{ where } T \subseteq S. $$

For example, given this set $$ \left\{-7, -3, -2, 9000, 5, 8\right\}, $$ the answer is yes because this subset $$ \left\{-3, -2, 5\right\} $$ sums to zero.

This problem is NP-Complete, and for more background information of the subset problem, please refer to this article on the subset sum problem.

Exercise 9: Read the code in the subset_sum.py Python file, complete the subset_sum_la() method which solve the subset sum problem using 0-1 ILA. The basic idea is create flag $ F $ for each number in the target set $ T $, and we just need to add the constraint $$ \sum_{i=0}^{N-1} \left(F[i]*S[i]\right) = 0, $$ where $N$ is the length of target set $S$.

Exercise 10: Read the code in the subset_sum.py Python file, a dynamic programming-based (DP) solution is already provided by subset_sum_dp() method. And another LA-based solution subset_sum_la_opt() which is optimized by Z3's Pseudo-Boolean constraints is also provided. Try to compare the efficiency of the DP, the LA algorithm and the optimized LA algorithm, by changing the value of max_nums to other values, say, 200, 2000, 20000, ..., what's your observation? What conclusion you can draw from these data?

Challenge: The DP solution provided in subset_sum.py file is based on recursion, it may affect the efficiency of the algorithm. Try to implement a DP solution without recursion, and compare the efficiency of the DP and the LA algorithms. what's your observation?

N-Queen Problem

In the assignment 3 (challenge problem), we've solved the n-queen problem (4 Queens) by SAT. The problem is about placing $ N $ chess queens on an $ N*N $ chessboard, so that no two queens threaten each other. A solution requires that no two queens share the same row, column, diagonal or anti-diagonal. The following figure shows a solution to the sample n-queen puzzle where $ N = 4 $: Four Queen

The problem's target is try to find how many solutions exist, for a $N*N$ chess board.

The basic idea of SAT implementation is to construct the n-queen puzzle constraints by Bool values. Actually，we can solve n-queen problem by LA, which is much easier to understand and more efficient compare to SAT. The idea is same with the solution of subset sum problem. We use a two dimensional 0-1 flag $F$ to represent each cell of the chessboard, the $F$ has value: $$ F[i][j] = \begin{cases} 0, & \text{if cell in $i$ row $j$ column is not placed a queen}; \\ 1, & \text{if cell in $i$ row $j$ column is placed a queen}; \\ \end{cases} $$ where $0 \le i \lt N$ and $0 \le j \lt N$. We can establish the constraints of the n-queen puzzle as follows：

Exercise 11: Read the code in the queen.py Python file, complete the n_queen_la() method which solve the n-queen problem using 0-1 ILA. You can construct the constraints by referring to the models we have discussed above, or you can propose your own constraints.

Another method to solve the n-queen problem is to use the backtracking algorithm, but the complexity is exponential with respect to the chess board size $N$.

Exercise 12: Read the code in the the queen.py Python file, there is a backtracking-based solution in the n_queen_bt() method. Try to compare of the backtracking and the LA algorithms, by changing the value of the chessboard size $N$ to other values, which one is faster? What conclusion you can draw from the result?

The above LA implementation is not the unique algorithm to solve n-queen problem. In fact, the way we establish constraints to describe a problem often has a big impact on the efficiency of the algorithm.

Exercise 13: Read the code of the method n_queen_la_opt() in the queen.py Python file. Try to compare the efficiency of this method with your implementation in exercise 11. What's your observation? What conclusion you can draw?

Challenge: Try to run your LA solution to the n-queen problem by increasing the chessboard size $N$. What's the maximal value of $N$ your program can handle?

Linear Programming

Generally speaking, a linear programming problem (shorted as LP in the following) consists of two parts: the constraints $C$ and the target function $F$. And our goal is to get the minimal or the maximal values of $F$ under the constraint $C$. Both the constraints and the target function are of linear forms. In this part, we will study some LP problems, also with the aid of Z3.

For example, given the constraints \begin{cases} x &\lt 2 \\ y - x &\lt 1 \\ \end{cases} we want to calculate $$ \max\left(x + 2y\right). $$

To solve LP with Z3, we'll be using the Optimize module and, to be specific, the maximize() API. For more details about the module and the API, you can refer to the official documents.

Exercise 14: Read the code in the linear_programming.py Python file. Run the example and make sure you understand the code. Finish the exercise by filling in the missing code to add the constraints \begin{cases} x - y &\ge 2.1 \\ x + z &\le 5.5 \\ y - z &\le 1.1 \\ \end{cases} and calculate $$ \max\left(x + y + z\right). $$

Knapsack Problem

The knapsack problem is a typical optimization problem，which has been studied for hundred of years. The problem is: given a set of items, each item has a weight and a value, determine the number of items such that the total weight is less than a given limit and the total value is as large as possible. There are a number of sub-problems of the knapsack problem: 0-1 knapsack problem, complete knapsack problem, multiply knapsack problem, multi-dimensional knapsack problem and so on. Please refer to knapsack problem to get more information.

Let's discuss 0-1 knapsack problem first, which assumes that there is only one quantity per item. In the following, we use the following notations: \begin{align} W: &\text{ item's weight}\\ V: &\text{ item's value}\\ C: &\text{ the bag's capacity}\\ \end{align} and we use a flag $F$ for each item with $$ F[i] = \begin{cases} 0, & \text{if item $i$ is not chosen}; \\ 1, & \text{if item $i$ is chosen}. \\ \end{cases} $$ To guarantee that the total weight can not exceed the bag capacity $C$, we have $$ \sum_{i=0}^{N-1} \left(F[i]*W[i]\right) \le C, $$ where $N$ is the total number of items. The goal of our optimization is to maximize the value of the chosen items $$ \max\left(\sum_{i=0}^{N-1} \left(F[i]*V[i]\right)\right). $$

Exercise 15: Read the code in the knapsack.py Python file, complete the code of the zero_one_knapsack_lp() method, to solve the 0-1 knapsack problem by using 0-1 ILP. Don't forget to test your code.

Another sub-problem of knapsack problem is the complete knapsack problem, which assumes that the number of items of all kinds is unlimited, your can choose one kind of item any times. So we need to declare a variable for each kind of item $F$ have chosen by amount $A$： $$ F[i] = A, $$ where $0 \le i \lt N$ and $0 \le A$. So the constraint of total weight is $$ \sum_{i=0}^{N-1} (F[i]*W[i]) \le C, $$ where $N$ is the total number of the kinds of items. The maximal value of the chosen items is $$ \max\left(\sum_{i=0}^{N-1} (F[i]*V[i])\right). $$

Exercise 16: Read the code in the knapsack.py Python file, fill in the code in the complete_knapsack_lp() method to solve the complete knapsack problem. Don't forget to test your code.

Exercise 17: Read the code in the knapsack.py Python file, a dynamic programming-based (DP) solution of 0-1 knapsack problem is already provided in the zero_one_knapsack_dp() method. Try to compare the efficiency of the DP and the LP algorithms using the data generated by get_large_test() method. What's your observation? What conclusion you can draw from the result?

Linear Regression

In statistics, a linear regression is a linear approach to modelling the relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables).

In recent years, linear Linear regression plays an important role in the field of artificial intelligence and machine learning. The linear regression algorithm is one of the fundamental supervised machine learning algorithms for its relative simplicity and well-known properties. Interested readers can refer to materials on deep learning, for instance, Andrew Ng gives a nice introduction to linear regression from a deep learning point of view (up to page 7).

However, as this is not a deep learning course, so we'll get access to the problem from the mathematical point of view. We start by studying one concrete example, given the following data (in deep learning's terminology, these are called the training data): \begin{align} X &= [1.0, 2.0, 3.0, 4.0]\\ Y &= [1.0, 3.0, 5.0, 7.0] \end{align} Our goal is to produce a linear function: $$ y = k*x + b, $$ such that it fits the above data as close as possible, where the variables $k$ and $b$ are unknown variables. By "as close as possible", we use a least squares method, that is, we want to minimize the following expression: $$ \min\left(\sum_{i=0}^{N-1} (Y[i] - (k*X[i]+b))^2\right), \tag{6} $$ where $N$ is the length of $X$ or $Y$.

Now the next step is to solve the equation $(6)$ to calculate the values for the variables $k$ and $b$. We will use the Z3 to finish this task, as Z3 also supports some non-linear constraint solving.

Exercise 18: Read the code in the linear_regression.py Python file, you need to install the matplotlib package to run this code. You can install matplotlib by pip:

    pip install matplotlib

Or, you can install them via PyChram's preference as we've done in Software Setup of the assignment 1. After setting the environments, complete the lr_training() method to do the linear regression using Z3.

The popular approach used extensively in deep learning is the gradient decedent algorithm, if you're interested in this algorithm, the above Andrew Ng's notes contains a nice introduction. In most situations, you don't need to re-invent the wheels, there are lots of efficient machine learning libraries in Python, say, the scikit-learn. You can use them directly to finish the task.

Exercise 19: In the linear_regression_ml.py Python file, we have provided a linear regression implementation based on the scikit-learn deep learning library. You do not need to write any code, but you need to install both the numpy and scikit-learn package via pip:

    pip install numpy
    pip install scikit-learn

Or, you can install them via PyChram's preference as we've done in Software Setup of the assignment 1. After setting the environments, you need to compare the machine learning approach and the LP approach which you implemented in exercise 17 by trying some different training data. Do the two algorithms produce the same results? What conclusion you can draw from the result?

Part C: LA Algorithms (Optional)

All problems in this section are optional, please download the code template for part C to start with.

In the lecture, we've discussed three algorithms for solving LA constraints: the Fourier-Motzkin variable elimination algorithm, the Simplex algorithm, and the branch-bound algorithm. In this part of the assignment, you will implement the former two of them.

Preparation

Before we start implementing the algorithms, let's introduce some helpful data structures.

First we need a data structure to describe the inequalities， To simplify things here， we assume that all the constraints conform to the syntax for LA which is introduced in Part A section. It means that all the constraints only take $=$ or $\le$ or $<$ as relation operator.

Challenge: Read the code in the constraint.py Python file, and try to understand the data structure Constraint. You may try to write some constraints based on it.

The data structure in the constraint.py Python file may suitable for describing a proposition in LA theory，but it's really not convenient to implement the LA algorithms by using them。Implementing a convenient data structure cost you a lot of time. Fortunately, some libraries provide us these facilities. To be specific, we'll be using the library pandas, which is a powerful and easy-to-use open source library. By using the DataFrame data structure in pandas to store the LA proposition, we will save a lot of engineering efforts.

This is the core data structure we need to implement the simplex algorithm. But implementing such a data structure may cost you a lot of time. Fortunately, some libraries provide us these facilities. To be specific, we'll be using the library pandas, which is a powerful and easy-to-use open source library. By using the DataFrame data structure in pandas to present auxiliary tableau, we will save a lot of engineering efforts.

Challenge: Execute pip install pandas to install pandas and take a moment to read the quickstart page. And make yourself familiar with it by finishing the pandas exercises in pandas_exercise.py file.

Challenge: Read the code in the tableau.py Python file, and try to understand the data structure Tableau which based on the pandas DataFrame class. Try to convert some constraints to Tableau.

Fourier-Motzkin Variable Elimination

The Fourier-Motzkin elimination method, also known as the FME method, is a mathematical algorithm for eliminating variables from a system of linear inequalities. It can operate on variables from real domains. The algorithm can be summarized as the following steps:

1. Equation elimination

Assumes we want to solve the proposition $P$ based on LA theory. The first step is find the constraints which contain $=$ relation operator $$ \sum_{j=1}^{n}a_{ij} * x_j = b_i, $$ and rewrite them to $$ x_p = \frac{b_i}{a_{ip}} - \sum_{\substack{1 \leq j \leq n \\ j \neq p}} \frac{a_{ij}}{a_{ip}} * x_j. \tag{7} $$ Replace all the variable $x_p$ in proposition $P$ by the equation $(7)$. It can elimination the equation and all the variable $x_p$ in proposition $P$. Do this for all the equations in $P$ and rewrite the proposition $P$ to $$ \bigwedge\limits_{i=1}^m\sum_{j=1}^{n}a_{ij} * x_j \leq b_i. $$

2. Unbounded variable elimination

In LA theory proposition $P$, the variable $x_p$ have no lower bound if all the coefficients of variable $x_p$ are positive；the variable $x_p$ have no upper bound if if all the coefficients of variable $x_p$ are negative. We call a variable an unbounded variable if it has no upper bound or no lower bound，on the contrary, a bounded variable has both upper bound and lower bound. Unbounded variables can be simply removed from the proposition $P$ together with all constraints that use them. Removing these constraints can make other variables unbounded. Thus, this simplification stage iterates until no unbounded variables remain.

3. Bounded variable elimination

If there is no unbounded variable in the proposition $P$, we can choose one bounded variable to eliminate. The choice of variables is heuristic， we assume that the chosen variable is $x_q$. We can get an upper bound of variable $x_q$ by $$ x_q \leq \frac{b_i}{\lvert a_{iq} \rvert} - \sum_{\substack{1 \leq j \leq n \\ j \neq q}} \frac{a_{ij}}{\lvert a_{iq} \rvert} * x_j $$ if $a_{ij} > 0$, and get one lower bound by $$ x_q \geq \sum_{\substack{1 \leq j \leq n \\ j \neq q}} \frac{a_{ij}}{\lvert a_{iq} \rvert} * x_j - \frac{b_i}{\lvert a_{iq} \rvert}， $$ if $a_{ij} < 0$. Assume there are $m$ positive and $n$ negative occurrences of $a_{ij}$ in proposition $P$, we can set the upper bounds as $$ U_1(x)，\dots，U_m(x), $$ and the lower bounds of proposition $P$ as $$ L_1(x)，\dots，L_n(x). $$ Let's continue to assume that there are $k$ constraints do not contain the variable $x_q$: $$ D_1(x)，\dots，D_k(x). $$ We need all the lower bounds smaller than the upper bounds of variable $x_q$ if we want to eliminate it. So we need compare them one by one and get the proposition without variable $x_q$: $$ \left( \bigwedge\limits_{j=1}^m \bigwedge\limits_{i=1}^n(L_i(x) - U_j(x) \leq 0) \right) \bigwedge \left(\bigwedge\limits_{r=1}^k D_r(x) \right). $$ After eliminate the bounded variable $x_q$, if there only have one variable left in proposition $P$, go to step 4. Or else, go back to step 2.

4. Solve the last variable

Running an elimination step over n inequalities can result in at most $(m^2)/4$ inequalities in the output, if the proposition $P$ has $m$ constraints and $n$ variables. Thus running the successive steps can result in at most $m^{2^n}/4^n$ complexity.

Simplex

Simplex algorithm is originally developed to solve the linear programming (LP) problems by George B. Dantzig in 1947, and SMT for LA is a sub-problem of LP in which there is no the optimization goal. The key idea of Simplex algorithm can be divided into the following steps:

Assignment 6: Linear Arithmetic and Linear Programming

Overview