Literature DB >> 23251229

Linear program relaxation of sparse nonnegative recovery in compressive sensing microarrays.

Linxia Qin¹, Naihua Xiu, Lingchen Kong, Yu Li.

Abstract

Compressive sensing microarrays (CSM) are DNA-based sensors that operate using group testing and compressive sensing principles. Mathematically, one can cast the CSM as sparse nonnegative recovery (SNR) which is to find the sparsest solutions subjected to an underdetermined system of linear equations and nonnegative restriction. In this paper, we discuss the l₁ relaxation of the SNR. By defining nonnegative restricted isometry/orthogonality constants, we give a nonnegative restricted property condition which guarantees that the SNR and the l₁ relaxation share the common unique solution. Besides, we show that any solution to the SNR must be one of the extreme points of the underlying feasible set.

Entities: Chemical Disease Gene Species

Mesh：

Substances：
DNA

Year: 2012 PMID： 23251229 PMCID： PMC3509541 DOI： 10.1155/2012/646045

Source DB: PubMed Journal: Comput Math Methods Med ISSN： 1748-670X Impact factor: 2.238

1. Introduction

Nowadays, with the rapid development of molecular biology techniques, scientists use compressive sensing microarrays to collect the gene expression changes of patients suffer from specific diseases and test a lot of different drugs on cells genetically to look for medicine being able to change the abnormal gene expression [1, 2]. A DNA microarray is a collection of microscopic DNA spots attached to a solid surface. Each DNA spot contains a string of specific DNA sequences, known as probes. These can be a short section of a gene or other DNA element that are used to hybridize an organism's genetic sample under high-stringency conditions. Probe-target hybridization is usually detected and quantified by detection of chemiluminescence-labeled targets to infer the genetic makeup in the test sample. Although the number of DNA sequences is extremely large, not all agents are expected to be present in a significant concentration at a given time and location. In traditional microarrays, this results in many inactive probes during sensing. On the other hand, we are often interested in only a small quantity of certain harmful biological agents. Therefore, it is important to not just detect the presence of agents in a sample but also estimate the concentrations with which they are present. Assume that there are m spots and n labeled targets, and we have far fewer spots than target agents such that m ≪ n. Mathematically, one can represent the DNA concentration of each organism as an element in a vector x ∈ ℝ and the measurements as b ∈ ℝ. For 1 ≤ i ≤ m and 1 ≤ j ≤ n, the probe at spot i hybridizes to target j with probability a . The target j occurs in the tested DNA sample with concentration x , which is clearly nonnegative. Denoting by A : = (a ), the process of DNA microarrays leads to the sparse nonnegative recovery (SNR) which is to find the sparsest solutions subjected to an underdetermined system of linear equations and nonnegative constraints, with the mathematical model as follows: where the variable vector x ∈ ℝ, ||x||0 denotes the number of the nonzero entries of x, A ∈ ℝ is the measurement matrix with full row rank, m ≪ n, and b ∈ ℝ. SNR can be regarded as a special case of the sparse recovery, which is related to program min{||x||0 | Ax = b}. This program has sparked the significant concern and rapid development in recent years [3-5] owing to its wide applications. However, with the nonnegativity prior information about the object to be recovered in various applications such as CSM, solutions on (() tend to be closer to the actual situations and lead to substantial improvements in the image reconstruction. Moreover, with the nonnegative constraints, the feasible set becomes a polyhedral set instead of an affine subspace. This will bring us essential hardness in projecting on the feasible set. Thus, (() is more likely difficult to solve. Therefore, SNR deserves specific study. Problem (() has been shown to be NP-hard [6, 7] in general from the perspective of computational complexity. One popular approach is to reconstruct the vector via the l 1 relaxation, which refers to Since (() is a standard linear program, it is easy to solve. An important issue is how to guarantee the equivalence of (() and (() in the sense that they have the same unique k-sparse solution under some conditions. Here, we call a vector x k-sparse if the number of its nonzero entries is no more than k. There has been some increasing interest and activity in this area; see, for example, [8-14]. Donoho and Tanner [9] firstly proposed that (() and (() share the common k-sparse unique solution if the polytope AT is outwardly k-neighborly, where T is the standard simplex in ℝ. Zhang [13] proved that (() and (() share the common k-sparse unique solution if the null space of A is strictly half k-balance. Juditsky et al. [11] developed several different necessary and sufficient conditions for the (()-(() equivalence in the case of general type sign restrictions, including the nonnegative constraints as its special case. When the feasible set of (() is a singleton, the unknown can be recovered by optimizing any objective function over this constraint set, and (() and (() definitely get the same unique solution. In this case, Bruckstein et al. [8] got the uniqueness of the feasible solution under a sufficient condition that A has a rowspan intersecting the positive orthant. Furthermore, Wang et al. [14] proved that the above sufficient condition is also necessary to the uniqueness of the feasible solution. Donoho and Tanner [10] proved that the underlying feasible set is a singleton if and only if the polytope Aℝ+ and ℝ+ have the same number of k-faces. Khajehnejad et al. [12] gave another equivalent condition of the uniqueness property by characterizing the support size of vectors in the null space of A. For the l 1 relaxation of sparse recovery, one of the most significant conditions is the restricted isometry property (RIP), named by Candès and Tao [15] with the groundbreaking work of Donoho et al. [16, 17]. However, to the best of our knowledge, the nonnegative case of RIP has not been investigated. This paper will deal with this issue. We begin with investigating the solution property of SNR and show that any solution to the SNR must be one extreme point of its feasible set in Section 2. We prove in Section 3 the nonemptiness and boundedness of the solution set of (() and show that any solution of (() could be stated as the convex combination of its optimal extreme points. In Section 4, by defining the nonnegative restricted isometry/orthogonality constants, we derive a sufficient condition for exact recovery of the sparsest nonnegative image/signal via the linear program relaxation. Now we give some notations used in the text. We use sol(·) and v(·) to denote the solution set and optimal value of problem (·). The e ∈ ℝ would be the vector with only the ith entry 1 and the rest all 0. e ∈ ℝ is the vector with each entry equal to 1; we also use e to demonstrate that e ∈ ℝ for short. The a ∈ ℝ for i = 1,…, n denote the column vectors of the matrix A and A = (a ). For any x ∈ ℝ, x is the ith component and I(x) is the support set of x; that is, I(x) = {i | x ≠ 0, i = 1,…, n}. For any subset T ⊂ {1,…, n}, T denotes the complement set of T out of {1,…, n}.

2. Solution Property

Throughout the paper we assume that Apparently, 𝒮 is a polyhedral set in ℝ. According to the representation theorem, any x ∈ 𝒮 could be represented as follows: where ∑ λ = 1, λ ≥ 0, i = 1,…, t, and are the extreme point set of 𝒮; μ ≥ 0, j = 1,…, q, and {d ( ∈ ℝ | j = 1,2,…, q} are the extreme direction set of 𝒮. Apparently, , i = 1,…, t; Ad ( = 0, d ( ≥ 0, j = 1,…, q. Define subsets of ℝ as follows: where sub{e ,…, e } denotes the subspace spanned by the vectors e ,…, e , k = 1,…, n. Clearly, {S 1, S 2,…, S } forms a partition of ℝ; that is, ⋃ S = ℝ and S ⋂S = ∅, i ≠ j. Moreover, ||x||0 = r for any x ∈ S . Along with the nonemptiness of 𝒮, it is easy to see that 𝒮 must intersect one of these sets, hence v(P 0) = min{r ∈ ℝ | 𝒮⋂S ≠ ∅} and sol(P 0) ≠ ∅. Furthermore, we have the following result for the optimal value of (().

Lemma 1

Assume that v(P 0) = k, one must have k ≤ m.

Proof

Suppose that the conclusion is not true; that is, there is x* ∈ sol(P 0) with ||x*||0 = k > m. Without loss of generality, let x * > 0, i = 1,…, k. We get Meanwhile, rank(A) = m < k. Thus, {a ∈ ℝ | i = 1,…, k} must be linearly dependent; that is, there exist d 1,…, d , not all zero, such that Assume that d 1 > 0. By denoting d = (d 1,…, d , 0,…, 0) ∈ ℝ, we get Ad = 0. Taking δ = min{x */d | d > 0, i = 1,…, k}, it holds that This is a contradiction with x* ∈ sol(P 0). We complete the proof. To characterize property of the solution set sol(P 0), we need the next lemma. In particular, this brand new result will play a key role in proposing the sufficient condition of the uniqueness of sol(P 0) in Section 4.

Lemma 2

Any two distinct solutions of (() must have different support sets. Assume for contradiction that x* and are two different solutions of ((), . If , we have x * > 0, , for all i ∈ I(x*). Set . Since , it must hold λ < 1. When , take It is easy to see that Thus, we have , and . This is a contradiction with the optimality of x*. When , just taking instead, we get the contradiction by a similar way. The proof is completed. Now we are in a position to give the main theorem in this section.

Theorem 3

Any solution of (() must be one of the extreme points of 𝒮. Given any solution x* ∈ sol(P 0) with representation where ∑ λ * = 1, λ * ≥ 0, , , for all i = 1,…, t; μ * ≥ 0, d ( ≥ 0, Ad ( = 0, for all j = 1,…, q. We only need to prove that in (9). To this end, we have the following three steps. Firstly, we claim that in (9), According to the fact that , i = 1,…, t, and μ * ≥ 0, d ( ≥ 0, j = 1,…, q, one has for any , i ∈ {i | λ * > 0, i = 1,2,…, t}, which means that l ∈ I(x*). This implies . Similarly, we get I(d () ⊂ I(x*). On the contrary, from the optimality of x* and (9), we know that . Secondly, we will show that μ * = 0, j = 1,…, q. If this is not true, there is an index j 0 such that μ * > 0, then μ *d ( has at least one positive component. Denote d = ∑ μ *d (, so Ad = 0 and {d > 0 | l = 1,…, n} ≠ ∅. Noting that , (11) implies the fact that . Take , and Without loss of generality, set . It is easy to verify that hence which is a contradiction with the optimality of x*. Thirdly, we will prove that ∑ λ * = 1, λ * ∈ {0,1}, i = 1,…, t. Suppose that there exist λ 1* > 0, λ 2* > 0, and λ 1* + λ 2* = 1 and two different extreme points of 𝒮, say , such that Based on (11), we have , hence , . Nevertheless, by Lemma 2, this is impossible. Hence, we show that We complete the argument. Theorem 3 tells us that each solution of (() lies in the extreme point set of 𝒮. Here is a concrete example.

Example 4

Let . Obviously, the solution set and optimal value of (() are respectively. In this case, (2,1, 0) is the only extreme point of 𝒮. While the solution set and optimal value of (P) : = min{||x||0 | Ax = b} are respectively. At the end of this section, we consider the l (0 < p < 1) relaxation of (() Clearly, (() is a concave relaxation of ((). For the program ((), Ge et al. [7] derived the useful result as the following.

Lemma 5

The set of all extreme points of 𝒮 is exactly the set of all local minimizers to ((). This lemma implies that any global solution of l relaxation must be one of its extreme points. From Theorem 3 and Lemma 5, we immediately draw a new proposition.

Proposition 6

For any p ∈ (0,1), there exists an extreme point of 𝒮 that is both an exact solution of (() and a local minimizer of ((). This is different from the result of Fung and Mangasarian in [18], where they showed that for sufficiently small , there exists an extreme point of the polyhedral set 𝒯, obtained by lifting the set 𝒮, such that is an exact solution of (() and a global solution of the relaxation.

3. Linear Program Relaxation

Consider the linear program relaxation ((). Since the linear objective function 〈e, x〉 is bounded below over the feasible set, based on the Frank-Wolfe theorem, the minimum of (() is attainable. Among all the extreme points of 𝒮, , we call an optimal extreme point if it also meets .

Proposition 7

Any x* ∈ sol (P 1) could be stated as the convex combination of optimal extreme points of ((). Hence, sol (P 1) is bounded. Given any x* ∈ sol(P 0) with representation (9). If there is i 0 ∈ {1,…, t} such that is not an optimal extreme point of (() and λ * > 0, we have , hence which is a contradiction. Similarly, if there is j 0 ∈ {1,…, q} such that μ * > 0, we have μ *〈e, d (〉>0, hence which is a contradiction. This completes the proof. From the above proposition, we know that linear program (() has at least one optimal extreme point. Thus, we could use simplex method or interior point method to solve (().

4. Nonnegative Restricted Property

In the framework of l 1 relaxation, a significant problem is how to guarantee the exact recovery of sparse image/signal via the l 1 relaxation. One of the most important qualifications is the restricted isometry property; see [15]. Recall that the k-restricted isometry constants (RIC) δ is the smallest scalar satisfying Similarly, the k, k′-restricted orthogonality constants (ROC) θ for k + k′ ≤ n is defined as the smallest scalar satisfying where x and y have disjoint support sets. The RIC δ and ROC θ measure how close each submatrix of A with certain cardinality is behaving like an orthonormal system. Under some restricted isometry property, one can get the sparse recovery via its l 1 relaxation. Nevertheless, for the nonnegative case, the sparse recovery may maintain new characterizations. Above all, we define NRIC and NROC.

Definition 8

Let A ∈ ℝ. We define the nonnegative k-restricted isometry constants (NRIC) δ + as the smallest number satisfying Similarly, we define the nonnegative k, k′-restricted orthogonality constants (NROC) θ + for k + k′ ≤ n as the smallest number satisfying with I(x) and I(y) being disjoint sets. Clearly, Moreover, the numbers δ + and θ + are nondecreasing in k, k′. By employing the projections of vectors in the null space of A to ℝ+ , we now provide a sufficient condition to determine a solution of (().

Theorem 9

Suppose that k ≥ 1 is such that δ + + θ + < 1 and x* ∈ 𝒮 with ||x*||0 = k. Then, x* is a solution of ((). We complete the proof by contradiction. If this is not true, there exists such that , . Set . Clearly, Ah = 0. Take h = h + − h − with h + and h − being the projections of h and −h to ℝ+ , respectively. We have h + ≥ 0, h − ≥ 0, 〈h +, h −〉 = 0. In particular, Therefore, Thus, we get in which the first inequality is due to (24), (25), and the fact that 2ab ≤ a 2 + b 2, and the last inequality is because of the assumption of δ + + θ + < 1 and the monotonicity of δ + in k. This is a contradiction. Therefore, x* ∈ sol(P 0). With the special result that any two solutions of (() have different support sets, we next derive a sufficient condition on the uniqueness of solution to (().

Theorem 10

Suppose that k ≥ 1 is such that δ + + θ + < 1 and x* ∈ 𝒮 with ||x*||0 = k. Then, x* is the unique solution of ((). Since δ + + θ + < 1 implies δ + + θ + < 1, we know that x* ∈ sol(P 0). Now we just need to verify that x* is the unique solution of ((). Assume that this is not true; that is, there is another solution . According to Lemma 2, it must hold . Take . By the argument similar to that in the proof of Theorem 9, we get and the contradiction. We conclude the proof. Now we are ready to give the main result of this paper, which is called the nonnegative restricted property.

Theorem 11

Assume that k ≥ 1 is such that and x* ∈ 𝒮 with ||x*||0 = k. Then x* is exactly the common unique minimizer of (() and ((). Since (31) implies δ + + θ + < 1, sol(P 0) = {x*} by Theorem 10. Suppose that is a solution of ((). Take . To get sol(P 1) = {x*}, it suffices to verify that h = 0. The proof includes three steps, the first two steps are parallel to that in [19], in the third step, we utilize the technique of projecting the null space of A on ℝ+ ; for details, see (42) and the argument around it. Firstly, we introduce a partition of {1,…, n}. Let T 0 be the support set of x*, T 1 the index set including the first k large components of in T 0 , T 2 the index set including the next k large components of in T 0 ∖T 1, and so on. Thus, Moreover, for any j = 0,1,…, [n/k], we define which is exactly that for any j = 1,…, [n/k] Therefore, Ah = 0, h = ∑ [ h , and Next, we show that ||h (||2 is bounded by ||h ||1. Note that for each j ≥ 2, where the second inequality is because of the monotonicity of h on T 0 , and This gives In fact, which implies By applying (38) and (40), we have Finally, we show that ||h ||2 = 0. By utilizing the projection h = h + − h −, where h + and h − are projections of h and −h on ℝ+ , we have h ( = h ( + − h ( −. Moreover, It is easy to see From Ah = 0, we compute On one hand, based on the definition of NROC, (35), and (43), for j ≥ 2, where the last inequality is by the fact that (a + b)2 ≤ 2(a 2 + b 2). On the other hand, together with the definition of NRIC and (43), one has Therefore, we compute where the forth inequality is from the fact that ||h ||1 2 ≤ k||h ||2 2. Then the assumption forces Thus, Therefore, we get h ( = 0 by (41), hence ||h|| = 0. This is exactly what we want. We complete the proof.

5. Conclusion

In this paper, we have derived a nonnegative restricted property condition, which ensures the exact recovery of sparse nonnegative image/signal via the linear program relaxation. Since the NRIC and NROC are defined in ℝ+ , there may be more types of measurement matrices satisfying the nonnegative restricted property than that in the case of RIP, regardless of random matrices or deterministic matrices. As a byproduct of the main result, we have investigated the solution property of the sparse nonnegative recovery and shown that any solution of (() must be one of the extreme points of its feasible set. However, it is not clear whether a given extreme point of the feasible set is a solution to ((). This can serve as a target for future work.

2 in total

1. Sparse nonnegative solution of underdetermined linear equations by linear programming.

Authors: David L Donoho; Jared Tanner
Journal: Proc Natl Acad Sci U S A Date: 2005-06-23 Impact factor: 11.205

2. Compressive sensing DNA microarrays.

Authors: Wei Dai; Mona A Sheikh; Olgica Milenkovic; Richard G Baraniuk
Journal: EURASIP J Bioinform Syst Biol Date: 2009-01-13

2 in total