A New Nonconvex Sparse Recovery Method for Compressive Sensing

Zhou, Zhiyong; Yu, Jun

doi:10.3389/fams.2019.00014

ORIGINAL RESEARCH article

Front. Appl. Math. Stat., 14 March 2019

Sec. Mathematics of Computation and Data Science

Volume 5 - 2019 | https://doi.org/10.3389/fams.2019.00014

This article is part of the Research TopicRecent Developments in Signal Approximation and ReconstructionView all 6 articles

A New Nonconvex Sparse Recovery Method for Compressive Sensing

Zhiyong Zhou^*

Jun Yu

Department of Mathematics and Mathematical Statistics, Umeå University, Umeå, Sweden

As an extension of the widely used ℓ_r-minimization with 0 < r ≤ 1, a new non-convex weighted ℓ_r − ℓ₁ minimization method is proposed for compressive sensing. The theoretical recovery results based on restricted isometry property and q-ratio constrained minimal singular values are established. An algorithm that integrates the iteratively reweighted least squares algorithm and the difference of convex functions algorithm is given to approximately solve this non-convex problem. Numerical experiments are presented to illustrate our results.

1. Introduction

Compressive sensing (CS) has attracted a great deal of interests since its advent [1, 2], see the monographs [3, 4] and the references therein for a comprehensive view. Basically, the goal of CS is to recover an unknown (approximately) sparse signal x ∈ ℝ^N from the noisy underdetermined linear measurements

\begin{array}{l} y = A x + e \in ℝ^{m}, & (1) \end{array}

with m ≪ N, A ∈ ℝ^m×N being the pre-given measurement matrix and e ∈ ℝ^m being the noise vector. If the measurement matrix A satisfies some kinds of incoherence conditions (e.g., mutual coherence condition [5, 6], restricted isometry property (RIP) [7, 8], null space property (NSP) [9, 10], or constrained minimal singular values (CMSV) [11, 12]), then stable (w.r.t. sparsity defect) and robust (w.r.t. measurement error) recovery results can be guaranteed by using the constrained ℓ₁-minimization [13]:

\begin{matrix} min_{z \in ℝ^{N}} ‖ z ‖_{1} subject to ‖ A z - y ‖_{2} \leq η . & (2) \end{matrix}

Here the ℓ₁-minimization problem works as a convex relaxation of ℓ₀-minimization problem, which is NP-hard to solve [14].

Meanwhile, non-convex recovery algorithms such as the ℓ_r-minimization (0 < r < 1) have been proposed to enhance sparsity [15–20]. ℓ_r-minimization enables one to reconstruct the sparse signal from fewer number of measurements compared to the convex ℓ₁-minimization, although it is more challenging to solve because of its non-convexity. Fortunately, an iteratively reweighted least squares (IRLS) algorithm can be applied to approximately solve this non-convex problem in practice [21, 22].

As an extension of the ℓ_r-minimization, we study in this paper the following weighted ℓ_r − ℓ₁ minimization problem for sparse signal recovery:

\begin{array}{l} min_{z \in ℝ^{N}} ‖ z ‖_{r}^{r} - α ‖ z ‖_{1}^{r} subject to ‖ A z - y ‖_{2} \leq η, & (3) \end{array}

where y = Ax + e with ∥e∥₂ ≤ η, 0 ≤ α ≤ 1, and 0 < r ≤ 1. Throughout the paper, we assume that α ≠ 1 when r = 1. Obviously, it reduces to the traditional ℓ_r-minimization problem when α = 0. This hybrid norm model is inspired by the non-convex Lipschitz continuous ℓ₁ − ℓ₂ model (minimizing the difference of ℓ₁ norm and ℓ₂ norm) proposed in Lou et al. [23] and Yin et al. [24], which improves the ℓ₁-minimization in a robust manner, especially for the highly coherent measurement matrices. Roughly speaking, the underlying logic of adopting these kinds of norm differences or the ratios of norms [25] comes from the fact that they can be viewed as sparsity measures, see the effective sparsity measure called q-ratio sparsity (involving the ratio of ℓ₁ norm and ℓ_q norm) defined later in Definition 2 of section 2.2. Other recent related literatures include [26–29], to name a few.

To illustrate these weighted ℓ_r − ℓ₁ norms ( $∥ \cdot ∥_{r}^{r} - α ∥ \cdot ∥_{1}^{r}$ ), we present their corresponding contour plots in Figure 1¹. As is shown, different non-convex patterns arise while varying the difference weight α or the norm order r. And the level curves of weighted ℓ_r − ℓ₁ norms approach the x and y axes as the norm values get small, which reflects their ability to promote sparsity. In the present paper, we shall focus on both the theoretical aspects and the computational study for this non-convex sparse recovery method.

FIGURE 1

Figure 1. The contour plots for weighted ℓ_r − ℓ₁ norms with different α and r. The first row corresponds to the cases that r = 0.5 with different α = 0, 0.3, 1, while the second row shows the cases that r = 0.1, 0.4, 0.7 with fixed α = 1.

This paper is organized as follows. In section 2, we derive the theoretical performance bounds for the weighted ℓ_r − ℓ₁ minimization based on both r-RIP and q-ratio CMSV. In section 3, we give an algorithm to approximately solve the unconstrained version of the weighted ℓ_r − ℓ₁ minimization problem. Numerical experiments are provided in section 4. Section 5 concludes with a brief summary and an outlook on future extensions.

2. Recovery Analysis

In this section, we establish the theoretical performance bounds for the reconstruction error of the weighted ℓ_r − ℓ₁ minimization problem, based on both r-RIP and q-ratio CMSV. Hereafter, we say a signal x ∈ ℝ^N is s-sparse if $∥ x ∥_{0} = \sum_{i = 1}^{N} 1 {x_{i} \neq 0} \leq s$ , and denote by x_S the vector that coincides with x on the indices in S ⊆ [N]: = {1, 2, ⋯ , N} and takes zero outside S.

2.1. r-RIP

We start with the definition of the s-th r-restricted isometry constant, which was introduced in Chartrand and Staneva [30].

Definition 1. ([30]) For integer s > 0 and 0 < r ≤ 1, the s-th r-restricted isometry constant (RIC) δ_s = δ_s(A) of a matrix A ∈ ℝ^m×N is defined as the smallest δ ≥ 0 such that

\begin{array}{l} (1 - δ) ‖ x ‖_{2}^{r} \leq ‖ A x ‖_{r}^{r} \leq (1 + δ) ‖ x ‖_{2}^{r} & (4) \end{array}

for all s-sparse vectors x ∈ ℝ^N.

Then, the r-RIP means that the s-th r-RIC δ_s is small for reasonably large s. In Chartrand and Staneva [30], the authors established the recovery analysis result for ℓ_r-minimization problem based on this r-RIP. To extend this to the weighted ℓ_r − ℓ₁ minimization problem, the following lemma plays a crucial role.

Lemma 1. Suppose x ∈ ℝ^N, 0 ≤ α ≤ 1 and 0 < r ≤ 1, then we have

\begin{matrix} (N - α N^{r}) {(min_{i \in [N]} | x_{i} |)}^{r} \leq ‖ x ‖_{r}^{r} - α ‖ x ‖_{1}^{r} \leq (N^{1 - r} - α) ‖ x ‖_{1}^{r} . & (5) \end{matrix}

In particular, when S = supp(x) ⊆ [N] and |S| = s, then

\begin{array}{l} (s - α s^{r}) {(min_{_{i \in S}} | x_{i} |)}^{r} \leq ‖ x ‖_{r}^{r} - α ‖ x ‖_{1}^{r} \leq (s^{1 - r} - α) ‖ x ‖_{1}^{r} . & (6) \end{array}

Proof. The right hand side of (5) follows immediately from the norm inequality $∥ x ∥_{r} \leq N^{1 / r - 1} ∥ x ∥_{1}$ for any x ∈ ℝ^N and 0 < r ≤ 1. As for the left hand side, it holds trivially if min_i∈[N] |x_i| = 0. When min_i∈[N] |x_i| ≠ 0, by dividing (min_i∈[N] |x_i|)^r on both sides, it is equivalent to show that

\begin{array}{l} \sum_{j = 1}^{N} {(\frac{| x_{j} |}{{min}_{i \in [N]} | x_{i} |})}^{r} - α {(\sum_{j = 1}^{N} \frac{| x_{j} |}{{min}_{i \in [N]} | x_{i} |})}^{r} \geq N - α N^{r} . & (7) \end{array}

By denoting $a_{j} = \frac{| x_{j} |}{\min_{i \in [N]} | x_{i} |}$ , we have a_j ≥ 1 for any 1 ≤ j ≤ N, and to show (7) it suffices to show

\sum_{j = 1}^{N} a_{j}^{r} - α {(\sum_{j = 1}^{N} a_{j})}^{r} \geq N - α N^{r} .

Assume the function $f (a_{1}, a_{2}, \dots, a_{N}) = \sum_{j = 1}^{N} a_{j}^{r} - α {(\sum_{j = 1}^{N} a_{j})}^{r}$ . Then, as a result of

\frac{\partial f}{\partial a_{k}} = r a_{k}^{r - 1} - α r {(\sum_{j = 1}^{N} a_{j})}^{r - 1} > 0 for any 1 \leq k \leq N,

we have $f (a_{1}, a_{2}, \dots, a_{N}) \geq f (1, 1, \dots, 1) = N - α N^{r}$ . Thus, the left hand side of (5) holds and the proof is completed. (6) follows as we apply (5) to x_S.

Now, we are ready to present the r-RIP based bound for the ℓ₂ norm of the reconstruction error.

Theorem 1. Let the ℓ_r-error of best s-term approximation of x be $σ_{s} {(x)}_{r} = inf {∥ x - z ∥_{r}, z \in ℝ^{N} is s -sparse}$ . We assume that a > 0 is properly chosen so that as is an integer. If

\begin{array}{l} b = \frac{{(a s)}^{1 - \frac{r}{2}} - α {(a s)}^{\frac{r}{2}}}{s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}}} > 1 & (8) \end{array}

and suppose the measurement matrix A satisfies the condition

\begin{array}{l} δ_{a s} + b δ_{(a + 1) s} < b - 1, & (9) \end{array}

then any solution $\hat{x}$ to the minimization problem (3) obeys

\begin{array}{l} ‖ \hat{x} - x ‖_{2} \leq C_{1} m^{1 / r - 1 / 2} η + C_{2} {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1 / r} σ_{s} {(x)}_{r} & (10) \end{array}

with $C_{1} = \frac{2^{1 / r} (1 + b^{1 / r})}{{(b - b δ_{(a + 1) s} - 1 - δ_{a s})}^{1 / r}}$ and $C_{2} = \frac{2^{2 / r - 1} [{(1 + δ_{a s})}^{1 / r} + {(1 - δ_{(a + 1) s})}^{1 / r}]}{{(b - b δ_{(a + 1) s} - 1 - δ_{a s})}^{1 / r}}$ .

Proof. We assume that S is the index set that contains the largest s absolute entries of x so that $σ_{s} {(x)}_{r} = ∥ x_{S^{c}} ∥_{r}$ and let $h = \hat{x} - x$ . Then we have

\begin{array}{l} ‖ x_{S} ‖_{r}^{r} + ‖ x_{S^{c}} ‖_{r}^{r} - α ‖ x ‖_{1}^{r} \\ = ‖ x ‖_{r}^{r} - α ‖ x ‖_{1}^{r} \\ \geq ‖ \hat{x} ‖_{r}^{r} - α ‖ \hat{x} ‖_{1}^{r} \\ = ‖ x_{S} + x_{S^{c}} + h_{S} + h_{S^{c}} ‖_{r}^{r} - α ‖ x + h_{S} + h_{S^{c}} ‖_{1}^{r} \\ \geq ‖ x_{S} + h_{S} ‖_{r}^{r} + ‖ x_{S^{c}} + h_{S^{c}} ‖_{r}^{r} - α {(‖ x + h_{S} ‖_{1} + ‖ h_{S^{c}} ‖_{1})}^{r} \\ \geq ‖ x_{S} ‖_{r}^{r} - ‖ h_{S} ‖_{r}^{r} + ‖ h_{S^{c}} ‖_{r}^{r} - ‖ x_{S^{c}} ‖_{r}^{r} - α ‖ x + h_{S} ‖_{1}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r} \\ \geq ‖ x_{S} ‖_{r}^{r} - ‖ h_{S} ‖_{r}^{r} + ‖ h_{S^{c}} ‖_{r}^{r} - ‖ x_{S^{c}} ‖_{r}^{r} \\ - α ‖ x ‖_{1}^{r} - α ‖ h_{S} ‖_{1}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r}, \end{array}

which implies

\begin{array}{l} ‖ h_{S^{c}} ‖_{r}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r} \leq ‖ h_{S} ‖_{r}^{r} + α ‖ h_{S} ‖_{1}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r} . & (11) \end{array}

Using the Holder's inequality, we obtain

‖ A h ‖_{r}^{r} \leq {({\sum_{i = 1}^{m} ({| {(A h)}_{i} |}^{r})}^{2 / r})}^{r / 2} \cdot {(\sum_{i = 1}^{m} 1)}^{1 - r / 2} = m^{1 - r / 2} ‖ A h ‖_{2}^{r} .

By ∥Ax − y∥₂ = ∥e∥₂ ≤ η and the triangular inequality,

\begin{array}{l} ‖ A h ‖_{2} & = ‖ (A \hat{x} - y) - (A x - y) ‖_{2} \\ \leq ‖ A \hat{x} - y ‖_{2} + ‖ A x - y ‖_{2} \leq 2 η . & (12) \end{array}

Thus,

\begin{array}{l} ‖ A h ‖_{r}^{r} \leq m^{1 - r / 2} ‖ A h ‖_{2}^{r} \leq m^{1 - r / 2} {(2 η)}^{r} . & (13) \end{array}

Arrange $S^{c} = S_{1} \cup S_{2} \cup \dots$ , where S₁ is the index set of M = as largest absolute entries of h in S^c, S₂ is the index set of M largest absolute entries of h in ${(S \cup S_{1})}^{c}$ , etc. And we denote S₀ = S∪S₁. Then, by adopting Lemma 1, for each i ∈ S_k, k ≥ 2,

\begin{array}{l} | h_{i} | \leq \min_{j \in S_{k - 1}} | h_{j} | \Rightarrow | h_{i} |^{r} \leq ​ {(\min_{j \in S_{k - 1}} ​ | h_{j} | ​)}^{r} \leq \frac{‖ h_{S_{k - 1}} ‖_{r}^{r} - α ‖ h_{S_{k - 1}} ‖_{1}^{r}}{M - α M^{r}} . & (14) \end{array}

Thus we have $∥ h_{S_{k}} ∥_{2}^{r} = {(\sum_{i \in S_{k}} | h_{i} |^{2})}^{r / 2} \leq M^{r / 2} \frac{∥ h_{S_{k - 1}} ∥_{r}^{r} - α ∥ h_{S_{k - 1}} ∥_{1}^{r}}{M - α M^{r}} = \frac{∥ h_{S_{k - 1}} ∥_{r}^{r} - α ∥ h_{S_{k - 1}} ∥_{1}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}}$ . Hence it follows that

\begin{array}{l} \sum_{k \geq 2} ‖ h_{S_{k}} ‖_{2}^{r} \leq \frac{\sum_{k \geq 1} ({‖ h_{S_{k}} ‖}_{r}^{r} - α {‖ h_{S_{k}} ‖}_{1}^{r})}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} = \frac{\sum_{k \geq 1} {‖ h_{S_{k}} ‖}_{r}^{r} - α \sum_{k \geq 1} {‖ h_{S_{k}} ‖}_{1}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} ​ . & (15) \end{array}

Note that

\sum_{k \geq 1} {‖ h_{S_{k}} ‖}_{r}^{r} = {‖ h_{S^{c}} ‖}_{r}^{r} and \sum_{k \geq 1} {‖ h_{S_{k}} ‖}_{1}^{r} \geq ‖ \sum_{k \geq 1} h_{S_{k}} ‖_{1}^{r} = {‖ h_{S^{c}} ‖}_{1}^{r},

therefore, with (11), it holds that

\begin{array}{l} \sum_{k \geq 2} ‖ h_{S_{k}} ‖_{2}^{r} \leq \frac{‖ h_{S^{c}} ‖_{r}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} \\ \leq \frac{‖ h_{S} ‖_{r}^{r} + α ‖ h_{S} ‖_{1}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} \\ \leq \frac{s^{1 - \frac{r}{2}} ‖ h_{S} ‖_{2}^{r} + α s^{\frac{r}{2}} ‖ h_{S} ‖_{2}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} \\ \leq \frac{(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}}) ‖ h_{S_{0}} ‖_{2}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} . & (16) \end{array}

Meanwhile, according to the definition of r-RIC, we have

\begin{array}{l} ‖ A h ‖_{r}^{r} = ‖ A h_{S_{0}} + \sum_{k \geq 2} A h_{S_{k}} ‖_{r}^{r} \\ \geq ‖ A h_{S_{0}} ‖_{r}^{r} - ‖ \sum_{k \geq 2} A h_{S_{k}} ‖_{r}^{r} \\ \geq ‖ A h_{S_{0}} ‖_{r}^{r} - \sum_{k \geq 2} ‖ A h_{S_{k}} ‖_{r}^{r} \\ \geq (1 - δ_{M + s}) ‖ h_{S_{0}} ‖_{2}^{r} - (1 + δ_{M}) \sum_{k \geq 2} ‖ h_{S_{k}} ‖_{2}^{r} . \end{array}

Thus by using (16), it follows that

\begin{array}{l} ‖ A h ‖_{r}^{r} \geq (1 - δ_{M + s}) ‖ h_{S_{0}} ‖_{2}^{r} - (1 + δ_{M}) \cdot \frac{(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}}) ‖ h_{S_{0}} ‖_{2}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} \\ = (1 - δ_{M + s} - \frac{1 + δ_{M}}{b}) ‖ h_{S_{0}} ‖_{2}^{r} - \frac{2 (1 + δ_{M})}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} ‖ x_{S^{c}} ‖_{r}^{r}, \end{array}

where $b = \frac{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}}{s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}}} = \frac{{(a s)}^{1 - \frac{r}{2}} - α {(a s)}^{\frac{r}{2}}}{s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}}}$ . Therefore, if δ_M + bδ_M+s < b − 1, then it yields that

\begin{array}{l} ‖ h_{S_{0}} ‖_{2}^{r} & \leq \frac{b}{b - b δ_{M + s} - 1 - δ_{M}} ‖ A h ‖_{r}^{r} + \frac{2 (1 + δ_{M}) {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1}}{b - b δ_{M + s} - 1 - δ_{M}} ‖ x_{S^{c}} ‖_{r}^{r} \\ \leq \frac{b m^{1 - r / 2} {(2 η)}^{r}}{b - b δ_{M + s} - 1 - δ_{M}} + \frac{2 (1 + δ_{M}) {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1}}{b - b δ_{M + s} - 1 - δ_{M}} ‖ x_{S^{c}} ‖_{r}^{r} . & (17) \end{array}

On the other hand,

\begin{array}{l} {(\sum_{k \geq 2} ‖ h_{S_{k}} ‖_{2})}^{r} & \leq \sum_{k \geq 2} ‖ h_{S_{k}} ‖_{2}^{r} \\ \leq \frac{(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}}) ‖ h_{S_{0}} ‖_{2}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r}}{M^{1 - \frac{r}{2}} - α M^{\frac{r}{2}}} \\ = \frac{1}{b} ‖ h_{S_{0}} ‖_{2}^{r} + \frac{2 {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1}}{b} ‖ x_{S^{c}} ‖_{r}^{r} \\ \leq \frac{1}{b} (\frac{b}{b - b δ_{M + s} - 1 - δ_{M}} ‖ A h ‖_{r}^{r} \\ + \frac{2 (1 + δ_{M}) {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1}}{b - b δ_{M + s} - 1 - δ_{M}} ‖ x_{S^{c}} ‖_{r}^{r}) \\ + \frac{2 {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1}}{b} ‖ x_{S^{c}} ‖_{r}^{r} \\ \leq \frac{1}{b - b δ_{M + s} - 1 - δ_{M}} ‖ A h ‖_{r}^{r} \\ + \frac{2 (1 - δ_{M + s}) {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1}}{b - b δ_{M + s} - 1 - δ_{M}} ‖ x_{S^{c}} ‖_{r}^{r} \\ \leq \frac{m^{1 - r / 2} {(2 η)}^{r}}{b - b δ_{M + s} - 1 - δ_{M}} \\ + \frac{2 (1 - δ_{M + s}) {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1}}{b - b δ_{M + s} - 1 - δ_{M}} ‖ x_{S^{c}} ‖_{r}^{r} . & (18) \end{array}

Since ${(v_{1}^{r} + v_{2}^{r})}^{1 / r} \leq 2^{1 / r - 1} (v_{1} + v_{2})$ for any v₁, v₂ ≥ 0, combining (17) and (18) gives

\begin{array}{l} ‖ h ‖_{2} & \leq ‖ h_{S_{0}} ‖_{2} + \sum_{k \geq 2} ‖ h_{S_{k}} ‖_{2} \\ \leq 2^{1 / r - 1} (\frac{2 b^{1 / r} m^{1 / r - 1 / 2} η}{{(b - b δ_{M + s} - 1 - δ_{M})}^{1 / r}} \\ + \frac{2^{1 / r} {(1 + δ_{M})}^{1 / r} {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1 / r}}{{(b - b δ_{M + s} - 1 - δ_{M})}^{1 / r}} ‖ x_{S^{c}} ‖_{r}) \\ + 2^{1 / r - 1} (\frac{2 m^{1 / r - 1 / 2} η}{{(b - b δ_{M + s} - 1 - δ_{M})}^{1 / r}} \\ + \frac{2^{1 / r} {(1 - δ_{M + s})}^{1 / r} {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1 / r}}{{(b - b δ_{M + s} - 1 - δ_{M})}^{1 / r}} ‖ x_{S^{c}} ‖_{r}) \\ = \frac{2^{1 / r} m^{1 / r - 1 / 2} (1 + b^{1 / r})}{{(b - b δ_{(a + 1) s} - 1 - δ_{a s})}^{1 / r}} η \\ + \frac{2^{2 / r - 1} [{(1 + δ_{a s})}^{1 / r} + {(1 - δ_{(a + 1) s})}^{1 / r}]}{{(b - b δ_{(a + 1) s} - 1 - δ_{a s})}^{1 / r}} \\ \times {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1 / r} ‖ x_{S^{c}} ‖_{r} \\ = C_{1} m^{1 / r - 1 / 2} η + C_{2} {(s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}})}^{- 1 / r} σ_{s} {(x)}_{r} . & (19) \end{array}

The proof is completed.

Based on this theorem, we can obtain the following corollary by assuming that the original signal x is s-sparse (σ_s(x)_r = 0) and the measurement vector is noise free (e = 0 and η = 0), which acts as a natural generalization of Theorem 2.4 in Chartrand and Staneva [30] from the case α = 0 to any α ∈ [0, 1].

Corollary 1. For any s-sparse signal x, if the conditions in Theorem 1 hold, then the unique solution of (3) with η = 0 is exactly x.

Remarks. Observe that r-RIP based condition for exact sparse recovery given in Chartrand and Staneva [30] reads

δ_{a s} < a^{1 - \frac{r}{2}} (1 - δ_{(a + 1) s}) - 1,

while ours goes to

δ_{a s} < b (1 - δ_{(a + 1) s}) - 1

with $b = \frac{{(a s)}^{1 - \frac{r}{2}} - α {(a s)}^{\frac{r}{2}}}{s^{1 - \frac{r}{2}} + α s^{\frac{r}{2}}} < a^{1 - \frac{r}{2}}$ when α ∈ (0, 1]. Thus, the sufficient condition established here is slightly stronger than that for the traditional ℓ_r-minimization in Chartrand and Staneva [30] if α ∈ (0, 1].

2.2. q-Ratio CMSV

Before the discussion of q-ratio CMSV, we start with presenting the definition of q-ratio sparsity as a kind of effective sparsity measure. We list the detailed statement here for the sake of completeness.

Definition 2. ([12, 31, 32]) For any non-zero z ∈ ℝ^N and non-negative q ∉ {0, 1, ∞}, the q-ratio sparsity level of z is defined as

\begin{array}{l} s_{q} (z) = {(\frac{‖ z ‖_{1}}{‖ z ‖_{q}})}^{\frac{q}{q - 1}} . & (20) \end{array}

The cases of q ∈ {0, 1, ∞}are evaluated as limits: $s_{0} (z) = lim_{q \to 0} s_{q} (z) = ∥ z ∥_{0}$ , $s_{\infty} (z) = lim_{q \to \infty} s_{q} (z) = \frac{∥ z ∥_{1}}{∥ z ∥_{\infty}}$ , and $s_{1} (z) = lim_{q \to 1} s_{q} (z) = exp (H_{1} (π (z)))$ , where π(z) ∈ ℝ^N with entries π_i(z) = |z_i|/∥z∥₁ and H₁ is the ordinary Shannon entropy $H_{1} (π (z)) = - \sum_{i = 1}^{N} π_{i} (z) log π_{i} (z)$ .

We are able to establish the performance bounds for both the ℓ_q norm and ℓ_r norm of the reconstruction error via a recently developed computable incoherence measure of the measurement matrix, called q-ratio CMSV. Its definition is given as follows.

Definition 3. ([12, 32]) For any real number s ∈ [1, N], q ∈ (1, ∞], and matrix A ∈ ℝ^m×N, the q-ratio constrained minimal singular value (CMSV) of A is defined as

\begin{array}{l} ρ_{q, s} (A) = \min_{z \neq 0, s_{q} (z) \leq s} \frac{‖ A z ‖_{2}}{‖ z ‖_{q}} . & (21) \end{array}

Then, when the signal is exactly sparse, we have the following q-ratio CMSV based sufficient condition for valid upper bounds of the reconstruction error, which are much more concise to obtain than the r-RIP based ones.

Theorem 2. For any 1 < q ≤ ∞, 0 ≤ α ≤ 1, and 0 < r ≤ 1, if the signal x is s-sparse and the measurement matrix A satisfies the condition

\begin{array}{l} ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A) > 0, & (22) \end{array}

then any solution $\hat{x}$ to the minimization problem (3) obeys

\begin{matrix} ‖ \hat{x} - x ‖_{q} \leq \frac{2 η}{ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)}, & (23) \end{matrix}

\begin{matrix} ‖ \hat{x} - x ‖_{r} \leq {(\frac{2^{r + 1}}{1 - α})}^{1 / r} \cdot \frac{s^{1 / r - 1 / q} η}{ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} . & (24) \end{matrix}

Proof. Suppose the support of x to be S with |S| ≤ s and $h = \hat{x} - x$ , then, based on (11), we have

\begin{matrix} ‖ h_{S^{c}} ‖_{r}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r} \leq ‖ h_{S} ‖_{r}^{r} + α ‖ h_{S} ‖_{1}^{r} . & (25) \end{matrix}

Hence, for any 1 < q ≤ ∞, it holds that

\begin{array}{l} ‖ h ‖_{r}^{r} - α ‖ h ‖_{1}^{r} \leq ‖ h_{S} ‖_{r}^{r} + ‖ h_{S^{c}} ‖_{r}^{r} - α ‖ h_{S} ‖_{1}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r} \\ \leq ‖ h_{S} ‖_{r}^{r} - α ‖ h_{S} ‖_{1}^{r} + ‖ h_{S} ‖_{r}^{r} + α ‖ h_{S} ‖_{1}^{r} \\ \leq 2 ‖ h_{S} ‖_{r}^{r} \leq 2 s^{1 - r / q} ‖ h_{S} ‖_{q}^{r} \leq 2 s^{1 - r / q} ‖ h ‖_{q}^{r} . \end{array}

Then since $(1 - α) ∥ h ∥_{1}^{r} \leq ∥ h ∥_{r}^{r} - α ∥ h ∥_{1}^{r}$ , it implies that $(1 - α) ∥ h ∥_{1}^{r} \leq 2 s^{1 - r / q} ∥ h ∥_{q}^{r}$ . As a consequence,

\begin{array}{l} s_{q} (h) = {(\frac{‖ h ‖_{1}}{‖ h ‖_{q}})}^{\frac{q}{q - 1}} \leq {(\frac{2 s^{1 - r / q}}{1 - α})}^{\frac{q}{r (q - 1)}} \\ = {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}} . & (26) \end{array}

Therefore, according to the definition of q-ratio CMSV the condition (22), and the fact that ∥Ah∥₂ ≤ 2η [see (12)], we can obtain that

\begin{array}{l} ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A) \leq \frac{‖ A h ‖_{2}}{‖ h ‖_{q}} \Rightarrow ‖ h ‖_{q} \leq \frac{‖ A h ‖_{2}}{ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} \\ \leq \frac{2 η}{ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)}, & (27) \end{array}

which completes the proof of (23). In addition, $(1 - α) ∥ h ∥_{r}^{r} \leq ∥ h ∥_{r}^{r} - α ∥ h ∥_{1}^{r} \leq 2 s^{1 - r / q} ∥ h ∥_{q}^{r}$ yields

\begin{array}{l} ‖ h ‖_{r} \leq {(\frac{2}{1 - α})}^{1 / r} s^{1 / r - 1 / q} ‖ h ‖_{q} \\ \leq {(\frac{2^{r + 1}}{1 - α})}^{1 / r} \cdot \frac{s^{1 / r - 1 / q} η}{ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} . & (28) \end{array}

Therefore, (24) holds and the proof is completed.

Remarks. Note that the results (11) and (12) in Theorem 1 of Zhou and Yu [12] correspond to the special case of α = 0 and r = 1 in this result. As a by-product of this theorem, we have that the perfect recovery can be guaranteed for any s-sparse signal x via (3) with η = 0, if there exists some q ∈ (1, ∞] such that the q-ratio CMSV of the measurement matrix A fulfils $ρ_{q, {(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A) > 0$ . As studied in Zhou and Yu [12, 32], this kind of q-ratio CMSV based sufficient conditions holds with high probability for subgaussian and a class of structured random matrices as long as the number of measurements is reasonably large.

Next, we extend the result to the case that x is compressible (i.e., not exactly sparse but can be well-approximated by an exactly sparse signal).

Theorem 3. For any 1 < q ≤ ∞, 0 ≤ α ≤ 1 and 0 < r ≤ 1, if the measurement matrix A satisfies the condition

\begin{matrix} ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A) > 0, & (29) \end{matrix}

then any solution $\hat{x}$ to the minimization problem (3) fulfils

\begin{matrix} ‖ \hat{x} - x ‖_{q} \leq \frac{2 η}{ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} + s^{1 / q - 1 / r} σ_{s} {(x)}_{r}, & (30) \end{matrix}

\begin{array}{l} ‖ \hat{x} - x ‖_{r} \leq {(​ \frac{4}{1 - α} ​)}^{​ 1 / r} ​​ \frac{s^{1 / r - 1 / q} η}{ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} \\ + {(\frac{4}{1 - α})}^{1 / r} σ_{s} {(x)}_{r} . & (31) \end{array}

Proof. We assume that S is the index set that contains the largest s absolute entries of x so that $σ_{s} {(x)}_{r} = ∥ x_{S^{c}} ∥_{r}$ and let $h = \hat{x} - x$ . Then we still have (11), that is,

\begin{matrix} ‖ h_{S^{c}} ‖_{r}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r} \leq ‖ h_{S} ‖_{r}^{r} + α ‖ h_{S} ‖_{1}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r} . & (32) \end{matrix}

As a result,

\begin{array}{l} (1 - α) ‖ h ‖_{r}^{r} \leq ‖ h ‖_{r}^{r} - α ‖ h ‖_{1}^{r} \\ \leq ‖ h_{S} ‖_{r}^{r} + ‖ h_{S^{c}} ‖_{r}^{r} - α ‖ h_{S} ‖_{1}^{r} - α ‖ h_{S^{c}} ‖_{1}^{r} \\ \leq 2 ‖ h_{S} ‖_{r}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r} \\ \leq 2 s^{1 - r / q} ‖ h ‖_{q}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r} & (33) \end{array}

holds for any 1 < q ≤ ∞, 0 ≤ α ≤ 1 and 0 < r ≤ 1.

To prove (30), we assume h ≠ 0 and $∥ h ∥_{q} > \frac{2 η}{ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)}$ , otherwise it holds trivially. Then

\begin{array}{l} ‖ h ‖_{q} > \frac{‖ A h ‖_{2}}{ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} \Rightarrow s_{q} (h) > {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}} \\ \Rightarrow ‖ h ‖_{1}^{r} > \frac{4}{1 - α} s^{1 - r / q} ‖ h ‖_{q}^{r}, \end{array}

which implies that $(1 - α) ∥ h ∥_{r}^{r} \geq (1 - α) ∥ h ∥_{1}^{r} > 4 s^{1 - r / q} ∥ h ∥_{q}^{r}$ . Then combining with (33), it yields that

\begin{matrix} ‖ h ‖_{q} \leq {(s^{r / q - 1})}^{1 / r} ‖ x_{S^{c}} ‖_{r} = s^{1 / q - 1 / r} ‖ x_{S^{c}} ‖_{r} = s^{1 / q - 1 / r} σ_{s} {(x)}_{r} . & (34) \end{matrix}

Therefore, we have

\begin{matrix} ‖ h ‖_{q} \leq \frac{2 η}{ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} + s^{1 / q - 1 / r} σ_{s} {(x)}_{r}, & (35) \end{matrix}

which completes the proof of (30).

Moreover, by using (33) and the inequality ${(v_{1}^{r} + v_{2}^{r})}^{1 / r} \leq 2^{1 / r - 1} (v_{1} + v_{2})$ for any v₁, v₂ ≥ 0, we obtain that

\begin{array}{l} ‖ h ‖_{r} \leq {(\frac{1}{1 - α})}^{1 / r} {(2 s^{1 - r / q} ‖ h ‖_{q}^{r} + 2 ‖ x_{S^{c}} ‖_{r}^{r})}^{1 / r} \\ \leq {(\frac{1}{1 - α})}^{1 / r} 2^{2 / r - 1} (s^{1 / r - 1 / q} ‖ h ‖_{q} + ‖ x_{S^{c}} ‖_{r}) \\ \leq {(\frac{1}{1 - α})}^{1 / r} 2^{2 / r - 1} ​ (​ s^{1 / r - 1 / q} \frac{2 η}{ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} + 2 σ_{s} {(x)}_{r} ​) \\ \leq {(\frac{4}{1 - α})}^{1 / r} \frac{s^{1 / r - 1 / q} η}{ρ_{q, {(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}}} (A)} + {(\frac{4}{1 - α})}^{1 / r} σ_{s} {(x)}_{r} . & (36) \end{array}

Hence, (31) holds and the proof is completed.

Remarks. When we select α = 0 and r = 1, our results reduce to the corresponding results for the ℓ₁-minimization or Basis Pursuit in Theorem 2 of Zhou and Yu [12]. In general, the sufficient condition provided here and that in Theorem 2 are slightly stronger than those established for the ℓ₁-minimization in Zhou and Yu [12], noticing that ${(\frac{2}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}} \geq 2^{\frac{q}{q - 1}} s$ and ${(\frac{4}{1 - α})}^{\frac{q}{r (q - 1)}} s^{\frac{q - r}{r (q - 1)}} \geq 4^{\frac{q}{q - 1}} s$ for any 1 < q ≤ ∞, 0 ≤ α ≤ 1, and 0 < r ≤ 1. This is caused by the fact that the technical inequalities used like (25) and (32) are far from tight. And this is also the case in the r-RIP based analysis. In fact, both r-RIP and q-ratio CMSV based conditions are loose. The discussion on much tighter sufficient conditions such as the NSP based conditions investigated in Tran and Webster [33], is left for future work.

3. Algorithm

In this section, we discuss the computational approach for the unconstrained version of (3), i.e.,

\begin{matrix} \min_{x \in ℝ^{N}} \frac{1}{2} ‖ A x - y ‖_{2}^{2} + λ (‖ x ‖_{r}^{r} - α ‖ x ‖_{1}^{r}), & (37) \end{matrix}

with λ > 0 being the regularizer parameter.

We integrate the iteratively reweighted least squares (IRLS) algorithm [21, 22] and the difference of convex functions algorithm (DCA) [34, 35] to solve this problem. In the outer loop, we use the IRLS to approximate the term $∥ x ∥_{r}^{r}$ , and use an iteratively reweighted ℓ₁ norm to approximate $∥ x ∥_{1}^{r}$ . Specifically, we begin with $x^{0} = \underset{x \in ℝ^{N}}{\arg \min} {‖ y - A x ‖}_{2}^{2}$ and ε₀ = 1, for n = 0, 1, ⋯ ,

\begin{matrix} x^{n + 1} = \arg \min_{x \in ℝ^{N}} \frac{1}{2} ‖ A x - y ‖_{2}^{2} + λ ‖ W^{n} x ‖_{2}^{2} - α λ v^{n} ‖ x ‖_{1}, & (38) \end{matrix}

where $W^{n} = diag {{({(x_{i}^{n})}^{2} + ε_{n})}^{r / 4 - 1 / 2}}$ and $v^{n} = ∥ x^{n} ∥_{1}^{r - 1}$ . We let ε_n+1 = ε_n/10 if the error $∥ x^{n + 1} - x^{n} ∥_{2} < \sqrt{ε_{n}} / 100$ . The algorithm is stopped when $ε_{n + 1} < 1 0^{- 8}$ for some n.

As for the inner loop used to solve (38), we view it as a minimization problem of a difference of two convex functions, that is, the objective function $F (x) = (\frac{1}{2} ∥ A x - y ∥_{2}^{2} + λ ∥ W^{n} x ∥_{2}^{2}) - α λ v^{n} ∥ x ∥_{1} = : G (x) - H (x)$ . We start with x^n+1,0 = 0. For k = 0, 1, 2, ⋯ , in the k + 1 step, by linearizing H(x) with the approximation H(x^n+1,k) + 〈y^n+1,k, x − x^n+1,k〉 where y^n+1,k ∈ ∂H(x^n+1,k), i.e. y^n+1,k is a subgradient of H(x) at x^n+1,k. Then we have

\begin{array}{l} x^{n + 1, k + 1} = \underset{x \in ℝ^{N}}{arg min} \frac{1}{2} ‖ A x - y ‖_{2}^{2} + λ ‖ W^{n} x ‖_{2}^{2} \\ - (α λ v^{n} ‖ x^{n + 1, k} ‖_{1} + 〈 α λ v^{n} sign (x^{n + 1, k}), x - x^{n + 1, k} 〉) \\ = \underset{x \in ℝ^{N}}{arg min} \frac{1}{2} ‖ A x - y ‖_{2}^{2} + λ ‖ W^{n} x ‖_{2}^{2} - 〈 α λ v^{n} sign (x^{n + 1, k}), x 〉 \\ = {(A^{T} A + 2 λ {(W^{n})}^{T} W^{n})}^{- 1} [A^{T} y + α λ v^{n} sign (x^{n + 1, k})], \end{array}

where sign(·) is the sign function. The termination criterion for the inner loop is set to be

\frac{‖ x^{n + 1, k + 1} - x^{n + 1, k} ‖_{2}}{max {‖ x^{n + 1, k} ‖_{2}, 1}} < δ

for some given parameter tolerance parameter δ > 0. Basically, this algorithm can be regarded as a generalized version of IRLS algorithm. Obviously, when α = 0, it exactly reduces to the traditional IRLS algorithm used for solving the ℓ_r-minimization problem.

4. Numerical Experiments

In this section, some numerical experiments on the proposed algorithm in section 3 are conducted to illustrate the performance of the weighted ℓ_r − ℓ₁ minimization in simulated sparse signal recovery.

4.1. Successful Recovery

First, we focus on the weighted ℓ_r − ℓ₁ minimization itself. In this set of experiments, the s-sparse signal x is of length N = 256, which is generated by choosing s entries uniformly at random, and then choosing the non-zero values from the standard normal distribution for these s entries. The underdetermined linear measurements y = Ax + e ∈ ℝ^m, where A ∈ ℝ^m×N is a standard Gaussian random matrix and the entries of the noise vector ${e_{i}, i = 1, 2, \dots, m} \overset{i . i . d .}{~} N (0, σ^{2})$ . Here we fix the number of measurements m = 64 and select a sequence of s as 10, 12, ⋯ , 36. We run the experiments for both noiseless and noisy cases. In all the experiments, we let the tolerance parameter δ = 10⁻³. And all the results are averaged over 100 repetitions.

In the noiseless case, i.e., σ = 0, we set λ = 10⁻⁶. In Figure 2, we show the results of successful recovery rate for different α (i.e., α = 0, 0.2, 0.5, 0.8, 1) while fixing r but varying the sparsity level s. We view it as a successful recovery if $∥ \hat{x} - x ∥_{2} / ∥ x ∥_{2} < 1 0^{- 3}$ . We do the experiments for r = 0.3 and r = 0.7, respectively. As we can see, when r is fixed, the influence of the weight α is negligible, especially in the case that r is relatively small. But the performance does improve in some scenarios when a proper weight α is used. However, the problem of adaptively selecting the optimal α seems to be challenging and is left for future work. In addition, we present the reconstruction performances for different r (i.e., r = 0.01, 0.2, 0.5, 0.8, 1) while the weight α is fixed to be 0.2 and 0.8 in Figure 3. Note that small r is favored when the weight α is fixed. And a non-convex recovery with 0 < r < 1 performs much better than the convex case (r = 1).

FIGURE 2

Figure 2. Successful recovery rate for different α with r = 0.3 and r = 0.7 in the noise free case, while varying the sparsity level s.

FIGURE 3

Figure 3. Successful recovery rate for different r with α = 0.2 and α = 0.8 in the noise free case, while varying the sparsity level s.

Next, we consider the noisy case, that is σ = 0.01. We set λ = 10⁻⁴. And we evaluate the recovery performance by the signal to noise ratio (SNR), which is given by

SNR = 20 \log_{10} (\frac{‖ x ‖_{2}}{‖ \hat{x} - x ‖_{2}}) .

As shown in Figures 4, 5, the findings aforementioned can still be seen here.

FIGURE 4

Figure 4. SNR for different α with r = 0.3 and r = 0.7 in the noisy case, while varying the sparsity level s.

FIGURE 5

Figure 5. SNR for different r with α = 0.2 and α = 0.8 in the noisy case, while varying the sparsity level s.

4.2. Algorithm Comparisons

Second, we compare the weighted ℓ_r − ℓ₁ minimization with some well-known algorithms. The following state-of-the-art recovery algorithms are operated:

• ADMM-Lasso, see Boyd et al. [36].

• CoSaMP, see Needell and Tropp [37].

• Iterative Hard Thresholding (IHT), see Blumensath and Davies [38].

• ℓ₁ − ℓ₂ minimization, see Yin et al. [24].

The tuning parameters used for these algorithms are the same as those adopted in section 5.2 of Yin et al. [24]. Specifically, for ADMM-Lasso, we choose λ = 10⁻⁶, β = 1, ρ = 10⁻⁵, ε^abs = 10⁻⁷, ε^rel = 10⁻⁵, and the maximum number of iterations maxiter = 5,000. For CoSaMP, maxiter=50 and the tolerance is set to be 10⁻⁸. The tolerance for IHT is 10⁻¹². For ℓ₁ − ℓ₂ minimization, we choose the parameters as ε^abs = 10⁻⁷, ε^rel = 10⁻⁵, ε = 10⁻², MAXoit = 10, and MAXit = 500. For our weighted ℓ_r − ℓ₁ minimization, we choose λ = 10⁻⁶, r = 0.5 but with two different weights α = 0 (denoted as ℓ_0.5) and α = 1 (denoted as ℓ_0.5 − ℓ₁).

We only consider the exactly sparse signal recovery in the noiseless case, and conduct the experiments under the same settings as in section 4.1. We present the successful recovery rates for different reconstruction algorithms while varying the sparsity level s in Figure 6. It can be observed that both ℓ_0.5 and ℓ_0.5 − ℓ₁ outperform over other algorithms, although their own performances are almost the same.

FIGURE 6

Figure 6. Sparse signal recovery performance comparison via different algorithms for Gaussian random matrix.

5. Conclusion

In this paper, we studied a new non-convex recovery method, developed as minimizing a weighted difference of ℓ_r (0 < r ≤ 1) norm and ℓ₁ norm. We established the performance bounds for this problem based on both r-RIP and q-ratio CMSV. An algorithm was proposed to approximately solve the non-convex problem. Numerical experiments show that the proposed algorithm provides superior performance compared to the existing algorithms such as ADMM-Lasso, CoSaMP, IHT and ℓ₁ − ℓ₂ minimization.

Besides, there are some open problems left for future work. One is the convergence study of the proposed algorithm in section 3. Another one is the generalization of this 1-D non-convex version to 2-D non-convex total variation minimization as done in Lou et al. [39] and the exploration of its application to medical imaging. Moreover, analogous to the non-convex block-sparse compressive sensing studied in Wang et al. [40], the study of the following non-convex block-sparse recovery minimization problem:

\begin{array}{l} \min_{z \in ℝ^{N}} ‖ z ‖_{2, r}^{r} - α ‖ z ‖_{2, 1}^{r} subject to ‖ A z - y ‖_{2} \leq η, & (39) \end{array}

where $∥ z ∥_{2, r} = {(\sum_{i = 1}^{p} ∥ z [i] ∥_{2}^{r})}^{1 / r}$ with z[i] denoting the i-th block of z, 0 ≤ α ≤ 1, and 0 < r ≤ 1, is also an interesting topic for further investigation.

Author Contributions

ZZ contributed to the initial idea and wrote the first draft. JY provided critical feedback and helped to revise the manuscript.

Funding

This work is supported by the Swedish Research Council grant (Reg.No. 340-2013-5342).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Footnote

1. ^ All figures can be reproduced from the code available at https://github.com/zzy583661/Weighted-l_r-l_1-minimization

References

1. Candes EJ, Tao T. Decoding by linear programming. IEEE Trans Inf Theory. (2005) 51:4203–15. doi: 10.1109/TIT.2005.858979

CrossRef Full Text | Google Scholar

2. Donoho DL. Compressed sensing. IEEE Trans Inf Theory. (2006) 52:1289–306. doi: 10.1109/TIT.2006.871582

CrossRef Full Text | Google Scholar

3. Eldar YC, Kutyniok G. Compressed Sensing: Theory and Applications. Cambridge: Cambridge University Press (2012).

Google Scholar

4. Foucart S, Rauhut H. A Mathematical Introduction to Compressive Sensing. Vol. 1. New York, NY: Birkhäuser (2013).

Google Scholar

5. Gribonval R, Nielsen M. Sparse representations in unions of bases. IEEE Trans Inf Theory. (2003) 49:3320–5. doi: 10.1109/TIT.2003.820031

CrossRef Full Text | Google Scholar

6. Tropp JA. Greed is good: algorithmic results for sparse approximation. IEEE Trans Inf Theory. (2004) 50:2231–42. doi: 10.1109/TIT.2004.834793

CrossRef Full Text | Google Scholar

7. Candes EJ. The restricted isometry property and its implications for compressed sensing. Comptes Rendus Math. (2008) 346:589–92. doi: 10.1016/j.crma.2008.03.014

CrossRef Full Text | Google Scholar

8. Candes EJ, Romberg JK, Tao T. Stable signal recovery from incomplete and inaccurate measurements. Commun Pure Appl Math. (2006) 59:1207–23. doi: 10.1002/cpa.20124

CrossRef Full Text | Google Scholar

9. Cohen A, Dahmen W, DeVore R. Compressed sensing and best k-term approximation. J Am Math Soc. (2009) 22:211–31. doi: 10.1090/S0894-0347-08-00610-3

CrossRef Full Text | Google Scholar

10. Dirksen S, Lecué G, Rauhut H. On the gap between restricted isometry properties and sparse recovery conditions. IEEE Trans Inf Theory. (2018) 64:5478–87. doi: 10.1109/TIT.2016.2570244

CrossRef Full Text | Google Scholar

11. Tang G, Nehorai A. Performance analysis of sparse recovery based on constrained minimal singular values. IEEE Trans Signal Process. (2011) 59:5734–45. doi: 10.1109/TSP.2011.2164913

CrossRef Full Text | Google Scholar

12. Zhou Z, Yu J. Sparse recovery based on q-ratio constrained minimal singular values. Signal Process. (2019) 155:247–58. doi: 10.1016/j.sigpro.2018.10.002

CrossRef Full Text | Google Scholar

13. Chen SS, Donoho DL, Saunders MA. Atomic decomposition by basis pursuit. SIAM J Sci Comput. (1998) 20:33–61.

14. Natarajan BK. Sparse approximate solutions to linear systems. SIAM J Sci Comput. (1995) 24:227–34. doi: 10.1137/S0097539792240406

CrossRef Full Text | Google Scholar

15. Chartrand R. Exact reconstruction of sparse signals via nonconvex minimization. IEEE Signal Process Lett. (2007) 14:707–10. doi: 10.1109/LSP.2007.898300

CrossRef Full Text | Google Scholar

16. Foucart S, Lai MJ. Sparsest solutions of underdetermined linear systems via ℓ_q-minimization for 0 < q ≤ 1. Appl Comput Harmon Anal. (2009) 26:395–407. doi: 10.1016/j.acha.2008.09.001

CrossRef Full Text | Google Scholar

17. Li S, Lin J. Compressed Sensing with coherent tight frames via l_q-minimization for 0 < q ≤ 1. Inverse Probl Imaging. (2014) 8:761–77. doi: 10.3934/ipi.2014.8.761

CrossRef Full Text | Google Scholar

18. Lin J, Li S. Restricted q-isometry properties adapted to frames for nonconvex l_q-analysis. IEEE Trans Inf Theory. (2016) 62:4733–47. doi: 10.1109/TIT.2016.2573312

CrossRef Full Text | Google Scholar

19. Shen Y, Li S. Restricted p–isometry property and its application for nonconvex compressive sensing. Adv Comput Math. (2012) 37:441–52. doi: 10.1007/s10444-011-9219-y

CrossRef Full Text | Google Scholar

20. Xu Z, Chang X, Xu F, Zhang H. L_1/2 regularization: a thresholding representation theory and a fast solver. IEEE Trans Neural Netw Learn Syst. (2012) 23:1013–27. doi: 10.1109/TNNLS.2012.2197412

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Chartrand R, Yin W. Iteratively reweighted algorithms for compressive sensing. In: IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. (2008). p. 3869–72.

Google Scholar

22. Lai MJ, Xu Y, Yin W. Improved iteratively reweighted least squares for unconstrained smoothed ℓ_q minimization. SIAM J Numer Anal. (2013) 51:927–57. doi: 10.1137/110840364

CrossRef Full Text | Google Scholar

23. Lou Y, Yin P, He Q, Xin J. Computing sparse representation in a highly coherent dictionary based on difference of L1 and L2. J Sci Comput. (2015) 64:178–96. doi: 10.1007/s10915-014-9930-1

CrossRef Full Text | Google Scholar

24. Yin P, Lou Y, He Q, Xin J. Minimization of ℓ₁₋₂ for compressed sensing. SIAM J Sci Comput. (2015) 37:A536–63. doi: 10.1137/140952363

CrossRef Full Text | Google Scholar

25. Yin P, Esser E, Xin J. Ratio and difference of l₁ and l₂ norms and sparse representation with coherent dictionaries. Commun Inf Syst. (2014) 14:87–109. doi: 10.4310/CIS.2014.v14.n2.a2

CrossRef Full Text | Google Scholar

26. Lou Y, Yan M. Fast L1–L2 minimization via a proximal operator. J Sci Comput. (2018) 74:767–85. doi: 10.1007/s10915-017-0463-2

CrossRef Full Text | Google Scholar

27. Wang Y. New Improved Penalty Methods for Sparse Reconstruction Based on Difference of Two Norms. Optimization Online, the Mathematical Optimization Society (2015). Available online at: http://www.optimization-online.org/DB_HTML/2015/03/4849.html

28. Wang D, Zhang Z. Generalized sparse recovery model and its neural dynamical optimization method for compressed sensing. Circ Syst Signal Process. (2017) 36:4326–53. doi: 10.1007/s00034-017-0532-7

CrossRef Full Text | Google Scholar

29. Zhao Y, He X, Huang T, Huang J. Smoothing inertial projection neural network for minimization L_p−q in sparse signal reconstruction. Neural Netw. (2018) 99:31–41. doi: 10.1016/j.neunet.2017.12.008

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Chartrand R, Staneva V. Restricted isometry properties and nonconvex compressive sensing. Inverse Probl. (2008) 24:035020. doi: 10.1088/0266-5611/24/3/035020

CrossRef Full Text | Google Scholar

31. Lopes ME. Unknown sparsity in compressed sensing: denoising and inference. IEEE Trans Inf Theory. (2016) 62:5145–66. doi: 10.1109/TIT.2016.2587772

CrossRef Full Text | Google Scholar

32. Zhou Z, Yu J. On q-ratio CMSV for sparse recovery. arXiv [Preprint]. arXiv:180512022. (2018). Available online at: https://arxiv.org/abs/1805.12022

Google Scholar

33. Tran H, Webster C. Unified sufficient conditions for uniform recovery of sparse signals via nonconvex minimizations. arXiv preprint arXiv:171007348. (2017).

Google Scholar

34. Tao PD, An LTH. Convex analysis approach to dc programming: theory, algorithms and applications. Acta Math Vietnam. (1997) 22:289–355.

Google Scholar

35. Tao PD, An LTH. A DC optimization algorithm for solving the trust-region subproblem. SIAM J Optim. (1998) 8:476–505. doi: 10.1137/S1052623494274313

CrossRef Full Text | Google Scholar

36. Boyd S, Parikh N, Chu E, Peleato B, Eckstein J, et al. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found Trends Mach Learn. (2011) 3:1–122. doi: 10.1561/2200000016

CrossRef Full Text | Google Scholar

37. Needell D, Tropp JA. CoSaMP: iterative signal recovery from incomplete and inaccurate samples. Appl Comput Harmon Anal. (2009) 26:301–21. doi: 10.1016/j.acha.2008.07.002

CrossRef Full Text | Google Scholar

38. Blumensath T, Davies ME. Iterative hard thresholding for compressed sensing. Appl Comput Harmon Anal. (2009) 27:265–74. doi: 10.1016/j.acha.2009.04.002

CrossRef Full Text | Google Scholar

39. Lou Y, Zeng T, Osher S, Xin J. A weighted difference of anisotropic and isotropic total variation model for image processing. SIAM J Imaging Sci. (2015) 8:1798–23. doi: 10.1137/14098435X

CrossRef Full Text | Google Scholar

40. Wang Y, Wang J, Xu Z. Restricted p-isometry properties of nonconvex block-sparse compressed sensing. Signal Process. (2014) 104:188–96. doi: 10.1016/j.sigpro.2014.03.040

CrossRef Full Text | Google Scholar

Keywords: compressive sensing, nonconvex sparse recovery, iteratively reweighted least squares, difference of convex functions, q-ratio constrained minimal singular values

Citation: Zhou Z and Yu J (2019) A New Nonconvex Sparse Recovery Method for Compressive Sensing. Front. Appl. Math. Stat. 5:14. doi: 10.3389/fams.2019.00014

Received: 28 September 2018; Accepted: 22 February 2019;
Published: 14 March 2019.

Edited by:

Jean-Luc Bouchot, Beijing Institute of Technology, China

Reviewed by:

Junhong Lin, École Polytechnique Fédérale de Lausanne, Switzerland
Richard G. Lynch, Texas A&M University, United States

Copyright © 2019 Zhou and Yu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhiyong Zhou, emhpeW9uZy56aG91QHVtdS5zZQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.