Backdoors into Two Occurrences

Johannsen, Jan

doi:10.3233/SAT-200125

Backdoors into Two Occurrences

Article type: Research Article

Authors: Johannsen, Jan^{; *}

Affiliations: Institut für Informatik, Ludwig-Maximilians-Universität München, Germany

Correspondence: [*] Corresponding author. E-mail: [email protected].

Keywords: Backdoor set, fixed-parameter tractability, parameterized complexity, completeness, formulas with two variable occurrences

DOI: 10.3233/SAT-200125

Journal: Journal on Satisfiability, Boolean Modeling and Computation, vol. 12, no. 1, pp. 1-15, 2020

Revision received November 2017

Published: 29 June 2020

Get PDF

Abstract

Backdoor sets for the class CNF(2) of CNF-formulas in which every variable has at most two occurrences are studied in terms of parameterized complexity. The question whether there exists a CNF(2)-backdoor set of size k is hard for the class W[2], for both weak and strong backdoors, and in both cases it becomes fixed-parameter tractable when restricted to inputs in d-CNF for a fixed d.

Besides that, it is shown that the problem of finding weak backdoor sets is W[2]-complete, for certain tractable cases. These are the first completeness results in lower levels of the W-hierarchy for any backdoor set problems.

1.Introduction

Despite the theoretical hardness of the SAT problem, being the canonical NP-complete problem [1] and conjectured to not be solvable in sub-exponential time [6], state-of-the-art SAT solvers have become very efficient, and routinely solve instances arising from applications with hundreds of thousands of variables and millions of clauses. Even though there are lots of known tractable cases of SAT, i.e., classes of formulas that can be solved in polynomial time, the efficiently solvable instances arising in practice usually do not belong to these tractable classes, and thus the existence of these classes does not suffice to explain this apparent discrepancy between theory and practice.

A possibly better attempt at explaining the discrepancy is that the large, efficiently solvable instances are in some way close to a tractable case. One possible such notion of closeness is that there is a small subset of variables, such that after giving values to these variables the residual formula is in the tractable class. This concept was introduced by Crama et al. [2] and Williams et al. [13], the latter work coined the name backdoor set for such a set of variables.

There are several kinds of backdoor sets for each tractable case considered in the literature. A strong backdoor set for the class C is a set of variables, such that for every setting of these variables the residual formula is in C. A weak backdoor set for C is a set of variables, such that for some setting of these variables the residual formula is in C and satisfiable. There is also the auxiliary notion of a deletion backdoor set to be defined below.

A strong backdoor set of size k allows to solve a formula of size m in time 2k⋅mO(1). This runtime bound depends on the two parameters (size of the formula m and backdoor set size k) in essentially different ways, and the proper framework to analyze such complexity bounds is fixed-parameter tractability and parameterized complexity as introduced by Downey and Fellows [3].

Besides the class of fixed-parameter tractable problems, the theory of parameterized complexity provides a hierarchy of classes of increasingly hard problems that are probably fixed-parameter intractable, the W-hierarchy containing in particular the classes W[t] for t∈N, and a notion of reduction that allows to define hardness and completeness for these classes.

The complexity bound mentioned above thus means that given a formula together with a strong backdoor set of size k, testing for satisfiability is fixed-parameter tractable w.r.t. the parameter k. Thus, if the problem of finding a backdoor set of size k was fixed-parameter tractable w.r.t. the parameter k as well, then the size of a smallest backdoor set would be a parameter with respect to which SAT is fixed-parameter tractable. Thus, starting with the work of Nishimura et al. [11], the parameterized complexity of finding backdoor sets was determined for various tractable cases, most of the results in that direction are collected in the recent survey by Gaspers and Szeider [5].

A less well-known tractable case is the class CNF(2) of CNF-formulas in which every variable has at most two occurrences. The satisfiability problem for these formulas can be solved in linear time [9], and it has been shown to be complete for deterministic logarithmic space [8].

In this work, we determine the parameterized complexity of finding backdoor sets w.r.t. the class CNF(2). We show that the problem is hard for the class W[2], for both weak and strong backdoor sets, and that in both cases it becomes fixed-parameter tractable when restricted to inputs in d-CNF for a fixed d.

For those tractable cases where the problem of finding backdoor sets is W[2]-hard, including CNF(2), the smallest parameterized complexity class known to contain the problem is W[P], which lies higher up in the W-hierarchy, beyond the classes W[t]. For the tractable cases of 0-valid and 1-valid formulas, we show that the weak backdoor set problem is complete for W[2]. To the best of our knowledge, these are the first completeness results in the W-hierarchy for any weak backdoor set problems.

In order prove this latter result, we study a related artificial problem of finding so-called very weak backdoor sets, whose definition differs from that of weak backdoor sets by weakening the condition that the residual formula has to be satisfiable. We show that the problem of finding very weak backdoor sets for these classes is in W[2], by utilizing the characterization of the W-hierarchy in terms of first-order logic definability. The method also allows us to put the very weak backdoor set problem for other tractable cases into the class W[2], including the class CNF(2).

The paper is structured as follows: in Section 2 we review the necessary background on the problem SAT and its tractable cases, in particular the class CNF(2), on parameterized complexity and on backdoor sets. Section 3 treats some general properties of CNF(2)-backdoor sets. In Section 4 we show the results about weak CNF(2)-backdoor sets, and in Section 5 those about strong CNF(2)-backdoor sets. Finally Section 6 presents the mentioned upper bounds in the W-hierarchy.

2.Preliminaries

We briefly review basic notions about the propositional satisfiability problem, mainly to fix the notation.

A literal is a variable x or a negated variable x¯. A clause is a disjunction C=a1∨⋯∨ad of literals ai. The width w(C) of C is d, the number of literals in C. We identify a clause with the set of literals occurring in it, even though for clarity we still write it as a disjunction.

A formula in conjunctive normal form (CNF) is a conjunction F=C1∧⋯∧Cm of clauses, it is usually identified with the set of clauses {C1,…,Cm}. A formula F in CNF is in d-CNF if every clause C in F is of width w(C)⩽d.

A restriction α is a partial assignment α:V→{0,1} from the set of variables V to the truth values {0,1}. A restriction α is extended to literals by setting α(x¯):=1−α(x). We occasionally identify a restriction α with the set of literals it sets, i.e., {a;α(a)=1}.

For a clause C over the variables V, we define C⌈α=1 if α(a)=1 for some literal a∈C, and otherwise C⌈α is the disjunction of those literals a∈C for which α(a)=0 does not hold, i.e., which are left unset by α. Here the empty disjunction is identified with the constant 0.

For a CNF-formula F over V, we define F⌈α=0 if C⌈α=0 for some C∈F, and otherwise F⌈α is the conjunction of the clauses C⌈α for those clauses C∈F for which C⌈α=1 is not the case. Here the empty conjunction is identified with the constant 1.

In other words, the formula F⌈α is obtained by deleting clauses satisfied by α from F, and deleting literals falsified by α from the remaining clauses in F.

For 𝜖=0,1, we denote by [x:=𝜖] the restriction setting the variable x to 𝜖. This notation is also extended to literals by letting [x¯:=𝜖] denote [x:=(1−𝜖)].

If F⌈α=1, then we say that α satisfies F and write α⊧F. The satisfiability problem SAT is the decision problem:

Instance:	Formula F in CNF.
Question:	Is there a restriction α with α⊧F?

This problem SAT is the canonical NP-complete problem [1], and the strong exponential time hypothesis [6] is a widely-believed conjecture that implies that SAT is not solvable in sub-exponential time.

A clause C is tautological, if both x and x¯ occur in C for some variable x. Since tautological clauses are satisfied by all restrictions of their variables, they are irrelevant for the satisfiability of a formula they appear in. Therefore, SAT and other related problems are often restricted to formulas that do not contain any tautological clauses. Except when noted, all our results hold for problems restricted in this way.

2.1.Tractable Cases of SAT

Despite its hardness, many easy special cases of the problem SAT have been identified. A tractable case of SAT, sometimes also called an “island of tractability”, is a class of CNF-formulas such that

membership F∈C can be decided in polynomial time,
the satisfiability problem for formulas F∈C can be decided in polynomial time.

Many tractable cases of SAT have been defined and studied in the literature, some well-known examples of such classes are, e.g., the following:

The class HORN of Horn formulas, i.e., formulas in which every clause contains at most one positive literal.
The class CO-HORN of formulas in which every clause contains at most one negative literal.
The class 2-CNF of formulas in which every clause is of width at most 2.
The class 1-Val of formulas where every clause contains at least one positive literal.
The class 0-Val of formulas where every clause contains at least one negative literal.

Note that formulas from the latter two classes are trivially satisfiable by the assignment setting every variable to 1 (resp., 0).

All the above tractable cases are defined by properties of the individual clauses in the formula, and by the classification result of Schaefer [12], these are the only maximal such classes of CNF formulas.

A different tractable class which is not defined by properties of clauses is the class of cluster formulas [7,10]. Two clauses C and C′ clash if they contain complementary literals, i.e. if a∈C and a¯∈C′ for some literal a. A formula F is a hitting formula if any two different clauses in F clash. The class CLU of cluster formulas is the class of CNF-formulas that are variable-disjoint unions of hitting formulas.

Another less well-known tractable class is the class CNF(2) of formulas with at most two occurrences of every variable – in a way a dual to 2-CNF. It is known that satisfiability of formulas in CNF(2) can be tested in linear time [9] and in logarithmic space – in fact it is complete for the class of problems solvable in deterministic logarithmic space [8].

2.2.Parameterized Complexity

We briefly review the basic concepts of parameterized complexity as used in this work, mostly following the textbook of Flum and Grohe [4].

A parameterized problem is a decision problem P together with a parameterization, i.e., a polynomial time computable mapping that associates with every instance x of P a parameter k=k(x)∈N. We denote by n=n(x)∈N the size of an instance, the number of bits required to represent x.

A parameterized problem P with parameter k is fixed-parameter tractable if there is an algorithm that solves P in time f(k)⋅O(nd) for some computable function f and d∈N. The class of parameterized problems that are fixed-parameter tractable is called FPT.

An FPT-reduction between parameterized problems P and P′ with parametrizations k and k′, resp., is a function r mapping instances of P to instances of P′ such that

r is computable in FPT-time f(k)⋅O(nd) for some computable function f and d∈N,
r(x) is a positive instance of P′ iff x is a positive instance of P,
there is a computable function g s.t. for every instance x of P we have k′(r(x))⩽g(k(x)).

For a first-order formula φ(X) with a free relation variable X of arity d, the parameterized problem WD(φ) is the following:

Instance:	Structure A for the language of φ, k∈N.
Parameter:	k.
Question:	Is there a relation R⊆Ad of size \|R\|⩽k with A⊧φ(R)?

A first-order formula is a Πt-formula if it has at most t alternations of quantifiers, with the outermost quantifier being universal. A Σt-formula has at most t alternations of quantifiers, with the outermost one an existential. The class W[t] is the class of parameterized problems that are FPT-reducible to some problem WD(φ) for a Πt-formula φ(X).

The parameterized problem Weighted Circuit SAT is:

Instance:	Boolean circuit C, k∈N.
Parameter:	k.
Question:	Is there a satisfying assignment α⊧C with \|{x;α(x)=1}\|⩽k?

The class W[P] is the class of parameterized problems that are FPT-reducible to Weighted Circuit SAT. Since restricted forms of Weighted Circuit SAT (for circuits of constant depth and weft t, see [3]) are complete for the classes W[t], we have that

FPT⊆W[1]⊆W[2]⊆⋯⊆W[t]⊆⋯⊆W[P].

The parameterized problem Hitting Set is the following :

Instance:	Family {S1,…,Sm} of subsets Si⊆U, k∈N.
Parameter:	k.
Question:	Is there a set H⊆U of size \|H\|⩽k with H∩Si≠∅ for every i?

Hitting Set is one of the canonical W[2]-complete problems. To see that it is in W[2], view an instance as a structure with unary predicates U(x) for the elements of U, and set(x) for the sets Si, and the element relation ∈ between elements of U and sets. Then Hitting Set is the problem WD(φ) for the following Π2-formula φ(X) with a unary relation variable X:

φ(X)≡∀x(set(x)→∃y(U(y)∧X(y)∧x∈y)).

2.3.Backdoor Sets

Let F be a CNF-formula in the variables V, and let C be a tractable case of SAT. Let X⊆V be a set of variables. The formula F∖X is the formula obtained by deleting from the clauses in F all occurrences of literals x and x¯ for x∈X.

A strong C-backdoor set for F is a subset X⊆V such that for every assignment α:X→{0,1}, the formula F⌈α is in C.
A weak C-backdoor set for F is a subset X⊆V such that there is an assignment α:X→{0,1} such that the formula F⌈α is in C and satisfiable.
A deletion C-backdoor set for F is a subset X⊆V such that the formula F∖X is in C.

If the class C is closed under subsets, then every deletion backdoor set is also a strong backdoor set, since F⌈α⊆F∖X for every α:X→{0,1}. This fact is useful since deletion backdoor sets are usually easier to find than strong backdoor sets. For classes C that are also closed under unions, strong and deletion backdoor sets coincide, since F∖X=⋃αF⌈α where the union is over all α:X→{0,1}.

The parameterized problem Strong C-Backdoor Set is:

Instance:	CNF-formula F, and k∈N.
Parameter:	k.
Question:	Is there a strong C-backdoor set for F of size k?

The problems Weak C-Backdoor Set and Deletion C-Backdoor Set are defined analogously.

The parameterized complexity of these problems for various tractable classes of formulas has been determined in the literature. For the classes C=HORN, CO-HORN, 2-CNF, 1-VAL and 0-VAL, the problem Strong C-Backdoor Set is fixed parameter tractable. Since these classes are defined by properties of individual clauses, they are closed under subsets and unions, so the problem is the same as Deletion C-Backdoor Set, which is easily seen to be fixed parameter tractable in these cases.

The problem Weak C-Backdoor Set for all of these classes, on the other hand, is W[2]-hard, but fixed parameter tractable when the input is restricted to formulas in d-CNF for a fixed d∈N. For the class of cluster formulas, both problems Strong CLU-Backdoor Set and Weak CLU-Backdoor Set are W[2]-hard, but are in FPT when restricted to inputs in d-CNF. These results, together with many more results for other tractable cases, can be found in a recent survey by Gaspers and Szeider [5].

3.Backdoors into CNF(2)

Since the class CNF(2) is obviously not closed under unions, deletion and strong backdoor sets do not necessarily coincide for this class. This is actually not the case, as the following example shows. Consider the following set of clauses:

(x1∨x2∨x¯3),(x¯1∨x2∨x5),(x¯1∨x¯3∨x¯5),(x¯2∨x4),(x3∨x¯4).

The variables {x1,x2,x3} occur three times each, so the smallest deletion CNF(2)-backdoor set has size 3. But {x1} is a strong CNF(2)-backdoor set, showing that strong backdoors can be smaller than deletion backdoors for this class.

For the base class CNF(2), it matters whether formulas are represented as multisets, with multiple occurrences of the same clause allowed, or as sets, since the number of occurrences of variables is counted differently in both cases.

If formulas are represented as multisets, then the smallest deletion CNF(2)-backdoor set is exactly the set of variables with more than two occurrences, hence we trivially have:

Proposition 1.

For formulas represented as multisets, the problem Deletion CNF(2)-Backdoor Set can be solved in linear time.

On the other hand, for formulas represented as sets, clauses can become equal – and hence identified – if literals are deleted or set to 0. To show that this actually makes a difference note the following proposition.

Proposition 2.

For formulas represented as sets, the problem Deletion CNF(2)-Backdoor Set is NP-hard.

Proof:We reduce the well-known NP-hard problem Vertex Cover to Deletion CNF(2)-Backdoor Set. For a graph G=(V,E), define a formula F=F(G) as follows: F has variables xv∈V for v, and for every edge {u,v} a subformula Fe of three clauses:

xu∨xv,x¯u∨xv,xu∨x¯v.

Now if U is a vertex cover in G, the set XU={xu;u∈U} is a deletion CNF(2)-backdoor set: after deleting the variable xu for u∈U, the remaining formula consists of the unit clauses xv and x¯v for v∈V∖U.

On the other hand, if U⊆V is not a vertex cover, and let edge e={u,v} be uncovered, then the subformula Fe remains unchanged after deleting the variables in XU, thus xu and xv occur at least three times each, thus XU is not a deletion CNF(2)-backdoor set. Thus F has a deletion CNF(2)-backdoor set of size k iff G has a vertex cover of size k. □

We will focus on the case of formulas represented as sets in the remainder of the paper.

4.Weak Backdoors

We now show the hardness of finding weak CNF(2)-backdoor sets.

Theorem 3.

Weak CNF(2)-Backdoor Set is W[2]-hard.

Proof:We reduce Hitting Set to Weak CNF(2)-Backdoor Set. Let an instance S={S1,…,Sm} of Hitting Set be given, with U=S1∪⋯∪Sm. We construct a formula F=F(S) in the variables xs for s∈U plus zi,1 and zi,2 for 1⩽i⩽m, such that F has a weak CNF(2)-backdoor set of size k iff S has a hitting set of size k.

The formula F has for every set Si∈S the subformula Di consisting of the three clauses

∨s∈Sixs∨z¯i,1,zi,1∨zi,2,zi,1∨z¯i,2.

Let H⊆U be a hitting set for S. We show that XH:={xs;s∈H} is a weak CNF(2)-backdoor set for F(S) of the same size. Let α be the assignment with α(xs)=1 for every s∈H. Then since H hits every set Si, it follows that α⊧∨s∈Sixs for every i, and hence Di⌈α=(zi,1∨zi,2)∧(zi,1∨z¯i,2) is in CNF(2) and satisfiable for every i. Thus F⌈α is in CNF(2) and satisfiable.

For the other direction, let B be a weak CNF(2)-backdoor set for F, and let α be an assignment to the variables in B such that F⌈α is in CNF(2) and satisfiable. We first show that without loss of generality, B contains only variables xs for s∈U. If B contains a variable zi,b, then the assignment α restricted to the variables in B∖{zi,b} will still leave every subformula Dj for j≠i in CNF(2) and satisfiable. Thus if we let B′=B∖{zi,b}∪{xs} for an arbitrary s∈Si, and let α′(xs)=1 and α′ coincide with α for all variables in B∖{zi,b}, then F⌈α′ is in CNF(2) and satisfiable, thus B′ is a weak CNF(2)-backdoor set with |B′|⩽|B|.

Now we define HB:={s;xs∈B}, and show that HB is a hitting set for S of size |B|. Assume for contradiction that Si∩HB=∅. Then the formula Di is left unchanged in F⌈α, and therefore the variable zi,1 has three occurrences in F⌈α. Hence B can not be a weak CNF(2)-backdoor set. □

In the survey [5] of Gaspers and Szeider, there is a generic construction to show the W[2]-hardness of the problem Weak C-Backdoor Set for a class C. Our reduction is based on that construction, but it is simplified, and it has the property that the formula F(S) only depends on S and is independent from the parameter k. In other words, it is a polynomial time reduction between the underlying classical problems that does not increase the parameter. The same simplification can also be made to other applications of the generic construction in [5], e.g. for C=HORN and C=2-CNF.

On the other hand, the problem of finding weak CNF(2)-backdoor sets becomes fixed-parameter tractable when restricted to inputs in 3-CNF:

Theorem 4.

The problem Weak CNF(2)-Backdoor Set for 3-CNF-formulas is fixed-parameter tractable.

Proof:We will devise a bounded search tree algorithm that, given a formula F and parameter k, will search for a restriction α of size |α|⩽k such that F⌈α is in CNF(2) and satisfiable. We will call such a restriction a backdoor restriction, its domain is a weak CNF(2)-backdoor set. Obviously, a backdoor restriction exists if and only if a weak CNF(2)-backdoor set exists.

A set of three clauses C1,C2,C3 in F that share a common variable x will be called an obstruction.

Proposition 5.

Let an obstruction C1,C2,C3 in F be given. Then every backdoor restriction α for F must set some variable that occurs in C1,C2,C3.

This holds because otherwise we have Ci⌈α=Ci for i=1,2,3 and therefore F⌈α still contains the obstruction, and is thus not in CNF(2).

Once we have chosen a literal to be set, the following obvious proposition shows the correctness of the recurrence that the search tree algorithm is based on.

Proposition 6.

F has a backdoor restriction of size k that contains the assignment [a:=1] iff F⌈[a:=1] has a backdoor restriction of size k−1.

These two proposition show the correctness of the following algorithm, that finds a backdoor restriction of size k in a 3-CNF-formula F if one exists.

Build a search tree of depth k, where at each node v at depth d we keep a partial assignment αv of size |αv|=d. At the root we have α=∅. A node v is closed if F⌈αv is in CNF(2).

To extend the tree from a node v of depth d<k that is not closed and labeled with α, find an obstruction C1,C2,C3 in F⌈α with the common variable x.

Now for each literal a based on a variable occurring in C1,C2,C3, add a child to v with the assignment α∪[a:=1]. These are at most 14 children since the clauses C1,C2,C3 together contain at most 6 distinct variables besides x.

Now, for every closed leaf v labeled α, test whether F⌈α is satisfiable. If so, α is a backdoor restriction. If there is no closed leaf, or for no closed leaf v the residual formula F⌈αv is satisfiable, then F does not have a backdoor restriction of size k, and hence no weak CNF(2)-backdoor set of size k.

Since every inner node has at most 14 children, the size of the search tree is 14k, and therefore the runtime is O(14kn). □

The algorithm generalizes in the obvious way to formulas in d-CNF for every fixed d∈N, where the branching degree of the search tree is 2(3(d−1)+1)=6d−4, and hence its size is (6d−4)k, which yields a runtime of (6d−4)k⋅O(n).

5.Strong Backdoors

Next, we show the hardness of finding strong CNF(2)-backdoor sets. Unfortunately, the proof of this result only works when the input formulas are allowed to contain tautological clauses. The complexity of the problem restricted to formulas without tautological clauses remains open.

Theorem 7.

Strong CNF(2)-Backdoor Set is W[2]-hard.

Proof:We reduce Hitting Set to Strong CNF(2)-Backdoor Set. The reduction is similar to that used in the proof of Theorem 3.

Let an instance S={S1,…,Sm} of Hitting Set be given, with U:=S1∪⋯∪Sm. We construct a formula F=F(S) in the variables xs for s∈U plus zi,1,zi,2 for 1⩽i⩽m, such that F has a strong CNF(2)-backdoor set of size k iff S has a hitting set of size k.

The formula F has three clauses for every set Si∈S, viz.

Ci:=∨s∈Ux¯s∨∨s∈Sixs∨zi,1,z¯i,1∨zi,2,z¯i,1∨z¯i,2.

Let H⊆U be a hitting set for S. We show that XH:={xs;s∈H} is a strong CNF(2)-backdoor set for F(S) of the same size.

Let α be an assignment to the variables in XH. If α(xs)=0 for some s∈H, then α satisfies ∨s∈Ux¯s, and hence α⊧Ci for every i. If, on the other hand α(xs)=1 for every s∈H, then since H is a hitting set, α satisfies ∨s∈Sixs and hence α⊧Ci for very i. Thus in either case F⌈α∈CNF(2).

Now let B be strong CNF(2)-backdoor set for F. As in the proof of Theorem 3, we first show that without loss of generality, B contains only variables xs for s∈S. If B contains a variable zi,b, then as before we can exchange it for a variable xs for an arbitrary s∈Si.

Now as above, we define HB:={s;xs∈B}, and show that HB is a hitting set for S of size |B|. Assume for contradiction that Si∩HB=∅, and let α be the assignment with α(xs)=1 for every xs∈B. Then α does not satisfy any of the clauses associated with Si, and thus the variable zi,1 occurs three times in F⌈α. Hence B cannot have been a strong CNF(2)-backdoor set. □

The comment after the proof of Theorem 3 applies here as well: the reduction from Hitting Set to Strong CNF(2)-Backdoor Set given is actually a polynomial time reduction that does not change the parameter.

As in the case of weak CNF(2)-backdoor sets, the problem of finding strong CNF(2)-backdoor sets becomes fixed-parameter tractable when restricted to inputs in 3-CNF.

Theorem 8.

Strong CNF(2)-Backdoor Set for 3-CNF-formulas is fixed-parameter tractable.

Proof:Let F be a 3-CNF-formula, and X a subset of the variables of F. The following two propositions show the correctness of the bounded search tree algorithm to be presented below.

Proposition 9.

Let α:X→{0,1} be a restriction. If C1,C2,C3 is an obstruction in F⌈α, then every strong CNF(2)-backdoor set for F extending X contains a variable that occurs in C1,C2,C3.

Otherwise, if Y⊇X is a set that is disjoint from the variables of C1,C2,C3, then for any restriction β:Y→{0,1} that extends α the formula F⌈β still contains the obstruction C1,C2,C3, and is therefore not in CNF(2).

Once we have chosen a variable to be included in the backdoor set, the following proposition, which holds obviously, shows the correctness of the recurrence that the search tree algorithm is based on.

Proposition 10.

F has a strong CNF(2)-backdoor set of size k that contains the variable x iff there is a set B of size |B|⩽k−1 that is a strong CNF(2)-backdoor set for F⌈[x:=0] and for F⌈[x:=1].

We build a search tree of depth k, where at each node of depth d a set of variables Xv of size |Xv|=d is kept. Together with Xv we keep a set Cv⊆2Xv of closed assignments, with the property that F⌈α∈CNF(2) for every α∈Cv. A node v is closed if Cv=2Xv.

To extend the tree from a node of depth d<k labeled (X,C) that is not closed, pick an assignment α:X→{0,1} such that F⌈α∉CNF(2), and an obstruction, i.e., three clauses C1,C2,C3 in F⌈α that share a common variable x.

For each variable y occurring in C1,C2,C3, add a child to v labeled with X∪{y}. For each of these children, add a set C′ of assignments. To determine the set C′, we first put both extensions α∪{[y:=0]} and α∪{[y:=1]} of every closed assignment α∈C into C′. Then for every open assignment β∈2X∖C we consider the extension β0:=β∪{[y:=0]}, and test whether F⌈β0 is in CNF(2). If that is the case, we add β0 to C′. We perform the same for the assignment β1:=β∪{[y:=1]}.

If a node is closed, i.e., C=2X, then X is a strong backdoor set. If on the other hand, no closed node has been found up to depth k, then by the two Propositions 9 and 10 above there is no strong backdoor set of size k.

Since the three clauses {C1,C2,C3} contain at most 7 variables, size of the search tree is bounded by 7k. At each node we need to perform at most 2kO(n) computation steps, so the runtime is bounded by 14kO(n). □

As in the case of the algorithm for weak backdoor sets, this algorithm generalizes in the obvious way to formulas in d-CNF for every fixed d∈N, with a larger search tree size and thus a larger exponential dependence on the parameter k.

6.Upper Bounds

For most weak backdoor set problems that are not known to be in FPT, there is no exact characterization of their complexity. With the exception of a few cases which are W[P]-complete, for most of the other cases it is only known that they are W[2]-hard and in W[P]. We will now show the – to the best of our knowledge – first W[2]-completeness results for weak backdoor set problems in the following theorem.

Theorem 11.

For the classes C=1-VAL and 0-VAL, the problem Weak C-Backdoor Set is W[2]-complete.

Since finding weak backdoor sets for these classes is known to be W[2]-hard [5], we only need to show that they are in W[2].

Since all formulas in 1-VAL and 0-VAL are satisfiable, weak backdoor sets into these classes are the same as the very weak backdoor sets defined next. This is an admittedly artificial notion of backdoor set where we weaken the requirement that the residual formula is satisfiable to the condition that it is not already false.

We define a very weak C-backdoor set for F to be a subset X⊆V such that there is an assignment α:X→{0,1} such that the formula F⌈α is in C and non-trivial, i.e., F⌈α≠0.

The parameterized problem Very Weak C-Backdoor Set is:

Instance:	CNF-formula F, and k∈N.
Parameter:	k.
Question:	Is there a very weak C-backdoor set for F of size k?

For the classes 1-VAL and 0-VAL, this problem is equivalent to Weak C-Backdoor Set, thus to prove Theorem 11 it suffices to show it is in W[2]. Since our technique readily generalizes to other cases, we show that the problem of finding very weak C-backdoor sets is in W[2], for various tractable cases C. For the other cases besides 1-VAL and 0-VAL, this might turn out to be useful in the future.

We show that these problems are in the class W[2] by making use of the logical characterization of this class.

Proposition 12.

For each of the tractable classes

C=CNF(2),HORN,CO-HORN,2-CNF,CLU,1-VAL and 0-VAL,

the problem Very Weak C-Backdoor Set is in W[2].

Proof:View a CNF-formula F as a structure whose elements are the literals of F and the clauses of F, with the relations described in Table 1.

Table 1.

Relations of a formula as a structure

Relation	Meaning
lit(a)	a is a literal
cl(c)	c is a clause
occ(a,c)	literal a occurs in clause c
comp(a,b)	a and b are complementary literals
pos(a)	a is a positive literal
neg(a)	a is a negative literal

For each of the classes C in the statement of the theorem, we will define a Π2-formula φC(A) in this language with a free set variable A expressing that A is a backdoor restriction into the class C, i.e. F⊧φC(α) for a restriction α iff F⌈α∈C. We also define a Π2-formula ntrivA expressing that the formula F after restriction by A is nontrivial. Thus the problem Very Weak C-Backdoor Set is equivalent to the problem WD(ψC), where ψC:≡ntrivA∧φC(A). Since both formulas ntrivA and φC(A) are Π2-formulas, this shows that these problems are in W[2].

We start by expressing that literal a is false under A. By uniqueness of the complementary literal, this can be expressed by the following formula:

falseA(a):≡∃x(comp(a,x)∧A(x))≡∀x(comp(a,x)→A(x)).

Note that this formula is a Δ1-formula, i.e., it is equivalent to both a Π1- and a Σ1-formula. Therefore it can be used like an atomic formula without increasing the quantifier complexity.

We then can express the condition that the formula F restricted by A is non-trivial by the Π2-formula

ntrivA:≡∀ccl(c)→∃alit(a)∧occ(a,c)∧¬falseA(a).

We define a Σ1-formula stating that the clause c is satisfied by A as:

satA(c):≡∃x(occ(x,c)∧A(x)).

The following Π1-formulas express that clauses c1 and c2 are equal after restriction by A:

subA(c1,c2):≡∀x(occ(x,c1)→falseA(x)∨occ(x,c2)),eqA(c1,c2):≡subA(c1,c2)∧subA(c2,c1).

We can express that the variable underlying literal a occurs in clause c by the Δ1-formula

varocc(a,c):≡occ(a,c)∨∃x(comp(a,x)∧occ(x,c))≡occ(a,c)∨∀x(comp(a,x)→occ(x,c)).

With the aid of these formulas, we can write down the formula as:

∀c1,c2,c3∧1⩽i⩽3cl(ci)→(∨1⩽i⩽3satA(ci)∨∨1⩽i<j⩽3eqA(ci,cj)∨∀a(lit(a)→A(a)∨falseA(a)∨∨1⩽i⩽3¬varocc(a,ci))).

This formula φCNF(2)(A) states that for any three clauses one of the following holds:

either one of them is satisfied by A,
or two of them are equal under A,
or every variable that is not set by A does not occur in one of them.

Thus F⊧φCNF(2)(α) if and only if F⌈α is in CNF(2).

We can use the same technique to obtain upper bounds on the complexity of finding very weak backdoor sets for other tractable cases:

The following Π2-formula φ2-CNF(A) states that F restricted by A is a 2-CNF-formula.

∀ccl(c)→satA(c)∨∀a1,a2,a3∧1⩽i⩽3lit(ai)→(∨1⩽i<j⩽3ai=aj∨∨1⩽i⩽3A(ai)∨∨1⩽i⩽3falseA(ai)∨∨1⩽i⩽3¬occ(ai,c)).

This formula states that for every clause c that is not satisfied by A, and any three literals, one of the following holds:

either two of the literals are equal,
or at least one of them is set by A,
or at least one of them does not occur in c.

Thus F⊧φ2-CNF(α) if and only if F⌈α is in 2-CNF.

The following Π2-formula φHORN(A) states that F restricted by A is a Horn formula.

∀ccl(c)→satA(c)∨∀a,a′lit(a)∧lit(a′)→(a=a′∨A(a)∨A(a′)∨falseA(a)∨falseA(a′)∨¬occ(a,c)∨¬occ(a′,c)∨neg(a)∨neg(a′))

This formula states that for every clause c unsatisfied by A, and any two distinct literals that are not set by A and occur in c, one of them is negative. The formula φCO-HORN(A) is defined symmetrically.

A result of Nishimura et al. [10] characterizes the class of cluster formulas by excluded configurations: A formula is a cluster formula if it does not contain any of the following obstructions:

1. two clauses C and C′ that overlap, i.e., have a variable in common, but do not clash,
2. three clauses C1, C2 and C3 such that C1 and C2 clash, and C2 and C3 clash, but C1 and C3 do not clash.

We define Σ1-formulas stating that two clauses overlap or clash as:

overlap(c1,c2):≡∃xlit(x)∧¬A(x)∧¬falseA(x)∧varocc(x,c1)∧varocc(x,c2),clash(c1,c2):≡∃xlit(x)∧¬A(x)∧¬falseA(x)∧occ(x,c1)clash(c1,c2):≡∧∃y(comp(x,y)∧occ(y,c2)).

With the help of these formulas, we define the Π2-formula φCLU(A) as φCLU1(A)∧φCLU2(A), where φCLU1(A) states that the formula F does not contain the first type of obstruction:

∀c1,c2(cl(c1)∧cl(c2)→satA(c1)∨satA(c2)∨eqA(c1,c2)∨¬overlap(c1,c2)∨clash(c1,c2))

and φCLU2(A) states that F does not have the second type of obstruction:

∀c1,c2,c3(∧1⩽i⩽3cl(ci)→∨1⩽i⩽3satA(ci)∨∨1⩽i<j⩽3eqA(ci,cj)∨¬clash(c1,c2)∨¬clash(c2,c3)∨clash(c1,c3)).

Finally, we define the Π2-formula φ1-VAL(A) expressing that the formula F restricted by A is 1-valid. The formula φ0-VAL(A) is defined analogously.

We define the Σ1-formula

cpos(c):=∃xocc(x,c)∧¬falseA(x)∧pos(x)

expressing that clause c contains a positive literal that is not falsified by A. Thus the following Π2-formula

φ1-VAL(A):=∀ccl(c)→satA(c)∨cpos(c)

states that F restricted by A is 1-valid. □

6.1.Open Problems

We list some problems left open by this work.

Settle the parameterized complexity of the problem Strong CNF(2)-Backdoor Set in the restricted case when formulas do not contain tautological clauses, i.e., show the problem remains W[2]-hard in this case.
Determine the precise parameterized complexity of Weak C-Backdoor Set for tractable cases C other that 1-Val and 0-Val, possibly using the logical approach used in this paper.
Is the problem Strong C-Backdoor Set in W[2] for the tractable cases C=CNF(2) and C=CLU, for which we know it is W[2]-hard?

Acknowledgements

I thank Stefan Szeider, Sebastian Ordyniak and Ulrich Schöpp for useful discussions about the contents of the paper, and an anonymous referee whose comments helped to improve the presentation of the paper. The research leading to the results in this paper was initiated at Dagstuhl Seminar 12471 “SAT Interactions”.

References

[1]	S.A. Cook, The complexity of theorem-proving procedures, in: Proceedings of the 3rd Annual ACM Symposium on Theory of Computing, (1971) , pp. 151–158.
[2]	Y. Crama, O. Ekin and P.L. Hammer, Variable and term removal from Boolean formulae, Discrete Applied Mathematics 75: (3) ((1997) ), 217–230. doi:10.1016/S0166-218X(96)00028-5.
[3]	R.G. Downey and M.R. Fellows, Fixed-parameter tractability and completeness, Congressus Numerantium 87: ((1992) ), 161–187.
[4]	J. Flum and M. Grohe, Parameterized Complexity Theory, Texts in Theoretical Computer Science, Springer, (2006) .
[5]	S. Gaspers and S. Szeider, Backdoors to satisfaction, in: The Multivariate Algorithmic Revolution and Beyond, H.L. Bodlaender, R. Downey, F.V. Fomin and D. Marx, eds, Lecture Notes in Computer Science, Vol. 7370: , (2012) , pp. 287–317. doi:10.1007/978-3-642-30891-8_15.
[6]	R. Impagliazzo, R. Paturi and F. Zane, Which problems have strongly exponential complexity?, Journal of Computer and System Sciences 63: (4) ((2001) ), 512–530. doi:10.1006/jcss.2001.1774.
[7]	K. Iwama, CNF-satisfiability test by counting and polynomial average time, SIAM Journal on Computing 18: (2) ((1989) ), 385–391. doi:10.1137/0218026.
[8]	J. Johannsen, Satisfiability problems complete for deterministic logarithmic space, in: 21st International Symposium on Theoretical Aspects of Computer Science (STACS 2004), V. Diekert and M. Habib, eds, Lecture Notes in Computer Science, Vol. 2996: , (2004) , pp. 317–325.
[9]	H. Kleine Büning and T. Lettmann, Propositional Logic: Deduction and Algorithms, Cambridge University Press, (1999) .
[10]	N. Nishimura, P. Ragde and S. Szeider, Solving #SAT using vertex covers, Acta Informatica 44: (7–8) ((2007) ), 509–523. doi:10.1007/s00236-007-0056-x.
[11]	N. Nishimura, P. Ragde and S. Szeider, Detecting backdoor sets with respect to Horn and binary clauses, in: Proceedings of the 7th International Conference on Theory and Applications of Satisfiability Testing, (2004) , pp. 96–103.
[12]	T.J. Schaefer, The complexity of satisfiability problems, in: Proceedings of the 10th ACM Symposium on Theory of Computing, (1978) , pp. 216–226.
[13]	R. Williams, C. Gomes and B. Selman, Backdoors to typical case complexity, in: Proceedings of the 18th International Joint Conference on Artificial Intelligence, G. Gottlob and T. Walsh, eds, (2003) , pp. 1173–1178.

Backdoors into Two Occurrences

Abstract

1.Introduction

2.Preliminaries

2.1.Tractable Cases of SAT

2.2.Parameterized Complexity

2.3.Backdoor Sets

3.Backdoors into CNF(2)

Proposition 1.

Proposition 2.

4.Weak Backdoors

Theorem 3.

Theorem 4.

Proposition 5.

Proposition 6.

5.Strong Backdoors

Theorem 7.

Theorem 8.

Proposition 9.

Proposition 10.

6.Upper Bounds

Theorem 11.

Proposition 12.

Table 1.

6.1.Open Problems

Acknowledgements

References

North America

Europe

Asia

Abstract

1.Introduction

2.Preliminaries

2.1.Tractable Cases of SAT

2.2.Parameterized Complexity

2.3.Backdoor Sets

3.Backdoors into CNF(2)

Proposition 1.

Proposition 2.

4.Weak Backdoors

Theorem 3.

Theorem 4.

Proposition 5.

Proposition 6.

5.Strong Backdoors

Theorem 7.

Theorem 8.

Proposition 9.

Proposition 10.

6.Upper Bounds

Theorem 11.

Proposition 12.

Table 1.

6.1.Open Problems

Acknowledgements

References

Share this:

North America

Europe

Asia