--- title: "introduction" output: rmarkdown::html_vignette vignette: > %\VignetteIndexEntry{introduction} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include = FALSE} knitr::opts_chunk$set( collapse = FALSE, comment = "#>" ) ``` ```{r setup} library(satire) ``` # satire `{satire}` is a package for defining Boolean satisfiability problems (i.e. SAT problems) and includes two pure R SAT solvers. A key feature of this package is that allows for the definition of the SAT problem by specifying the clauses of the Boolean formula as logical expressions. This can allow for a more natural definition of the problem and for remapping of the solution back to the original variable names. In addition, even though this style of SAT solver usually requires Boolean formula to be in [Conjunctive Normal Form(CNF)](https://en.wikipedia.org/wiki/Conjunctive_normal_form), `{satire}` will automatically translate non-CNF expressions to CNF using substitutions and applying [Tesytin transformation](https://en.wikipedia.org/wiki/Tseytin_transformation). ### What's in the box * `sat <- sat_new()` create a new problem object * `sat_add_literals()` add integer literals to a problem. * `sat_add_exprs()` add Boolean expressions to a problem. If expressions are not in CNF (conjunctive normal form), then substitution rules and Tseytin transformation will be applied to convert them to CNF. * `sat_card_*()` add cardinality constraints to a problem. * `sat_card_exactly_one()` * `sat_card_atleast_one()` * `sat_card_atmost_one()` * `sat_card_exactly_k()` * `sat_card_atleast_k()` * `sat_card_atmost_k()` * Report the current state of the problem * `sat$dimacs` * `sat$literals` * `sat$names` * `sat$exprs` * `read_dimacs()` read in a problem from a standard DIMACs-formatted file * SAT solvers * `sat_solve_naive()` is a naive solver suitable for problems with ~20 variables * `sat_solve_dpll()` implements basic [DPLL](https://en.wikipedia.org/wiki/DPLL_algorithm) solving techniques. This solver is suitable for use in larger problems, but is written in base R and is not fast. ## SAT Solvers The two SAT solvers included in this package (`sat_solve_naive()` & `sat_solve_dpll()`) are written in base R and are really for demonstration and education purposes. They are only suitable for smaller/easier problems as they are comparatively quite slow. For larger problems, either export to DIMACS format and use another solver of your choice, or try the following companion R packages: * [picosatire](https://github.com/coolbutuseless/picosatire) a wrapper for the C SAT solver library PicoSAT * [kissatire](https://github.com/coolbutuseless/kissatire) a wrapper for the C SAT solver library Kissat (not compatible with windows) ### SAT Example: Finding all possible work teams This is a demonstration of how `{satire}` could be used to formulate and solve a very simple SAT problem. #### Problem statement A, B, C, D are employed by the same company. The company needs to find all combinations of these employees who can work any given shift given the following constraints: * Exactly 2 people need to work a shift * B and D are not allowed to work together * C must always work with A or D (in order to be trained) ```{r} #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # Create a SAT problem #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ sat <- sat_new() #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # Add constraints #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ sat_card_exactly_k(sat, c('a', 'b', 'c', 'd'), 2) # Exactly two people working sat_add_exprs(sat, "!b | !d") # B and D not together sat_add_exprs(sat, "c -> (a | d)") # C must work with B or D sat #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # This is a small problem (only 16 variables) so can use the naive solver #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ solns <- sat_solve_naive(sat) #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # The solutions are returned as a data.frame. # - column names correspond to variable names in the problem. # - each row is a solution #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ solns #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ # Extract the names of each pair that can work together #~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ apply(solns, 1, function(x) names(solns)[which(x)], simplify = FALSE) ``` ### Tools for defining a problem Problems can be defined in two general ways: * constructing the problem as an accumulation of Boolean expressions * constructing the problem as a sequence of literal integers All of the following are equivalent definitions of a single problem ```{r eval = TRUE} # Sequential adding of single expressions sat <- sat_new() sat_add_exprs(sat, "!a | b") sat_add_exprs(sat, "!b | c") sat_add_exprs(sat, "!c | a") # Adding a vector of expressions sat <- sat_new() sat_add_exprs(sat, c("!a | b", "!b | c", "!c | a")) # Adding a compound expression sat <- sat_new() sat_add_exprs(sat, c("(!a | b) & (!b | c) & (!c | a)")) # Using integer literals # 1 = a # -1 = !a # 2 = b # -2 = !b # 3 = c # -3 = !c sat <- sat_new() sat_add_literals(sat, c(-1, 2, 0)) sat_add_literals(sat, c(-2, 3, 0)) sat_add_literals(sat, c(-3, 1, 0)) # Using integer literals, trailing 0 is optional sat <- sat_new() sat_add_literals(sat, c(-1, 2)) sat_add_literals(sat, c(-2, 3)) sat_add_literals(sat, c(-3, 1)) # Using literals all in a single vector. Zeros required to separate clauses sat <- sat_new() sat_add_literals(sat, c(-1, 2, 0, -2, 3, 0, -3, 1, 0)) ``` ### Cardinality helpers A common constraint on SAT problems is **cardinality** i.e. how many different variables can be simultaneously true. `{satire}` includes the following helpers to add cardinality constraints to a SAT problem: * `sat_card_exactly_one()` * `sat_card_atleast_one()` * `sat_card_atmost_one()` * `sat_card_exactly_k()` * `sat_card_atleast_k()` * `sat_card_atmost_k()` The encoding for *at most one* can be done via *pairwise* encoding (a.k.a binominal encoding) or using *commander* variables (after Klieber & Kwon). Coding for *at least k*, *at most k* and *exactly k* uses Sinz's formulation of LTSeq. As an example, the following SAT problem finds combinations from the first 8 letters of the alphabet where exactly 3 are marked as TRUE. The solution is a data frame where the columns correspond to the variables in the SAT problem, and each row represents a solution. ```{r eval=TRUE} sat <- sat_new() sat_card_exactly_k(sat, letters[1:8], 3) sat sat_solve_dpll(sat, max_solutions = 4) ``` ## Unsatisfiable problem example A problem which is unsatisfiable will generate a `NULL` with the solvers included in this package (`sat_solve_naive()` and `sat_solve_dpll()`). Below, this unsatisiable problem wants `a` to be simultaneously `TRUE` and `FALSE` - an impossibility. ```{r} sat <- sat_new() sat_add_exprs(sat, "a") sat_add_exprs(sat, "!a") sat_solve_naive(sat) # Unsatisfiable sat_solve_dpll(sat) # Unsatisfiable ``` ## Solving a problem from a DIMACs file ```{r eval = FALSE} sat <- read_dimacs("inst/zebra_v155_c1135.cnf") sat sat_solve_dpll(sat) ```