#+BEGIN_abstract If you have a taste for NP-completeness, Sudoku, or literate programming, then this one's for you. #+END_abstract #+TITLE: Making and Slaying Monster Sudoku #+DATE: 2020-05-05 #+FILETAGS: sudoku:np-complete:backtracking:search #+PROPERTY: header-args :noweb no-export :noweb-sep "\n" :session :eval no-export :noweb-sep "\n\n\n" :mkdirp yes :comments link /All of the code described in this article is available [[https://github.com/reindeereffect/reindeereffect.github.io/tree/master/2020/05/05][here]]./ * 0xdeadbeef :noexport: ** todo - ** code #+NAME: install.sh #+BEGIN_SRC shell :exports none :results none :tangle install.sh :shebang "#! /bin/bash" ./setup.py sdist virtualenv -p `which python3` $HOME/test . $HOME/test/bin/activate pip install dist/sudoku* mkdir -p images #+END_SRC #+NAME: sdtx #+BEGIN_SRC shell :exports none :results output export PATH=$HOME/test/bin:$PATH function sudoset() { out=images/$1; shift sudoku2img -- $@ > $out echo -n $out } #+END_SRC #+RESULTS: sdtx * Introduction Quarantine is in full swing, and after watching a disturbing amount of TV, you plowed through a book of Sudoku puzzles that turned up in the basement. Now they're gone, and you're looking for something more. Bigger. Monstrous, you might say. Unfortunately, the giant Sudoku puzzles (think 16\times16 or 25\times25) are not quite so easy to find in any quantity. It turns out, though, that with a little effort, you can make machines churn them out while you sleep. In this article we will - consider the structure of the Sudoku board; - use that structure to capture, in a general way, common heuristics for solving Sudoku puzzles; - employ both constraint propagation and backtracking to create a suitably fast Sudoku solver; and - use that solver as building block for generating Sudoku puzzles of any size we might desire. By the end, we'll have implemented a library for dealing with Sudoku, as well as a small collection of command line tools for generating, solving, and formatting puzzles. * The Board Before endeavoring to operate on Sudoku boards, we should first pin down some useful representations, as well as conversions between them. ** Board Divisions The classic Sudoku board consists of a 9\times9 grid of cells, with a 3\times3 grid of boxes, each containing a 3\times3 grid of cells, overlaid: #+BEGIN_SRC shell :results file :exports results <> seq 0 80 | sudoset cells.png #+END_SRC #+RESULTS: [[file:images/cells.png]] where the cells have been numbered in row-major order. Each cell is a member of a set of three divisions, namely - rows: #+BEGIN_SRC shell :results file :exports results <> for i in {0..8}; do for j in {0..8}; do echo $i; done done | sudoset row-divs.png #+END_SRC #+RESULTS: [[file:images/row-divs.png]] - columns: #+BEGIN_SRC shell :results file :exports results <> for i in {0..8}; do for j in {0..8}; do echo $j; done done | sudoset col-divs.png #+END_SRC #+RESULTS: [[file:images/col-divs.png]] - boxes: #+BEGIN_SRC shell :results file :exports results <> for i in {1..9}; do for j in {1..9}; do echo -n "$(( ($i-1)/3 * 3 + ($j-1)/3 )) " done echo done | sudoset box-divs.png #+END_SRC #+RESULTS: [[file:images/box-divs.png]] Of these, the most interesting division of the board is into boxes. The board board can be viewed as a 3\times3 grid of boxes, each a 3\times3 grid of cells, resulting in a grid of 3^4 cells arranged into 3^2 rows and 3^2 columns, for a grand total of 3\times3^2 distinct divisions. When the board is complete, each cell will contain a number from 1 to 9 (or 3^2) inclusive. Every interesting dimension is describable in terms of powers of 3, which we can think of as the /order/ of a standard board. We can now think of Sudoku (or Sudoku-like) boards a bit more generally. Given a board of order $\omega$, for any cell $c\in [1, \omega^4]$, we can find the row number $I$ as $$I(c) = \left\lfloor\frac{c}{\omega^2}\right\rfloor$$ the column number $J$ as $$J(c) = c\bmod \omega^2$$ and the box number $B$ as $$B(c) = \omega\times\left\lfloor\frac{I(c)}{\omega}\right\rfloor + \left\lfloor\frac{J(c)}{\omega}\right\rfloor.$$ This generalization allows us to handle larger boards with ease. For simplicity, the discussion and examples below will center on conventional order 3 boards unless otherwise specified; even so, the principles remain the same for other orders. If we sequentially number divisions like | division | start | end | |----------+-------------+-----------------| | rows | 0 | $\omega^2 - 1$ | | columns | $\omega^2$ | $2\omega^2 - 1$ | | boxes | $2\omega^2$ | $3\omega^2 - 1$ | then we can write a function to compute a mapping from cells to divisions, as well as an inverse mapping from divisions to cells: #+NAME: functions #+BEGIN_SRC python :results none def board_divs(order): ''' generates a dictionary (cell2divs) mapping cells to their various divisions in boards of the given order. Also generates a complementary mapping, div2cells. Returns (cell2divs, div2cells). ''' n = order**2 box = lambda i, j: i//order * order + j//order cell2divs = dict(enumerate({i, n + j, 2*n + box(i, j)} for i in range(n) for j in range(n))) return cell2divs, transpose(cell2divs) #+END_SRC where #+NAME: functions #+BEGIN_SRC python :results none def transpose(m): ''' given a binary matrix represented as a dictionary whose values are sets, and where a 1 at (i,j) is indicated by j in m[i] return the transpose of m. ''' t = {} for i, js in m.items(): for j in js: t.setdefault(j, set()).add(i) return t #+END_SRC Besides allowing more concise expression of algorithms operating on Sudoku boards, thinking in terms of cells and divisions opens the door to adapting some of what we develop here to Sudoku variants featuring irregularly-shaped divisions (like [[http://www.dailysudoku.com/sudoku/archive.shtml?type=squiggly][squiggly Sudoku]]). ** Logical Representation We'll need a convenient representation of the board state at any given time, as well as a ways to sensibly change that state. For that, we'll define a simple class: #+NAME: data types #+BEGIN_SRC python :results none class board: 'Utility class for representing and tracking board state.' <> <> <> #+END_SRC Each cell is either known or unknown. For the known cells, we need only track their values. For the unknown cells, however, we need to either track or compute the values that they may possibly take. Since the requirements for the two cell classes are different, we handle them separately. #+NAME: board initialization #+BEGIN_SRC python :results none def __init__(self, known, unknown, cell2divs, div2cells): ''' known dictionary mapping known cells to their respective values unknown dictionary mapping unknown cells to sets of possible values cell2divs, div2cells complementary mappings describing the board structure, such as those produced by board_divs ''' assert not set(known) & set(unknown) self.known = known self.unknown = unknown self.cell2divs = cell2divs self.div2cells = div2cells #+END_SRC Solving a Sudoku involves repeatedly /marking/ the board until no empty cells remain, subject to the constraint that each division contains one each of the numbers from 1 to 9 inclusive. With each marking, we assert knowledge about a previously unknown cell, and the possible values that can be taken by unknown cells sharing a division become more constrained. To track this, #+NAME: cell marking #+BEGIN_SRC python :results none def mark(self, cell, val): 'set cell to val, updating unknowns as necessary' self.known[cell] = val self.unknown.pop(cell, None) for div in self.cell2divs[cell]: for cell2 in self.div2cells[div]: self.elim(cell2, val) def elim(self, cell, val): "remove val from cell's possibilities" self.unknown.get(cell, set()).discard(val) #+END_SRC This is the basic mechanism of /constraint propagation/ that ultimately allows us to develop usefully fast solution techniques. For brevity, whenever we speak of marking a cell, we'll assume that the possibilities for other cells are updated as necessary, too. Sometimes we may not know that a given marking will work out---perhaps we're guessing---so we should support marking cells speculatively and recovering when we realize how wrong we are. The simplest method is to mark a copy of the current board state: #+NAME: cell marking #+BEGIN_SRC python :results none def marked(self, cell, val): 'returns a new board, with cell marked as val and possibilities eliminated' new = self.copy() new.mark(cell, val) return new #+END_SRC #+NAME: imports #+BEGIN_SRC python :results none import copy #+END_SRC #+NAME: copying #+BEGIN_SRC python :results none def copy(self): 'copies board' return self.__class__(copy.deepcopy(self.known), copy.deepcopy(self.unknown), self.cell2divs, self.div2cells) #+END_SRC ** Textual Representation Humans hardly want to look at Python dictionaries when there are better representations available, so let's work out a textual representation for our boards, and let's make it flexible enough to handle boards of any order. *** Converting from Strings We'll impose the following requirements on strings that represent Sudoku boards of any order $\omega$: - Each cell will be represented by an integer (if known) or a '.' (if unknown). - The number of cells must be $\omega^4$, where $\omega$ is some integer. - Cells can be separated by any other character. - Values for known cells must be in $[1, \omega^2]$. These rules will allow us to handle #+BEGIN_EXAMPLE 1 3 | . . . . | 3 1 ----+---- 3 1 | . . . 2 | 1 3 #+END_EXAMPLE as easily as #+BEGIN_EXAMPLE 1 3 . . . . 3 1 3 1 . . . 2 1 3 #+END_EXAMPLE or #+BEGIN_EXAMPLE 1 3 . . . . 3 1 3 1 . . . 2 1 3 #+END_EXAMPLE They also allow us to compute the order directly from the number of cells. #+NAME: functions #+BEGIN_SRC python :results none def load_board(s, validate_vals=True): ''' given a string representing a board, returns a board object. For a board of a given order: - Order is computed as the fourth root of board length, and it must be an integer. - Each cell must be represented by an integer in [1, order**2] inclusive, or `.' to denote unknown cells. This check can be disabled by setting validate_vals to False. - Cells must be separated from each other by any sequences of characters in /[^0-9.]+/. On failure, raises ValueError. ''' vals = [cell for cell in ''.join(c if c in '0123456789.' else ' ' for c in s).strip().split() if cell.isdigit() or cell == '.'] order = int(len(vals) ** 0.25) n = order**2 if len(vals) != order**4: raise ValueError bd = blank(order) for (cell, val_) in enumerate(vals): if val_ == '.': continue val = int(val_) if validate_vals and (val < 1 or val > n): raise ValueError bd.mark(cell, val) return bd #+END_SRC where #+NAME: functions #+BEGIN_SRC python :results none def blank(order): 'generate a blank board' n = order**2 possible_vals = set(range(1, n + 1)) return board({}, {i:set(possible_vals) for i in range(n**2)}, ,*board_divs(order)) #+END_SRC It would also be good know whether a board brought in from the outside world is indeed valid, in the sense of having no conflicting cell values in any division. #+NAME: functions #+BEGIN_SRC python :results none def isvalid(bd): ''' returns True if - no known cells' values conflict - no unknown cell's possibilities conflict with any known cell's value ''' return not any(val0 in {bd.known.get(cell)} | bd.unknown.get(cell, set()) for (cell0, val0) in bd.known.items() for cell in neighbors(bd, cell0) if cell in bd.known and cell != cell0) def neighbors(bd, cell0): return union(bd.div2cells[div] for div in bd.cell2divs[cell0]) def union(xss): return {x for xs in xss for x in xs} #+END_SRC *** Converting to Strings Once we've solved a puzzle or otherwise modified a board, we'd like to get a readable representation back out. Given that there are further use cases for a completed Sudoku board, like deriving Sudoku puzzles of varying difficulty, it should be loadable via =load_board=, like: #+BEGIN_EXAMPLE 8 3 7 | 1 2 6 | 9 5 4 9 5 4 | 3 8 7 | 1 6 2 2 1 6 | 4 5 9 | 3 7 8 ------+-------+------ 7 . 9 | . 4 5 | 8 1 3 3 4 5 | 9 1 8 | 6 2 7 1 . 8 | . 7 3 | 4 9 5 ------+-------+------ 4 8 1 | 5 6 2 | 7 . 9 5 9 3 | 7 . 1 | 2 8 6 6 7 2 | 8 9 4 | 5 3 1 #+END_EXAMPLE #+NAME: functions #+BEGIN_SRC python :results none def dump_board(bd): 'returns a "pretty printed" string representation of board bd' order = int((len(bd.known) + len(bd.unknown)) ** 0.25) n = order**2 svals = [str(bd.known[i] if i in bd.known else '.') for i in range(n**2)] width = max(map(len, svals)) fmt = lambda cell: ('%%%ds' % width) % cell n_x_n = [svals[i*n : i*n + n] for i in range(n)] cols_grpd = [' | '.join(' '.join(map(fmt, row[j*order : j*order + order])) for j in range(order)) for row in n_x_n] rows_grpd = ['\n'.join(cols_grpd[i*order : i*order + order]) for i in range(order)] rule = '\n' + ''.join('+' if c == '|' else '-' for c in cols_grpd[0]) + '\n' return rule.join(rows_grpd) #+END_SRC * Solving Sudoku Having a suitable representation of the board state, we can now work out how to solve a Sudoku puzzle. All of the techniques discussed here rely on the constraint propagation that [[cell marking][=board.mark=]] performs automatically. ** Deductive Techniques Consider how a human might approach a grid like #+BEGIN_SRC shell :results file :exports results <> sudoset ex-1-1.png <> sudoset ex-1-2.png -p 43 <> sudoset ex-1-4.png -p 42 <> sudoset ex-1-5.png -p 10 53 <> sudoset ex-1-6.png -p 17 <> sudoset ex-1-7.png -p <> sudoset ex-1-8.png -p 18 < yields #+BEGIN_SRC shell :results file :exports results <> sudoset ex-1-soln.png < 1: break return nsolns == 1 #+END_SRC and #+NAME: functions #+BEGIN_SRC python :results none def marked_up(order, *marks): ''' returns a new board of the given order, with the given marks, (cell, val) pairs, applied ''' bd = blank(order) for mark in marks: bd.mark(*mark) return bd #+END_SRC However, the naive procedure's performance degrades rapidly with increasing order---checking a board's propriety requires solving it, and =solve='s complexity grows exponentially with the number of unknown cells. A few measures can salvage this situation: - We can safely mask out any cell that can be deduced based on the currently known cells. - Checking whether masking out a given cell would result in proper board requires attempting to solve the board resulting from masking the cell. We can constrain the solver to only generate solutions within a certain number of guesses. Doing so allows a faster, though weaker, check for propriety. It also provides us a means of limiting the end result's difficulty. - The solver chooses from the unknown cells with the fewest possible values, i.e., it attempts to minimize the branching factor. We can limit unknown cells only to those that the solver would choose among. We can estimate difficulty by multiplying the number of possibilities for each cell we mask; this represents the total number of choices that a perfect player would face. The generation procedure we'll actually use is #+NAME: functions #+BEGIN_SRC python :results none def generate_from(soln, minbranch=False, maxguesses=inf): ''' Generate a board for which soln is a solution, within at most maxguesses guesses. If set, minbranch restricts unknown cells to those that - can be easily deduced or - are among those with the fewest possible values. If maxguesses < inf, the generated board is guaranteed to be solvable within the prescribed number of guesses, but is not guaranteed to have only one solution. Returns (bd, difficulty) where bd is the generated board and difficulty is a difficulty estimate. ''' known = soln.known.copy() order = int(len(known) ** 0.25) clues = {} new = lambda: marked_up(order, *known.items(), *clues.items()) minunks = lambda bd: min(map(len, bd.unknown.values())) guesses = 0 difficulty = 1 while known: cell = random.choice(list(known)) val = known.pop(cell) bd2 = new() mark_forced(bd2) if cell in bd2.known: pass elif (guesses >= maxguesses or minbranch and len(bd2.unknown[cell]) > minunks(bd2) or not isproper(bd2, maxguesses=maxguesses, clue=(cell, val))): clues[cell] = val else: difficulty *= len(bd2.unknown[cell]) guesses += 1 return new(), difficulty #+END_SRC We know that marking the masked cell with the value it previously had will ultimately result in a solution; exploiting that knowledge when testing a board derived from a board known to be proper, #+NAME: functions #+BEGIN_SRC python :results none def isproper(bd, maxguesses=inf, clue=None): 'bd has exactly one solution within maxguesses guesses' nsolns = 0 if clue: cell0, val0 = clue nsolns += 1 for val in bd.unknown[cell0] - {val0}: for soln in solve(bd.marked(cell0, val), maxguesses): nsolns += 1 if nsolns > 1: return False else: for soln in solve(bd, maxguesses): nsolns += 1 if nsolns > 1: return False return nsolns == 1 #+END_SRC We can now create puzzles of various sizes; for example, order 2: #+NAME: order #+BEGIN_SRC python :results value :var order=2 :exports none from sudoku import * dump_board(generate_from(next(solve(blank(order))))[0]) #+END_SRC #+BEGIN_SRC shell :noweb yes :results file :exports results <> sudoset order2.png <> EOF #+END_SRC #+RESULTS: [[file:images/order2.png]] order 3: #+BEGIN_SRC shell :noweb yes :results file :exports results <> sudoset order3.png <> EOF #+END_SRC #+RESULTS: [[file:images/order3.png]] and order 4: #+BEGIN_SRC shell :noweb yes :results file :exports results <> sudoset order4.png <> EOF #+END_SRC #+RESULTS: [[file:images/order4.png]] * Utility Library Before going any further, let's package what we have so far into a library: #+NAME: sudoku/__init__.py #+BEGIN_SRC python :results none :tangle sudoku/__init__.py :shebang "#! /usr/bin/env python3\n" 'useful utilities for manipulating Sudoku puzzles' <> <> <> #+END_SRC The finished product is [[./sudoku/__init__.py]]. * Command Line Tools Having a library encapsulating the bulk of what we might wish to do, let's make it more operationally useful by creating a series of tools that we can use from a command line or shell script. #+NAME: common #+BEGIN_SRC python :results none import sys def usage(): return __doc__.lstrip() % sys.argv[0] if __name__ == '__main__': if set(sys.argv) & {'-h', '--help'}: sys.exit(usage()) else: main(sys.argv[1:]) #+END_SRC ** The Solver The solver should read a board, as defined by =load_board=, from either a file or standard input, and emit all the solutions to standard output. The overall program structure should look something like #+NAME: bin/sudoku #+BEGIN_SRC python :results none :tangle bin/sudoku :shebang "#! /usr/bin/env python3\n" <> <> <> <> #+END_SRC where #+NAME: solver usage #+BEGIN_SRC python :results none ''' Usage: %s [FILE] Find all solutions for a Sudoku puzzle. Options: -h, --help print this help and exit If FILE is omitted or `-', then the initial board is read from stdin. The input board should consist of a series of cells, each either a positive integer or a `.' to denote an unknown value, separated by any characters not in /[0-9.]/. The order of the board is automatically detected as the fourth root of the number of cells, and it must be an integer. The numerical values are constrained from 1 to order**2 inclusive. The solutions will always be ``pretty-printed'', e.g., solution 1: 4 2 7 | 1 3 6 | 5 8 9 6 5 1 | 9 2 8 | 4 7 3 3 8 9 | 5 4 7 | 1 6 2 ------+-------+------ 2 3 5 | 8 1 9 | 7 4 6 9 6 8 | 3 7 4 | 2 1 5 7 1 4 | 2 6 5 | 9 3 8 ------+-------+------ 8 9 6 | 7 5 1 | 3 2 4 1 4 3 | 6 9 2 | 8 5 7 5 7 2 | 4 8 3 | 6 9 1 solution 2: ... It is the case that a ``proper'' Sudoku can have only one solution; however, ``improper'' Sudoku puzzles do exist. ''' #+END_SRC #+NAME: solver imports #+BEGIN_SRC python :results none import sys import sudoku as sd #+END_SRC #+NAME: solver functions #+BEGIN_SRC python :results none def main(argv): fn = argv[0] if argv else '-' try: bd = sd.load_board((sys.stdin if fn == '-' else open(fn)).read()) except ValueError: sys.exit('ill-formed board') for (i, soln) in enumerate(sd.solve(bd), start=1): assert sd.isvalid(soln) and sd.issolved(soln) print('solution %s:' % i) print(sd.dump_board(soln)) print() #+END_SRC to give our [[file:bin/sudoku][finished Sudoku solver]]. ** The Generator The overall structure for the generator is much like that of the solver: #+NAME: bin/sudokugen #+BEGIN_SRC python :results none :tangle bin/sudokugen :shebang "#! /usr/bin/env python3" <> <> <> <> #+END_SRC where #+NAME: generator usage #+BEGIN_SRC python :results none ''' Usage: %s [-o ORDER] [-g MAXGUESSES] [-m] Generate a Sudoku puzzle. Options: -h, --help print this help and exit -g MAXGUESSES when testing potential clues, restrict solver to a depth of MAXGUESSES -m only remove cells that can be deduced or have that might be among the best candidates If the computed puzzle is not proper (i.e., has exactly one solution), exits with nonzero status. ''' #+END_SRC #+NAME: generator imports #+BEGIN_SRC python :results none import getopt from math import inf import sudoku as sd #+END_SRC #+NAME: generator functions #+BEGIN_SRC python :results none def main(argv): opts_, args = getopt.gnu_getopt(argv, 'g:mo:') opts = dict(opts_) order = int(opts.get('-o', 3)) maxguesses = int(opts['-g']) if '-g' in opts else inf minbranch = '-m' in opts soln = next(sd.solve(sd.blank(order))) bd, difficulty = sd.generate_from(soln, minbranch=minbranch, maxguesses=maxguesses) proper = sd.isproper(bd) print('difficulty:', difficulty) print('proper:', proper) print() print(sd.dump_board(bd)) print() print('> ' + sd.dump_board(soln).replace('\n', '\n> ')) if not proper: exit(1) #+END_SRC ** The Formatter Having the means to both generate and solve Sudoku puzzles, the next thing is to nicely present them. We'll generate Latex source code as an intermediate form, leaning on a custom Latex package for setting boards. Finally, we tie things together with a convenience script that orchestrates conversion from readable boards to transparent PNGs, like the figures in this article. What follows depends on Latex and ImageMagick. *** Conversion to Latex The overal structure of the Latex converter is #+NAME: bin/sudoku2tex #+BEGIN_SRC python :results none :tangle bin/sudoku2tex :shebang "#! /usr/bin/env python3" <> <> <> <> #+END_SRC where the usage statement is #+NAME: formatter usage #+BEGIN_SRC python :results none ''' Usage: %s [OPTIONS] [HIGHLIGHT]... Given a Sudoku board, generate Latex source code. Options: x -h, --help print this help and exit -p print pencil marks for all unknown cells Cells are numbered sequentially from 0 in row-major order. Each HIGHLIGHT indicates a cell whose value (or pencil marks) will have its value surrounded by a red box; HIGHLIGHTs and any cell sharing a possible value with a HIGHLIGHT will have their possibilities set in red. In the absence of the -p option, only cells sharing a division with a HIGHLIGHT will be pencil marked. Used separately, the code generated by this program requires the sudokuii Latex package, included in the source repository (as latex/sudokuii.sty). ''' #+END_SRC The Latex environment we'll use expects as input something like #+BEGIN_SRC latex :eval never \begin{sudoku}[2] |1|2|3|4|. |1|2|3|4|. |1|2|3|4|. |1|2|3|4|. \end{sudoku} #+END_SRC The individual cells can contain more complex items than numbers, provided they're suitably wrapped. Generating the =sudoku= environment falls to #+NAME: formatter functions #+BEGIN_SRC python :results none def sudoku_env(bd, pencil_marks, special): ncells = len(bd.known) + len(bd.unknown) order = int(ncells**0.25) n = order**2 cells = [str(bd.known.get(i, ' ')) for i in range(ncells)] if pencil_marks: apply_pencils(bd, cells, order) reds = set() redboxes = set() for cell in special: dr, drb = highlight(cell, bd, cells, order) reds |= dr redboxes |= drb cells_fmtd = fmt_cells(cells, bd, reds, redboxes) grid = form_body(cells_fmtd, n) sudokusize = n/9 * (17 if pencil_marks or redboxes else 12) unitlength = sudokusize / n fboxsep = {2: 2, 3: 7, 4: 9}.get(order, 9) / 4 / n return f''' \\setlength\\sudokusize{{{sudokusize}cm}} \\setlength\\unitlength{{{1/n}\\sudokusize}} \\setlength\\fboxsep{{-{fboxsep}\\unitlength}} \\renewcommand\\sudokuformat[1]{{\\Huge\\sffamily#1}} \\begin{{sudoku}}[{order}] {grid} \\end{{sudoku}} ''' def form_body(cells, n): rows = [cells[i*n : (i + 1) * n] for i in range(n)] lines = ['|%s|.' % '|'.join(row) for row in rows] return '\n'.join(lines) #+END_SRC The calculations for =sudokusize= and =fboxsep= are the product of considerable trial and error to determine what would look decent/reasonable/not terrible over a range of board sizes. Pencil marks should be formed in a square array containing just the values of interest and little else. In practice, we have to add some blank rows and columns to give more favorable placement in the cells. #+NAME: formatter functions #+BEGIN_SRC python :results none def pencils(possible, order): vals = [str(val) if val in possible else '.' for val in range(1, 1 + order**2)] coldesc = 'c' + 'c' * order grid = ' \\\\\n'.join(' & '.join(map(str, ['\\ \\ '] + vals[order*i : order*(i + 1)])) for i in range(order)) return f''' \\resizebox{{\\unitlength}}{{.6\\unitlength}}{{ \\begin{{tabular}}{{{coldesc}}} \\ \\\\ {grid} \\\\ \\ \\\\ \\end{{tabular}} }} ''' def apply_pencils(bd, cells, order): for (unk, vals) in bd.unknown.items(): cells[unk] = pencils(vals, order) #+END_SRC We wish to call out cells of interest, and we also want to indicate how constraints might propagate: #+NAME: formatter functions #+BEGIN_SRC python :results none def highlight(cell0, bd, cells, order): reds = set() redboxes = {cell0} for div in bd.cell2divs[cell0]: for cell in bd.div2cells[div] - set(bd.known): cells[cell] = pencils(bd.unknown[cell], order) if bd.unknown[cell0] & bd.unknown[cell]: reds.add(cell) return reds, redboxes #+END_SRC Once the pencil marks and highlights have been computed, we can format each cell to show pencil marks, highlighted cells, and the possible effects of constraint propagation: #+NAME: formatter functions #+BEGIN_SRC python :results none def fmt_cells(cells, bd, reds, redboxes): red = lambda s: '{\\color{red}%s}' % s redboxed = lambda s: '{\\color{red}\\fbox{%s}}' % s black = lambda s: '{\\color{black}%s}' % s return [redboxed(cell) if i in redboxes else red(cell) if i in reds else black(cell) for (i, cell) in enumerate(cells)] #+END_SRC With the formatting machinery out of the way, #+NAME: formatter imports #+BEGIN_SRC python :results none import getopt import sys import sudoku as sd #+END_SRC #+NAME: formatter functions #+BEGIN_SRC python :results none def main(argv): try: opts_, args = getopt.gnu_getopt(argv, 'hp') special = {int(cell) for cell in args} except getopt.GetoptError: sys.exit(usage()) except ValueError: sys.exit(usage()) opts = dict(opts_) pencil_marks = '-p' in opts try: bd = sd.load_board(sys.stdin.read(), validate_vals=False) except ValueError: sys.exit('ill-formed board') not_special = set(special) & set(bd.known) if not_special: print("Won't hightlight known cells", not_special, file=sys.stderr) exit(1) print(sudoku_env(bd, pencil_marks, special)) #+END_SRC Since we're not attempting to generate solutions, it is not critical that input boards be restricted in their cell values. Setting =validate_vals= to =False= gives the flexibility needed for such things as illustrations of the division memberships. *** The Latex Package Latex has had for years a package for formatting Sudoku boards, but it focuses purely on the classic 9\times9 grid. To get around this, we can create a package of our own that redefines the =sudoku= environment to deal with boards of any order. #+NAME: latex sudoku definitions #+BEGIN_SRC latex :results none \renewenvironment{sudoku}[1][3]{ \newcount\order \order = #1 \newcount\n \n = \numexpr(#1*#1) \FPeval{\sudodelta}{1/#1/#1} \renewenvironment{sudoku-block}{ \catcode`\|=\active \@sudoku@activate \setcounter{@sudoku@col}{-1} \setcounter{@sudoku@row}{\numexpr(\n-1)} \setlength\unitlength{\sudodelta\sudokusize} \begin{picture}(\n,\n) \@sudoku@grid\@sudoku@grab@arguments }{ \end{picture} } \renewcommand*\@sudoku@grid{ \linethickness{\sudokuthinline} \multiput(0,0)(1,0){\numexpr(\n+1)}{\line(0,1){\n}} \multiput(0,0)(0,1){\numexpr(\n+1)}{\line(1,0){\n}} \linethickness{\sudokuthickline} \multiput(0,0)(\order,0){\numexpr(\order+1)}{\line(0,1){\n}} \multiput(0,0)(0,\order){\numexpr(\order+1)}{\line(1,0){\n}} \linethickness{0.5\sudokuthickline} \put(0,0){\framebox(0,0){}} \put(\n,0){\framebox(0,0){}} \put(0,\n){\framebox(0,0){}} \put(\n,\n){\framebox(0,0){}}} \begin{center} \begin{sudoku-block} }{ \end{sudoku-block} \end{center} } #+END_SRC The original =\@sudoku@grab@arguments= also presumes too much about its input, which becomes a problem for boards of order 2. #+NAME: latex sudoku definitions #+BEGIN_SRC latex :results none \def\@sudoku@grab@arguments#1.{ \scantokens{#1.}} #+END_SRC Now we can assemble these with a bit of boilerplate and dependency information to form the [[file:latex/sudokuii.sty][finished Latex package]]. #+NAME: latex/sudokuii.sty #+BEGIN_SRC latex :results none :tangle latex/sudokuii.sty \NeedsTeXFormat{LaTeX2e}[1999/12/01] \ProvidesPackage{sudokuii}[2020/04/18 Big Sudoku] \RequirePackage{sudoku} \RequirePackage{fp} <> \endinput #+END_SRC *** Converting Boards to Images We can streamline board formatting a bit more. The output of =sudoku2tex= is meant to be combined with =sudokuii.sty= in a Latex document, which would then be converted to some convenient format. Let's assume that that format will be transparent PNG. The overall structure of the image converter will be #+NAME: bin/sudoku2img #+BEGIN_SRC shell :results none :tangle bin/sudoku2img :shebang "#! /bin/bash" <> <> <> #+END_SRC with the following usage: #+NAME: image converter functions #+BEGIN_SRC shell :results none function usage() { cat </dev/null { cat <<'EOF' > sudokuii.sty <> EOF pdflatex --jobname tmp >/dev/null [[ -f tmp.pdf ]] && cat tmp.pdf } popd > /dev/null rm -rf $d } #+END_SRC Including the contents of =sudokuii.sty= in this way ensures that we always have a copy on hand for this application, regardless of what happens on the wider system. It also side-steps any issues that might arise from installing in a non-=/usr= prefix, having a misconfigured =TEXINPUTS=, etc. With =pipetex= defined, we can express conversion of the Latex for a single board: #+NAME: image converter functions #+BEGIN_SRC shell :results none function topng() { convert - -trim -transparent white -colorspace RGB png:-; } function tex2png() { cat <' $infile | sudoku2tex "$@" | tex2png > $outd/solved.png egrep -v '[:>]' $infile | sudoku2tex "$@" | tex2png > $outd/new.png egrep -v '[:>]' $infile | sudoku2tex -p "$@" | tex2png > $outd/penciled.png } function convert_board() { sudoku2tex "$@" | tex2png } #+END_SRC Once we deal with the command line arguments #+NAME: handle image converter arguments #+BEGIN_SRC shell :results none while [[ "$1" ]]; do case "$1" in -h|--help) usage exit 0 ;; -P) shift outd="$1" problem=1 if ! [[ "$outd" ]]; then echo "'-P' requires output directory" usage exit 1 fi ;; --) shift break ;; ,*) echo unknown option "'$1'" usage exit 1 ;; esac shift done #+END_SRC we can get on with dispatching to the proper conversion routine: #+NAME: image converter dispatch #+BEGIN_SRC shell tmpfile=`mktemp` cat > $tmpfile err=0 if [[ "$problem" ]]; then convert_puzzle $tmpfile $outd "$@" elif grep -q difficulty $tmpfile; then echo 'sudokugen output detected; re-run with -P option.' >&2 err=1 else <$tmpfile convert_board "$@" fi rm -f $tmpfile exit $err #+END_SRC At this point, generating a large Sudoku is as simple as : sudokugen -o 5 -m -g2 | sudoku2img -P foo #+BEGIN_SRC shell :exports none :results none <> sudokugen -o 5 -m -g2 | sudoku2img -P images/5x5 #+END_SRC Now we have something to occupy a good bit of time: [[file:images/5x5/new.png]] And, when we finally give up, here's the solution: [[file:images/5x5/solved.png]] * Putting It All Together There's just one more item to make this into a usable package. #+NAME: setup.py #+BEGIN_SRC python :tangle setup.py :shebang "#! /usr/bin/env python3" import os from setuptools import setup, find_packages def ls(base): return [os.path.join(base, fn) for fn in os.listdir(base)] setup(name='sudoku', version='0.1', description='Sudoku', packages=find_packages(), scripts=ls('bin'), zip_safe=False) #+END_SRC Now installation is a simple : ./setup.py install away. * Performance With all the work we've put in, how well does all of this perform? Let's go by major use-case. ** Generating Puzzles The following depicts the run time distributions for creating puzzles, via =sudokugen=, of orders 2--5, with =maxguesses= varying in \omega steps from 0 to $\omega^2$. Each pairing was run 100 times, and each run was capped at 300 seconds of real time. #+NAME: genpuzzles.sh #+BEGIN_SRC shell :results none :exports none :shebang "#! /bin/bash" :tangle genpuzzles.sh mkdir -p data/puzzle/{2,3,4,5} for order in 2 3 4 5; do for (( guesses=0; $guesses <= $order**2; guesses += $order )); do for i in {1..100}; do echo sudokugen -o$order -m -g$guesses \> data/puzzle/$order/$guesses.$i done done done | sort -R | parallel -P2 --timeout 300 --joblog data/genlog #+END_SRC #+BEGIN_SRC shell :results none :exports none sed -re 's/-[og]/ /g' data/genlog | awk '{printf("%s %d %d\n", $4, $10, $12)}' | tail -n+2 | sort -k2,2n -k3,3n | awk '{printf("%s (%s,%s)\n", $1, $2, $3)}' > data/gentime.dat gnuplot < images/gentime.png set terminal pngcairo enhanced transparent size 1024,768 crop set style boxplot nooutliers set style data boxplot set logscale y 2 set title 'Generator Runtime for Various Settings' font 'Times,20' set xlabel '(order, maxguesses)' font 'Times,14' set ylabel 'Runtime (seconds)' font 'Times,14' set tics font "Times,12" plot 'data/gentime.dat' using (1.0):1:(0):2 notitle EOF #+END_SRC [[./images/gentime.png]] As a practical matter, =maxguesses= doesn't seem to matter until order 5, at least for performance. At order 5, though, once we allow 10 or more guesses, generation time ramps up very quickly. (As a practical matter, though, =maxguesses= is key to ensuring that humans can handle larger boards manually.) We can get an alternate perspective by looking at how the mix of job results varies: #+BEGIN_SRC shell :results none :exports none sed -re 's/-[og]/ /g' data/genlog \ | awk '{printf("%s (%.2d,%.2d)\n", $7, $10, $12)}' \ | tail -n+2 \ | sort -k2,2n -k1,1n | awk ' $1==0 {$1="proper\t\t"} $1==1 {$1="improper\t"} $1==-1 {$1="timed-out\t"} {print $2,$1,$3} ' | uniq -c | awk ' { s2c[$3][$2]=$1; c2s[$2][$3]=$1; } END { for (c in c2s) { printf("%s\t", c); for (s in s2c) { printf("%s\t", 0+s2c[s][c]); } printf("\n"); } } ' | sort -k1,1 | sed -re 's/\(0/(/; s/,0/,/'> data/genstat.dat gnuplot < images/genstat.png set terminal pngcairo enhanced transparent size 1024,768 crop set style histogram set style data histogram set style fill solid border -1 set title 'Generator Status for Various Settings' font 'Times,20' set xlabel '(order, maxguesses)' font 'Times,14' set ylabel 'Trials' font 'Times,14' set tics font "Times,12" plot 'data/genstat.dat' using 2:xtic(1) title 'improper', '' using 3 title 'proper', '' using 4 title 'timed out' EOF #+END_SRC [[./images/genstat.png]] The explosion in generation time shows here in the growth of job timeouts. We can also see the effect that =maxguesses= has on our ability to ensure that a generated board is proper. ** Solving #+BEGIN_SRC shell :results none :exports none :tangle gensolns.sh :shebang "#! /bin/bash" awk '$7==0 {print $NF}' data/genlog \ | sort -R \ | parallel -P2 --joblog=data/solvelog --timeout 300 'egrep -v \> {} | grep -v : | sudoku > {}.soln' #+END_SRC #+BEGIN_SRC shell :results none :exports none tail -n+2 data/solvelog \ | sed -re ' s|egrep.+puzzle/||; s|.soln||; s|(.+)/(.+)\..+$|\1 \2|' \ | awk '{print $4, $9, $10}' \ | sort -k2,2n -k3,3n \ | awk '{printf("%s (%s,%s)\n", $1, $2, $3)}' > data/solvetime.dat gnuplot < images/solvetime.png set terminal pngcairo enhanced transparent size 1024,768 crop set style boxplot nooutliers set style data boxplot set logscale y 2 set title 'Solver Runtime for Various Settings' font 'Times,20' set xlabel '(order, maxguesses)' font 'Times,14' set ylabel 'Runtime (seconds)' font 'Times,14' set tics font "Times,12" plot 'data/solvetime.dat' using (1.0):1:(0):2 notitle EOF #+END_SRC The generation test produced a total of 1438 proper boards. Solving each gives the following distribution of runtimes vs. generation parameters: [[./images/solvetime.png]] * Wrapping Up The library that implements all of the core logic for generating, solving, parsing, and serializing boards weighs in at 258 lines, excluding blanks; the command line tools, 363 lines. There is much, much more that we could do: - We could create related tools that, rather than assuming the nested-grid structure that we've been enforcing so far, instead read the cell/division structure from a file, allowing us to lean on both the generation and solution logic for nearly arbitrary board arrangements (like Squiggly Sudoku or Jigsaw Sudoku). - We could implement more strategies for solving puzzles, and then build out machinery for tracking which get used, allowing us to more meaningfully estimate difficulty. - We could rework the formatting tools to jettison the dependence on Latex and ImageMagick. - We could lavish attention on performance. But there's little reason. Now that we know how to generate and solve basically anything that is recognizably a Sudoku board, we can consider ourselves free to think of [[https://en.wikipedia.org/wiki/NP-completeness][other]] problems, like [[https://en.wikipedia.org/wiki/Register_allocation#Graph-coloring_allocation][register allocation]] and [[https://en.wikipedia.org/wiki/Job_shop_scheduling][job scheduling]].