Updated

bryan · bryan · commit 6fd75deebad0 · 2016-12-27T14:11:51.000+09:00
diff --git a/README.md b/README.md
@@ -1,6 +1,6 @@
 # Can Convolutional Neural Networks Crack Sudoku Puzzles?
 
-This project is motivated simply by my personal curiosity--can CNNs crack learn how to solve Sudoku? There are many computational approaches to do that. Why not neural networks?
+Sudoku is a popular number puzzle that requires you to fill blanks in a 9X9 grid with digits so that each column, each row, and each of the nine 3×3 subgrids contains all of the digits from 1 to 9. There have been various approaches to that, including computational ones. In this pilot project, we show convolutional neural networks have the potential to crack Sukoku puzzles without any other rule-based post-processing.
 
 ## Requirements
   * numpy >= 1.11.1
@@ -10,68 +10,72 @@ This project is motivated simply by my personal curiosity--can CNNs crack learn
 Can Convolutional Neural Networks Crack Sudoku Puzzles?
 
 ## Background
-* To see what Sudoku is, check the wikipedia [here](https://en.wikipedia.org/wiki/Sudoku)
-* To investigate this task comprehensively, read through [McGuire et al. 2013](https://arxiv.org/pdf/1201.0749.pdf)
-
-## Workflow
-* STEP 1. Generate [1M games of Sudoku](https://drive.google.com/open?id=0B0ZXk88koS2Ka0lVQWtBTUhWbUU). (=Y) (See `generate_sudoku.py`)<br/>
-* STEP 2. Make [1, 65] blanks randomly with uniform probabilities for every cell. (=X) (See `load_data` in `train.py`)<br/>
-* STEP 3. Build convolutional networks as follows. (See `Graph` in `train.py`)<br/>
-&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;5 convolutional layers of 512 dimensions<br/>
-&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;1 final convolutional layer with a 1 by 1 filter.<br/>
-STEP 4. Train the model, feeding X and Y. Loss is calucated from the predictions for the blanks. (See `train.py`)<br/>
-STEP 5. Evaluate (See `test.py`).
+* To see what Sudoku is, check the [wikipedia](https://en.wikipedia.org/wiki/Sudoku)
+* To investigate this task comprehensively, read through [McGuire et al. 2013](https://arxiv.org/pdf/1201.0749.pdf).
+
+## Training
+* STEP 1. Generate 1 million Sudoku games. (See `generate_sudoku.py`). The pre-generated games are available [here](https://www.kaggle.com/bryanpark/sudoku).
+* STEP 2. Construct convolutional networks as follows. (See `Graph` in `train.py`)<br/>
+![graph](graph.png?raw=true)
+* STEP 3. Train the model, feeding X (quizzes) and Y (solutions). Note that only the predictions for the position of the blanks count when computing loss. (See `train.py`)<br/>
+
+## Evaluation
+We test the performance of the final model against 30 real Sudoku puzzles and their solutions, which vary from the easy to evil level.
 
 ## Results
-After 10 epochs, we got [this model file](https://drive.google.com/open?id=0B0ZXk88koS2KR0hETzI4dVdZV0k). Subsequently, we evaluated according to the following two methods.
+After 4 epochs, we got [the best model file](https://drive.google.com/open?id=0B0ZXk88koS2KV1VIT2RYUGhuOEU). We designed two test methods.
 
 * Test method 1: Predict the numbers in blanks all at once.
-* Test method 2: Predict the numbers sequentially from the most confident one at each step.
+* Test method 2: Predict the numbers sequentially the most confident one at a time.
 
  
 | Level  |  Test1 <br/>(#correct/#blanks=acc.)| Test2 <br/>(#correct/#blanks=acc.) |
 | ---    |---     |---     |
-|Easy|25/47=0.53|27/47=0.57|
-|Easy|26/45=0.58|29/45=0.64|
-|Easy|29/47=0.62|37/47=0.79|
-|Easy|24/45=0.53|24/45=0.53|
-|Easy|25/47=0.53|35/47=0.74|
-|Easy|23/46=0.50|30/46=0.65|
-|Medium|17/53=0.32|9/53=0.17|
-|Medium|20/55=0.36|13/55=0.24|
-|Medium|17/55=0.31|16/55=0.29|
-|Medium|25/53=0.47|39/53=0.74|
-|Medium|25/52=0.48|32/52=0.62|
-|Medium|28/56=0.50|12/56=0.21|
-|Hard|18/56=0.32|12/56=0.21|
-|Hard|19/55=0.35|14/55=0.25|
-|Hard|19/55=0.35|21/55=0.38|
-|Hard|22/57=0.39|16/57=0.28|
-|Hard|26/55=0.47|9/55=0.16|
-|Hard|25/56=0.45|36/56=0.64|
-|Expert|21/56=0.38|19/56=0.34|
-|Expert|22/55=0.40|25/55=0.45|
-|Expert|20/54=0.37|12/54=0.22|
-|Expert|25/55=0.45|25/55=0.45|
-|Expert|23/55=0.42|20/55=0.36|
-|Expert|24/54=0.44|19/54=0.35|
-|Evil|28/50=0.56|38/50=0.76|
-|Evil|20/50=0.40|26/50=0.52|
-|Evil|26/49=0.53|29/49=0.59|
-|Evil|21/53=0.40|17/53=0.32|
-|Evil|23/51=0.45|15/51=0.29|
-|Evil|26/51=0.51|16/51=0.31|
-Total Accuracy| 692/1568=0.44|672/1568=0.43|
+|Easy|43/47=0.91|**47/47=1.00**|
+|Easy|37/45=0.82|**45/45=1.00**|
+|Easy|40/47=0.85|**47/47=1.00**|
+|Easy|33/45=0.73|**45/45=1.00**|
+|Easy|37/47=0.79|**47/47=1.00**|
+|Easy|39/46=0.85|**46/46=1.00**|
+|Medium|27/53=0.51|32/53=0.60|
+|Medium|27/55=0.49|27/55=0.49|
+|Medium|32/55=0.58|36/55=0.65|
+|Medium|28/53=0.53|**53/53=1.00**|
+|Medium|27/52=0.52|33/52=0.63|
+|Medium|29/56=0.52|39/56=0.70|
+|Hard|30/56=0.54|41/56=0.73|
+|Hard|31/55=0.56|28/55=0.51|
+|Hard|33/55=0.60|**55/55=1.00**|
+|Hard|33/57=0.58|**57/57=1.00**|
+|Hard|27/55=0.49|50/55=0.91|
+|Hard|28/56=0.50|27/56=0.48|
+|Expert|32/56=0.57|22/56=0.39|
+|Expert|32/55=0.58|**55/55=1.00**|
+|Expert|37/54=0.69|**54/54=1.00**|
+|Expert|33/55=0.60|**55/55=1.00**|
+|Expert|30/55=0.55|23/55=0.42|
+|Expert|25/54=0.46|**54/54=1.00**|
+|Evil|32/50=0.64|**50/50=1.00**|
+|Evil|33/50=0.66|**50/50=1.00**|
+|Evil|34/49=0.69|**49/49=1.00**|
+|Evil|33/53=0.62|**53/53=1.00**|
+|Evil|35/51=0.69|**51/51=1.00**|
+|Evil|34/51=0.67|**51/51=1.00**|
+|Total Accuracy| 971/1568=0.62| **1322/1568=0.84**|
+|Success Rate| 0/30=0| **19/30=0.63**|
 
 ## Conclusions
 * I also tested fully connected layers, to no avail.
 * Up to some point, it seems that CNNs can learn to solve Sudoku.
-* For some problems, the second method was more effective than the first method. But I can't figure out more about that.
-* Probably reinforcement learning would be more appropriate for Sudoku solving.
+* For most problems, the second method was outperfrom the fist one.
+* Humans cannot predict all numbers simultaneously. Probably so do CNNs.
+
+## Furthery Study
+* Reinforcement learning would be more appropriate for Sudoku solving.
 
 ## Notes for reproducibility
-* Download pre-generated Sudoku games [here](https://drive.google.com/open?id=0B0ZXk88koS2Ka0lVQWtBTUhWbUU) and extract it to `data/` folder.
-* Download pre-trained model file [here](https://drive.google.com/open?id=0B0ZXk88koS2KR0hETzI4dVdZV0k) and extract it to `asset/train/ckpt` folder.
+* Download pre-generated Sudoku games [here](https://www.kaggle.com/bryanpark/sudoku) and extract it to `data/` folder.
+* Download the pre-trained model file [here](https://drive.google.com/open?id=0B0ZXk88koS2KV1VIT2RYUGhuOEU) and extract it to `asset/train/ckpt` folder.
 	
 
 
diff --git a/generate_sudoku.py b/generate_sudoku.py
@@ -1,7 +1,8 @@
-
+#!/usr/bin/python2
 """
 This is adapted from https://www.ocf.berkeley.edu/~arel/sudoku/main.html.
-Kyubyong Park.
+Generates 1 million Sudoku games. 
+Kyubyong Park. kbpark.linguist@gmail.com www.github.com/kyubyong
 """
 
 import random, copy
@@ -139,16 +140,14 @@ def run(n = 28, iter=100):
 #     print "* creating the solution..."
     a_puzzle_solution = construct_puzzle_solution()
     
-    return a_puzzle_solution
-
 #     print "* constructing a puzzle..."
-#     for i in range(iter):
-#         puzzle = copy.deepcopy(a_puzzle_solution)
-#         (result, number_of_cells) = pluck(puzzle, n)
-#         all_results.setdefault(number_of_cells, []).append(result)
-#         if number_of_cells <= n: break
-# 
-#     return all_results
+    for i in range(iter):
+        puzzle = copy.deepcopy(a_puzzle_solution)
+        (result, number_of_cells) = pluck(puzzle, n)
+        all_results.setdefault(number_of_cells, []).append(result)
+        if number_of_cells <= n: break
+ 
+    return all_results, a_puzzle_solution
 
 def best(set_of_puzzles):
     # Could run some evaluation function here. For now just pick
@@ -170,13 +169,18 @@ def main(num):
     '''
     Generates `num` games of Sudoku.
     '''
-    Y = np.zeros((num, 9, 9), np.int32)
+    quizzes = np.zeros((num, 9, 9), np.int32)
+    solutions = np.zeros((num, 9, 9), np.int32)
     for i in range(num):
-        game = np.array(run())
-        Y[i] = game
+        all_results, solution = run(n=23, iter=10)
+        quiz = best(all_results)
+        
+        quizzes[i] = quiz
+        solutions[i] = solution
+
         if (i+1) % 1000 == 0:
             print i+1
-            np.save('data/sudoku.npy', Y)
+            np.save('data/sudoku.npz', quizzes=quizzes, solutions=solutions)
 
 if __name__ == "__main__":
     main(1000000)
diff --git a/graph.png b/graph.png
diff --git a/test.py b/test.py
@@ -1,8 +1,15 @@
 # -*- coding: utf-8 -*-
+'''
+Test the performance of the model.
+'''
 import sugartensor as tf
 import numpy as np
 from train import Graph
 
+# Test sets
+# 6 * Easy + 6 * Medium + 6 * Hard + 6 * Expert + 6 * Evil 
+# From http://1sudoku.com/print/print-sudoku-free/
+
 problems = '''\
 080032001
 703080002
@@ -605,13 +612,14 @@
 438269517
 619875423'''
 
-def data_process():
-    # Convert problem and solution sets to the proper format
+def preprocess():
+    '''Converts problem and solution sets to the proper format
+    '''
     global problems, solutions
     
     nproblems = len(problems.strip().split("\n\n"))
     X = np.zeros((nproblems, 9, 9), np.float32)  
-    Y = np.zeros((nproblems, 9, 9), np.float32)  
+    Y = np.zeros((nproblems, 9, 9), np.int32)  
     
     for i, prob in enumerate(problems.strip().split('\n\n')):
         for j, row in enumerate(prob.splitlines()):
@@ -631,7 +639,7 @@ def test1():
     '''
     Predicts all at once.
     '''
-    X, Y = data_process()
+    X, Y = preprocess()
     g = Graph(is_train=False)
         
     with tf.Session() as sess:
@@ -668,7 +676,7 @@ def test2():
     '''
     Predicts sequentially.
     '''
-    X, Y = data_process()
+    X, Y = preprocess()
     g = Graph(is_train=False)
          
     with tf.Session() as sess:
diff --git a/train.py b/train.py
@@ -5,90 +5,97 @@
 # set log level to debug
 tf.sg_verbosity(10)
 
+class Hyperparams:
+    batch_size = 64
+
 def load_data(is_train=True):
-    Y = np.load('data/sudoku.npy') # solutions
+    '''Loads training / validation data.
     
-    X = np.zeros_like(Y, dtype=np.float32)
-    for i, y in enumerate(Y): # game-wise
-        nblanks = np.random.randint(1, 65) # We generate a problem which varies from 1 to 65 in number of blanks.
-        blank_indices = np.random.choice(81, nblanks)
-        masks= np.ones((9*9))
-        masks[blank_indices] = 0
-        masks = masks.reshape((9, 9))
-
-        x = y * masks # puzzle. 0: blanks=targets.
-        X[i] = x
+    Args
+      is_train: Boolean. If True, it loads training data.
+        Otherwise, it loads validation data.
+    
+    Returns:
+      X: 4-D array of float. Has the shape of (# total games, 9, 9, 1) (for train) 
+        or (batch_size, 9, 9, 1) (for validation)
+      Y: 3-D array of int. Has the shape of (# total games, 9, 9) (for train) 
+        or (batch_size, 9, 9) (for validation)            
+    '''
+    X = np.load('data/sudoku.npz')['quizzes'].astype(np.float32)
+    Y = np.load('data/sudoku.npz')['solutions']
     
     X = np.expand_dims(X, -1)
     
     if is_train:
-        return X[:-100], Y[:-100] # training data
+        return X[:-Hyperparams.batch_size], Y[:-Hyperparams.batch_size] # training data
     else:
-        return X[-100:], Y[-100:] # validation data
+        return X[-Hyperparams.batch_size:], Y[-Hyperparams.batch_size:] # validation data
+        
+def get_batch_data(is_train=True):
+    '''Returns batch data.
     
-def get_batch_data(is_train=True, batch_size=16):
-    '''
     Args:
-      is_train: Boolean. If True, load training data. Otherwise, load validation data. 
+      is_train: Boolean. If True, it returns batch training data. 
+        Otherwise, batch validation data. 
+        
     Returns:
-      A Tuple of X batch queues (Tensor), Y batch queues (Tensor), and number of batches (int) 
+      A Tuple of x, y, and num_batch
+        x: A `Tensor` of float. Has the shape of (batch_size, 9, 9, 1).
+        y: A `Tensor` of int. Has the shape of (batch_size, 9, 9).
+        num_batch = A Python int. Number of batches.
     '''
-    # Load data
     X, Y = load_data(is_train=is_train)
     
     # Create Queues
     input_queues = tf.train.slice_input_producer([tf.convert_to_tensor(X), 
                                                   tf.convert_to_tensor(Y)]) 
     
     # create batch queues
-    X_batch, Y_batch = tf.train.shuffle_batch(input_queues,
-                                      num_threads=8,
-                                      batch_size=batch_size, 
-                                      capacity=batch_size*64,
-                                      min_after_dequeue=batch_size*32, 
-                                      allow_smaller_final_batch=False)
+    x, y = tf.train.shuffle_batch(input_queues,
+                                  num_threads=8,
+                                  batch_size=Hyperparams.batch_size, 
+                                  capacity=Hyperparams.batch_size*64,
+                                  min_after_dequeue=Hyperparams.batch_size*32, 
+                                  allow_smaller_final_batch=False)
     # calc total batch count
     num_batch = len(X) // batch_size 
     
-    return X_batch, Y_batch, num_batch  # (16, 9, 9, 1) int32. cf. Y_batch: (16, 9, 9) int32
+    return x, y, num_batch  # (64, 9, 9, 1), (64, 9, 9), ()
 
 class Graph(object):
     def __init__(self, is_train=True):
         # inputs
         if is_train:
-            self.X, self.Y, self.num_batch = get_batch_data() # (16, 9, 9, 1), (16, 9, 9)
-            self.X_val, self.Y_val, _ = get_batch_data(is_train=False)
+            self.x, self.y, self.num_batch = get_batch_data()
+            self.x_val, self.y_val, _ = get_batch_data(is_train=False)
         else:
-            self.X = tf.placeholder(tf.float32, [None, 9, 9, 1])
+            self.x = tf.placeholder(tf.float32, [None, 9, 9, 1])
 
         with tf.sg_context(size=3, act='relu', bn=True):
-            self.logits = self.X.sg_identity()
-            for _ in range(5):
+            self.logits = self.x.sg_identity()
+            for _ in range(10):
                 self.logits = (self.logits.sg_conv(dim=512))
-            self.logits = self.logits.sg_conv(dim=10, size=1, act='linear', bn=False) # (16, 9, 9, 10) float32
+
+            self.logits = self.logits.sg_conv(dim=10, size=1, act='linear', bn=False)
             
         if is_train:
-            self.ce = self.logits.sg_ce(target=self.Y, mask=False) # (16, 9, 9) dtype=float32
-            self.istarget = tf.equal(self.X.sg_squeeze(), tf.zeros_like(self.X.sg_squeeze())).sg_float() # zeros: 1, non-zeros: 0 (16, 9, 9) dtype=float32
-            self.loss = self.ce * self.istarget # (16, 9, 9) dtype=float32
+            self.ce = self.logits.sg_ce(target=self.y, mask=False)
+            self.istarget = tf.equal(self.x.sg_squeeze(), tf.zeros_like(self.x.sg_squeeze())).sg_float()
+            self.loss = self.ce * self.istarget
             self.reduced_loss = self.loss.sg_sum() / self.istarget.sg_sum()
             tf.sg_summary_loss(self.reduced_loss, "reduced_loss")
             
-            # accuracy evaluation ( for train set )
-            self.preds = (self.logits.sg_argmax()).sg_int()
-            self.hits = tf.equal(self.preds, self.Y).sg_float()
-            self.acc_train = (self.hits * self.istarget).sg_sum() / self.istarget.sg_sum()
-            
             # accuracy evaluation ( for validation set )
-            self.preds_ = (self.logits.sg_reuse(input=self.X_val).sg_argmax()).sg_int()
-            self.hits_ = tf.equal(self.preds_, self.Y_val).sg_float()
-            self.istarget_ = tf.equal(self.X_val.sg_squeeze(), tf.zeros_like(self.X_val.sg_squeeze())).sg_float()
-            self.acc_val = (self.hits_ * self.istarget_).sg_sum() / self.istarget_.sg_sum()
+            self.preds_ = (self.logits.sg_reuse(input=self.x_val).sg_argmax()).sg_int()
+            self.hits_ = tf.equal(self.preds_, self.y_val).sg_float()
+            self.istarget_ = tf.equal(self.x_val.sg_squeeze(), tf.zeros_like(self.x_val.sg_squeeze())).sg_float()
+            self.acc = (self.hits_ * self.istarget_).sg_sum() / self.istarget_.sg_sum()
 
 def main():
     g = Graph()
     
-    tf.sg_train(log_interval=10, loss=g.reduced_loss, eval_metric=[g.acc_train, g.acc_val], 
+    tf.sg_train(lr=0.0001, lr_reset=True, log_interval=10, save_interval=300, 
+                loss=g.reduced_loss, eval_metric=[g.acc], 
                 ep_size=g.num_batch, save_dir='asset/train', max_ep=10, early_stop=False)
     
 if __name__ == "__main__":