{ "cells": [ { "cell_type": "code", "execution_count": 1, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Populating the interactive namespace from numpy and matplotlib\n" ] } ], "source": [ "%pylab inline\n", "from pylab import *\n", "import codecs,string,os,sys,os.path,glob,re" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The CLSTM command line tools take their training data in HDF5 files (you will evenutally also be able to train directly from images saved on disk, as in ocropy, but that's not quite implemented yet). This illustrates how to store images into an HDF5 file and then how to run the `clstmctc` training tool on the data." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# The UW3-500 Dataset" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We illustrate loading data with the `uw3-500.tgz` dataset, available from `tmbdev.net`." ] }, { "cell_type": "code", "execution_count": 2, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "--2015-06-03 14:02:53-- http://www.tmbdev.net/ocrdata/uw3-500.tgz\n", "Resolving www.tmbdev.net (www.tmbdev.net)... 69.163.203.33\n", "Connecting to www.tmbdev.net (www.tmbdev.net)|69.163.203.33|:80... connected.\n", "HTTP request sent, awaiting response... 200 OK\n", "Length: 1401847 (1.3M) [text/plain]\n", "Saving to: ‘uw3-500.tgz’\n", "\n", "100%[======================================>] 1,401,847 815KB/s in 1.7s \n", "\n", "2015-06-03 14:02:55 (815 KB/s) - ‘uw3-500.tgz’ saved [1401847/1401847]\n", "\n" ] } ], "source": [ "!test -f uw3-500.tgz || wget -nd http://www.tmbdev.net/ocrdata/uw3-500.tgz" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's untar the file unless it has already been untarred." ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "collapsed": false, "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "book/\n", "book/1241/\n", "book/1241/010055.bin.png\n", "book/1241/010055.gt.txt\n", "book/0134/\n", "book/0134/010016.gt.txt\n", "book/0134/010016.bin.png\n", "book/1180/\n", "book/1180/010028.bin.png\n", "book/1180/010028.gt.txt\n", "book/0654/\n", "book/0654/010043.bin.png\n", "book/0654/010043.gt.txt\n", "book/1583/\n", "book/1583/010065.gt.txt\n", "book/1583/010065.bin.png\n", "book/1147/\n", "book/1147/010030.bin.png\n", "book/1147/010030.gt.txt\n", "book/1395/\n", "book/1395/010044.bin.png\n", "book/1395/010044.gt.txt\n", "book/0873/\n", "book/0873/010074.bin.png\n", "book/0873/010074.gt.txt\n", "book/1480/\n", "book/1480/010084.gt.txt\n", "book/1480/010084.bin.png\n", "book/0604/\n", "book/0604/010010.bin.png\n", "book/0604/010010.gt.txt\n", "book/1074/\n", "book/1074/010051.gt.txt\n", "book/1074/010040.gt.txt\n", "book/1074/010040.bin.png\n", "book/1074/010051.bin.png\n", "book/1437/\n", "book/1437/010049.bin.png\n", "book/1437/010049.gt.txt\n", "book/0885/\n", "book/0885/010006.gt.txt\n", "book/0885/010006.bin.png\n", "book/0214/\n", "book/0214/010002.gt.txt\n", "book/0214/010002.bin.png\n", "book/0326/\n", "book/0326/010061.gt.txt\n", "book/0326/010061.bin.png\n", "book/1377/\n", "book/1377/010039.gt.txt\n", "book/1377/010039.bin.png\n", "book/0972/\n", "book/0972/010026.gt.txt\n", "book/0972/010043.bin.png\n", "book/0972/010026.bin.png\n", "book/0972/010043.gt.txt\n", "book/0461/\n", "book/0461/010074.bin.png\n", "book/0461/010074.gt.txt\n", "book/0157/\n", "book/0157/010037.bin.png\n", "book/0157/010037.gt.txt\n", "book/0225/\n", "book/0225/010074.bin.png\n", "book/0225/010074.gt.txt\n", "book/0395/\n", "book/0395/010088.gt.txt\n", "book/0395/010063.bin.png\n", "book/0395/010063.gt.txt\n", "book/0395/010064.gt.txt\n", "book/0395/010001.bin.png\n", "book/0395/010001.gt.txt\n", "book/0395/010088.bin.png\n", "book/0395/010064.bin.png\n", "book/0074/\n", "book/0074/010022.gt.txt\n", "book/0074/010022.bin.png\n", "book/1590/\n", "book/1590/010011.bin.png\n", "book/1590/010011.gt.txt\n", "book/0659/\n", "book/0659/010017.gt.txt\n", "book/0659/010017.bin.png\n", "book/0185/\n", "book/0185/010089.bin.png\n", "book/0185/010089.gt.txt\n", "book/0835/\n", "book/0835/010039.gt.txt\n", "book/0835/010039.bin.png\n", "book/1522/\n", "book/1522/010058.gt.txt\n", "book/1522/010065.gt.txt\n", "book/1522/010065.bin.png\n", "book/1522/010058.bin.png\n", "book/0961/\n", "book/0961/010054.gt.txt\n", "book/0961/010054.bin.png\n", "book/0599/\n", "book/0599/010020.bin.png\n", "book/0599/010020.gt.txt\n", "book/0318/\n", "book/0318/010085.bin.png\n", "book/0318/010085.gt.txt\n", "book/0931/\n", "book/0931/010007.gt.txt\n", "book/0931/010007.bin.png\n", "book/0931/010023.gt.txt\n", "book/0931/010023.bin.png\n", "book/1520/\n", "book/1520/010021.bin.png\n", "book/1520/010021.gt.txt\n", "book/0028/\n", "book/0028/010012.bin.png\n", "book/0028/010004.gt.txt\n", "book/0028/010004.bin.png\n", "book/0028/010012.gt.txt\n", "book/1355/\n", "book/1355/010089.bin.png\n", "book/1355/010089.gt.txt\n", "book/0460/\n", "book/0460/010004.gt.txt\n", "book/0460/010004.bin.png\n", "book/1125/\n", "book/1125/010013.gt.txt\n", "book/1125/010013.bin.png\n", "book/0539/\n", "book/0539/010061.gt.txt\n", "book/0539/010061.bin.png\n", "book/0539/010091.gt.txt\n", "book/0539/010091.bin.png\n", "book/1488/\n", "book/1488/010042.bin.png\n", "book/1488/010042.gt.txt\n", "book/1127/\n", "book/1127/010102.gt.txt\n", "book/1127/010102.bin.png\n", "book/1336/\n", "book/1336/010071.bin.png\n", "book/1336/010071.gt.txt\n", "book/1417/\n", "book/1417/010035.gt.txt\n", "book/1417/010035.bin.png\n", "book/1305/\n", "book/1305/010024.gt.txt\n", "book/1305/010024.bin.png\n", "book/0953/\n", "book/0953/010015.gt.txt\n", "book/0953/010015.bin.png\n", "book/1031/\n", "book/1031/010014.gt.txt\n", "book/1031/010014.bin.png\n", "book/0596/\n", "book/0596/010002.gt.txt\n", "book/0596/010002.bin.png\n", "book/1174/\n", "book/1174/010044.bin.png\n", "book/1174/010044.gt.txt\n", "book/1001/\n", "book/1001/010061.gt.txt\n", "book/1001/010061.bin.png\n", "book/0355/\n", "book/0355/010001.bin.png\n", "book/0355/010001.gt.txt\n", "book/0222/\n", "book/0222/010007.gt.txt\n", "book/0222/010007.bin.png\n", "book/0222/010017.gt.txt\n", "book/0222/010017.bin.png\n", "book/1409/\n", "book/1409/010004.gt.txt\n", "book/1409/010004.bin.png\n", "book/0066/\n", "book/0066/010016.gt.txt\n", "book/0066/010016.bin.png\n", "book/0971/\n", "book/0971/010005.bin.png\n", "book/0971/010005.gt.txt\n", "book/0191/\n", "book/0191/010040.gt.txt\n", "book/0191/010040.bin.png\n", "book/1325/\n", "book/1325/010036.bin.png\n", "book/1325/010036.gt.txt\n", "book/1544/\n", "book/1544/010022.gt.txt\n", "book/1544/010022.bin.png\n", "book/0787/\n", "book/0787/010001.bin.png\n", "book/0787/010001.gt.txt\n", "book/0331/\n", "book/0331/010014.gt.txt\n", "book/0331/010014.bin.png\n", "book/0768/\n", "book/0768/010009.gt.txt\n", "book/0768/010009.bin.png\n", "book/0911/\n", "book/0911/010008.bin.png\n", "book/0911/010008.gt.txt\n", "book/0012/\n", "book/0012/010036.bin.png\n", "book/0012/010036.gt.txt\n", "book/0012/010108.gt.txt\n", "book/0012/010108.bin.png\n", "book/0865/\n", "book/0865/010022.gt.txt\n", "book/0865/010022.bin.png\n", "book/0865/010042.bin.png\n", "book/0865/010091.gt.txt\n", "book/0865/010042.gt.txt\n", "book/0865/010091.bin.png\n", "book/1552/\n", "book/1552/010017.gt.txt\n", "book/1552/010017.bin.png\n", "book/1223/\n", "book/1223/010060.bin.png\n", "book/1223/010057.bin.png\n", "book/1223/010060.gt.txt\n", "book/1223/010057.gt.txt\n", "book/1177/\n", "book/1177/010046.bin.png\n", "book/1177/010046.gt.txt\n", "book/0857/\n", "book/0857/010109.bin.png\n", "book/0857/010001.bin.png\n", "book/0857/010109.gt.txt\n", "book/0857/010001.gt.txt\n", "book/0705/\n", "book/0705/010051.gt.txt\n", "book/0705/010045.gt.txt\n", "book/0705/010045.bin.png\n", "book/0705/010051.bin.png\n", "book/0808/\n", "book/0808/010031.gt.txt\n", "book/0808/010031.bin.png\n", "book/0158/\n", "book/0158/010007.gt.txt\n", "book/0158/010007.bin.png\n", "book/0468/\n", "book/0468/010060.bin.png\n", "book/0468/010060.gt.txt\n", "book/0360/\n", "book/0360/010027.bin.png\n", "book/0360/010027.gt.txt\n", "book/0540/\n", "book/0540/010011.bin.png\n", "book/0540/010011.gt.txt\n", "book/0376/\n", "book/0376/010048.gt.txt\n", "book/0376/010048.bin.png\n", "book/0376/010085.bin.png\n", "book/0376/010085.gt.txt\n", "book/0350/\n", "book/0350/010054.gt.txt\n", "book/0350/010054.bin.png\n", "book/1415/\n", "book/1415/010004.gt.txt\n", "book/1415/010004.bin.png\n", "book/1094/\n", "book/1094/010075.gt.txt\n", "book/1094/010075.bin.png\n", "book/1090/\n", "book/1090/010009.gt.txt\n", "book/1090/010009.bin.png\n", "book/1162/\n", "book/1162/010004.gt.txt\n", "book/1162/010004.bin.png\n", "book/1276/\n", "book/1276/010002.gt.txt\n", "book/1276/010002.bin.png\n", "book/0735/\n", "book/0735/010002.gt.txt\n", "book/0735/010002.bin.png\n", "book/0183/\n", "book/0183/010006.gt.txt\n", "book/0183/010006.bin.png\n", "book/0394/\n", "book/0394/010060.bin.png\n", "book/0394/010060.gt.txt\n", "book/0864/\n", "book/0864/010038.bin.png\n", "book/0864/010038.gt.txt\n", "book/0625/\n", "book/0625/010007.gt.txt\n", "book/0625/010007.bin.png\n", "book/0913/\n", "book/0913/010019.bin.png\n", "book/0913/010034.gt.txt\n", "book/0913/010034.bin.png\n", "book/0913/010019.gt.txt\n", "book/1462/\n", "book/1462/010070.bin.png\n", "book/1462/010070.gt.txt\n", "book/0123/\n", "book/0123/010003.gt.txt\n", "book/0123/010003.bin.png\n", "book/0123/010034.gt.txt\n", "book/0123/010034.bin.png\n", "book/1339/\n", "book/1339/010054.gt.txt\n", "book/1339/010054.bin.png\n", "book/0989/\n", "book/0989/010035.gt.txt\n", "book/0989/010035.bin.png\n", "book/1523/\n", "book/1523/010039.gt.txt\n", "book/1523/010039.bin.png\n", "book/0712/\n", "book/0712/010013.gt.txt\n", "book/0712/010013.bin.png\n", "book/1447/\n", "book/1447/010091.gt.txt\n", "book/1447/010091.bin.png\n", "book/0373/\n", "book/0373/010036.bin.png\n", "book/0373/010036.gt.txt\n", "book/0373/010046.bin.png\n", "book/0373/010001.bin.png\n", "book/0373/010001.gt.txt\n", "book/0373/010046.gt.txt\n", "book/1161/\n", "book/1161/010021.bin.png\n", "book/1161/010021.gt.txt\n", "book/0005/\n", "book/0005/010001.bin.png\n", "book/0005/010001.gt.txt\n", "book/0963/\n", "book/0963/010026.gt.txt\n", "book/0963/010026.bin.png\n", "book/0476/\n", "book/0476/010023.gt.txt\n", "book/0476/010023.bin.png\n", "book/1346/\n", "book/1346/010012.bin.png\n", "book/1346/010012.gt.txt\n", "book/1423/\n", "book/1423/010042.bin.png\n", "book/1423/010042.gt.txt\n", "book/0969/\n", "book/0969/010098.bin.png\n", "book/0969/010098.gt.txt\n", "book/0969/010034.gt.txt\n", "book/0969/010034.bin.png\n", "book/0893/\n", "book/0893/010116.bin.png\n", "book/0893/010116.gt.txt\n", "book/0291/\n", "book/0291/010070.bin.png\n", "book/0291/010070.gt.txt\n", "book/1117/\n", "book/1117/010049.bin.png\n", "book/1117/010049.gt.txt\n", "book/0207/\n", "book/0207/010017.gt.txt\n", "book/0207/010017.bin.png\n", "book/1039/\n", "book/1039/010051.gt.txt\n", "book/1039/010051.bin.png\n", "book/1240/\n", "book/1240/010045.gt.txt\n", "book/1240/010045.bin.png\n", "book/1240/010038.bin.png\n", "book/1240/010038.gt.txt\n", "book/1021/\n", "book/1021/010037.bin.png\n", "book/1021/010037.gt.txt\n", "book/1169/\n", "book/1169/010033.gt.txt\n", "book/1169/010057.bin.png\n", "book/1169/010100.gt.txt\n", "book/1169/010057.gt.txt\n", "book/1169/010100.bin.png\n", "book/1169/010033.bin.png\n", "book/1185/\n", "book/1185/010064.gt.txt\n", "book/1185/010064.bin.png\n", "book/0664/\n", "book/0664/010033.gt.txt\n", "book/0664/010049.bin.png\n", "book/0664/010049.gt.txt\n", "book/0664/010033.bin.png\n", "book/0709/\n", "book/0709/010002.gt.txt\n", "book/0709/010002.bin.png\n", "book/1109/\n", "book/1109/010059.bin.png\n", "book/1109/010059.gt.txt\n", "book/1216/\n", "book/1216/010035.gt.txt\n", "book/1216/010035.bin.png\n", "book/1139/\n", "book/1139/010029.gt.txt\n", "book/1139/010077.gt.txt\n", "book/1139/010077.bin.png\n", "book/1139/010104.gt.txt\n", "book/1139/010029.bin.png\n", "book/1139/010104.bin.png\n", "book/0948/\n", "book/0948/010014.gt.txt\n", "book/0948/010014.bin.png\n", "book/0995/\n", "book/0995/010028.bin.png\n", "book/0995/010028.gt.txt\n", "book/1402/\n", "book/1402/010020.bin.png\n", "book/1402/010020.gt.txt\n", "book/1323/\n", "book/1323/010053.gt.txt\n", "book/1323/010053.bin.png\n", "book/0041/\n", "book/0041/010039.gt.txt\n", "book/0041/010039.bin.png\n", "book/1473/\n", "book/1473/010041.bin.png\n", "book/1473/010041.gt.txt\n", "book/0590/\n", "book/0590/010033.gt.txt\n", "book/0590/010033.bin.png\n", "book/1002/\n", "book/1002/010025.gt.txt\n", "book/1002/010025.bin.png\n", "book/0036/\n", "book/0036/010043.bin.png\n", "book/0036/010043.gt.txt\n", "book/1307/\n", "book/1307/010074.bin.png\n", "book/1307/010074.gt.txt\n", "book/1038/\n", "book/1038/010005.bin.png\n", "book/1038/010005.gt.txt\n", "book/0639/\n", "book/0639/010060.bin.png\n", "book/0639/010060.gt.txt\n", "book/1371/\n", "book/1371/010031.gt.txt\n", "book/1371/010031.bin.png\n", "book/0349/\n", "book/0349/010036.bin.png\n", "book/0349/010036.gt.txt\n", "book/1032/\n", "book/1032/010071.bin.png\n", "book/1032/010071.gt.txt\n", "book/1394/\n", "book/1394/010089.bin.png\n", "book/1394/010045.gt.txt\n", "book/1394/010089.gt.txt\n", "book/1394/010045.bin.png\n", "book/1492/\n", "book/1492/010014.gt.txt\n", "book/1492/010014.bin.png\n", "book/1329/\n", "book/1329/010049.bin.png\n", "book/1329/010049.gt.txt\n", "book/0895/\n", "book/0895/010030.bin.png\n", "book/0895/010030.gt.txt\n", "book/0701/\n", "book/0701/010023.gt.txt\n", "book/0701/010023.bin.png\n", "book/1443/\n", "book/1443/010083.gt.txt\n", "book/1443/010072.gt.txt\n", "book/1443/010083.bin.png\n", "book/1443/010072.bin.png\n", "book/0629/\n", "book/0629/010020.bin.png\n", "book/0629/010020.gt.txt\n", "book/1477/\n", "book/1477/010082.bin.png\n", "book/1477/010082.gt.txt\n", "book/1477/010049.bin.png\n", "book/1477/010049.gt.txt\n", "book/1067/\n", "book/1067/010058.gt.txt\n", "book/1067/010058.bin.png\n", "book/0630/\n", "book/0630/010019.bin.png\n", "book/0630/010019.gt.txt\n", "book/0593/\n", "book/0593/010078.bin.png\n", "book/0593/010078.gt.txt\n", "book/1549/\n", "book/1549/010041.bin.png\n", "book/1549/010041.gt.txt\n", "book/1286/\n", "book/1286/010007.gt.txt\n", "book/1286/010007.bin.png\n", "book/1286/010051.gt.txt\n", "book/1286/010051.bin.png\n", "book/1215/\n", "book/1215/010013.gt.txt\n", "book/1215/010013.bin.png\n", "book/1150/\n", "book/1150/010008.bin.png\n", "book/1150/010008.gt.txt\n", "book/0011/\n", "book/0011/010111.gt.txt\n", "book/0011/010111.bin.png\n", "book/0795/\n", "book/0795/010011.bin.png\n", "book/0795/010011.gt.txt\n", "book/0653/\n", "book/0653/010086.bin.png\n", "book/0653/010086.gt.txt\n", "book/1252/\n", "book/1252/010051.gt.txt\n", "book/1252/010051.bin.png\n", "book/1252/010028.bin.png\n", "book/1252/010028.gt.txt\n", "book/0422/\n", "book/0422/010017.gt.txt\n", "book/0422/010017.bin.png\n", "book/1123/\n", "book/1123/010102.gt.txt\n", "book/1123/010102.bin.png\n", "book/0833/\n", "book/0833/010089.bin.png\n", "book/0833/010089.gt.txt\n", "book/0517/\n", "book/0517/010039.gt.txt\n", "book/0517/010039.bin.png\n", "book/0315/\n", "book/0315/010029.gt.txt\n", "book/0315/010029.bin.png\n", "book/0315/010034.gt.txt\n", "book/0315/010034.bin.png\n", "book/0877/\n", "book/0877/010004.gt.txt\n", "book/0877/010004.bin.png\n", "book/0677/\n", "book/0677/010015.gt.txt\n", "book/0677/010015.bin.png\n", "book/0095/\n", "book/0095/010007.gt.txt\n", "book/0095/010007.bin.png\n", "book/0498/\n", "book/0498/010019.bin.png\n", "book/0498/010019.gt.txt\n", "book/1376/\n", "book/1376/010006.gt.txt\n", "book/1376/010006.bin.png\n", "book/0945/\n", "book/0945/010008.bin.png\n", "book/0945/010008.gt.txt\n", "book/1352/\n", "book/1352/010022.gt.txt\n", "book/1352/010022.bin.png\n", "book/1599/\n", "book/1599/010019.bin.png\n", "book/1599/010019.gt.txt\n", "book/1420/\n", "book/1420/010013.gt.txt\n", "book/1420/010059.bin.png\n", "book/1420/010059.gt.txt\n", "book/1420/010013.bin.png\n", "book/1004/\n", "book/1004/010017.gt.txt\n", "book/1004/010017.bin.png\n", "book/0176/\n", "book/0176/010077.gt.txt\n", "book/0176/010077.bin.png\n", "book/0410/\n", "book/0410/010043.bin.png\n", "book/0410/010043.gt.txt\n", "book/1526/\n", "book/1526/010034.gt.txt\n", "book/1526/010049.bin.png\n", "book/1526/010034.bin.png\n", "book/1526/010049.gt.txt\n", "book/1184/\n", "book/1184/010045.gt.txt\n", "book/1184/010045.bin.png\n", "book/1398/\n", "book/1398/010015.gt.txt\n", "book/1398/010015.bin.png\n", "book/0700/\n", "book/0700/010021.bin.png\n", "book/0700/010021.gt.txt\n", "book/1141/\n", "book/1141/010022.gt.txt\n", "book/1141/010022.bin.png\n", "book/1011/\n", "book/1011/010053.gt.txt\n", "book/1011/010053.bin.png\n", "book/1442/\n", "book/1442/010117.gt.txt\n", "book/1442/010117.bin.png\n", "book/0380/\n", "book/0380/010074.bin.png\n", "book/0380/010074.gt.txt\n", "book/0661/\n", "book/0661/010061.gt.txt\n", "book/0661/010061.bin.png\n", "book/0823/\n", "book/0823/010034.gt.txt\n", "book/0823/010034.bin.png\n", "book/1422/\n", "book/1422/010016.gt.txt\n", "book/1422/010016.bin.png\n", "book/0798/\n", "book/0798/010045.gt.txt\n", "book/0798/010045.bin.png\n", "book/0896/\n", "book/0896/010059.bin.png\n", "book/0896/010059.gt.txt\n", "book/1317/\n", "book/1317/010023.gt.txt\n", "book/1317/010023.bin.png\n", "book/0097/\n", "book/0097/010042.bin.png\n", "book/0097/010042.gt.txt\n", "book/1080/\n", "book/1080/010090.gt.txt\n", "book/1080/010090.bin.png\n", "book/1140/\n", "book/1140/010018.gt.txt\n", "book/1140/010018.bin.png\n", "book/0187/\n", "book/0187/010046.bin.png\n", "book/0187/010046.gt.txt\n", "book/1263/\n", "book/1263/010022.gt.txt\n", "book/1263/010018.gt.txt\n", "book/1263/010022.bin.png\n", "book/1263/010018.bin.png\n", "book/1483/\n", "book/1483/010002.gt.txt\n", "book/1483/010002.bin.png\n", "book/0854/\n", "book/0854/010034.gt.txt\n", "book/0854/010034.bin.png\n", "book/1244/\n", "book/1244/010003.gt.txt\n", "book/1244/010003.bin.png\n", "book/0830/\n", "book/0830/010130.gt.txt\n", "book/0830/010130.bin.png\n", "book/1524/\n", "book/1524/010099.gt.txt\n", "book/1524/010099.bin.png\n", "book/0200/\n", "book/0200/010007.gt.txt\n", "book/0200/010007.bin.png\n", "book/0934/\n", "book/0934/010081.bin.png\n", "book/0934/010084.gt.txt\n", "book/0934/010081.gt.txt\n", "book/0934/010084.bin.png\n", "book/1569/\n", "book/1569/010076.bin.png\n", "book/1569/010076.gt.txt\n", "book/1160/\n", "book/1160/010026.gt.txt\n", "book/1160/010026.bin.png\n", "book/1199/\n", "book/1199/010084.gt.txt\n", "book/1199/010084.bin.png\n", "book/1458/\n", "book/1458/010008.bin.png\n", "book/1458/010008.gt.txt\n", "book/1557/\n", "book/1557/010034.gt.txt\n", "book/1557/010034.bin.png\n", "book/1124/\n", "book/1124/010041.bin.png\n", "book/1124/010041.gt.txt\n", "book/0192/\n", "book/0192/010012.bin.png\n", "book/0192/010012.gt.txt\n", "book/0129/\n", "book/0129/010016.gt.txt\n", "book/0129/010016.bin.png\n", "book/0635/\n", "book/0635/010021.bin.png\n", "book/0635/010021.gt.txt\n", "book/1449/\n", "book/1449/010014.gt.txt\n", "book/1449/010014.bin.png\n", "book/1449/010075.gt.txt\n", "book/1449/010051.gt.txt\n", "book/1449/010051.bin.png\n", "book/1449/010075.bin.png\n", "book/1425/\n", "book/1425/010035.gt.txt\n", "book/1425/010035.bin.png\n", "book/1269/\n", "book/1269/010064.gt.txt\n", "book/1269/010064.bin.png\n", "book/1075/\n", "book/1075/010005.bin.png\n", "book/1075/010005.gt.txt\n", "book/1337/\n", "book/1337/010044.bin.png\n", "book/1337/010044.gt.txt\n", "book/0643/\n", "book/0643/010024.gt.txt\n", "book/0643/010024.bin.png\n", "book/0255/\n", "book/0255/010031.gt.txt\n", "book/0255/010031.bin.png\n", "book/0623/\n", "book/0623/010038.bin.png\n", "book/0623/010038.gt.txt\n", "book/1206/\n", "book/1206/010018.gt.txt\n", "book/1206/010018.bin.png\n", "book/1069/\n", "book/1069/010107.bin.png\n", "book/1069/010008.bin.png\n", "book/1069/010107.gt.txt\n", "book/1069/010008.gt.txt\n", "book/1548/\n", "book/1548/010046.bin.png\n", "book/1548/010046.gt.txt\n", "book/1568/\n", "book/1568/010075.gt.txt\n", "book/1568/010075.bin.png\n", "book/0496/\n", "book/0496/010012.bin.png\n", "book/0496/010012.gt.txt\n", "book/0513/\n", "book/0513/010064.gt.txt\n", "book/0513/010064.bin.png\n", "book/0878/\n", "book/0878/010086.bin.png\n", "book/0878/010086.gt.txt\n", "book/0173/\n", "book/0173/010009.gt.txt\n", "book/0173/010015.gt.txt\n", "book/0173/010009.bin.png\n", "book/0173/010015.bin.png\n", "book/1471/\n", "book/1471/010085.bin.png\n", "book/1471/010085.gt.txt\n", "book/0796/\n", "book/0796/010041.bin.png\n", "book/0796/010043.bin.png\n", "book/0796/010041.gt.txt\n", "book/0796/010043.gt.txt\n", "book/0859/\n", "book/0859/010037.bin.png\n", "book/0859/010037.gt.txt\n", "book/1482/\n", "book/1482/010042.bin.png\n", "book/1482/010042.gt.txt\n", "book/0955/\n", "book/0955/010055.bin.png\n", "book/0955/010055.gt.txt\n", "book/1219/\n", "book/1219/010024.gt.txt\n", "book/1219/010024.bin.png\n", "book/0412/\n", "book/0412/010052.bin.png\n", "book/0412/010052.gt.txt\n", "book/1380/\n", "book/1380/010003.gt.txt\n", "book/1380/010086.bin.png\n", "book/1380/010003.bin.png\n", "book/1380/010086.gt.txt\n", "book/1107/\n", "book/1107/010018.gt.txt\n", "book/1107/010018.bin.png\n", "book/0194/\n", "book/0194/010024.gt.txt\n", "book/0194/010024.bin.png\n", "book/0587/\n", "book/0587/010001.bin.png\n", "book/0587/010001.gt.txt\n", "book/0587/010023.gt.txt\n", "book/0587/010023.bin.png\n", "book/0634/\n", "book/0634/010085.bin.png\n", "book/0634/010085.gt.txt\n", "book/0528/\n", "book/0528/010015.gt.txt\n", "book/0528/010015.bin.png\n", "book/0479/\n", "book/0479/010101.bin.png\n", "book/0479/010101.gt.txt\n", "book/0175/\n", "book/0175/010047.gt.txt\n", "book/0175/010047.bin.png\n", "book/0852/\n", "book/0852/010006.gt.txt\n", "book/0852/010006.bin.png\n", "book/0937/\n", "book/0937/010001.bin.png\n", "book/0937/010001.gt.txt\n", "book/0651/\n", "book/0651/010072.gt.txt\n", "book/0651/010072.bin.png\n", "book/0651/010054.gt.txt\n", "book/0651/010054.bin.png\n", "book/0377/\n", "book/0377/010039.gt.txt\n", "book/0377/010047.gt.txt\n", "book/0377/010044.bin.png\n", "book/0377/010047.bin.png\n", "book/0377/010044.gt.txt\n", "book/0377/010039.bin.png\n", "book/0956/\n", "book/0956/010023.gt.txt\n", "book/0956/010023.bin.png\n", "book/1519/\n", "book/1519/010046.bin.png\n", "book/1519/010046.gt.txt\n", "book/1260/\n", "book/1260/010002.gt.txt\n", "book/1260/010002.bin.png\n", "book/0812/\n", "book/0812/010008.bin.png\n", "book/0812/010008.gt.txt\n", "book/0638/\n", "book/0638/010025.gt.txt\n", "book/0638/010025.bin.png\n", "book/1130/\n", "book/1130/010061.gt.txt\n", "book/1130/010061.bin.png\n", "book/0240/\n", "book/0240/010002.gt.txt\n", "book/0240/010027.bin.png\n", "book/0240/010027.gt.txt\n", "book/0240/010002.bin.png\n", "book/1508/\n", "book/1508/010033.gt.txt\n", "book/1508/010033.bin.png\n", "book/0280/\n", "book/0280/010022.gt.txt\n", "book/0280/010022.bin.png\n", "book/1126/\n", "book/1126/010078.bin.png\n", "book/1126/010078.gt.txt\n", "book/0633/\n", "book/0633/010014.gt.txt\n", "book/0633/010014.bin.png\n", "book/0570/\n", "book/0570/010008.bin.png\n", "book/0570/010008.gt.txt\n", "book/0565/\n", "book/0565/010001.bin.png\n", "book/0565/010001.gt.txt\n", "book/0814/\n", "book/0814/010004.gt.txt\n", "book/0814/010004.bin.png\n", "book/1435/\n", "book/1435/010035.gt.txt\n", "book/1435/010035.bin.png\n", "book/1318/\n", "book/1318/010031.gt.txt\n", "book/1318/010031.bin.png\n", "book/1392/\n", "book/1392/010041.bin.png\n", "book/1392/010041.gt.txt\n", "book/0142/\n", "book/0142/010031.gt.txt\n", "book/0142/010031.bin.png\n", "book/0188/\n", "book/0188/010059.bin.png\n", "book/0188/010059.gt.txt\n", "book/0414/\n", "book/0414/010068.gt.txt\n", "book/0414/010068.bin.png\n", "book/0186/\n", "book/0186/010028.bin.png\n", "book/0186/010028.gt.txt\n", "book/0272/\n", "book/0272/010025.gt.txt\n", "book/0272/010008.bin.png\n", "book/0272/010025.bin.png\n", "book/0272/010008.gt.txt\n", "book/1132/\n", "book/1132/010007.gt.txt\n", "book/1132/010007.bin.png\n", "book/1132/010098.bin.png\n", "book/1132/010098.gt.txt\n", "book/1132/010040.gt.txt\n", "book/1132/010040.bin.png\n", "book/1158/\n", "book/1158/010035.gt.txt\n", "book/1158/010047.gt.txt\n", "book/1158/010015.gt.txt\n", "book/1158/010081.bin.png\n", "book/1158/010047.bin.png\n", "book/1158/010015.bin.png\n", "book/1158/010081.gt.txt\n", "book/1158/010035.bin.png\n", "book/1071/\n", "book/1071/010112.gt.txt\n", "book/1071/010031.gt.txt\n", "book/1071/010031.bin.png\n", "book/1071/010061.gt.txt\n", "book/1071/010061.bin.png\n", "book/1071/010112.bin.png\n", "book/0407/\n", "book/0407/010046.bin.png\n", "book/0407/010046.gt.txt\n", "book/0413/\n", "book/0413/010076.bin.png\n", "book/0413/010009.gt.txt\n", "book/0413/010009.bin.png\n", "book/0413/010076.gt.txt\n", "book/0306/\n", "book/0306/010003.gt.txt\n", "book/0306/010003.bin.png\n", "book/1138/\n", "book/1138/010077.gt.txt\n", "book/1138/010077.bin.png\n", "book/0241/\n", "book/0241/010015.gt.txt\n", "book/0241/010015.bin.png\n", "book/0603/\n", "book/0603/010051.gt.txt\n", "book/0603/010031.gt.txt\n", "book/0603/010031.bin.png\n", "book/0603/010018.gt.txt\n", "book/0603/010051.bin.png\n", "book/0603/010018.bin.png\n", "book/0160/\n", "book/0160/010036.bin.png\n", "book/0160/010036.gt.txt\n", "book/1502/\n", "book/1502/010057.bin.png\n", "book/1502/010044.bin.png\n", "book/1502/010044.gt.txt\n", "book/1502/010057.gt.txt\n", "book/1406/\n", "book/1406/010041.bin.png\n", "book/1406/010041.gt.txt\n", "book/1108/\n", "book/1108/010026.gt.txt\n", "book/1108/010026.bin.png\n", "book/0508/\n", "book/0508/010056.bin.png\n", "book/0508/010056.gt.txt\n", "book/0195/\n", "book/0195/010019.bin.png\n", "book/0195/010019.gt.txt\n", "book/1357/\n", "book/1357/010001.bin.png\n", "book/1357/010001.gt.txt\n", "book/0316/\n", "book/0316/010063.bin.png\n", "book/0316/010063.gt.txt\n", "book/0488/\n", "book/0488/010006.gt.txt\n", "book/0488/010006.bin.png\n", "book/0137/\n", "book/0137/010045.gt.txt\n", "book/0137/010045.bin.png\n", "book/1591/\n", "book/1591/010070.bin.png\n", "book/1591/010070.gt.txt\n", "book/1142/\n", "book/1142/010118.gt.txt\n", "book/1142/010118.bin.png\n", "book/1452/\n", "book/1452/010022.gt.txt\n", "book/1452/010022.bin.png\n", "book/0211/\n", "book/0211/010012.bin.png\n", "book/0211/010012.gt.txt\n", "book/0575/\n", "book/0575/010019.bin.png\n", "book/0575/010019.gt.txt\n", "book/1335/\n", "book/1335/010027.bin.png\n", "book/1335/010027.gt.txt\n", "book/0855/\n", "book/0855/010082.bin.png\n", "book/0855/010082.gt.txt\n", "book/0924/\n", "book/0924/010019.bin.png\n", "book/0924/010019.gt.txt\n", "book/0640/\n", "book/0640/010018.gt.txt\n", "book/0640/010018.bin.png\n", "book/1113/\n", "book/1113/010043.bin.png\n", "book/1113/010043.gt.txt\n", "book/0189/\n", "book/0189/010096.bin.png\n", "book/0189/010096.gt.txt\n", "book/1272/\n", "book/1272/010076.bin.png\n", "book/1272/010077.gt.txt\n", "book/1272/010077.bin.png\n", "book/1272/010076.gt.txt\n", "book/0834/\n", "book/0834/010025.gt.txt\n", "book/0834/010023.gt.txt\n", "book/0834/010025.bin.png\n", "book/0834/010023.bin.png\n", "book/0385/\n", "book/0385/010009.gt.txt\n", "book/0385/010009.bin.png\n", "book/0456/\n", "book/0456/010058.gt.txt\n", "book/0456/010058.bin.png\n", "book/1229/\n", "book/1229/010003.gt.txt\n", "book/1229/010056.bin.png\n", "book/1229/010003.bin.png\n", "book/1229/010056.gt.txt\n", "book/1152/\n", "book/1152/010058.gt.txt\n", "book/1152/010056.bin.png\n", "book/1152/010060.bin.png\n", "book/1152/010060.gt.txt\n", "book/1152/010056.gt.txt\n", "book/1152/010058.bin.png\n", "book/1469/\n", "book/1469/010006.gt.txt\n", "book/1469/010008.bin.png\n", "book/1469/010006.bin.png\n", "book/1469/010008.gt.txt\n", "book/1262/\n", "book/1262/010065.gt.txt\n", "book/1262/010065.bin.png\n", "book/1320/\n", "book/1320/010046.bin.png\n", "book/1320/010046.gt.txt\n", "book/0626/\n", "book/0626/010062.bin.png\n", "book/0626/010062.gt.txt\n", "book/0383/\n", "book/0383/010001.bin.png\n", "book/0383/010001.gt.txt\n", "book/0628/\n", "book/0628/010013.gt.txt\n", "book/0628/010013.bin.png\n", "book/1143/\n", "book/1143/010021.bin.png\n", "book/1143/010021.gt.txt\n", "book/1459/\n", "book/1459/010040.gt.txt\n", "book/1459/010040.bin.png\n", "book/0809/\n", "book/0809/010039.gt.txt\n", "book/0809/010039.bin.png\n", "book/0489/\n", "book/0489/010028.bin.png\n", "book/0489/010028.gt.txt\n", "book/1059/\n", "book/1059/010098.bin.png\n", "book/1059/010098.gt.txt\n", "book/1531/\n", "book/1531/010022.gt.txt\n", "book/1531/010022.bin.png\n", "book/0892/\n", "book/0892/010031.gt.txt\n", "book/0892/010031.bin.png\n", "book/1026/\n", "book/1026/010010.bin.png\n", "book/1026/010010.gt.txt\n", "book/1534/\n", "book/1534/010061.gt.txt\n", "book/1534/010061.bin.png\n", "book/1466/\n", "book/1466/010046.bin.png\n", "book/1466/010046.gt.txt\n", "book/0121/\n", "book/0121/010017.gt.txt\n", "book/0121/010017.bin.png\n", "book/1061/\n", "book/1061/010055.bin.png\n", "book/1061/010055.gt.txt\n", "book/0034/\n", "book/0034/010034.gt.txt\n", "book/0034/010034.bin.png\n", "book/1546/\n", "book/1546/010031.gt.txt\n", "book/1546/010031.bin.png\n", "book/0974/\n", "book/0974/010042.bin.png\n", "book/0974/010042.gt.txt\n", "book/1375/\n", "book/1375/010007.gt.txt\n", "book/1375/010007.bin.png\n", "book/0159/\n", "book/0159/010007.gt.txt\n", "book/0159/010007.bin.png\n", "book/1490/\n", "book/1490/010065.gt.txt\n", "book/1490/010065.bin.png\n", "book/1003/\n", "book/1003/010049.bin.png\n", "book/1003/010049.gt.txt\n", "book/1171/\n", "book/1171/010046.bin.png\n", "book/1171/010046.gt.txt\n", "book/1432/\n", "book/1432/010005.bin.png\n", "book/1432/010005.gt.txt\n", "book/1077/\n", "book/1077/010059.bin.png\n", "book/1077/010059.gt.txt\n", "book/1282/\n", "book/1282/010021.bin.png\n", "book/1282/010021.gt.txt\n", "book/1051/\n", "book/1051/010039.gt.txt\n", "book/1051/010039.bin.png\n", "book/0293/\n", "book/0293/010123.gt.txt\n", "book/0293/010123.bin.png\n", "book/0523/\n", "book/0523/010068.gt.txt\n", "book/0523/010068.bin.png\n", "book/1528/\n", "book/1528/010010.bin.png\n", "book/1528/010010.gt.txt\n", "book/0853/\n", "book/0853/010092.bin.png\n", "book/0853/010092.gt.txt\n", "book/1389/\n", "book/1389/010003.gt.txt\n", "book/1389/010003.bin.png\n", "book/1387/\n", "book/1387/010035.gt.txt\n", "book/1387/010064.gt.txt\n", "book/1387/010064.bin.png\n", "book/1387/010035.bin.png\n", "book/1078/\n", "book/1078/010076.bin.png\n", "book/1078/010076.gt.txt\n", "book/1554/\n", "book/1554/010025.gt.txt\n", "book/1554/010025.bin.png\n", "book/1164/\n", "book/1164/010063.bin.png\n", "book/1164/010063.gt.txt\n", "book/1164/010084.gt.txt\n", "book/1164/010084.bin.png\n", "book/0313/\n", "book/0313/010021.bin.png\n", "book/0313/010021.gt.txt\n", "book/1314/\n", "book/1314/010062.bin.png\n", "book/1314/010062.gt.txt\n", "book/0106/\n", "book/0106/010002.gt.txt\n", "book/0106/010002.bin.png\n", "book/1271/\n", "book/1271/010003.gt.txt\n", "book/1271/010003.bin.png\n", "book/1478/\n", "book/1478/010047.gt.txt\n", "book/1478/010047.bin.png\n", "book/0022/\n", "book/0022/010005.bin.png\n", "book/0022/010005.gt.txt\n", "book/1183/\n", "book/1183/010072.gt.txt\n", "book/1183/010072.bin.png\n", "book/1547/\n", "book/1547/010006.gt.txt\n", "book/1547/010052.bin.png\n", "book/1547/010006.bin.png\n", "book/1547/010052.gt.txt\n", "book/1386/\n", "book/1386/010032.bin.png\n", "book/1386/010032.gt.txt\n", "book/1489/\n", "book/1489/010089.bin.png\n", "book/1489/010089.gt.txt\n", "book/1489/010091.gt.txt\n", "book/1489/010091.bin.png\n", "book/1193/\n", "book/1193/010046.bin.png\n", "book/1193/010046.gt.txt\n", "book/0999/\n", "book/0999/010003.gt.txt\n", "book/0999/010047.gt.txt\n", "book/0999/010003.bin.png\n", "book/0999/010034.gt.txt\n", "book/0999/010034.bin.png\n", "book/0999/010047.bin.png\n", "book/0999/010022.gt.txt\n", "book/0999/010050.bin.png\n", "book/0999/010022.bin.png\n", "book/0999/010050.gt.txt\n", "book/1319/\n", "book/1319/010058.gt.txt\n", "book/1319/010058.bin.png\n", "book/0062/\n", "book/0062/010033.gt.txt\n", "book/0062/010033.bin.png\n", "book/0525/\n", "book/0525/010030.bin.png\n", "book/0525/010030.gt.txt\n", "book/1237/\n", "book/1237/010059.bin.png\n", "book/1237/010059.gt.txt\n", "book/1453/\n", "book/1453/010018.gt.txt\n", "book/1453/010018.bin.png\n", "book/0417/\n", "book/0417/010033.gt.txt\n", "book/0417/010056.bin.png\n", "book/0417/010056.gt.txt\n", "book/0417/010033.bin.png\n", "book/0408/\n", "book/0408/010003.gt.txt\n", "book/0408/010047.gt.txt\n", "book/0408/010003.bin.png\n", "book/0408/010049.bin.png\n", "book/0408/010047.bin.png\n", "book/0408/010049.gt.txt\n", "book/0026/\n", "book/0026/010010.bin.png\n", "book/0026/010010.gt.txt\n", "book/1175/\n", "book/1175/010006.gt.txt\n", "book/1175/010006.bin.png\n", "book/0941/\n", "book/0941/010039.gt.txt\n", "book/0941/010039.bin.png\n", "book/0518/\n", "book/0518/010044.bin.png\n", "book/0518/010044.gt.txt\n", "book/0370/\n", "book/0370/010099.gt.txt\n", "book/0370/010099.bin.png\n", "book/0648/\n", "book/0648/010028.bin.png\n", "book/0648/010028.gt.txt\n", "book/0140/\n", "book/0140/010095.gt.txt\n", "book/0140/010095.bin.png\n", "book/1295/\n", "book/1295/010009.gt.txt\n", "book/1295/010009.bin.png\n", "book/0755/\n", "book/0755/010030.bin.png\n", "book/0755/010030.gt.txt\n", "book/0416/\n", "book/0416/010055.bin.png\n", "book/0416/010055.gt.txt\n", "book/0601/\n", "book/0601/010068.gt.txt\n", "book/0601/010068.bin.png\n", "book/0104/\n", "book/0104/010009.gt.txt\n", "book/0104/010009.bin.png\n", "book/1507/\n", "book/1507/010042.bin.png\n", "book/1507/010042.gt.txt\n", "book/1083/\n", "book/1083/010056.bin.png\n", "book/1083/010056.gt.txt\n", "book/1116/\n", "book/1116/010030.bin.png\n", "book/1116/010089.bin.png\n", "book/1116/010089.gt.txt\n", "book/1116/010030.gt.txt\n", "book/1137/\n", "book/1137/010010.bin.png\n", "book/1137/010023.gt.txt\n", "book/1137/010023.bin.png\n", "book/1137/010010.gt.txt\n", "book/0868/\n", "book/0868/010037.bin.png\n", "book/0868/010037.gt.txt\n", "book/0868/010028.bin.png\n", "book/0868/010028.gt.txt\n", "book/0033/\n", "book/0033/010060.bin.png\n", "book/0033/010060.gt.txt\n", "book/0328/\n", "book/0328/010050.bin.png\n", "book/0328/010050.gt.txt\n", "book/1155/\n", "book/1155/010069.gt.txt\n", "book/1155/010069.bin.png\n", "book/0605/\n", "book/0605/010026.gt.txt\n", "book/0605/010026.bin.png\n", "book/0526/\n", "book/0526/010045.gt.txt\n", "book/0526/010045.bin.png\n", "book/0706/\n", "book/0706/010004.gt.txt\n", "book/0706/010004.bin.png\n", "book/1064/\n", "book/1064/010095.gt.txt\n", "book/1064/010095.bin.png\n", "book/0917/\n", "book/0917/010054.gt.txt\n", "book/0917/010054.bin.png\n", "book/1348/\n", "book/1348/010024.gt.txt\n", "book/1348/010024.bin.png\n", "book/0317/\n", "book/0317/010093.gt.txt\n", "book/0317/010133.bin.png\n", "book/0317/010133.gt.txt\n", "book/0317/010093.bin.png\n", "book/0317/010042.bin.png\n", "book/0317/010042.gt.txt\n", "book/0591/\n", "book/0591/010099.gt.txt\n", "book/0591/010099.bin.png\n", "book/1036/\n", "book/1036/010021.bin.png\n", "book/1036/010021.gt.txt\n", "book/1539/\n", "book/1539/010002.gt.txt\n", "book/1539/010002.bin.png\n", "book/0658/\n", "book/0658/010002.gt.txt\n", "book/0658/010002.bin.png\n", "book/1133/\n", "book/1133/010066.bin.png\n", "book/1133/010066.gt.txt\n", "book/1040/\n", "book/1040/010056.bin.png\n", "book/1040/010056.gt.txt\n", "book/1253/\n", "book/1253/010027.bin.png\n", "book/1253/010027.gt.txt\n", "book/0415/\n", "book/0415/010035.gt.txt\n", "book/0415/010033.gt.txt\n", "book/0415/010057.bin.png\n", "book/0415/010057.gt.txt\n", "book/0415/010035.bin.png\n", "book/0415/010033.bin.png\n", "book/0749/\n", "book/0749/010030.bin.png\n", "book/0749/010030.gt.txt\n", "book/1208/\n", "book/1208/010032.bin.png\n", "book/1208/010032.gt.txt\n", "book/1208/010017.gt.txt\n", "book/1208/010017.bin.png\n", "book/0691/\n", "book/0691/010073.bin.png\n", "book/0691/010073.gt.txt\n", "book/1243/\n", "book/1243/010056.bin.png\n", "book/1243/010056.gt.txt\n", "book/1430/\n", "book/1430/010025.gt.txt\n", "book/1430/010025.bin.png\n", "book/0172/\n", "book/0172/010072.gt.txt\n", "book/0172/010038.bin.png\n", "book/0172/010038.gt.txt\n", "book/0172/010072.bin.png\n", "book/1362/\n", "book/1362/010116.bin.png\n", "book/1362/010116.gt.txt\n", "book/1362/010055.bin.png\n", "book/1362/010055.gt.txt\n", "book/0184/\n", "book/0184/010002.gt.txt\n", "book/0184/010002.bin.png\n", "book/1363/\n", "book/1363/010007.gt.txt\n", "book/1363/010007.bin.png\n", "book/1418/\n", "book/1418/010025.gt.txt\n", "book/1418/010052.bin.png\n", "book/1418/010045.gt.txt\n", "book/1418/010025.bin.png\n", "book/1418/010045.bin.png\n", "book/1418/010052.gt.txt\n", "book/0043/\n", "book/0043/010006.gt.txt\n", "book/0043/010006.bin.png\n", "book/1170/\n", "book/1170/010108.gt.txt\n", "book/1170/010108.bin.png\n", "book/1168/\n", "book/1168/010023.gt.txt\n", "book/1168/010023.bin.png\n", "book/0174/\n", "book/0174/010060.bin.png\n", "book/0174/010060.gt.txt\n", "book/0015/\n", "book/0015/010023.gt.txt\n", "book/0015/010023.bin.png\n", "book/1070/\n", "book/1070/010027.bin.png\n", "book/1070/010027.gt.txt\n", "book/0607/\n", "book/0607/010008.bin.png\n", "book/0607/010008.gt.txt\n", "book/1367/\n", "book/1367/010084.gt.txt\n", "book/1367/010084.bin.png\n", "book/0939/\n", "book/0939/010086.bin.png\n", "book/0939/010086.gt.txt\n", "book/1097/\n", "book/1097/010025.gt.txt\n", "book/1097/010025.bin.png\n", "book/0038/\n", "book/0038/010048.gt.txt\n", "book/0038/010048.bin.png\n", "book/0785/\n", "book/0785/010022.gt.txt\n", "book/0785/010022.bin.png\n", "book/1015/\n", "book/1015/010015.gt.txt\n", "book/1015/010015.bin.png\n", "book/0257/\n", "book/0257/010001.bin.png\n", "book/0257/010001.gt.txt\n", "book/0981/\n", "book/0981/010043.bin.png\n", "book/0981/010043.gt.txt\n" ] } ], "source": [ "!test -d book || tar -zxvf uw3-500.tgz" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The UW3-500 dataset is a collection of text line images and corresponding ground truth transcription. It's organized as a directory tree of the form `book//.bin.png` etc." ] }, { "cell_type": "code", "execution_count": 4, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "book/0005/010001.bin.png book/0005/010001.gt.txt\r\n" ] } ], "source": [ "!ls book/0005/010001.*" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's now run `clstmctc` training. We report every 100 training steps. Since we didn't dewarp or size-normalize the lines, we need to use a `dewarp=center` argument to training." ] }, { "cell_type": "code", "execution_count": 25, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "params hg_version e8159ab3bd6d+ lrate 0.0001 nhidden 100 nhidden2 -1 pseudo_batch 1 momentum 0.9 type BIDILSTM LSTM SoftmaxLayer\n", "center_normalizer 4 1 0.3\n", "dataset 500 48 center\n", "Segmentation fault (core dumped)\n" ] } ], "source": [ "!dewarp=center report_every=500 save_name=test save_every=10000 ntrain=11000 ../clstmctc uw3-500.h5" ] }, { "cell_type": "code", "execution_count": 5, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ " 100 uw3-test.files\r\n", " 400 uw3-train.files\r\n", " 500 uw3.files\r\n", " 1000 total\r\n" ] } ], "source": [ "!ls book/*/*.bin.png | sort -r > uw3.files\n", "!sed 100q uw3.files > uw3-test.files\n", "!sed 1,100d uw3.files > uw3-train.files\n", "!wc -l uw3*.files" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "#: seed = 0\n", "got 400 files, 100 tests\n", "got 83 classes\n", "#: nhidden = 100\n", "#: momentum = 0.9\n", "#: rate = 0.0001\n", ".stacked: 0.0001 0.9 in 0 48 out 0 83\n", ".stacked.parallel: 0.0001 0.9 in 0 48 out 0 200\n", ".stacked.parallel.lstm: 0.0001 0.9 in 0 48 out 0 100\n", ".stacked.parallel.reversed: 0.0001 0.9 in 0 48 out 0 100\n", ".stacked.parallel.reversed.lstm: 0.0001 0.9 in 0 48 out 0 100\n", ".stacked.softmax: 0.0001 0.9 in 0 200 out 0 83\n", "#: maxtrain = 50000\n", "#: save_every = 1000\n", "#: save_name = uw3small\n", "#: report_every = 100\n", "#: display = 0\n", "#: test_every = 1000\n", "0\n", "TRU Table 3. Percent responses to questions ask-\n", "ALN \n", "OUT \n", "100\n", "TRU that only those neighbors are taken into account which do not\n", "ALN ttatt eeeeer e nn n o ooot\n", "OUT \n", "200\n", "TRU rived it, and the equation follows:\n", "ALN rived it,, a d h eeeeeqqaiioon flllwss:::\n", "OUT \n", "300\n", "TRU illustrate typical scene content. In order to obtain\n", "ALN illstraatetyyypical scene cconnttnI. Innorder to obtain\n", "OUT \n", "400\n", "TRU Table 3. Percent responses to questions ask-\n", "ALN ale 3Percent resonsssess oqquestsnssk-\n", "OUT eeeeeeeeeeee eeeee \n", "500\n", "TRU sition between pre-assembly relations and post-assembly\n", "ALN sition betweenpre-ssemby rellatiions aand ppost-asssemlyy\n", "OUT a a a a a a \n", "600\n", "TRU increasing commitment to im-\n", "ALN inncreasing cooommmitmmeent to imm-\n", "OUT eeeeee eeeeeeeeeee eeeeee eee\n", "700\n", "TRU gineering from ISU in 1984. From 1977 to 1982, he was a\n", "ALN gineeering froom IU iin 19884. From197 to718, hehe wass a\n", "OUT aoees ao aeoao aa e a oao oe a aesao\n", "800\n", "TRU Fig. 2. (a) The energy spectrum of the neutron pulses emerging from any of the LANSCE moderators is very broad. Details of the spectrum vary\n", "ALN i. 2. The energyysspesscctrruuum of thee neeeutrroon ppulsses emmerginngg frooom aany of thehe LNCE deratortrs iss eryyy broad. Dttaiss of thee sspeectrum vary\n", "OUT a aaa aaeaaaaaeaaaaaaaaaeai aaaeaaaaaaaaeaalaeaaaaaiaaa aaaaaaeaaaaeai aa ata eaaaaaaaaa eeaa aaaai eaaaaaeai aaeaaaaaaaaaeaaa\n", "900\n", "TRU 306\n", "ALN 3066\n", "OUT ae\n", "ERROR 1000 0.62998 2862 4543\n", "1000\n", "TRU 2. The major additional assumption at this well site is that there are both\n", "ALN 2. The majoor adddditionnnal asssuumptionnn at this well ste i that there are boothh\n", "OUT toe oo eoe aelot eoooot oeeeoaoot oan ot ti et i e not teee oee oeottn \n", "1100\n", "TRU Our work to date has been focused on an initial set\n", "ALN Ouuur worrk to date has been foocused on ann initial et\n", "OUT ttnn oeoat oo dnte n ee ooeeneed on inn nnitiil et\n", "1200\n", "TRU 2755\n", "ALN 2755\n", "OUT sa\n", "1300\n", "TRU 306\n", "ALN 306\n", "OUT t\n", "1400\n", "TRU instead of a series over time, to obtain\n", "ALN innsteaaad of a eries over immme, to obtainn\n", "OUT innslesned l n eries oer innne eo oltsninn\n", "1500\n", "TRU is one in which the neural activity is generally in\n", "ALN is one in which te neural activity is generally nn\n", "OUT is orne rn etnietn he ernral aetieity s ernerally rn\n", "1600\n", "TRU at the creative aspects of design and at the intricate\n", "ALN at he reative aspects of design and at heintricate\n", "OUT at he neatipe aseots l dessn aod at heintitate\n", "1700\n", "TRU page).\n", "ALN pagage)\n", "OUT ge \n", "1800\n", "TRU increasing commitment to im-\n", "ALN increasing commitment o im-\n", "OUT ircreasitug oruritnert oir-\n", "1900\n", "TRU The data for N and H were obtained from the\n", "ALN The data for N and H were obtained fromm he\n", "OUT The data or y anod f yere htaineod ronn he\n", "ERROR 2000 0.287255 1305 4543\n", "2000\n", "TRU mother to the benefits of differen-\n", "ALN mother to the benefits of differen-\n", "OUT rother o he oreflts f llffrer-\n", "2100\n", "TRU Brittonia, 45(1), 1993, pp. 28-31.\n", "ALN BBirtonia, 45(1), 1993, pp. 28-31\n", "OUT i2errroeina wlol yyp, oo, f-Sl\n", "2200\n", "TRU unit quaternions).\n", "ALN unit quatenions)\n", "OUT unit puatenions\n", "2300\n", "TRU Policymakers seek data to monitor\n", "ALN PPolicymakers seek data o monitor\n", "OUT T2olicynaiersseei data o nonitor\n", "2400\n", "TRU In recent years a number of papers have appeared in\n", "ALN In recent years a number ofpaapers have appeared in\n", "OUT lp ecen1 pevp w aumnher f oepers nave appearedin\n", "2500\n", "TRU processes of ripening and shedding in the\n", "ALN processes of ripening and shedding in tthe\n", "OUT processes of ipening and sheddingin hie\n", "2600\n", "TRU The slugs in the flow path and the source term at the repository are\n", "ALN The slugs in the flow path and the source term at the repository are\n", "OUT t2he slugsin he low path and the source ermn at he repositorg are\n", "2700\n", "TRU for the Mozambique Belt away from the Kenya\n", "ALN for the Mozambiqque Belt away from the Kenya\n", "OUT for he Moxambiouc Sclt away from the enya\n", "2800\n", "TRU from the image at the predicted point and performs cross-correlation\n", "ALN from the image at the predicted point and perfoms cross-correlation\n", "OUT fmnm theimage at the predicted oirt and perfomns moss-correlation\n", "2900\n", "TRU move a pointing device to the location of the\n", "ALN move a pointing device to the location of the\n", "OUT move a pointing device to thelocation of he\n", "ERROR 3000 0.125468 570 4543\n", "3000\n", "TRU burners.'' Further, a lighthouse could identify itself by ex-\n", "ALN burners.'' Further, a lighthouse could identify itself by ex-\n", "OUT burners urther, a ighthouse couldidentifyitself by er-\n", "3100\n", "TRU Chilopoda, and Araneae. Several pinned salamanders are also present.\n", "ALN Chilopoda, and Araneae. Several pinned salamanders are also present\n", "OUT Chilopoda. and hraneae. Several pinned salamanders are also present\n", "3200\n", "TRU 10%) relative to the much greater limitation on photo-\n", "ALN 10%%%) relative to the much greater limitation on photo-\n", "OUT 1 relative o the much greater limitation on photo-\n", "3300\n", "TRU IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, VOL. 8, NO. 5, OCTOBER 1992\n", "ALN IEEEEEE TTRANSACTIONS OON ROOBOTICS AND AUUTOMATION, OL. 8, NO. 5, OCTOBBERER 1992\n", "OUT 1ESESES TTRANSAcT1oNS oN RCEsCTEcSs AahNb AtTToMA1CN, o1, s, wwo. s. octoEERER wt2\n", "3400\n", "TRU burners.'' Further, a lighthouse could identify itself by ex-\n", "ALN burners.' Further, a lighthouse could identify itself by ex-\n", "OUT burners Further, a iighthouse could identify itself y ex-\n", "3500\n", "TRU lengths of the straight regions.\n", "ALN lengths of the straight regions\n", "OUT lengths af the straight regions\n", "3600\n", "TRU called the Navlab 2.\n", "ALN called the Navlab 2\n", "OUT called the Nawlab 2\n", "3700\n", "TRU tures for each frame. These features are dense, stable,\n", "ALN tures for each frame. These features are dense, stable,\n", "OUT tures for each frame, These features are dense, stalble\n", "3800\n", "TRU microstructures obtained under various cooling condi-\n", "ALN microstructures obtained under various cooling condi-\n", "OUT microstructures obtained under warious cooling condi-\n", "3900\n", "TRU for the Mozambique Belt away from the Kenya\n", "ALN for the Mozambique Belt away from the Kenya\n", "OUT for the Mozambique Belt away from the Fenya\n", "ERROR 4000 0.0664759 302 4543\n", "4000\n", "TRU its base (see Figure 1).\n", "ALN its base (see Figure 1).\n", "OUT its oase fsee iFigure i)\n", "4100\n", "TRU Three WM102 dual-frequency GPS receivers were used to track the GPS\n", "ALN Three WWM102 dual-frequency GPS receivers were used to track the GPS\n", "OUT Three MtO2 dual-freouency BPS receivers were used to track the 3PS\n", "4200\n", "TRU endless loop because in this strategy we use only the pick\n", "ALN endless loop because in this strategy we use only the pick\n", "OUT endless loop because in this strategy we use only the pick\n", "4300\n", "TRU [Tarabanis et al., 1991b]\n", "ALN [Tarabanis et al., 1991b]\n", "OUT TTarabanis ct ol., f91hl\n", "4400\n", "TRU 10%) relative to the much greater limitation on photo-\n", "ALN 10%%) relative to the much greater limitation on photo-\n", "OUT 1C9uo) relative to the much greater limitation on photo-\n", "4500\n", "TRU 6312 F. W. Bingham\n", "ALN 6312 F. W. Bingham\n", "OUT 6312 P. N. Bingham\n", "4600\n", "TRU Analysis and Machine Intelligence, 7(4):384-401,\n", "ALN Analysis and Machine Intelligence, 7(4):384-401,\n", "OUT Analgsis and Macbine Intelligence, TC4)i384-4D1\n", "4700\n", "TRU improves the fanout of the monitored signals. The phase-\n", "ALN improves the fanout of the monitored signals. The phase-\n", "OUT improves the fanout of the monitored signals. The phase-\n", "4800\n", "TRU \"A Model-Based Computer Vision System\n", "ALN \"\"A Model-Based Computer Vision System\n", "OUT T7A Mocdel-3ased Computer Mision System\n", "4900\n", "TRU to send the prepared poll to the selected advisers or\n", "ALN to send the prepared poll to the selected advisers or\n", "OUT to send the prepared poll to the selected advisers or\n", "ERROR 5000 0.0490865 223 4543\n", "5000\n", "TRU instead of a series over time, to obtain\n", "ALN instead of a series over time, to obtain\n", "OUT instead of a series over time, to obtain\n", "5100\n", "TRU processors.\n", "ALN processors.\n", "OUT processors\n", "5200\n", "TRU loosen adhesion at the zone cell interface\n", "ALN loosen adhesion at the zone cell interface\n", "OUT loosen adhesion at the 2one cell interface\n", "5300\n", "TRU page).\n", "ALN page)\n", "OUT page)\n", "5400\n", "TRU problems.\n", "ALN problems.\n", "OUT problems\n", "5500\n", "TRU [Tarabanis et al., 1991b]\n", "ALN [Tarabanis et al., 1991b]\n", "OUT TTarabanis et ol., 1991hl\n", "5600\n", "TRU 3.2 Spatio-Geometric and Optical Compu-\n", "ALN 3.2 Spatio-Geometric and Optical Compu-\n", "OUT 3.2 Spatio-MGeometric and Optical Compu-\n", "5700\n", "TRU Such a coarsening mechanism can either be due to a\n", "ALN Such a coarsening mechanism can either be due to a\n", "OUT Such a coarsening mechanism can either be due to a\n", "5800\n", "TRU 11b. A fault scarp on the east is formed after\n", "ALN 11b. A fault scarp on the east is formed after\n", "OUT 11b. A fault scarp on the east is formed after\n" ] } ], "source": [ "!params=1 save_name=uw3small save_every=1000 report_every=100 maxtrain=50000 test_every=1000 ../clstmocrtrain uw3-train.files uw3-test.files" ] }, { "cell_type": "code", "execution_count": null, "metadata": { "collapsed": true }, "outputs": [], "source": [] } ], "metadata": { "kernelspec": { "display_name": "Python 2", "language": "python", "name": "python2" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 2 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython2", "version": "2.7.6" } }, "nbformat": 4, "nbformat_minor": 0 }