{
"cells": [
{
"cell_type": "markdown",
"metadata": {},
"source": [
"\n",
" Text_Extensions_for_Pandas_Overview.ipynb:\n",
" Overview of the basic functionality and usage of Text Extensions for Pandas.
\n", "\n", "\n", "\n", " In\n", "\n", "\n", "\n", " AD\n", "\n", "\n", "\n", " 932\n", "\n", " , \n", "\n", " King\n", "\n", "\n", "\n", " Arthur\n", "\n", "\n", "\n", " and\n", "\n", "\n", "\n", " his\n", "\n", "\n", "\n", " squire\n", "\n", " , \n", "\n", " Patsy\n", "\n", " , \n", "\n", " travel\n", "\n", "\n", "\n", " throughout\n", "\n", "\n", "\n", " Britain\n", "\n", "\n", "\n", " searching\n", "\n", "\n", "\n", " for\n", "\n", "\n", "\n", " men\n", "\n", "\n", "\n", " to\n", "\n", "\n", "\n", " join\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " Knights\n", "\n", "\n", "\n", " of\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " Round\n", "\n", "\n", "\n", " Table\n", "\n", " . \n", "\n", " Along\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " way\n", "\n", " , \n", "\n", " he\n", "\n", "\n", "\n", " recruits\n", "\n", "\n", "\n", " Sir\n", "\n", "\n", "\n", " Bedevere\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " Wise\n", "\n", " , \n", "\n", " Sir\n", "\n", "\n", "\n", " Lancelot\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " Brave\n", "\n", " , \n", "\n", " Sir\n", "\n", "\n", "\n", " Galahad\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " Pure\n", "\n", " , \n", "\n", " Sir\n", "\n", "\n", "\n", " Robin\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " Not-Quite-So-Brave-as-Sir-Lancelot\n", "\n", " , \n", "\n", " and\n", "\n", "\n", "\n", " Sir\n", "\n", "\n", "\n", " Not-Appearing-in-this-Film\n", "\n", " , \n", "\n", " along\n", "\n", "\n", "\n", " with\n", "\n", "\n", "\n", " their\n", "\n", "\n", "\n", " squires\n", "\n", "\n", "\n", " and\n", "\n", "\n", "\n", " Robin's\n", "\n", "\n", "\n", " troubadours\n", " .\n", "
\n", "\n", "\n", " In AD 932, King Arthur and his squire, Patsy, travel throughout Britain searching for men to join the Knights of the Round Table. Along the way, he recruits Sir Bedevere the Wise, Sir Lancelot the Brave, Sir Galahad the Pure, \n", "\n", " Sir\n", "\n", "\n", "\n", " Robin\n", "\n", "\n", "\n", " the\n", "\n", "\n", "\n", " Not-Quite-So-Brave-as-Sir-Lancelot\n", " , and Sir Not-Appearing-in-this-Film, along with their squires and Robin's troubadours.\n", "
\n", "\n", "\n", " In AD 932, \n", "\n", " King Arthur\n", "\n", " and his squire, \n", "\n", " Patsy\n", "\n", " , travel throughout Britain searching for men to join the Knights of the Round Table. Along the way, he recruits \n", "\n", " Sir Bedevere the Wise\n", "\n", " , \n", "\n", " Sir Lancelot the Brave\n", "\n", " , \n", "\n", " Sir Galahad the Pure\n", "\n", " , \n", "\n", " Sir Robin the Not-Quite-So-Brave-as-Sir-Lancelot\n", "\n", " , and \n", "\n", " Sir Not-Appearing-in-this-Film\n", "\n", " , along with their squires and \n", "\n", " Robin's\n", " troubadours.\n", "
\n", "\n", "\n", "\n", "\n", " In\n", "\n", "\n", "\n", " AD\n", "\n", "\n", "\n", " 932\n", "\n", " , \n", "\n", " King\n", "\n", "\n", "\n", " Arthur\n", " and his squire, Patsy, travel throughout Britain searching for men to join the Knights of the Round Table. Along the way, he recruits Sir Bedevere the Wise, Sir Lancelot the Brave, Sir Galahad the Pure, Sir Robin the Not-Quite-So-Brave-as-Sir-Lancelot, and Sir Not-Appearing-in-this-Film, along with their squires and Robin's troubadours.\n", "
\n", "\n", "\n", " In AD 932, \n", "\n", " King Arthur\n", " and his squire, Patsy, travel throughout Britain searching for men to join the Knights of the Round Table. Along the way, he recruits Sir Bedevere the Wise, Sir Lancelot the Brave, Sir Galahad the Pure, Sir Robin the Not-Quite-So-Brave-as-Sir-Lancelot, and Sir Not-Appearing-in-this-Film, along with their squires and Robin's troubadours.\n", "
\n", "\n", "\n", "\n", "\n", " Second document\n", "\n", "
\n", "\n", " | match | \n", "
---|---|
0 | \n", "[157, 169): 'Sir Bedevere' | \n", "
1 | \n", "[180, 192): 'Sir Lancelot' | \n", "
2 | \n", "[204, 215): 'Sir Galahad' | \n", "
3 | \n", "[226, 235): 'Sir Robin' | \n", "
4 | \n", "[280, 310): 'Sir Not-Appearing-in-this-Film' | \n", "
\n", " | match | \n", "
---|---|
0 | \n", "[323, 328): 'their' | \n", "
0 | \n", "[98, 109): 'the Knights' | \n", "
1 | \n", "[113, 122): 'the Round' | \n", "
2 | \n", "[136, 143): 'the way' | \n", "
3 | \n", "[170, 178): 'the Wise' | \n", "
4 | \n", "[193, 202): 'the Brave' | \n", "
5 | \n", "[216, 224): 'the Pure' | \n", "
6 | \n", "[236, 274): 'the Not-Quite-So-Brave-as-Sir-Lan... | \n", "
\n", " | knight | \n", "virtue | \n", "
---|---|---|
0 | \n", "[157, 169): 'Sir Bedevere' | \n", "[170, 178): 'the Wise' | \n", "
1 | \n", "[180, 192): 'Sir Lancelot' | \n", "[193, 202): 'the Brave' | \n", "
2 | \n", "[204, 215): 'Sir Galahad' | \n", "[216, 224): 'the Pure' | \n", "
3 | \n", "[226, 235): 'Sir Robin' | \n", "[236, 274): 'the Not-Quite-So-Brave-as-Sir-Lan... | \n", "
\n", " | time | \n", "features | \n", "
---|---|---|
0 | \n", "2018-01-01 00:00:00 | \n", "[0, 1] | \n", "
1 | \n", "2018-01-01 01:00:00 | \n", "[2, 3] | \n", "
2 | \n", "2018-01-01 02:00:00 | \n", "[4, 5] | \n", "
3 | \n", "2018-01-01 03:00:00 | \n", "[6, 7] | \n", "
4 | \n", "2018-01-01 04:00:00 | \n", "[8, 9] | \n", "
\n", " | time | \n", "features | \n", "
---|---|---|
4 | \n", "2018-01-01 04:00:00 | \n", "[8, 9] | \n", "
3 | \n", "2018-01-01 03:00:00 | \n", "[6, 7] | \n", "
2 | \n", "2018-01-01 02:00:00 | \n", "[4, 5] | \n", "
1 | \n", "2018-01-01 01:00:00 | \n", "[2, 3] | \n", "
0 | \n", "2018-01-01 00:00:00 | \n", "[0, 1] | \n", "
\n", " | span | \n", "features | \n", "
---|---|---|
0 | \n", "[0, 2): 'In' | \n", "[0, 0, 0, 1] | \n", "
1 | \n", "[3, 5): 'AD' | \n", "[1, 0, 0, 0] | \n", "
2 | \n", "[6, 9): '932' | \n", "[0, 0, 1, 0] | \n", "
3 | \n", "[11, 15): 'King' | \n", "[0, 1, 0, 0] | \n", "
4 | \n", "[16, 22): 'Arthur' | \n", "[1, 0, 0, 0] | \n", "
\n", " | span | \n", "features | \n", "
---|---|---|
0 | \n", "[0, 2): 'In' | \n", "[0, 0, 0, 1] | \n", "
1 | \n", "[3, 5): 'AD' | \n", "[1, 0, 0, 0] | \n", "
2 | \n", "[6, 9): '932' | \n", "[0, 0, 1, 0] | \n", "
3 | \n", "[11, 15): 'King' | \n", "[0, 1, 0, 0] | \n", "
4 | \n", "[16, 22): 'Arthur' | \n", "[1, 0, 0, 0] | \n", "