{
"cells": [
{
"cell_type": "code",
"execution_count": 1,
"metadata": {
"slideshow": {
"slide_type": "skip"
}
},
"outputs": [],
"source": [
"import sys\n",
"sys.path.append(\"..\")\n",
"import statnlpbook.util as util"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"# Introduction"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## What is NLP?\n",
"\n",
"![dogs](../img/whatwesaytodogs.png)\n",
"\n",
"* Building computer systems that **understand** and **generate** natural languages.\n",
"* Deep understanding of **broad** language\n",
" * not just string processing or keyword matching\n",
" "
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"Can you think of NLP Applications?"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Speech Recognition\n",
"\n",
"![beach](../img/wreckanicebeach.png)\n",
"Speech Recognition is usually not considered NLP. We will not cover this topic here."
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Machine Translation\n",
"\n",
"![mt](../img/mt.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Personal Assistants\n",
""
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Information Extraction\n",
"![ie1](../img/ie1.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Information Extraction\n",
"![ie2](../img/ie2.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Summarization\n",
"![sum](../img/summarization.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Generation\n",
"![gen](../img/shirtless.jpeg)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Question Answering\n",
"![qa](../img/qa.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Sentiment Analysis\n",
"\n",
"![sent](../img/sentiment.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Machine Comprehension\n",
"\n",
"![mc](../img/comprehension.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Cognitive Science and Psycholinguistics \n",
"\n",
"![cog](../img/psycho.png)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Why is it difficult?\n",
"\n",
"![sailor_moon1](../img/sailor_moon1.jpg)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Why is it difficult?\n",
"\n",
"![sexcoffee](../img/sailor_moon2.jpg)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Structural Ambiguity\n",
"\n",
"[Play the Structural Ambiguity Game](http://madlyambiguous.osu.edu:1035/)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"### Ambiguity Everywhere\n",
"\n",
"* Fed raises interest rates 0.5% in effort to control inflation\n",
"* Fed raises interest rates 0.5% in effort to control inflation\n",
"* Fed raises interest rates 0.5% in effort to control inflation"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"### Fool a [Sentiment Analyzer](http://text-processing.com/demo/sentiment/)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"### Fool a [Machine Translator](http://translate.google.com/?hl=en&tab=TT)"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"### [Count](https://books.google.com/ngrams) N-grams"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## Syllabus\n",
"\n",
"* Structured Prediction \n",
"* Preprocessing\n",
"* Generative Learning\n",
"* Discriminative Learning\n",
"* Weak Supervision\n",
"* Representation and Deep Learning"
]
},
{
"cell_type": "markdown",
"metadata": {
"slideshow": {
"slide_type": "slide"
}
},
"source": [
"## NLP Tasks\n",
"* Tokenization, Segmentation\n",
"* Language Modelling\n",
"* Machine Translation\n",
"* Syntactic Parsing\n",
"* Document Classification\n",
"* Information Extraction\n",
"* Textual Entailment/Machine Comprehension"
]
}
],
"metadata": {
"celltoolbar": "Slideshow",
"kernelspec": {
"display_name": "Python 3",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
"version": "3.6.2"
}
},
"nbformat": 4,
"nbformat_minor": 1
}