{ "cells": [ { "cell_type": "markdown", "metadata": { "id": "Ss6WwQxZcyB3" }, "source": [ "# 머신 러닝 교과서 3판" ] }, { "cell_type": "markdown", "metadata": { "id": "lTv_uMjxcyB4" }, "source": [ "# 8장 - 감성 분석에 머신 러닝 적용\n" ] }, { "cell_type": "markdown", "metadata": { "id": "qeuUzWpZcyB5" }, "source": [ "**아래 링크를 통해 이 노트북을 주피터 노트북 뷰어(nbviewer.jupyter.org)로 보거나 구글 코랩(colab.research.google.com)에서 실행할 수 있습니다.**\n", "\n", "
\n", " 주피터 노트북 뷰어로 보기\n", " | \n", "\n", " 구글 코랩(Colab)에서 실행하기\n", " | \n", "
\n", " | review | \n", "sentiment | \n", "
---|---|---|
0 | \n", "In 1974, the teenager Martha Moxley (Maggie Gr... | \n", "1 | \n", "
1 | \n", "OK... so... I really like Kris Kristofferson a... | \n", "0 | \n", "
2 | \n", "***SPOILER*** Do not read this, if you think a... | \n", "0 | \n", "
GridSearchCV(cv=5,\n", " estimator=Pipeline(steps=[('vect',\n", " TfidfVectorizer(lowercase=False)),\n", " ('clf',\n", " LogisticRegression(random_state=0,\n", " solver='liblinear'))]),\n", " n_jobs=-1,\n", " param_grid=[{'clf__C': [1.0, 10.0, 100.0],\n", " 'clf__penalty': ['l1', 'l2'],\n", " 'vect__ngram_range': [(1, 1)],\n", " 'vect__stop_words': [['i', 'me', 'my', 'myself', 'we',\n", " 'our', 'ours', 'ourselves',\n", " 'you', "you're", "you've...\n", " 'our', 'ours', 'ourselves',\n", " 'you', "you're", "you've",\n", " "you'll", "you'd", 'your',\n", " 'yours', 'yourself',\n", " 'yourselves', 'he', 'him',\n", " 'his', 'himself', 'she',\n", " "she's", 'her', 'hers',\n", " 'herself', 'it', "it's", 'its',\n", " 'itself', ...],\n", " None],\n", " 'vect__tokenizer': [<function tokenizer at 0x7cb75a79de10>,\n", " <function tokenizer_porter at 0x7cb75a79dea0>],\n", " 'vect__use_idf': [False]}],\n", " scoring='accuracy')In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
GridSearchCV(cv=5,\n", " estimator=Pipeline(steps=[('vect',\n", " TfidfVectorizer(lowercase=False)),\n", " ('clf',\n", " LogisticRegression(random_state=0,\n", " solver='liblinear'))]),\n", " n_jobs=-1,\n", " param_grid=[{'clf__C': [1.0, 10.0, 100.0],\n", " 'clf__penalty': ['l1', 'l2'],\n", " 'vect__ngram_range': [(1, 1)],\n", " 'vect__stop_words': [['i', 'me', 'my', 'myself', 'we',\n", " 'our', 'ours', 'ourselves',\n", " 'you', "you're", "you've...\n", " 'our', 'ours', 'ourselves',\n", " 'you', "you're", "you've",\n", " "you'll", "you'd", 'your',\n", " 'yours', 'yourself',\n", " 'yourselves', 'he', 'him',\n", " 'his', 'himself', 'she',\n", " "she's", 'her', 'hers',\n", " 'herself', 'it', "it's", 'its',\n", " 'itself', ...],\n", " None],\n", " 'vect__tokenizer': [<function tokenizer at 0x7cb75a79de10>,\n", " <function tokenizer_porter at 0x7cb75a79dea0>],\n", " 'vect__use_idf': [False]}],\n", " scoring='accuracy')
Pipeline(steps=[('vect', TfidfVectorizer(lowercase=False)),\n", " ('clf',\n", " LogisticRegression(random_state=0, solver='liblinear'))])
TfidfVectorizer(lowercase=False)
LogisticRegression(random_state=0, solver='liblinear')
\n", " | review | \n", "sentiment | \n", "
---|---|---|
0 | \n", "In 1974, the teenager Martha Moxley (Maggie Gr... | \n", "1 | \n", "
1 | \n", "OK... so... I really like Kris Kristofferson a... | \n", "0 | \n", "
2 | \n", "***SPOILER*** Do not read this, if you think a... | \n", "0 | \n", "