{ "nbformat": 4, "nbformat_minor": 0, "metadata": { "colab": { "name": "2021-07-28-audible-audiobook-recommender.ipynb", "provenance": [], "authorship_tag": "ABX9TyMrZO9INiS0MfLYdaPheRkx" }, "kernelspec": { "name": "python3", "display_name": "Python 3" }, "language_info": { "name": "python" } }, "cells": [ { "cell_type": "markdown", "metadata": { "id": "9kwmww3PqNY7" }, "source": [ "# Audible Book Recommender\n", "> Finding similar books using simple text countvectorizer model on audible dataset\n", "\n", "- toc: false\n", "- badges: true\n", "- comments: true\n", "- categories: [Books, CountVectorizer]\n", "- image:" ] }, { "cell_type": "code", "metadata": { "id": "3wQ4xg6-mJqQ" }, "source": [ "import numpy as np\n", "import pandas as pd\n", "\n", "from sklearn.feature_extraction.text import CountVectorizer\n", "from sklearn.metrics.pairwise import cosine_similarity" ], "execution_count": 1, "outputs": [] }, { "cell_type": "code", "metadata": { "colab": { "base_uri": "https://localhost:8080/", "height": 661 }, "id": "YU2R9RnOmLc8", "outputId": "15d4c7ba-57b8-446a-8d07-1a5f4c69b0cd" }, "source": [ "audible_data = pd.read_csv(\"https://github.com/sparsh-ai/reco-data/raw/audible/audible/audible.csv\",\n", " encoding='latin1')\n", "audible_data.head()" ], "execution_count": 2, "outputs": [ { "output_type": "execute_result", "data": { "text/html": [ "
\n", " | Book Title | \n", "Book Subtitle | \n", "Book Author | \n", "Book Narrator | \n", "Audio Runtime | \n", "Audiobook_Type | \n", "Categories | \n", "Rating | \n", "Total No. of Ratings | \n", "Price | \n", "Review 1 | \n", "Review 2 | \n", "Review 3 | \n", "Review 4 | \n", "Review 5 | \n", "Review 6 | \n", "Review 7 | \n", "Review 8 | \n", "Review 9 | \n", "Review 10 | \n", "Review 11 | \n", "Review 12 | \n", "Review 13 | \n", "Review 14 | \n", "Review 15 | \n", "Review 16 | \n", "Review 17 | \n", "Review 18 | \n", "Review 19 | \n", "Review 20 | \n", "Review 21 | \n", "Review 22 | \n", "Review 23 | \n", "Review 24 | \n", "Review 25 | \n", "Review 26 | \n", "Review 27 | \n", "Review 28 | \n", "Review 29 | \n", "Review 30 | \n", "... | \n", "Review 61 | \n", "Review 62 | \n", "Review 63 | \n", "Review 64 | \n", "Review 65 | \n", "Review 66 | \n", "Review 67 | \n", "Review 68 | \n", "Review 69 | \n", "Review 70 | \n", "Review 71 | \n", "Review 72 | \n", "Review 73 | \n", "Review 74 | \n", "Review 75 | \n", "Review 76 | \n", "Review 77 | \n", "Review 78 | \n", "Review 79 | \n", "Review 80 | \n", "Review 81 | \n", "Review 82 | \n", "Review 83 | \n", "Review 84 | \n", "Review 85 | \n", "Review 86 | \n", "Review 87 | \n", "Review 88 | \n", "Review 89 | \n", "Review 90 | \n", "Review 91 | \n", "Review 92 | \n", "Review 93 | \n", "Review 94 | \n", "Review 95 | \n", "Review 96 | \n", "Review 97 | \n", "Review 98 | \n", "Review 99 | \n", "Review100 | \n", "
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | \n", "Bamboozled by Jesus | \n", "How God Tricked Me into the Life of My Dreams | \n", "Yvonne Orji | \n", "Yvonne Orji | \n", "6 hrs and 31 mins | \n", "Unabridged Audiobook | \n", "Biographies & Memoirs | \n", "5 | \n", "47.0 | \n", "$29.65 | \n", "Thank you for being obedient and sharing your ... | \n", "This book was amazing. What made it amazing wa... | \n", "The narration of the book by the author was a ... | \n", "I'm sending Yvonne a tilth because this was th... | \n", "Yvonne is truly amazing at blending scripture ... | \n", "I enjoyed this book immensely. Thank you for m... | \n", "This book really blessed my life. I pray that ... | \n", "I have enjoyed Yvonnes work on Insecure and he... | \n", "to quote my wife \"I feel so seen!\" Yvonne must... | \n", "This content was amazing and being a fan of Yv... | \n", "Already surrendered my life to Jesus but this!... | \n", "I loved this book. I finished it in 2 days. I ... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
1 | \n", "Building Bridges | \n", "NaN | \n", "Marie Dunlop | \n", "Diane Books, Natalie Moore Williams, John Scou... | \n", "1 hr and 41 mins | \n", "Unabridged Audiobook | \n", "Literature & Fiction, Genre Fiction | \n", "5 | \n", "1.0 | \n", "$0.00 | \n", "Recent old times brought to life | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
2 | \n", "King of Scotland | \n", "Modern Plays | \n", "Iain Heggie | \n", "Liam Brennan | \n", "52 mins | \n", "Unabridged Audiobook | \n", "Literature & Fiction, Drama & Plays | \n", "Not rated yet | \n", "NaN | \n", "$0.00 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
3 | \n", "Mrs G | \n", "NaN | \n", "Mike Tibbetts | \n", "Sarah Rose Graber, Brett Whitted | \n", "34 mins | \n", "Unabridged Audiobook | \n", "Literature & Fiction | \n", "5 | \n", "1.0 | \n", "$0.00 | \n", "great story in 30 mins. you wont know who's si... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
4 | \n", "Signature | \n", "NaN | \n", "Bob Davidson | \n", "Sakshi Sharma, Lucy Goldie | \n", "36 mins | \n", "Unabridged Audiobook | \n", "Mystery, Thriller & Suspense, Mystery | \n", "Not rated yet | \n", "NaN | \n", "$0.00 | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "... | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "NaN | \n", "
5 rows × 110 columns
\n", "