# BentoML Demo: Iris Classifier with custom web UI


**BentoML makes moving trained ML models to production easy:**

* Package models trained with **any ML framework** and reproduce them for model serving in production
* **Deploy anywhere** for online API serving or offline batch serving
* High-Performance API model server with *adaptive micro-batching* support
* Central hub for managing models and deployment process via Web UI and APIs
* Modular and flexible design making it *adaptable to your infrastrcuture*

BentoML is a framework for serving, managing, and deploying machine learning models. It is aiming to bridge the gap between Data Science and DevOps, and enable teams to deliver prediction services in a fast, repeatable, and scalable way.

Before reading this example project, be sure to check out the [Getting started guide](https://github.com/bentoml/BentoML/blob/master/guides/quick-start/bentoml-quick-start-guide.ipynb) to learn about the basic concepts in BentoML.


This notebook demonstrates how to use BentoML to __serve a Iris Classification model containing a REST API server with Custom Web UI__.

![Impression](https://www.google-analytics.com/collect?v=1&tid=UA-112879361-3&cid=555&t=event&ec=scikit-learn&ea=scikit-learn-iris-classifier-web&dt=scikit-learn-iris-classifier-web)

In [1]:
%reload_ext autoreload
%autoreload 2
%matplotlib inline

import warnings
warnings.filterwarnings("ignore")

In [2]:
!pip install -q bentoml "scikit-learn>=0.23.2"

## Create BentoService for model serving

In [1]:
%%writefile iris_classifier.py
from bentoml import env, artifacts, api, BentoService
from bentoml.adapters import DataframeInput
from bentoml.frameworks.sklearn import SklearnModelArtifact

@env(infer_pip_packages=True)
@artifacts([SklearnModelArtifact('model')])
class IrisClassifier(BentoService):

    @api(input=DataframeInput(), batch=True)
    def predict(self, df):
        # Optional pre-processing, post-processing code goes here
        return self.artifacts.model.predict(df)

Writing iris_classifier.py


In [2]:
%%writefile main.py
from sklearn import svm
from sklearn import datasets

from iris_classifier import IrisClassifier

if __name__ == "__main__":
    # Load training data
    iris = datasets.load_iris()
    X, y = iris.data, iris.target

    # Model Training
    clf = svm.SVC(gamma='scale')
    clf.fit(X, y)

    # Create a iris classifier service instance
    iris_classifier_service = IrisClassifier()

    # Pack the newly trained model artifact
    iris_classifier_service.pack('model', clf)

    # Save the prediction service to disk for model serving
    saved_path = iris_classifier_service.save()

Writing main.py


In [3]:
!python main.py

[2020-09-09 11:50:41,005] INFO - Detected non-PyPI-released BentoML installed, copying local BentoML modulefiles to target saved bundle path..
no previously-included directories found matching 'e2e_tests'
no previously-included directories found matching 'tests'
no previously-included directories found matching 'benchmark'
UPDATING BentoML-0.8.6+34.g6123b8c6/bentoml/_version.py
set BentoML-0.8.6+34.g6123b8c6/bentoml/_version.py to '0.8.6+34.g6123b8c6'
[2020-09-09 11:50:45,682] INFO - BentoService bundle 'IrisClassifier:20200909115040_9FC7F4' saved to: /Users/bozhaoyu/bentoml/repository/IrisClassifier/20200909115040_9FC7F4


## REST API Model Serving


To start a REST API model server with the BentoService saved above, use the bentoml serve command:

In [6]:
!bentoml serve IrisClassifier:latest

[2020-07-28 16:38:55,465] INFO - Getting latest version IrisClassifier:20200728163753_DE85E9
 * Serving Flask app "IrisClassifier" (lazy loading)
 * Environment: production
[2m   Use a production WSGI server instead.[0m
 * Debug mode: off
 * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)
^C


If you are running this notebook from Google Colab, you can start the dev server with `--run-with-ngrok` option, to gain acccess to the API endpoint via a public endpoint managed by [ngrok](https://ngrok.com/):

In [None]:
!bentoml serveIrisiClassifier:latest --run-with-ngrok

At this point you can test out the Rest API server either by opening http://localhost:5000 in a new tab which will serve the swagger docs:

![alt text](https://raw.githubusercontent.com/bentoml/gallery/master/scikit-learn/iris-classifier/swagger.png)


or by making a curl request in a another terminal window:

```bash
curl -i \
--header "Content-Type: application/json" \
--request POST \
--data '[[1,2,1,2]]' \
localhost:5000/predict
```

## Adding Custom Web Static Content


In [7]:
!curl https://raw.githubusercontent.com/bentoml/gallery/master/scikit-learn/iris-classifier/static.tar.xz -o static.tar.xz
!tar --xz -xf static.tar.xz
!rm static.tar.xz

  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed
100  197k  100  197k    0     0   200k      0 --:--:-- --:--:-- --:--:--  200k


Here we have a very simple web ui as our static content that bento will serve.
Now we will edit our bento service to point to this static directory.

Add `@web_static_content('./static')` to `iris_classifier.py`

**Note**: The path can be both relative or absolute. 

In [9]:
%%writefile iris_classifier.py
from bentoml import env, artifacts, api, BentoService, web_static_content
from bentoml.adapters import DataframeInput
from bentoml.artifact import SklearnModelArtifact

@env(auto_pip_dependencies=True)
@artifacts([SklearnModelArtifact('model')])
@web_static_content('./static')
class IrisClassifier(BentoService):

    @api(input=DataframeInput(), batch=True)
    def test(self, df):
        # Optional pre-processing, post-processing code goes here
        return self.artifacts.model.predict(df)

Overwriting iris_classifier.py


In [10]:
!python main.py

[2020-07-30 09:09:39,344] INFO - Detect BentoML installed in development model, copying local BentoML module file to target saved bundle path
running sdist
running egg_info
writing BentoML.egg-info/PKG-INFO
writing dependency_links to BentoML.egg-info/dependency_links.txt
writing entry points to BentoML.egg-info/entry_points.txt
writing requirements to BentoML.egg-info/requires.txt
writing top-level names to BentoML.egg-info/top_level.txt
reading manifest file 'BentoML.egg-info/SOURCES.txt'
reading manifest template 'MANIFEST.in'
no previously-included directories found matching 'e2e_tests'
no previously-included directories found matching 'tests'
no previously-included directories found matching 'benchmark'
writing manifest file 'BentoML.egg-info/SOURCES.txt'
running check
creating BentoML-0.8.3+42.gb8d36b6
creating BentoML-0.8.3+42.gb8d36b6/BentoML.egg-info
creating BentoML-0.8.3+42.gb8d36b6/bentoml
creating BentoML-0.8.3+42.gb8d36b6/bentoml/adapters
creating BentoML-0.8.3+42.gb8d36b

In [None]:
!bentoml serve IrisClassifier:latest

[2020-07-30 09:09:43,808] INFO - Getting latest version IrisClassifier:20200730090929_C02FB7
[2020-07-30 09:09:43,808] INFO - Starting BentoML API server in development mode..
 * Serving Flask app "IrisClassifier" (lazy loading)
 * Environment: production
[2m   Use a production WSGI server instead.[0m
 * Debug mode: off
 * Running on http://127.0.0.1:5000/ (Press CTRL+C to quit)
127.0.0.1 - - [30/Jul/2020 09:09:52] "[37mPOST /test HTTP/1.1[0m" 200 -
127.0.0.1 - - [30/Jul/2020 09:09:55] "[37mPOST /test HTTP/1.1[0m" 200 -
127.0.0.1 - - [30/Jul/2020 09:09:57] "[37mPOST /test HTTP/1.1[0m" 200 -
127.0.0.1 - - [30/Jul/2020 09:10:00] "[37mPOST /test HTTP/1.1[0m" 200 -


Now if you visit http://localhost:5000/, you should be served with a beautiful UI:

![Custom UI](https://raw.githubusercontent.com/bentoml/gallery/master/scikit-learn/iris-classifier/webui.png)

It's still possible to access the swagger docs at `/docs`

## Containerize model server with Docker


One common way of distributing this model API server for production deployment, is via Docker containers. And BentoML provides a convenient way to do that.

Note that docker is **not available in Google Colab**. You will need to download and run this notebook locally to try out this containerization with docker feature.

If you already have docker configured, simply run the follow command to product a docker container serving the IrisClassifier prediction service created above:

In [4]:
!bentoml containerize IrisClassifier:latest

[2020-09-09 11:52:05,072] INFO - Getting latest version IrisClassifier:20200909115040_9FC7F4
[39mFound Bento: /Users/bozhaoyu/bentoml/repository/IrisClassifier/20200909115040_9FC7F4[0m
[39mTag not specified, using tag parsed from BentoService: 'irisclassifier:20200909115040_9FC7F4'[0m
Building Docker image irisclassifier:20200909115040_9FC7F4 from IrisClassifier:latest 
-we in here
processed docker file
(None, None)
root in create archive /Users/bozhaoyu/bentoml/repository/IrisClassifier/20200909115040_9FC7F4 ['Dockerfile', 'IrisClassifier', 'IrisClassifier/__init__.py', 'IrisClassifier/artifacts', 'IrisClassifier/artifacts/__init__.py', 'IrisClassifier/artifacts/model.pkl', 'IrisClassifier/bentoml.yml', 'IrisClassifier/iris_classifier.py', 'MANIFEST.in', 'README.md', 'bentoml-init.sh', 'bentoml.yml', 'bundled_pip_dependencies', 'bundled_pip_dependencies/BentoML-0.8.6+34.g6123b8c6.tar.gz', 'docker-entrypoint.sh', 'environment.yml', 'requirements.txt', 'setup.py']
about to build
abo

/[39mInstalling collected packages: threadpoolctl, joblib, scipy, scikit-learn, pytz, pandas[0m
-[39mSuccessfully installed joblib-0.16.0 pandas-1.1.2 pytz-2020.1 scikit-learn-0.23.2 scipy-1.5.2 threadpoolctl-2.1.0[0m
-[39m[91m+ for filename in ./bundled_pip_dependencies/*.tar.gz
+ '[' -e ./bundled_pip_dependencies/BentoML-0.8.6+34.g6123b8c6.tar.gz ']'
+ pip install -U ./bundled_pip_dependencies/BentoML-0.8.6+34.g6123b8c6.tar.gz --no-cache-dir
[0m[0m
\[39mLooking in indexes: https://pypi.python.org/simple/[0m
[39mProcessing ./bundled_pip_dependencies/BentoML-0.8.6+34.g6123b8c6.tar.gz[0m
-[39m  Installing build dependencies: started[0m
-[39m  Installing build dependencies: finished with status 'done'
  Getting requirements to build wheel: started[0m
/[39m  Getting requirements to build wheel: finished with status 'done'[0m
[39m    Preparing wheel metadata: started[0m
|[39m    Preparing wheel metadata: finished with status 'done'[0m


|[39mCollecting sqlalchemy-utils<0.36.8[0m
-[39m  Downloading SQLAlchemy-Utils-0.36.7.tar.gz (131 kB)[0m
[39mBuilding wheels for collected packages: BentoML, sqlalchemy-utils[0m
[39m  Building wheel for BentoML (PEP 517): started[0m
\[39m  Building wheel for BentoML (PEP 517): finished with status 'done'[0m
-[39m  Created wheel for BentoML: filename=BentoML-0.8.6+34.g6123b8c6-py3-none-any.whl size=4712455 sha256=e6944330cc4d24e21e20865bf3b08f780a9c2fde3fbc129cbda87c7f36aef89b[0m
[39m  Stored in directory: /tmp/pip-ephem-wheel-cache-n87zjgsc/wheels/be/7b/58/8207840666d87408400c426e983365fbdcb71015a400a124ea[0m
[39m  Building wheel for sqlalchemy-utils (setup.py): started[0m


-[39m  Building wheel for sqlalchemy-utils (setup.py): finished with status 'done'[0m
[39m  Created wheel for sqlalchemy-utils: filename=SQLAlchemy_Utils-0.36.7-py2.py3-none-any.whl size=93226 sha256=2f7ba4b0a4e6875f0b834144d9850d2a7e7e952d19eb97b5225e8fe8849415cb
  Stored in directory: /tmp/pip-ephem-wheel-cache-n87zjgsc/wheels/85/a9/d2/3377194742a9ad5eb57ee36f934438c15e1aaa0843dd41c899[0m
[39mSuccessfully built BentoML sqlalchemy-utils[0m
|[39mInstalling collected packages: sqlalchemy-utils, BentoML
  Attempting uninstall: sqlalchemy-utils[0m
[39m    Found existing installation: SQLAlchemy-Utils 0.36.8[0m
[39m    Uninstalling SQLAlchemy-Utils-0.36.8:[0m
\[39m      Successfully uninstalled SQLAlchemy-Utils-0.36.8[0m
-[39m  Attempting uninstall: BentoML[0m
[39m    Found existing installation: BentoML 0.8.6[0m
/[39m    Uninstalling BentoML-0.8.6:[0m
\[39m      Successfully uninstalled BentoML-0.8.6[0m
/[39mSuccessfully installed BentoML-0.8.6+34.g6123b8c6 s

In [None]:
!docker run --rm -p 5000:5000 irisclassifier:20200909115040_9FC7F4 

## Launch inference job from CLI

BentoML cli supports loading and running a packaged model from CLI. With the DataframeInput adapter, the CLI command supports reading input Dataframe data from CLI argument or local csv or json files:

In [None]:
!bentoml run IrisClassifier:latest predict --input '[[5, 4, 3, 2]]'

## Load saved BentoService

bentoml.load is the API for loading a BentoML packaged model in python:

In [None]:
from bentoml import load

service = load(saved_path)

print(service.predict([[5,4,3,2]]))

# Deployment Options

If you are at a small team with limited engineering or DevOps resources, try out automated deployment with BentoML CLI, currently supporting AWS Lambda, AWS SageMaker, and Azure Functions:
- [AWS Lambda Deployment Guide](https://docs.bentoml.org/en/latest/deployment/aws_lambda.html)
- [AWS SageMaker Deployment Guide](https://docs.bentoml.org/en/latest/deployment/aws_sagemaker.html)
- [Azure Functions Deployment Guide](https://docs.bentoml.org/en/latest/deployment/azure_functions.html)

If the cloud platform you are working with is not on the list above, try out these step-by-step guide on manually deploying BentoML packaged model to cloud platforms:
- [AWS ECS Deployment](https://docs.bentoml.org/en/latest/deployment/aws_ecs.html)
- [Google Cloud Run Deployment](https://docs.bentoml.org/en/latest/deployment/google_cloud_run.html)
- [Azure container instance Deployment](https://docs.bentoml.org/en/latest/deployment/azure_container_instance.html)
- [Heroku Deployment](https://docs.bentoml.org/en/latest/deployment/heroku.html)

Lastly, if you have a DevOps or ML Engineering team who's operating a Kubernetes or OpenShift cluster, use the following guides as references for implementating your deployment strategy:
- [Kubernetes Deployment](https://docs.bentoml.org/en/latest/deployment/kubernetes.html)
- [Knative Deployment](https://docs.bentoml.org/en/latest/deployment/knative.html)
- [Kubeflow Deployment](https://docs.bentoml.org/en/latest/deployment/kubeflow.html)
- [KFServing Deployment](https://docs.bentoml.org/en/latest/deployment/kfserving.html)
- [Clipper.ai Deployment Guide](https://docs.bentoml.org/en/latest/deployment/clipper.html)

