{ "cells": [ { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import ipywidgets as ipw\n", "import hvplot.xarray # noqa\n", "import hvplot.pandas # noqa\n", "import panel as pn\n", "import pandas as pd\n", "import panel.widgets as pnw\n", "import xarray as xr" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Interactive command-line or notebook interfaces are incredibly powerful tools for quickly doing exploratory analysis, letting you supply arguments to Python methods and functions and see the results immediately. However, this process of exploration can be slow and awkward for large parameter spaces because it requires manually typing each argument value. To further ease exploratory workflows, hvPlot ships with a convenient `.interactive` API, which mirrors the regular API of your favorite data analysis libraries like Pandas, Dask, and xarray but makes it possible to pass in _widgets_ for each argument value, not just a constant. When the widgets are used, the output will dynamically update the full pipeline of method calls so that it works just as if that particular value had been specified in the call being wrapped.\n", "\n", "In this user guide we will explore how to use the .interactive API on xarray and pandas objects:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "ds = xr.tutorial.load_dataset('air_temperature')\n", "ds" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "from bokeh.sampledata.stocks import IBM\n", "\n", "df = pd.DataFrame(IBM)\n", "df['date'] = pd.to_datetime(df.date)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Interactive widgets" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We can supply both regular values, widgets and parameters as arguments to methods on the `.interactive` accessor. Here, we'll use widgets from the [Panel](https://panel.holoviz.org) library. The repr of the resulting object will contain a layout of the widget and a view of the resulting output:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "slider = pnw.IntSlider(name='time', start=0, end=10)\n", "\n", "ds.air.interactive(width=800).isel(time=slider)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "You can also use widgets from the [ipywidgets](https://ipywidgets.readthedocs.io) library:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "slider = ipw.IntSlider(description='time', min=0, max=10)\n", "\n", "ds.air.interactive(width=800).isel(time=slider)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Note that this works just as well for DataFrame objects whether they are Pandas, Dask or cuDF dataframes:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "nrows = pn.widgets.IntSlider(start=1, end=100, value=10)\n", "\n", "df.interactive(width=500).head(nrows)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "For Panel widgets, we can let .interactive automatically configure the widget, which is particularly convenient when working with `DiscreteSlider` widgets:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "ds.air.interactive(width=800).sel(time=pnw.DiscreteSlider)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Functions as inputs\n", "\n", "In some cases your starting point for your interactive pipeline may not simply be a DataFrame or xarray Dataset but a function that fetches some data or applies some initial processing on your data. In such a case you can use the `hvplot.bind` function to bind static AND dynamic arguments to your function. Binding dynamic arguments such as a widget or parameter means that whenever the widget/parameter value changes the output of the function will change as well. This makes it possible to construct functions as the input to your interactive pipeline that themselves represent some data pipeline.\n", "\n", "In the example below we will explicitly declare a `Select` widget to select between multiple stock tickers and a function that loads dataframes containing data for each of those stocks. Using the `hvplot.bind` function we then bind the `ticker` select widget to the `ticker` argument of the `stock_df` function and call `.interactive` on the resulting bound function:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "from bokeh import sampledata\n", "\n", "ticker = pn.widgets.Select(options=['AAPL', 'IBM', 'GOOG', 'MSFT'], name='Ticker')\n", "\n", "def stock_df(ticker):\n", " df = pd.DataFrame(getattr(sampledata.stocks, ticker))\n", " df['date'] = pd.to_datetime(df.date)\n", " return df\n", "\n", "stock_dfi = hvplot.bind(stock_df, ticker).interactive(width=600)\n", "\n", "stock_dfi.head(10)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "As you can see this `interactive` component behaves just like any other, allowing us to chain `.head` on it and updating when the `ticker` widget changes.\n", "\n", "Just like any other `interactive` component you may also chain it further:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "ticker = pn.widgets.Select(options=['AAPL', 'IBM', 'GOOG', 'MSFT'], name='Ticker')\n", "\n", "def stock_df(ticker):\n", " df = pd.DataFrame(getattr(sampledata.stocks, ticker))\n", " df['date'] = pd.to_datetime(df.date)\n", " return df\n", "\n", "stock_dfi = hvplot.bind(stock_df, ticker).interactive()\n", "\n", "dt_range = pn.widgets.DateRangeSlider(start=df.date.iloc[-1000], end=df.date.max(), value=(df.date.iloc[-100], df.date.max()))\n", "\n", "stock_dfi[(stock_dfi.date>=dt_range.param.value_start) & (stock_dfi.date<=dt_range.param.value_end)].hvplot(kind='ohlc', grid=True, title=ticker)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Docstrings\n", "\n", "When accessing a method on the `.interactive` accessor it will transparently mirror the docstring of the equivalent method in the underlying library being wrapped:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "print(ds.air.interactive.isel.__doc__)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Plotting\n", "\n", "One of the most useful aspects of the .interactive API is to feed the output of chained method calls into a plot." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Matplotlib" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "The output can be almost anything, such as the HTML repr (above) or a matplotlib plot:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "ds.air.interactive.sel(time=pnw.DiscreteSlider).plot()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "If we like, we can animate the output with a `Player` widget, and customize the location of the widget using the `loc` keyword argument to `.interactive`:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "time = pnw.Player(name='time', start=0, end=10, loop_policy='loop', interval=100)\n", "\n", "ds.air.interactive(loc='bottom').isel(time=time).plot()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### hvPlot" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We can also make use of the `.hvplot` method to get fully interactive Bokeh-based plots:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "slider = pnw.FloatSlider(name='quantile', start=0, end=1)\n", "\n", "ds.air.interactive.quantile(slider, dim='time').hvplot(data_aspect=1)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "You can chain any number of methods, with as many widgets controlling steps in this pipeline as you wish:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "q = pnw.FloatSlider(name='quantile', start=0, end=1)\n", "\n", "(ds.air.interactive(loc='left')\n", " .sel(time=pnw.DiscreteSlider)\n", " .quantile(q=q, dim='lon')\n", " .hvplot(aspect=1))" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We can also use a `RangeSlider` to select a slice and compute the mean over that range instead of selecting a specific time:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "range_slider = pnw.IntRangeSlider\n", "\n", "(ds.air.interactive\n", " .isel(time=range_slider)\n", " .mean('time')\n", " .hvplot())" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "`.interactive` supports arbitrary chains of method calls, including anything that is supported by your data object. For instance, you can even convert your xarray object into a dataframe using `.to_dataframe`, then call pandas methods:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "ds.air.interactive.sel(lat=pnw.DiscreteSlider).to_dataframe().groupby('time').mean().hvplot('time', 'air')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Operators\n", "\n", "You can further transform your output, if desired, by applying math operators on the interactive object:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "slider = pnw.IntSlider(name='time', start=0, end=10)\n", "baseline = ds.air.mean().item()\n", "baseline" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "ds.air.interactive(width=800).isel(time=slider).mean().item() - baseline" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "You can even do math with a widget:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "slider = pnw.IntSlider(name='time', start=0, end=10)\n", "offset = pnw.IntSlider(name='offset', start=0, end=500)\n", "\n", "ds.air.interactive.isel(time=slider).mean().item() + offset" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Math operators work with array data as well, such as the time-averaged value of each array value:" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "diff = ds.air.interactive.sel(time=pnw.DiscreteSlider) - ds.air.mean('time')\n", "kind = pnw.Select(options=['contour', 'contourf', 'image'])\n", "\n", "diff.hvplot(cmap='RdBu_r', clim=(-20, 20), kind=kind)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "If you want more control over the layout, you can use any of the features from [Panel](https://panel.holoviz.org):" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "diff = ds.air.interactive.sel(time=pnw.DiscreteSlider) - ds.air.mean('time')\n", "kind = pnw.Select(options=['contourf', 'contour', 'image'], value='image')\n", "interactive = diff.hvplot(cmap='RdBu_r', clim=(-20, 20), kind=kind)\n", "\n", "pn.Column(\n", " pn.Row(\n", " pn.panel(\"https://hvplot.holoviz.org/assets/hvplot-wm.png\", width=100), \n", " pn.Spacer(width=20),\n", " pn.Column(\n", " pn.panel(\"## Select a time and type of plot\", width=400),\n", " interactive.widgets()\n", " ),\n", " pn.panel(\"https://panel.holoviz.org/_static/logo_stacked.png\", width=100)\n", " ),\n", " interactive.panel()\n", ").servable()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "As you can see, the `.interactive` functionality makes it simple to work interactively with your data, letting you use widgets about as easily as any other method argument! See the Panel or ipwidgets docs for the various widgets and other functionality available." ] } ], "metadata": { "language_info": { "name": "python", "pygments_lexer": "ipython3" } }, "nbformat": 4, "nbformat_minor": 4 }