{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Run DAG flow in Azure" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "\n", "**Requirements** - In order to benefit from this tutorial, you will need:\n", "- An Azure account with an active subscription - [Create an account for free](https://azure.microsoft.com/free/?WT.mc_id=A261C142F)\n", "- An Azure ML workspace - [Configure workspace](https://github.com/microsoft/promptflow/blob/main/examples/configuration.ipynb)\n", "- A python environment\n", "- Installed prompt flow SDK\n", "\n", "**Learning Objectives** - By the end of this tutorial, you should be able to:\n", "- Connect to your Azure AI workspace from the Python SDK\n", "- Create and develop a new promptflow run\n", "- Evaluate the run with a evaluation flow\n", "\n", "**Motivations** - This guide will walk you through the main user journey of prompt flow code-first experience. You will learn how to create and develop your first prompt flow, test and evaluate it." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 0. Install dependent packages" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "%pip install -r ../../requirements.txt" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# 1. Connect to Azure Machine Learning Workspace\n", "\n", "The [workspace](https://docs.microsoft.com/en-us/azure/machine-learning/concept-workspace) is the top-level resource for Azure Machine Learning, providing a centralized place to work with all the artifacts you create when you use Azure Machine Learning. In this section we will connect to the workspace in which the job will be run.\n", "\n", "## 1.1 Import the required libraries" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "import json\n", "\n", "# Import required libraries\n", "from azure.identity import DefaultAzureCredential, InteractiveBrowserCredential\n", "\n", "# azure version promptflow apis\n", "from promptflow.azure import PFClient" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 1.2 Configure credential\n", "\n", "We are using `DefaultAzureCredential` to get access to workspace. \n", "`DefaultAzureCredential` should be capable of handling most Azure SDK authentication scenarios. \n", "\n", "Reference for more available credentials if it does not work for you: [configure credential example](https://github.com/microsoft/promptflow/blob/main/examples/configuration.ipynb), [azure-identity reference doc](https://docs.microsoft.com/en-us/python/api/azure-identity/azure.identity?view=azure-python)." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "try:\n", " credential = DefaultAzureCredential()\n", " # Check if given credential can get token successfully.\n", " credential.get_token(\"https://management.azure.com/.default\")\n", "except Exception as ex:\n", " # Fall back to InteractiveBrowserCredential in case DefaultAzureCredential not work\n", " credential = InteractiveBrowserCredential()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 1.3 Get a handle to the workspace\n", "\n", "We use config file to connect to a workspace. The Azure ML workspace should be configured with computer cluster. [Check this notebook for configure a workspace](https://github.com/microsoft/promptflow/blob/main/examples/configuration.ipynb)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# Get a handle to workspace\n", "pf = PFClient.from_config(credential=credential)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 1.4 Create necessary connections\n", "Connection helps securely store and manage secret keys or other sensitive credentials required for interacting with LLM and other external tools for example Azure Content Safety.\n", "\n", "In this notebook, we will use flow `web-classification` which uses connection `azure_open_ai_connection` inside, we need to set up the connection if we haven't added it before.\n", "\n", "Prepare your Azure Open AI resource follow this [instruction](https://learn.microsoft.com/en-us/azure/cognitive-services/openai/how-to/create-resource?pivots=web-portal) and get your `api_key` if you don't have one.\n", "\n", "Please go to [workspace portal](https://ml.azure.com/), click `Prompt flow` -> `Connections` -> `Create`, then follow the instruction to create your own connections. \n", "Learn more on [connections](https://learn.microsoft.com/en-us/azure/machine-learning/prompt-flow/concept-connections?view=azureml-api-2)." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 2. Create a new run\n", "\n", "`web-classification` is a flow demonstrating multi-class classification with LLM. Given an url, it will classify the url into one web category with just a few shots, simple summarization and classification prompts." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Set flow path and input data" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# load flow\n", "flow = \"../../flows/standard/web-classification\"\n", "data = \"../../flows/standard/web-classification/data.jsonl\"" ] }, { "attachments": {}, "cell_type": "markdown", "metadata": {}, "source": [ "### Submit run" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# create run\n", "base_run = pf.run(\n", " flow=flow,\n", " data=data,\n", ")\n", "print(base_run)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "pf.stream(base_run)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "details = pf.get_details(base_run)\n", "details.head(10)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "pf.visualize(base_run)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## 3. Evaluate your flow run result\n", "Then you can use an evaluation method to evaluate your flow. The evaluation methods are also flows which use Python or LLM etc., to calculate metrics like accuracy, relevance score.\n", "\n", "In this notebook, we use `eval-classification-accuracy` flow to evaluate. This is a flow illustrating how to evaluate the performance of a classification system. It involves comparing each prediction to the groundtruth and assigns a \"Correct\" or \"Incorrect\" grade, and aggregating the results to produce metrics such as accuracy, which reflects how good the system is at classifying the data." ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "eval_run = pf.run(\n", " flow=\"../../flows/evaluation/eval-classification-accuracy\",\n", " data=data,\n", " run=base_run,\n", " column_mapping={\n", " \"groundtruth\": \"${data.answer}\",\n", " \"prediction\": \"${run.outputs.category}\",\n", " },\n", ")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "pf.stream(eval_run)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "details = pf.get_details(eval_run)\n", "details.head(10)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "metrics = pf.get_metrics(eval_run)\n", "print(json.dumps(metrics, indent=4))" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "pf.visualize([base_run, eval_run])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Create another run with different variant node\n", "\n", "In this example, `web-classification`'s node `summarize_text_content` has two variants: `variant_0` and `variant_1`. The difference between them is the inputs parameters:\n", "\n", "variant_0:\n", "\n", " - inputs:\n", " - deployment_name: gpt-35-turbo\n", " - max_tokens: '128'\n", " - temperature: '0.2'\n", " - text: ${fetch_text_content_from_url.output}\n", "\n", "variant_1:\n", "\n", " - inputs:\n", " - deployment_name: gpt-35-turbo\n", " - max_tokens: '256'\n", " - temperature: '0.3'\n", " - text: ${fetch_text_content_from_url.output}\n", "\n", "\n", "You can check the whole flow definition at [flow.dag.yaml](https://github.com/microsoft/promptflow/blob/main/examples/flows/standard/web-classification/flow.dag.yaml)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "# use the variant1 of the summarize_text_content node.\n", "variant_run = pf.run(\n", " flow=flow,\n", " data=data,\n", " variant=\"${summarize_text_content.variant_1}\", # here we specify node \"summarize_text_content\" to use variant 1 version.\n", ")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "pf.stream(variant_run)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "details = pf.get_details(variant_run)\n", "details.head(10)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Run evaluation against variant run" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "eval_flow = \"../../flows/evaluation/eval-classification-accuracy\"\n", "\n", "eval_run_variant = pf.run(\n", " flow=eval_flow,\n", " data=\"../../flows/standard/web-classification/data.jsonl\", # path to the data file\n", " run=variant_run, # use run as the variant\n", " column_mapping={\n", " # reference data\n", " \"groundtruth\": \"${data.answer}\",\n", " # reference the run's output\n", " \"prediction\": \"${run.outputs.category}\",\n", " },\n", ")" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "pf.stream(eval_run_variant)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "details = pf.get_details(eval_run_variant)\n", "details.head(10)" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "metrics = pf.get_metrics(eval_run_variant)\n", "print(json.dumps(metrics, indent=4))" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [ "pf.visualize([eval_run, eval_run_variant])" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "# Next Steps\n", "\n", "Learn more on how to:\n", "- run the flow as a component in a azureml pipeline: [flow in pipeline](https://github.com/microsoft/promptflow/blob/main/examples/tutorials/run-flow-with-pipeline/pipeline.ipynb)." ] } ], "metadata": { "build_doc": { "author": [ "wangchao1230@github.com" ], "category": "azure", "section": "Flow", "weight": 20 }, "description": "A quickstart tutorial to run a flow in Azure AI and evaluate it.", "kernelspec": { "display_name": "promptflow", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.9.17" }, "resources": "examples/requirements.txt, examples/flows/standard/web-classification, examples/flows/evaluation/eval-classification-accuracy" }, "nbformat": 4, "nbformat_minor": 2 }