{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "## LLM Compressor Workbench -- Getting Started\n", "\n", "This notebook will demonstrate how common [LLM Compressor](https://github.com/vllm-project/llm-compressor) flows can be run on the [opendatahub/llmcompressor-workbench](https://quay.io/repository/opendatahub/llmcompressor-workbench) image.\n", "\n", "We will show how a user can compress and evaluate a Large Language Model, first without data and then with a calibration dataset.\n", "\n", "The notebook will detect if a GPU is available. If one is not available, it will demonstrate an abbreviated run, so users without GPU access can still get a feel for `llm-compressor`.\n", "\n", "\n", "