**[Docs](https://docs.cortexlabs.com)** • **[Slack](https://community.cortexlabs.com)**

Note: This project is no longer actively maintained by its original authors. # Production infrastructure for machine learning at scale Deploy, manage, and scale machine learning models in production.
## Serverless workloads **Realtime** - respond to requests in real-time and autoscale based on in-flight request volumes. **Async** - process requests asynchronously and autoscale based on request queue length. **Batch** - run distributed and fault-tolerant batch processing jobs on-demand.
## Automated cluster management **Autoscaling** - elastically scale clusters with CPU and GPU instances. **Spot instances** - run workloads on spot instances with automated on-demand backups. **Environments** - create multiple clusters with different configurations.
## CI/CD and observability integrations **Provisioning** - provision clusters with declarative configuration or a Terraform provider. **Metrics** - send metrics to any monitoring tool or use pre-built Grafana dashboards. **Logs** - stream logs to any log management tool or use the pre-built CloudWatch integration.
## Built for AWS **EKS** - Cortex runs on top of EKS to scale workloads reliably and cost-effectively. **VPC** - deploy clusters into a VPC on your AWS account to keep your data private. **IAM** - integrate with IAM for authentication and authorization workflows.