Website • Slack • Docs

Cost-effective serverless computing

Cortex is a highly-scalable and cost-effective serverless computing platform that runs on your AWS account. It scales microservices, data processing, machine learning, and other compute-intensive realtime and batch workloads. Cortex is designed to handle production traffic of up to 20M QPS and is up to 90% less expensive than AWS Lambda.

Maximize instance utilization

Workload autoscaling - set autoscaling policies per workload based on its traffic.

Resource requests - configure CPU, GPU, and memory requests per workload, without limits.

Container deployments - customize the runtime and request concurrency for each container.

Minimize instance costs

Cluster autoscaling - elastically scale your cluster to meet demand.

Spot instances - run workloads on spot instances without sacrificing reliability.

Multi-instance - use multiple instance types to optimize price-performance ratio per workload.

Control your spend

Workload observability - monitor latency and resource utilization with pre-built dashboards.

Cost transparency - visualize your costs using the latest AWS pricing information.

Predictable spend - set limits on resource consumption globally and per workload.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Cost-effective serverless computing

Maximize instance utilization

Minimize instance costs

Control your spend

Files

README.md

Latest commit

History

README.md

File metadata and controls

Cost-effective serverless computing

Maximize instance utilization

Minimize instance costs

Control your spend