Skip to content

Releases: cortexlabs/cortex

v0.34.0

27 Apr 23:41
Compare
Choose a tag to compare

v0.34.0

New features

Breaking changes

  • "predictor" has been renamed to "handler" throughout the product (API configuration and Python APIs). In addition, as a result of supporting additional HTTP method verbs, predict() has been renamed to handle_post() in Realtime APIs (handle_get(), handle_put(), handle_patch(), and handle_delete() are now also supported). For consistency, predict() has been renamed to handle_async() for Async APIs, and handle_batch() for Batch APIs. See the examples for Realtime, Async, and Batch APIs. Task APIs have not been changed.

Bug fixes

Docs

Misc

v0.33.0

13 Apr 21:41
Compare
Choose a tag to compare

v0.33.0

New features

Breaking changes

  • The onnx predictor type has been replaced by the python predictor type; please use the python predictor type instead (all onnx models are fully supported by the python predictor type)

Bug fixes

Misc

v0.32.0

30 Mar 23:55
Compare
Choose a tag to compare

v0.32.0

New features

Breaking changes

  • GCP support has been removed so that we can focus our efforts on improving the scalability, reliability, and security for Cortex on AWS. Cortex on GCP will still be available in v0.31. If you are currently using Cortex on GCP, our team will be happy to help you migrate to AWS or work with you to find alternative solutions. Please feel free to reach out to us on slack or email us at hello@cortex.dev if you're interested.

Bug fixes

Docs

Misc

v0.31.1

23 Mar 20:07
Compare
Choose a tag to compare

v0.31.1

Bug fixes

  • Preemptible node pools on GCP aren't autoscaling #1981 (vishalbollu)
  • Replica autoscaler targets incorrect deployments on operator restart #1982 (miguelvr)
  • Replica autoscaler is not reinitialized for running APIs on operator restart on GCP #1984 (vishalbollu)

v0.31.0

17 Mar 01:38
Compare
Choose a tag to compare

v0.31.0

New features

Breaking changes

  • on_demand_backup in cluster configuration has been removed in favour of using a cluster with a mixture of spot and on-demand nodegroups. See multi-instance documentation for aws and gcp for more details.

Bug fixes

  • Fix Python client not respecting CORTEX_CLI_CONFIG_DIR environment variable for client-id.txt #1953 (jackmpcollins)
  • Prevent threads from being stuck in DynamicBatcher #1915 (cbensimon)
  • Fix unexpected cortex logs termination by increasing buffer size #1939 (vishalbollu)
  • Decouple cluster deletion from EBS volume deletion for cortex cluster down #1954 (deliahu)
  • Fix spot/on-demand GPU instances not joining the cluster by upgrading to eksctl 0.40.0 #1955 (vishalbollu)
  • Prevent premature queue not found errors by preserving the SQS for minutes till after the job has completed #1952 (vishalbollu)

Docs

Misc

v0.30.0

03 Mar 00:30
Compare
Choose a tag to compare

v0.30.0

New features

Breaking changes

  • Remove support for installing Cortex on your own Kubernetes Cluster #1921 (RobertLucian)

Bug fixes

Docs

  • Add docs for accessing dashboard through private load balancer (docs) #1907 (deliahu)

Misc

v0.29.0

17 Feb 04:52
Compare
Choose a tag to compare

v0.29.0

New features

Breaking changes

  • The default Docker images for APIs have been slimmed down to not include packages other than what Cortex requires to function. Therefore, when deploying APIs, it is now necessary to include the dependencies that your predictor needs in requirements.txt (docs) and/or dependencies.sh (docs).

Bug fixes

Docs

  • Add documentation for using a version of Python other than the default via dependencies.sh (docs) or custom images (docs) #1862 #1779 (RobertLucian)

Misc

v0.28.0

03 Feb 20:46
Compare
Choose a tag to compare

v0.28.0

New features

Breaking changes

  • The cloudwatch dashboard has been removed as a result of our switch to Prometheus for metrics aggregation. The dashboard will be replaced with an alternative in an upcoming release.

Bug fixes

  • Fix bug which can cause requests to APIs from a Python client to timeout during cluster autoscaling #1841 #1840 (RobertLucian)
  • Fix bug which can cause downscale_stabilization_period to be disregarded during downscaling #1847 #1846 (RobertLucian)

Misc

  • AWS credentials are no longer required to connect the CLI to the cluster operator. If you need to restrict access to your cluster operator, configure the operator's load balancer to be private by setting operator_load_balancer_scheme: internal in your cluster configuration file, and set up VPC Peering. We plan in supporting a new auth strategy in an upcoming release.
  • Improve S6 error code/signal handling #1825 #1703 (RobertLucian)

v0.27.0

21 Jan 05:23
Compare
Choose a tag to compare

v0.27.0

New features

Breaking changes

  • cortex logs now streams logs from a single replica at random when there are multiple replicas for an API. The recommended way to analyze production logs is via a dedicated logging tool (by default, logs are sent to CloudWatch on AWS and StackDriver on GCP)

Bug fixes

Docs

Misc

v0.26.0

06 Jan 08:18
Compare
Choose a tag to compare

v0.26.0

New features

Breaking changes

  • The local provider has been removed. The best way to test your predictor implementation locally is to import it in a separate Python file and call your __init__() and predict() functions directly. The best way to test your API is to deploy it to a dev/test cluster.
  • Built-in support for API Gateway has been removed. If you need to create an https endpoint with valid certs, some options are to set up a custom domain or to manually create an API Gateway.
  • Prediction monitoring has been removed. We are exploring how to build a more powerful and customizable solution for this.
  • The predict CLI command has been deleted. curl, requests, etc. are the best tools for testing APIs.

Bug fixes

Docs