Skip to content

Releases: cortexlabs/cortex

v0.25.0

23 Dec 06:28
Compare
Choose a tag to compare

v0.25.0

New features

Breaking changes

  • The predictor.model_path field of the realtime api configuration has been moved to predictor.models.path. In addition, for the Python predictor type, predictor.models has been renamed to predictor.multi_model_reloading. Here is the entire API configuration schema.

Bug fixes

Docs

Misc

v0.24.1

13 Dec 05:18
Compare
Choose a tag to compare

v0.24.1

Bug fixes

  • Propagate the exit code from the predictor's initialization so that the API status is set to "error" when initialization fails #1680 #1691 (RobertLucian)

v0.24.0

09 Dec 07:01
Compare
Choose a tag to compare

v0.24.0

New features

Breaking changes

  • The Python client's deploy() function has been renamed to create_api(), and some of the argument names have changed (docs)

Bug fixes

Docs

Misc

  • Improve Python client cross Python version compatibility #1640 (vishalbollu)
  • Reinstall TensorFlow and ONNX dependencies when the Python version is overridden #1652 (vishalbollu)
  • Terminate container when bootloader script fails #1639 (vishalbollu)

v0.23.0

25 Nov 00:35
Compare
Choose a tag to compare

v0.23.0

New features

  • Update Python client deploy() to accept a Python dictionary for API configuration (previously, only a file path was supported) (docs) #1587 (vishalbollu)
  • Show API deployment history in cortex get API_NAME command #1544 #1496 (deliahu)
  • Add cortex export API_NAME and cortex export API_NAME API_ID commands to export specific and historical API deployments #1544 #1497 (deliahu)
  • Build and push python-predictor-gpu-slim image with different combinations of cuda and cudnn (cuda10.0-cudnn7, cuda10.1-cudnn7, cuda10.1-cudnn8, cuda10.2-cudnn7, cuda10.2-cudnn8, cuda11.0-cudnn8, cuda11.1-cudnn8) (docs) #1575 #1574 (deliahu)

Bug fixes

Docs

Misc

v0.22.1

19 Nov 23:39
Compare
Choose a tag to compare

v0.22.1

Bug fixes

  • Set the predictor's working directory to the root Cortex project directory #1573 #1572 (deliahu)
  • Allow max_instances to be updated via cortex cluster configure #1568 #1567 (deliahu)
  • Gracefully stop the serving container when a multi-processed cron throws exception #1560 #1552 (RobertLucian)

Docs

Misc

  • Build and upload the Python package/CLI to a public S3 bucket #1562 (vishalbollu)

v0.22.0

11 Nov 16:30
a9e166b
Compare
Choose a tag to compare

v0.22.0

New features

  • Multi-model caching: serve a collection of models that is collectively bigger than what will fit in memory (via LRU cache eviction) (docs) #1428 #619 (RobertLucian)
  • Live reloading: support updating models in running APIs by adding new versions to the model's S3 directory (docs) #1428 #1252 (RobertLucian)
  • Inter-process fairness: distribute requests within an API replica evenly across all processes #1526 #839 #1298 (RobertLucian)
  • Support requests between APIs within the same cluster (docs) #1503 #1241 (deliahu)
  • Allow overriding of CLI install path and config directory (via $CORTEX_INSTALL_PATH and $CORTEX_CLI_CONFIG_DIR) (docs) #1521 #1222 (deliahu)

Breaking changes

  • ONNX model paths in API configuration files must now point to a directory containing a single ONNX file, rather than the onnx file itself. For example model_path: s3://cortex-examples/onnx/yolov5-youtube/yolov5s.onnx becomes model_path: s3://cortex-examples/onnx/yolov5-youtube.
  • The --env/-e flag in all cortex cluster commands has been renamed to --configure-env/-e, and if not provided, the environment named aws will no longer be configured in the cortex cluster info command

Bug fixes

  • Fix intermittent failed requests during rolling updates #1526 #814 (RobertLucian)
  • Prevent CLI environments from getting overwritten when multiple cortex cluster commands are run concurrently #1520 #1410 (deliahu)

Docs

Misc

  • Stagger Predictor __init__() calls to reduce peak memory consumption #1543 #1450 (RobertLucian)
  • Add --name/-n and --region/-r flags to cortex cluster info, cortex cluster export, and cortex cluster down commands #1492 #1363 (RobertLucian)
  • Rename --env/-e flag to --configure-env/-e in cortex cluster commands and update its behavior #1533 #1412 (deliahu)
  • Disallow ARM-based instances, which are not currently supported #1536 (deliahu)
  • Validate AWS vCPU quota is sufficient for up to max_instances instances when running cortex cluster up and cortex cluster configure #1537 #1461 (deliahu)

v0.21.0

27 Oct 17:24
Compare
Choose a tag to compare

New features

Bug fixes

Docs

Misc

  • Change default local port from 8888 to 8890 to avoid port conflicts with Jupyter #1456 (vishalbollu)
  • Disallow instance types that aren't supported by NLB #1436 #1433 (deliahu)
  • Add --cluster-aws-key and --cluster-aws-secret flags to cortex cluster configure command #1404 (deliahu)
  • Add --output flag to cortex env list command #1444 (vishalbollu)

v0.20.0

29 Sep 22:37
Compare
Choose a tag to compare

v0.20.0

New features

Breaking changes

  • The valid values for api_gateway in the cluster configuration file have been changed from enabled/disabled to public/none (to match the values for networking.api_gateway in the API configuration file).

Bug fixes

New examples

Docs

Misc

  • Update sources of AWS credentials for cortex cluster * commands, and improve transparency (docs) #1378 #1229 (vishalbollu)
  • Rename cluster api_gateway config values to match API config #1335 #1334 (deliahu)
  • Set the default value for networking.api_gateway in the API configuration to none if api gateway is disabled cluster-wide #1337 #1336 (deliahu)
  • Support c6g and r6g instances #1332 #809 (deliahu)
  • Display autoscaling group activity history when cortex cluster up fails #1342 #1340 (deliahu)
  • Print debug info if cortex cluster up times out #1396 (deliahu)
  • Add Inferentia compute statistics to cortex cluster info command #1354 #1304 (RobertLucian)
  • Disable prompts in get-cli.sh if not running interactively #1372 #1371 (deliahu)
  • Update cortex help output #1398 (deliahu)

v0.19.0

25 Aug 15:31
Compare
Choose a tag to compare

New features

Breaking changes

  • kind is now a required top-level field for all API configurations. Existing APIs should add kind: RealtimeAPI. This release adds support for kind: BatchAPI and kind: TrafficSplitter.

Bug fixes

New examples

Docs

Misc

v0.18.1

30 Jun 19:38
Compare
Choose a tag to compare

Bug fixes

  • Fix dynamic axes for ONNX models #1187 #1186 (RobertLucian)
  • Fix memory node capacity calculation for multi-api configuration files #1185 (deliahu)
  • Check cluster-name tag when choosing load balancer for VPC Link integration #1173 (deliahu)

New guides

Misc