ZetaSQL defines a language (grammar, types, data model, and semantics) as well as a parser and analyzer. It is not itself a database or query engine. Instead it is intended to be used by multiple engines wanting to provide consistent behavior for all semantic analysis, name resolution, type checking, implicit casting, etc. Specific query engines may not implement all features in the ZetaSQL language and may give errors if specific features are not supported. For example, engine A may not support any updates and engine B may not support analytic functions.
ZetaSQL BigQuery Analysis Example
This codebase is being open sourced in multiple phases:
- Parser and Analyzer Complete
- Reference Implementation In Progress
- Base capability Complete
- Function library In Progress
- Compliance Tests Complete
- includes framework for validating compliance of arbitrary engines
- Misc tooling
- Improved Formatter Complete
Multiplatform support is planned for the following platforms:
- Linux (Ubuntu 20.04 is our reference platform, but others may work).
- gcc-9+ is required, recent versions of clang may work.
- MacOS (Experimental)
- Windows (version TDB)
We do not provide any guarantees of API stability and cannot accept contributions.
ZetaSQL uses the Abseil Flags library to handle commandline flags. Unless otherwise documented, all flags are for debugging purposes only and may change, stop working or be removed at any time.
ZetaSQL uses bazel for building and dependency resolution. After installing bazel (check .bazelversion for the specific version of bazel we test with, but other versions may work), simply run:
bazel build ...
See the (WORKSPACE) file, as it is a little unusual.
TODO: Add docker build instructions.
A very basic command line tool is available to run simple queries with the
reference implementation:
bazel run //zetasql/tools/execute_query:execute_query -- "select 1 + 1;"
The reference implementation is not yet completely released and currently supports only a subset of functions and types.
For questions, documentation and examples of ZetaSQLs implementation of Differential Privacy, please check out (https://github.com/google/differential-privacy).
ZetaSQL makes no guarantees regarding compatibility between releases. Breaking changes may be made at any time. Our releases are numbered based on the date of the commit the release is cut from. The number format is YYYY.MM.n, where YYYY is the year, MM is the two digit month, and n is a sequence number within the time period.
This is not an officially supported Google product.