Skip to content

Latest commit

 

History

History
74 lines (42 loc) · 6.89 KB

architecture.md

File metadata and controls

74 lines (42 loc) · 6.89 KB

Architecture

The following figure shows the architecture of the Blaze consisting of ingress, frontend, backend and auth provider services.

Blaze consists of four services, the nginx ingress, the frontend, the backend and the Keycloak auth provider. The nginx ingress is responsible for TLS termination and routing HTTP requests to the corresponding components. The frontend contains a Web UI that uses server-side and client-side technologies. The backend provides the FHIR API and a custom Admin API. Keycloak is currently the supported auth provider that can be used with Blaze. Others may follow. Also Keycloak doesn't have to be deployed behind the nginx ingress.

Ingress

The main role of the ingress is to provide TLS termination for the Blaze frontend and backend which is not available for the individual services. In the above shown diagram, the ingress provides routes for the browser going exclusively to the frontend component, direct API routes which can be used by API clients like blazectl and routes for the auth provider. However the ingress doesn't have to be implemented using nginx. Other reverse proxies like Apache, Envoy or Traefik can also be used. Furthermore separate ingress services can be used for the frontend, backend and auth provider.

Frontend

The frontend contains a Web UI that covers the FHIR API and the custom Admin API of Blaze. It's a standalone service that delivers server-side rendered HTML, intercepts possible browser-side API requests and directly talks to the backend. Authentication is done by server-side OAuth2 Authorization Code Flow and the tokens are stored via secure, HTTP only, encrypted JWT session cookies. User credentials are only visible to the auth provider. Tokens are not accessible via Javascript. So all browser-side requests are authorized by the frontend server-side via the session cookie and backend requests are only done by the server side of the frontend.

The frontend is stateless and can be deployed more than once.

Backend

The following figure shows the architecture of the Blaze backend.

In this architecture, the FHIR API on the left is supported by the Queries, Resource Pulls and Transactions functionalities, provided by the Database Node API. The Database Node itself consists of an embedded RocksDB key-value store that holds all Indices required to answer the queries and an Indexer that creates those indices from transactions. The Resource Pulls functionality accesses the Resource Store directly, which is a key-value store from resource content hash (key) to resource content (value). The Transactions functionality pushes transaction commands to the Transaction Log and resource content hashes and resource contents to the Resource Store. The Transaction Log can be standalone (RocksDB) or distributed (Kafka). In both cases only one single Transaction Log exists. The transaction commands will be streamed into the Indexer that also fetches resources from the Resource Store in order to create the local Indices.

Transaction Log

The Transaction Log is a central component in Blaze. it is used to ensure ACID (atomicity, consistency, isolation, durability) properties of Blaze by maintaining a total order of all transactions which corresponds to the isolation level Serializable.

In the standalone deployment scenario, the Transaction Log is backed by RocksDB and embedded in the overall Blaze Process, in order to keep it simple and use the same technology that the Indices use already. However other implementations are possible.

In the distributed case a single Kafka topic with a single partition is used to ensure the total order of transaction commands while multiple Database Nodes write into this topic. Performance wise, the single topic is no problem, because the transaction commands are small.

Resource Store

The Resource Store holds all versions of all resources by their content hash. This approach is similar to Git which keeps commits by their SHA1 content hashes. Using the content hash instead of the resource id ensures key-value pairs are never updated and so are easily cacheable. On top of that, older versions of resources are still available for the FHIR History interactions.

In the standalone deployment scenario, the Resource Store is embedded and backed by RocksDB for the same reason why the Transaction Log uses RocksDB. Again other implementations are possible.

In the distributed case a Cassandra database is used. However adapters for any suitable central key-value store could be implemented. Even relational databases like Postgres would be possible. Cassandra was chosen as first implementation, because it can be deployed in a cluster, so that it will scale and be high available.

Database Node

As written already above, the Database Node represents the data access layer in Blaze. In the distributed deployment scenario, a Database Node will exist in every Blaze Process. This also means that every Blaze Process contains it's own set of Indices and can answer Queries without relaying on central resources. Only at the point were resources actually should be returned by the FHIR API, and a cache miss in the Resource Cache happens, access to the central Resource Store is needed. The Transaction Log is not needed for Queries and Pulls at all.

One important note is, that Blaze uses full replication over all Database Nodes. In order to ensure full consistency and keep the implementation simple, sharding isn't a feature of Blaze yet. That means that the storage requirements for the Indices grow with the number of Database Nodes deployed.

Blaze Process

Each Blaze Process runs in a JVM. In the standalone case, in addition to the FHIR API and the Database Node, the Blaze Process contains also the Transaction Log and the Resource Store. In the distributed case, both the Transaction Log and the Resource Store will run in separate processes, meaning in the respective processes of their implementations.

Authorization

The backend authorizes each request optionally by validating the signature of bearer tokens against the public key of the auth provider.

Other Work

The architecture of Blaze is inspired by the architecture of XTDB but optimized for the FHIR use-case.

Auth Provider

Currently the only supported auth provider is Keycloak. Others can be added in the future. The auth provider doesn't have to be part of teh Blaze deployment. External Keycloak instanced can be used.