General flow

The main flow of the application can be illustrated as following:

flowchart TB
  Geyser --> ZeroMQ([ZeroMQ]) -->|"1.Receive\ntransaction data"| Ingester
  BigTable[(BigTable)] --->|1.Receive\ntransaction data| Ingester
  SolanaRPC --->|1.Receive\ntransaction data| Ingester
  Ingester -->|2.Schedule metadata\ndownload task| Postgre[("PostgreSQL\n(download queue\n&\nindex)")]
  JSONDownloader -->|3.Take metadata\ndownload task| Postgre
  JSONDownloader["JSON\nDownloader"] -->|4.Download\nmedia\nmetadata| DigitalAssetResource["Digital\nAsset\nResource"]
  
  
  subgraph system[" "]
    Backfiller -->|0.Trigger historical\ndata backfill\nif required| Ingester
    Ingester--> RocksDB[("RocksDB\n(transaction & asset metadata)")]
    JSONDownloader --> |5.Save\nmetadata|RocksDB
    JsonRPCServer["JSON RPC\nserver"]-->|9.Retrieve\ndesired\nmetadata |RocksDB
  end

  RocksDB --> Synchronizer -->|6.Update index| Postgre

  Clients <--->|"7.Request digital assets\ndirectly by key\nor by fields"| JsonRPCServer
  JsonRPCServer-->|"8.Search key by fields\nusing index\n(optional)"| Postgre

Loading

Data preparation part:

The Ingester mechanism continuously fetches fresh transactions Geyser that sends them via ZeroMQ (similar info can be also taken from Solana RPC nodes and Google BigTable).
The Ingester filters the newly fetched transactions (we are interested only in media related records), and saves them as task for downloading into the PostgreSQL db (we also use PostgreSQL as a queue).
After that, the JsonDownloader (interface::json::JsonDownloader) picks up a next tasks from the PostgreSQL "queue".
JsonDownloader fetches the media metadata from the source the actual media asset is persisted at.
The fetched metadata is saved to the RocksDB.
The Synchronizer (separate process) updates the index in PosgreSQL to make available searching by different matadata fields.

There is also the backfill mechanism that is used to load a historical data (metadata of already existing transactions).

Search data part:

A client (end user or another service) makes a call to our JSON RPC endpoint specifying field he wants to search by.
The server first goes to the PostgreSQL inxed to find an ID of the required record.
Using the ID, the server fetches required metadata from the RocksDB and returns to the client.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

flow.md

flow.md

General flow

Files

flow.md

Latest commit

History

flow.md

File metadata and controls

General flow