From 7a6375fe0e7be4dadb0be04587010d476d1c2fb6 Mon Sep 17 00:00:00 2001 From: "John A. Lotoski" <39752197+johnalotoski@users.noreply.github.com> Date: Sat, 14 Oct 2023 09:43:29 -0500 Subject: [PATCH] SRE update 2023-10-13 (#269) --- blog/2023-10-13-sre.md | 66 ++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 66 insertions(+) create mode 100644 blog/2023-10-13-sre.md diff --git a/blog/2023-10-13-sre.md b/blog/2023-10-13-sre.md new file mode 100644 index 0000000000..efc3e5f62b --- /dev/null +++ b/blog/2023-10-13-sre.md @@ -0,0 +1,66 @@ +--- +title: SRE Team Update +slug: 2023-10-13-sre +authors: johnalotoski +tags: [sre] +hide_table_of_contents: false +--- + +## High level summary + +The SRE team continues work on cardano environment improvements and general environment maintenance. + +Some notable recent changes, updates or improvements include: +* Sanchonet environment was updated to 8.5.0-pre. +* Cardano-parts now supports cardano-db-sync, cardano-smash, cardano-faucet and grafana cloud monitoring + +## Lower level summary + +### Capkgs +* Various improvements and fixes including: + * Add GHA cron schedule + * Add nix auto-gc to avoid running out of storage during large package set builds + * Add new packages to capkgs + * Reduce runner storage requirement leaving more room for builds + * Restructure capkgs attribute names to avoid package name collisions + * Return to non-musl builds for cardano packages to retain journald compatability + * Update cache usage to from file level to folder level to reduce network and latency overhead + * Commit diff: [capkgs-compare](https://github.com/input-output-hk/capkgs/compare/a274b72...b197e22) + +### Cardano-parts +* Updates cardano-node-ng to 8.5.0-pre and adds a Conway era automation job: [cardano-parts-pull-16](https://github.com/input-output-hk/cardano-parts/pull/16) +* Dbsync, smash, faucet and more: [cardano-parts-pull-15](https://github.com/input-output-hk/cardano-parts/pull/15) + * Adds cardano-db-sync, cardano-faucet, cardano-postgres, cardano-smash, profiles and/or services and related changes + * Adds nginx vhost metrics exporter profile + * Adds smash registered-relay-dump service and exporter for use until legacy relay nodes are scaled down + * Adds bash *-ng autocompletion compatible wrappers + * Adds a list-machines just recipe using nushell dataframe outer joins and scj ssh_config parser for fast cluster evals of machine state overview + * Adds downstream grafana cloud dashboard as templates + * Adds downstream grafana cloud alerts as templates + * Updates grafana-agent profile with new exporter scrape hooks: cardano-db-sync, cardano-faucet, nginx-vts, varnish + * Updates the basic profile with IOG cache and commonly used bins + * Updates the pre-release profile to support cardano-db-sync, cardano-faucet, cardano-smash *-ng versioning + * Updates flakeModule jobs with new conway era automation and additional IO encryption shimming and file type checks + * Updates .sops.yaml template for supporting faucet secrets, workbench secrets, state-demo secrets + * Updates the Justfile template with terraform fixes for workspace switching and provider auto-reconfiguration + * Updates the cloudFormation terraformState template with stack modifications to preserve all resources in case of deletion + * Updates the colmena template with dbsync, smash, faucet machines profiles and roles + * Improves prior cardano-postgres modules to now automatically tune pg parameters based on machine cpuCount, memMiB and desired conns + * Bumps capkgs node-ng to 8.5.0-pre + +### Cardano-playground +* Dbsync, smash, faucet and more: [cardano-playground-pull-3](https://github.com/input-output-hk/cardano-playground/pull/3) + * Adds a list-machines just recipe using nushell dataframe outer joins and scj ssh_config parser for fast cluster evals of machine state overview + * Adds dbsync, smash, faucet machines and corresponding metrics exporters, dashboards and alerts + * Moves the flake.cardano-parts.cluster.group attrSet name to groups to accurately reflect the plurality and the upstream corresponding change + * Optimizes machine sizes + * Updates .sops.yaml for supporting faucet secrets, workbench secrets, state-demo secrets + * Updates the cloudFormation terraformState file with stack modifications to preserve all resources in case of deletion + * Updates the cluster isNg definition to support cardano-db-sync, cardano-faucet, cardano-smash *-ng versioning + * Updates the Justfile with terraform fixes for workspace switching and provider auto-reconfiguration + +### Cardano-world +* Update sanchonet network to cardano-node 8.5.0: [cardano-world-commit](https://github.com/input-output-hk/cardano-world/commit/c8a480fa7cdb6eff4f8b4565450c57361ddfdec4) + +### Sanchonet-demo +* Update for cardano-node 8.5.0, conway job recipes and cardano-parts interface changes: [sanchonet-demo-commit](https://github.com/input-output-hk/sanchonet-demo/commit/4124465c0c663d11c8cf4ef80af18a5246f89b31)