jecio/MESSAGES_RELEASE_FLOW.md
2026-02-14 21:10:26 +01:00

1.2 KiB

Messages Release Flow

This flow creates a Nessie tag for lake.db1.messages, generates a manifest JSON, and appends a row to lake.db1.releases_v2.

Run on lakehouse-core

ssh niklas@lakehouse-core.rakeroots.lan 'cd /tmp/jecio && ./create-messages-release-via-spark-container.sh'

Custom release name

ssh niklas@lakehouse-core.rakeroots.lan 'cd /tmp/jecio && ./create-messages-release-via-spark-container.sh rel_2026-02-14_messages-v1'

Outputs

  • Manifest file written to ./manifests/<release_name>.json
  • Nessie tag <release_name> created at current main hash (or reused if already present)
  • Registry row appended to lake.db1.releases_v2

Verify

ssh niklas@lakehouse-core.rakeroots.lan "docker exec spark /opt/spark/bin/spark-sql --properties-file /opt/lakehouse/spark-conf/lakehouse-spark-defaults.conf --packages 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.10.1,org.apache.iceberg:iceberg-aws-bundle:1.10.1,org.projectnessie.nessie-integrations:nessie-spark-extensions-3.5_2.12:0.104.5' -e \"SELECT release_name, table_identifier, snapshot_id, created_at_utc FROM lake.db1.releases_v2 WHERE table_identifier='lake.db1.messages' ORDER BY created_at_utc DESC LIMIT 10\""