jecio/MESSAGES_SCHEMA.md

24 lines
785 B
Markdown
Raw Permalink Normal View History

# Messages Schema
Creates Iceberg table `lake.db1.messages` with ingest fields:
- `thread_id` STRING
- `message_id` STRING
- `sender` STRING
- `channel` STRING
- `sent_at` TIMESTAMP
- `body` STRING
- `metadata_json` STRING
## Run on lakehouse-core
```bash
ssh niklas@lakehouse-core.rakeroots.lan 'cd /tmp/jecio && ./create-messages-table-via-spark-container.sh'
```
## Verify
```bash
ssh niklas@lakehouse-core.rakeroots.lan "docker exec spark /opt/spark/bin/spark-sql --properties-file /opt/lakehouse/spark-conf/lakehouse-spark-defaults.conf --packages 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.10.1,org.apache.iceberg:iceberg-aws-bundle:1.10.1,org.projectnessie.nessie-integrations:nessie-spark-extensions-3.5_2.12:0.104.5' -e 'DESCRIBE TABLE lake.db1.messages'"
```