Golden Path

Build Flow

Execution-path details for how the automation is staged and run.

Orchestration Plumbing

Use this page to answer the execution-boundary questions:

what runs on the operator workstation
what runs on virt-01
what runs on bastion-01
when the tracked runner files move from local state to bastion state
why the dashboard can show different data before and after handoff

Use AUTOMATION FLOW for the build order. Use this page when the question is "who is actually running this step right now?"

If the question is "what order should I run things in?", use AUTOMATION FLOW instead. This page is specifically about execution ownership and handoff.

The Short Version

The automation is not one uninterrupted process.

It is a staged execution chain:

outer AWS prep
workstation-side bootstrap and bastion staging
bastion-side lab orchestration

The key transition is:

site-bootstrap.yml is a workstation-driven phase
site-lab.yml starts on the workstation, but only for validation and bastion staging
the real long-running site-lab body then moves to bastion

Execution Contexts

Phase Split

Prep

This is the outer substrate:

cloudformation/deploy-stack.sh tenant
cloudformation/deploy-stack.sh host

That gets you:

VPC/subnet/security-group shape
virt-01
attached guest EBS volumes
public ingress to the hypervisor

`site-bootstrap.yml`

This is still workstation-led.

It imports:

What that means in practice:

the operator workstation talks to metal-01
virt-01 is bootstrapped
bastion-01 is created and configured
the repo and execution inputs are staged onto bastion

It does not automatically continue into site-lab.yml.

`site-lab.yml`

This starts on the workstation but does not stay there.

The real sequence is:

sequenceDiagram participant W as Workstation participant M as metal-01 participant B as bastion-01 W->>W: validate-orchestration.sh W->>M: run bastion-stage.yml M->>B: refresh staged repo, inventory, secrets, helpers W->>B: run_bastion_playbook.sh playbooks/site-lab.yml B->>B: create /var/tmp/bastion-playbooks/site-lab.* B->>B: run the long-lived site-lab body

So when someone asks "why doesn't bastion see site-lab yet?", the usual answer is:

because the run is still in workstation-side validation or bastion staging
the bastion-side tracked runner does not exist until after handoff

Runner Files And Telemetry

Workstation-tracked state

Workstation-side wrappers write state under:

~/.local/state/calabi-playbooks/

Typical files:

<stem>.pid
<stem>.log
<stem>.rc
<stem>.remote.env

Examples:

site-bootstrap.pid
site-bootstrap.log
site-lab.log
site-lab.remote.env

site-lab.remote.env is the handoff marker. When it exists, the workstation knows where the bastion-side runner lives.

Bastion-tracked state

The long-running bastion-side runner writes state under:

/var/tmp/bastion-playbooks/

Typical files:

site-lab.pid
site-lab.log
site-lab.rc

Those files do not exist during:

workstation validation
bastion-stage.yml

They appear only after the SSH handoff into:

scripts/run_bastion_playbook.sh

Wrapper Responsibilities

`scripts/run_local_playbook.sh`

Use this for workstation-side tracked execution.

It is appropriate for:

site-bootstrap.yml
other workstation-resident playbook runs

It is responsible for:

creating local pid/log/rc files
making workstation dashboards usable before bastion exists

`scripts/run_remote_bastion_playbook.sh`

Use this when the intended steady-state runner is bastion.

It is responsible for:

running the validation lane locally
refreshing bastion staging locally
SSH handoff to bastion
recording the remote runner paths in <stem>.remote.env

This is why site-lab.yml has two visible telemetry phases:

local site-lab.log during validation/staging
bastion /var/tmp/bastion-playbooks/site-lab.log after handoff

`scripts/run_bastion_playbook.sh`

This is the bastion-native runner.

It is responsible for:

the actual long-running site-lab.yml body
tracked state under /var/tmp/bastion-playbooks/

Dashboard Behavior

The dashboard follows the same split.

Before bastion handoff

lab-dashboard.sh site-lab on the workstation can only see:

local validation output
local bastion-stage.yml output

So during that window:

workstation dashboard is authoritative
bastion dashboard will show nothing for site-lab

After bastion handoff

Once site-lab.remote.env exists and bastion creates:

/var/tmp/bastion-playbooks/site-lab.pid
/var/tmp/bastion-playbooks/site-lab.log

the dashboard can follow bastion-native telemetry.

flowchart TD A[site-lab launched from workstation] --> B[local validation] B --> C[bastion-stage.yml] C --> D{remote.env exists?} D -- no --> E[workstation dashboard only] D -- yes --> F[bastion runner exists] F --> G[workstation dashboard can follow bastion state] F --> H[bastion dashboard can follow site-lab]

Why The Split Exists

The split is intentional, not accidental.

Reasons:

the workstation owns the outer SSH path to virt-01
the bastion owns the inner lab network and support-service reachability
the project wants the real long-running lab build to happen from the same host that later admin workflows use

That means the handoff is part of the design, not just a helper-script detail.

Common Misreads

"Bastion dashboard is broken because it does not show `site-lab`"

Usually false.

Often true instead:

site-lab has not handed off yet
bastion runner files do not exist yet

"The run stopped after `site-bootstrap.yml`"

That is normal unless the next command was started.

Current design is still a two-step operator flow:

site-bootstrap.yml
site-lab.yml

"The local `site-lab` log is the real lab run"

Only partly.

Before handoff:

After handoff:

the local log is mostly wrapper and handoff context
the bastion log is the real long-running orchestration log

Operator View

If you are just running the build, the practical mental model is:

prepare AWS
run site-bootstrap.yml
run site-lab.yml
watch the workstation dashboard first
expect bastion tracking only after the handoff is complete

If you are debugging orchestration plumbing, this is the order to check:

local pid/log/rc
local validation lane
bastion-stage.yml
local remote.env
bastion /var/tmp/bastion-playbooks/*

Calabi

Orchestration Plumbing

The Short Version

Execution Contexts

Phase Split

Prep

`site-bootstrap.yml`

`site-lab.yml`

Runner Files And Telemetry

Workstation-tracked state

Bastion-tracked state

Wrapper Responsibilities

`scripts/run_local_playbook.sh`

`scripts/run_remote_bastion_playbook.sh`

`scripts/run_bastion_playbook.sh`

Dashboard Behavior

Before bastion handoff

After bastion handoff

Why The Split Exists

Common Misreads

"Bastion dashboard is broken because it does not show `site-lab`"

"The run stopped after `site-bootstrap.yml`"

"The local `site-lab` log is the real lab run"

Operator View

Continue

Orchestration Plumbing

The Short Version

Execution Contexts

Phase Split

Prep

site-bootstrap.yml

site-lab.yml

Runner Files And Telemetry

Workstation-tracked state

Bastion-tracked state

Wrapper Responsibilities

scripts/run_local_playbook.sh

scripts/run_remote_bastion_playbook.sh

scripts/run_bastion_playbook.sh

Dashboard Behavior

Before bastion handoff

After bastion handoff

Why The Split Exists

Common Misreads

"Bastion dashboard is broken because it does not show site-lab"

"The run stopped after site-bootstrap.yml"

"The local site-lab log is the real lab run"

Operator View

Continue

`site-bootstrap.yml`

`site-lab.yml`

`scripts/run_local_playbook.sh`

`scripts/run_remote_bastion_playbook.sh`

`scripts/run_bastion_playbook.sh`

"Bastion dashboard is broken because it does not show `site-lab`"

"The run stopped after `site-bootstrap.yml`"

"The local `site-lab` log is the real lab run"