The "don't reach across *-core crates" rule was being read as prohibiting any cross-crate import, but the load-bearing intent is to keep *behavior* decoupled (so cluster-mode can swap implementations behind traits in shared). Importing transport DTOs across crates is fine — ExecRequest/ExecResponse/ExecError live in executor-core because that's where they're produced, and the v1.1.1 dispatcher in manager-core legitimately consumes them. Bright line: structs/enums/type-aliases crossing is fine; traits, functions, and service handles crossing is not. Surfaced during the v1.1.1 audit (see REVIEW.md §4).
8.5 KiB
CLAUDE.md
This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
Project
PiCloud is a self-hosted, event-driven serverless compute platform. Users upload Rhai scripts, get HTTP endpoints. Optimized for solo-dev / consumer hardware (single node MVP, multi-node cluster in v1.3+).
Authoritative design: serverless_cloud_blueprint.md. The blueprint is a living document — when architecture decisions are made in conversation that contradict it, treat the latest decision as truth and update the blueprint.
Current focus (Phase 4, v1.1.0): SDK foundation + stdlib utilities — the shape every v1.1.x service module hangs off, see docs/sdk-shape.md. Stdlib reference at docs/stdlib-reference.md. Subsequent v1.1.x releases (KV in v1.1.1, docs in v1.1.2, …) fill it in; see blueprint §12 for the full table. Phase 3 shipped end-to-end: admin auth, multi-app scoping, and Phase 3.5 capability gating (manager-core::authz::{can, require, Capability} + migration 0006_users_authz.sql). Every v1.1+ table starts with app_id UUID NOT NULL REFERENCES apps(id) ON DELETE CASCADE and every Rhai SDK call resolves its app from the execution context.
Three-Service Architecture
The platform splits into three logical services, each backed by a *-core library crate so the same logic runs in single-process MVP mode and split-process cluster mode:
| Service | Role | Library crate |
|---|---|---|
| Manager | Control plane: script CRUD, scheduling/cron, dashboard backend, config. Single-writer to Postgres. | manager-core |
| Orchestrator | Per-node ingress: receives HTTP (later SMTP, queue) events, resolves script, dispatches to local executor. Stateless. | orchestrator-core |
| Executor | Per-node compute: runs Rhai scripts in a sandboxed engine. Stateless. | executor-core |
In MVP, all three run in one process (picloud binary). In cluster mode, each runs as its own binary on each node, with one manager total and one orchestrator + executor per node.
Key boundary: the orchestrator never imports executor-core directly — it depends on an ExecutorClient trait. The local impl calls executor-core in-process; the remote impl is an HTTP client. Same pattern keeps cluster mode a swap, not a rewrite.
Path Scheme
Versioned API surfaces live under /api/v{N}/.... See docs/versioning.md for the full scheme.
/api/v1/admin/*— manager (control plane: script CRUD, routes CRUD + check + match, logs, config; apps CRUD once Phase 3b lands)/api/v1/execute/{id}— orchestrator (data plane: invoke a script by ID, always-available bypass)/admin/*— dashboard SPA (SvelteKit,paths.base = '/admin')/healthz— liveness (string"ok")/version— every compatibility-surface version +public_base_url(JSON)- everything else — orchestrator's user-route matcher: user scripts bind to arbitrary paths via
POST /api/v1/admin/scripts/{id}/routes; if no route matches, picloud returns 404 with a JSON error.
Reserved path prefixes (rejected at route creation): /api/, /admin/, /healthz, /version.
Caddy fronts everything. Same Caddyfile shape works for single-node and cluster — only upstream targets change.
Param syntax convention: route paths use :name (e.g., /users/:id); domains (once apps land) use {name} (e.g., {tenant}.example.com). These are deliberately distinct — never use : in a domain context or {} in a route-path context.
Two-phase dispatch (Phase 3b onward): the orchestrator first resolves Host → app (most-specific domain claim wins), then runs that app's route trie. The route matcher itself is unchanged and never sees other apps' routes.
Tech Stack
- Rust 1.92+ workspace, pinned via
rust-toolchain.toml - Axum for HTTP, Tokio async, sqlx for Postgres
- Rhai embedded scripting (in
executor-core) - PostgreSQL 15+ with
pgcrypto. v1.1+ data-plane tables use JSONB for value columns (hstore was considered for KV and rejected — see blueprint §8.1). - SvelteKit dashboard, static adapter, CodeMirror 6 for the script editor
- Caddy 2 reverse proxy (auto-HTTPS in prod)
- Docker Compose for dev and single-node prod
Common Commands
# Rust workspace
cargo check --workspace
cargo test --workspace
cargo fmt --all
cargo clippy --all-targets --all-features -- -D warnings
# Run the all-in-one binary (MVP entry point)
cargo run -p picloud
# Run a single test
cargo test -p executor-core sandbox::tests::respects_operation_budget
# Dashboard (from dashboard/)
npm run dev
npm run build
npm run check
# Full stack (once docker-compose.yml exists)
docker compose up
docker compose down -v # reset Postgres data
Workspace Layout
crates/
shared/ # cross-cutting types (Script, IDs, error enum, db pool)
executor-core/ # Rhai engine, sandbox, ctx, log, SDK
orchestrator-core/ # event ingress + ExecutorClient trait + dispatch
manager-core/ # control plane: repos, scheduler, config
picloud/ # ★ MVP all-in-one binary
picloud-manager/ # cluster mode binary (skeleton)
picloud-orchestrator/ # cluster mode binary (skeleton)
picloud-executor/ # cluster mode binary (skeleton)
dashboard/ # SvelteKit
caddy/ # Caddyfile, Caddyfile.prod
docker/ # Dockerfiles
docs/
git-workflow.md # trunk-based workflow
architecture.md # (TBD)
Working Rules
- Honor the three-service boundary. Don't reach across
*-corecrates for behavior. Iforchestrator-coreneeds to invoke logic frommanager-core, define a trait insharedand inject the impl — keep implementations decoupled. Transport DTOs are not behavior: types likeExecRequest/ExecResponse/ExecErrorrepresent values produced or consumed across the wire, and depending on the originating crate's type definitions is fine. The bright line is "don't call across crates," not "don't import types." When in doubt: if the imported item is astruct/enum/type aliaswith no methods (or only data-shape methods), it's a DTO and crossing is fine; if it's a trait, function, or service, define the abstraction insharedand inject. executor-corehas no Postgres dependency. Data-plane services (kv, docs, users — v1.1+) come in via injectedServiceProvidertraits.- Database writes only from
manager-core.orchestrator-corereads scripts (cached);executor-coredoesn't touch the DB. - Stateful SDK services use the handle pattern +
SdkCallCx. Collection-scoped surfaces look likekv::collection("x").get(k), notkv::get("x", k). Every service trait method takes&SdkCallCxand MUST deriveapp_idfromcx.app_id— never trust a script-passedapp_id. That is the cross-app isolation boundary. See docs/sdk-shape.md. - MVP builds only the
picloudall-in-one binary. The three split binaries exist as skeletons so the crate boundaries stay honest; flesh them out only when cluster mode is being implemented. - Trunk-based dev. See docs/git-workflow.md. No long-lived branches. Feature flags for incomplete work.
Runtime configuration
Environment variables consumed by the picloud binary:
| Variable | Default | Purpose |
|---|---|---|
PICLOUD_BIND |
0.0.0.0:8080 |
HTTP listen address. Port 8080 is owned by another process on this host — override locally. |
PICLOUD_MAX_CONCURRENT_EXECUTIONS |
32 |
Global concurrency cap on data-plane script executions. Overflow returns HTTP 503 with Retry-After: 1 immediately (no queue). |
DATABASE_URL |
— | Required. Postgres connection string. |
PICLOUD_SESSION_TTL_HOURS |
24 |
Sliding-window session lifetime. |
PICLOUD_SANDBOX_MAX_* |
conservative defaults | Per-knob admin ceilings on Rhai sandbox overrides. See manager-core::sandbox::SandboxCeiling. |
Out of MVP
Queue triggers, cron triggers, SMTP ingress, KV / docs / email / users / HTTP SDKs in scripts, interceptors, workflows, function-to-function invoke(), secrets, metrics dashboard. All deferred to v1.1+ per the blueprint. Don't pre-build for them — but don't make decisions that close the door on them either.
Pulled forward to Phase 3 (pre-v1.1): admin auth, multi-app scoping. Cross-app data sharing (export/import) stays at v1.3+; the initial cut enforces strict isolation. See blueprint §11.5.