How is a pipeline defined?

One SQL file is one pipeline. A PIPELINE block at the top declares schedule (cron), timezone, retries, retry delay, timeout, SLA target, fail-fast behavior, and notification routing. The body is plain SQL.

How is lineage tracked?

Table-level lineage is derived from the SQL statements themselves. No annotation is required.

Git-First SQL Pipelines on a Schedule

Q: Is there a DAG or YAML orchestration layer?

No. Pipelines are sequential SQL. The scheduling contract lives in the same SQL file as the work.

One SQL file is one pipeline

The scheduling contract lives at the top; the work lives below

Declared inline

Pipelines carry their schedule, timeouts, retries, SLA target, fail-fast behavior, and notification routing in the same SQL file as the work. No separate YAML, no orchestrator config to keep in sync.

Approval gates

Production runs can require an explicit approval. The gate resets the moment the source file changes, so a review is always against the version that will run.

Sequential SQL

The body is just SQL. CREATE, INSERT, MERGE, UPDATE, DELETE, procedure calls, whatever the engine understands. The pipeline runs them in order. The house style is idempotent SQL: a retry or a re-run lands the same result as the first run.

Git-backed workspaces

Each workspace links to one Git repository; pipelines are SQL files in that repo

Discovery

DeltaForge scans the linked repository for .sql files that contain a PIPELINE declaration. No manifest, no registration step.

Version history

Pipeline changes are commit history. Each scheduled run records the Git SHA it executed so results can be traced to a specific version.

IDE integration

Open the workspace in VS Code or the built-in Pipeline IDE. Commit, push, pull, and branch without leaving the editor.

Lineage

Table-level lineage is derived from the SQL statements. No annotation needed. See what each pipeline reads and writes from the catalog.

Frequently asked questions

Git-first SQL pipelines, in practice

Is there a DAG or YAML orchestration layer?

No. Pipelines are sequential SQL, and the scheduling contract lives in the same file as the work. Where notebook-based orchestration scatters logic across cells and a separate scheduler, here a code review of one SQL file shows exactly what runs and when.

How does DeltaForge discover pipelines?

Each workspace links to one Git repository. DeltaForge scans the linked repo for .sql files containing a PIPELINE declaration. Commit to Git and the pipeline becomes visible on the next scan; no manifest, no registration step.

How do approval gates work?

A pipeline can require an explicit approval before a production run. The gate resets the moment the source file changes, so the review is always against the exact version that will execute.

Git-first SQL pipelines that run on a schedule