Skip to content
Architecture

Three tiers, your infrastructure

DeltaForge is software you install and operate. A control plane governs the catalog and access. Compute workers execute queries against your storage. Nothing runs outside your network.

Runs on your VMs, on-prem, or Kubernetes
No JVM, no Spark dependency
Data stays on storage you own
GUI / VS Code ODBC / Python MCP / CLI SQL Compute Worker native execution engine Control Plane catalog + RBAC auth read/write Your Storage S3 / ADLS / GCS / MinIO / Local Delta Lake tables Iceberg tables ODBC Power BI Tableau Excel

Three components, one platform

Each tier has a clear responsibility and runs on infrastructure you operate

Control Plane

Stores the catalog (schemas, tables, columns, views, pipelines), users, roles, storage credentials, audit log, cron-scheduled jobs, and the compute-worker registry. Metadata only: no data moves through it.

Compute Workers

Stateless native execution engines. Each query runs on one worker by default. Add more workers for more concurrency. Workers register with the control plane on startup and accept queries over HTTP.

Your Storage

Delta Lake and Iceberg tables stored as Parquet files on S3, ADLS Gen2, GCS, MinIO, or local disk. No proprietary format, no lock-in. Compatible engines read the same files.

Client Interfaces

Desktop GUI, VS Code extension, CLI, MCP server for Claude and Cursor, and an ODBC driver for Power BI, Tableau, Excel, .NET, Python, and R. All speak SQL to a worker over HTTP.

Where it runs

Customer-installed on infrastructure you already control

Cloud VMs

Deploy on Azure VMs, AWS EC2, or GCP Compute Engine alongside your existing cloud infrastructure. Storage talks directly to ADLS, S3, or GCS.

Kubernetes

Run control plane and workers as containers in your cluster. Helm charts ship with the product. Add more worker pods to increase query concurrency.

On-premises

Bare-metal servers, VMware, or Docker Compose on-prem. Pair with MinIO or a NAS mount for S3-compatible storage inside your data centre.

Security and access

Governance enforced at the query layer, not bolted on after

RBAC

Built-in and custom roles with GRANT and REVOKE at schema and table level. Workers check permissions before executing any read or write.

Audit log

Every query, credential access, and permission change is logged with user, timestamp, and outcome. Supports SOC 2 and HIPAA audit evidence workflows.

Credential vault

Storage credentials (SAS tokens, access keys, service account keys) are stored encrypted in the control plane and injected into workers at query time.

Deploy on your terms

You install and operate DeltaForge. No managed service, no external dependencies for your data path.