DeltaForge is software you install and operate. A control plane governs the catalog and access. Compute workers execute queries against your storage. Nothing runs outside your network.
Each tier has a clear responsibility and runs on infrastructure you operate
Stores the catalog (schemas, tables, columns, views, pipelines), users, roles, storage credentials, audit log, cron-scheduled jobs, and the compute-worker registry. Metadata only: no data moves through it.
Stateless native execution engines. Each query runs on one worker by default. Add more workers for more concurrency. Workers register with the control plane on startup and accept queries over HTTP.
Delta Lake and Iceberg tables stored as Parquet files on S3, ADLS Gen2, GCS, MinIO, or local disk. No proprietary format, no lock-in. Compatible engines read the same files.
Desktop GUI, VS Code extension, CLI, MCP server for Claude and Cursor, and an ODBC driver for Power BI, Tableau, Excel, .NET, Python, and R. All speak SQL to a worker over HTTP.
Customer-installed on infrastructure you already control
Deploy on Azure VMs, AWS EC2, or GCP Compute Engine alongside your existing cloud infrastructure. Storage talks directly to ADLS, S3, or GCS.
Run control plane and workers as containers in your cluster. Helm charts ship with the product. Add more worker pods to increase query concurrency.
Bare-metal servers, VMware, or Docker Compose on-prem. Pair with MinIO or a NAS mount for S3-compatible storage inside your data centre.
Governance enforced at the query layer, not bolted on after
Built-in and custom roles with GRANT and REVOKE at schema and table level. Workers check permissions before executing any read or write.
Every query, credential access, and permission change is logged with user, timestamp, and outcome. Supports SOC 2 and HIPAA audit evidence workflows.
Storage credentials (SAS tokens, access keys, service account keys) are stored encrypted in the control plane and injected into workers at query time.
You install and operate DeltaForge. No managed service, no external dependencies for your data path.