Skip to content
Connectors

Read any format. Write to Delta.

DeltaForge reads columnar, row, semi-structured, and industry formats through a unified SQL surface. Every read path lands in Delta tables on your storage, with schema inference and evolution handled automatically.

Delta, Iceberg, Parquet, ORC, Avro, CSV, JSON, XML, Excel, EDI, HL7, FHIR, Protobuf
Visual Flattener for nested formats: JSON, XML, EDI, HL7, FHIR, Protobuf
Schema inference across 40+ locales, schema evolution across files
Parquet / ORC CSV / JSON XML / Excel Avro / Protobuf EDI / HL7 / FHIR Iceberg Delta Forge SQL engine Delta Tables on your storage ACID, versioned 13 formats in. Delta out. Your storage.

Formats you can read and write

Every format listed here is a real code path in the engine, not a roadmap item.

Delta Lake and Iceberg

Delta tables with full ACID and time travel. Iceberg UniForm metadata written alongside Delta so Iceberg-native readers can consume your tables. Native Iceberg reader for tables written by other engines.

Columnar: Parquet, ORC, Avro

Column pruning and predicate pushdown for Parquet and ORC. Schema evolution across Avro files with type promotion and null-filling. Compression codec auto-detection in all three.

Text: CSV, JSON, NDJSON, XML, Excel

Culture-aware type inference across 40+ locales for CSV. Subtree capture for JSON and XML so nested objects stay as JSON columns. Multi-sheet Excel with header detection and per-sheet type inference.

Industry: EDI, HL7, FHIR, Protobuf

Segment-based EDI flattening, HL7 component and field aliasing, FHIR resource bundle unbundling, and Proto3 binary parsing with schema-driven decoding and enum resolution.

Visual Flattener for nested formats

Point at a file. Browse the tree. Decide what to flatten, explode, or keep as a JSON blob. The configuration persists to the table and applies on every query.

Interactive tree view

Discover all nested paths, types, and sample values automatically. Works identically across JSON, XML, EDI, HL7, FHIR, and Protobuf: one visual experience for all six.

Per-field control

Choose how each path lands in the table: pull it out as a column, explode array elements into separate rows, keep a nested subtree as a JSON blob, or let the engine flatten it automatically.

Schema evolution

Files with different structures are merged automatically. New paths null-fill older rows. Path aliases map multiple source paths to one output column.

Schema inference

No manual schema definitions required for CSV, JSON, or XML. The engine samples, infers, and generates the cast expressions.

Culture-aware parsing

German dates (DD.MM.YYYY), French decimals (1 234 567,89), US dates (MM/DD/YYYY): the inference engine detects locale-specific separators and formats automatically across 40+ locales.

Auto-generated transform views

Inferred types produce SQL cast expressions and a ready-to-use transform view. Schema merging is configurable: accept new columns, require exact agreement, or restrict to the common set. Null-fill and type widening are handled automatically.

Connect your data sources

Read from the formats you have. Write to the Delta tables you own. No format conversion layer in between.