DeltaForge reads columnar, row, semi-structured, and industry formats through a unified SQL surface. Every read path lands in Delta tables on your storage, with schema inference and evolution handled automatically.
Every format listed here is a real code path in the engine, not a roadmap item.
Delta tables with full ACID and time travel. Iceberg UniForm metadata written alongside Delta so Iceberg-native readers can consume your tables. Native Iceberg reader for tables written by other engines.
Column pruning and predicate pushdown for Parquet and ORC. Schema evolution across Avro files with type promotion and null-filling. Compression codec auto-detection in all three.
Culture-aware type inference across 40+ locales for CSV. Subtree capture for JSON and XML so nested objects stay as JSON columns. Multi-sheet Excel with header detection and per-sheet type inference.
Segment-based EDI flattening, HL7 component and field aliasing, FHIR resource bundle unbundling, and Proto3 binary parsing with schema-driven decoding and enum resolution.
Point at a file. Browse the tree. Decide what to flatten, explode, or keep as a JSON blob. The configuration persists to the table and applies on every query.
Discover all nested paths, types, and sample values automatically. Works identically across JSON, XML, EDI, HL7, FHIR, and Protobuf: one visual experience for all six.
Choose how each path lands in the table: pull it out as a column, explode array elements into separate rows, keep a nested subtree as a JSON blob, or let the engine flatten it automatically.
Files with different structures are merged automatically. New paths null-fill older rows. Path aliases map multiple source paths to one output column.
No manual schema definitions required for CSV, JSON, or XML. The engine samples, infers, and generates the cast expressions.
German dates (DD.MM.YYYY), French decimals (1 234 567,89), US dates (MM/DD/YYYY): the inference engine detects locale-specific separators and formats automatically across 40+ locales.
Inferred types produce SQL cast expressions and a ready-to-use transform view. Schema merging is configurable: accept new columns, require exact agreement, or restrict to the common set. Null-fill and type widening are handled automatically.
Read from the formats you have. Write to the Delta tables you own. No format conversion layer in between.